03. How AI Helps
These tools sort by the need they meet.
Seeing: describing images and surroundings
For blind and low-vision users, several apps turn a phone camera into a narrator. Be My Eyes, which has linked blind users with volunteers since 2012, added Be My AI on OpenAI's GPT-4, where you photograph something, hear a description, and ask follow-up questions in plain language, free for blind and low-vision users. Microsoft's Seeing AI, a free talking camera from 2017 and on Android since December 2023, narrates text, documents, barcodes, scenes, and currency, and can answer questions about a scanned document. Google's Lookout describes an image with or without a caption, and Guided Frame gives blind users spoken guidance to take their own photo. Apple's June 2026 update builds this in, so VoiceOver describes images across apps and a Live Recognition button answers questions about what the camera sees.
This is visual question answering, explained in multimodal models and built on the image recognition in computer vision.
Hearing: real-time captions and transcription
For deaf and hard-of-hearing users, phones caption the world in real time. Google's Live Transcribe writes out speech and key surrounding sounds in over 120 languages, and Google reports it helps more than a billion people. Live Caption adds automatic captions to almost any audio on Android. Apple's Live Captions do the same across its system, and in 2025 Apple brought them to the Apple Watch and to a connected braille display, so a deafblind user can read a conversation by touch. Sound Recognition and Music Haptics add non-audio cues, like a doorbell alert or music turned into taps.
The engine is automatic speech recognition, covered in speech and audio AI.
Speaking and atypical speech: synthetic and banked voices
For people who cannot speak, or are losing the ability, AI provides a voice. Apple's Live Speech reads typed text aloud, and Personal Voice uses on-device machine learning to recreate the user's own voice from recordings, so someone facing a condition like ALS can keep sounding like themselves, a setup Apple cut to about ten short phrases in under a minute in 2025. These pair with the augmentative and alternative communication (AAC) apps many nonspeaking people use.
A harder problem is speech the model cannot understand, since standard recognition is trained mostly on typical speech and stumbles on the atypical patterns of conditions like cerebral palsy, Parkinson's, or stroke. Google's Project Euphonia, launched in 2019, trained personalized models on samples from people with non-standard speech, which Google reports cut the median word error rate by an average of more than 80 percent, and shipped as the Project Relate app. The University of Illinois Speech Accessibility Project builds a shared, de-identified dataset of disordered speech with a coalition of Amazon, Apple, Google, Meta, and Microsoft, and Apple's 2024 Listen for Atypical Speech drew on that work.
Motor control: operating a device hands-free
For people with motor or physical disabilities, AI enables control with no keyboard, mouse, or touch. Apple's Eye Tracking, announced in 2024, lets a user navigate an iPhone or iPad with their eyes alone, using the front camera and on-device machine learning with no added hardware. Voice Control runs a device by spoken command and, in the June 2026 update, takes natural-language instructions. Head Tracking and Switch Control remain, and Apple has announced brain-computer interface support and, in 2026, eye-driven control of a compatible power wheelchair through Vision Pro. Availability varies by device and region.
Reading and cognition: simplifying text
For users with dyslexia, low vision, or other reading and cognitive differences, AI lowers the effort of reading. Microsoft's free Immersive Reader reads text aloud while highlighting each word, with adjustable spacing, syllable splitting, a focused line, and a picture dictionary, its design grounded in reading research. Apple's Accessibility Reader, announced in 2025, is a comparable system-wide reading mode. The same summarizing a chatbot does can turn dense text into plainer language for someone who needs it.