- Published on
AI Accessibility & Disability Tech 2026 Deep Dive - Be My Eyes · Seeing AI · Speechify · Otter Live Captions · VoiceItt · Glean · Apple Live Caption · Google TalkBack
- Authors

- Name
- Youngju Kim
- @fjvbn20031
Prologue — 1.3 Billion People and AI
WHO's 2024 "Global report on health equity for persons with disabilities" puts the number of people with a disability at roughly 1.3 billion globally, about 16 % of humanity. That includes vision, hearing, speech, cognitive/learning, motor/musculoskeletal, and mental health.
As of May 2026 this population sits between two pressures.
- Aging — Japan's share of people 65+ is roughly 29 %, Korea about 19 %, the US about 17 %. Aging tends to come with declines in vision, hearing, and cognition.
- Digital transformation — Paper to apps, counters to kiosks, in-person to video meetings. Without accessibility, social participation itself breaks.
Two things then converged on top.
- Multimodal LLMs like GPT-4o — Image to text, audio to text, and text to audio in a single model. Be My Eyes' Be My AI and Microsoft Seeing AI sit directly on this layer.
- Legal teeth — The US DOJ published a final ADA Title II digital accessibility rule in April 2024; the European Accessibility Act enters mandatory enforcement in June 2025. Korea's Anti-Discrimination Against and Remedies for Persons with Disabilities Act and Japan's Act for Eliminating Discrimination against Persons with Disabilities point the same way.
This article consolidates the leading sixty-plus tools across vision, hearing, speech, cognitive, motor, and AAC; the core standards; testing tooling; and the Korean and Japanese landscapes — all as of May 2026.
1. Why Disability Tech Matters in 2026
Assistive Technology (AT) was, through the 2010s, expensive and hardware-centric. A JAWS license ran 1,000 USD, a refreshable braille display 3,000-5,000 USD, a dedicated AAC device 7,000-12,000 USD.
Three things shifted in the 2020s.
- Smartphone ubiquity — iOS and Android ship VoiceOver, TalkBack, Live Caption, and Switch Control for free. Core functionality no longer requires special hardware.
- Multimodal AI — GPT-4o, Gemini 2.5, and Claude 3.7 process image, video, and audio in one model. Be My AI and Seeing AI rode that wave.
- Stronger legal teeth — US ADA Title III 2024 DOJ rule, EU EAA 2025 enforcement, Korea's anti-discrimination act, and Japan's information accessibility law amendments.
The intersection turns "tools only people with disabilities use" into "tools an aging society uses every day." Captions matter to deaf users and to commuters watching video on a silent subway alike.
2. Vision Accessibility — Two Axes: Human Volunteers and AI
Vision impairment (blind plus low vision) is around 250 million worldwide per WHO; full blindness is about 43 million.
In 2026 vision accessibility tools split along two axes.
- Human volunteers / remote assistants — Be My Eyes (volunteers), Aira (paid professionals), Glide (navigation guidance).
- AI-only processing — Be My AI (GPT-4o), Microsoft Seeing AI, Google Lookout, Apple Live Recognition, Envision AI.
The two axes are complementary. AI is fast and free but can hallucinate; humans are accurate but bounded by time, cost, and privacy. Be My Eyes is the most cited 2026 case study because it bundles both into a single app.
3. Be My Eyes + Be My AI — The Best-Known GPT-4o Application
Be My Eyes (bemyeyes.com) was founded in Denmark in 2015 by Hans Jørgen Wiberg. It connects blind users with sighted volunteers over video calls.
- Cumulative volunteers — Roughly 8 million registered as of May 2026.
- Cumulative users — About 700,000 blind and low-vision users.
- Languages — Around 180 supported.
In March 2023, a partnership with OpenAI launched Be My AI (originally "Virtual Volunteer") in beta. It was one of the first external applications of GPT-4 multimodal, and after GPT-4o shipped in 2024 both response latency and Korean/Japanese quality improved significantly.
How it works.
- The user takes a photo or captures a video frame.
- The image is sent to GPT-4o and returns a natural-language description.
- Follow-up questions are allowed ("How many calories in this can?", "What dosage is printed on this medication label?").
Limits.
- Hallucination — Users have reported cases of mis-read dosages on medication labels. Be My Eyes shows a "verify medical, legal, and financial decisions with a person" warning.
- Missing context — Because it only sees a single frame, dynamic situations (water running on a hot stove) can be missed.
4. Microsoft Seeing AI — Seven Years and Still Free
Seeing AI (seeingai.com) is Microsoft's free iOS app. Launched in the US in 2017, available in roughly 70 countries and 20 languages including Korean and Japanese as of 2024.
Channels available as of May 2026.
- Short Text — Read text in front of the camera immediately.
- Document — Multi-page document scanning, structure recognition.
- Product — Barcode recognition plus product info.
- Person — Face recognition, estimated age and emotion (or the registered name for known faces).
- Scene — Natural-language scene description.
- Currency — Identifies bill denominations (USD, EUR, GBP, with KRW and JPY added in 2024).
- Color — Identifies color in the camera view.
- Light — Indicates whether a light source is on via an audible cue.
- Photo Browser — Finds objects within the photo library.
Strengths — free, partial offline functionality, run by Microsoft with stable longevity. Drawbacks — iOS only (Android relies on OEM solutions), Korean and Japanese OCR somewhat trails English.
5. Google Lookout · Apple Live Recognition · Envision
A three-way comparison of OEM and third-party offerings.
- Google Lookout (Android) — Android-only. Food label mode, currency mode, document mode, quick text, and explore mode for ambient object narration. Korean and Japanese both supported.
- Apple Live Recognition (iOS 18+) — Integrated as part of VoiceOver. Announces text, people, doors, and furniture in the camera view. Door Detection requires LiDAR (iPhone Pro and iPad Pro).
- Envision AI (letsenvision.com) — Netherlands-based. Runs on iOS, Android, and Envision Glasses (built on Google Glass Enterprise Edition 2). Core features are text recognition, scene description, and "Ally Call" (remote helper).
Envision Glasses retail around 3,500 USD as of May 2026. Expensive, but hands-free, which makes them useful for navigation, cooking, and workplace scenarios.
6. OrCam · Aira · Glide — Specialized Assistive Tools
OrCam (orcam.com) — Israeli. A small camera plus speaker that attaches to glasses.
- OrCam MyEye 3 Pro — Around 4,500 USD. Magnet-mounted on glasses. Reads text on demand, recognizes faces, identifies colors and bills. Runs entirely on-device without the cloud.
- OrCam Read 3 — Around 1,990 USD. Pen-form factor. Point at a single line and it reads only that line. Popular with dyslexic and low-vision users.
Aira (aira.io) — US. Paid professional video-call vision assistance.
- Agents — Trained professionals guide live via your camera feed.
- Pricing — Monthly plans 29-200 USD.
- Public sponsorship — Walgreens, AT&T, Starbucks, and select airports provide "Site-in-Sight" free access on their premises.
Glide (glide.me) — A navigation guide for blind users. Camera plus AI provides audio cues for sidewalks, crosswalks, and stairs. Beta started in 2024; full service in select US cities by 2026.
7. Hearing Accessibility — The Live Caption Era
Hearing impairment, including hard-of-hearing, affects around 430 million people per WHO; roughly 150 million are at a level that affects communication.
The 2026 core of hearing accessibility is "live captions" — real-time conversion of surrounding sound into on-screen text.
OS-built-in.
- Apple Live Captions (iOS 16+, macOS Ventura+) — System-wide. Captions for FaceTime, Zoom, phone calls, and ambient audio. Available in English, Mandarin, Japanese, Korean, French, German, and more.
- Android Live Caption (Android 10+) — Introduced by Google in 2019. Spread beyond Pixel to many OEMs. Korean and Japanese support is limited to certain devices.
- Android Live Transcribe — Separate app. Supports 80+ languages, optimized for longer conversations.
Third-party.
- Google Sound Notifications (Android) — Converts alarms, doorbells, sirens, and dog barks into vibration plus visual alerts.
- Apple Sound Recognition (iOS) — Similar concept. Recognizes sirens, alarms, baby cries, and dog barks.
Live caption limits are speaker separation, non-standard pronunciation, and background noise. As of 2026 these systems are still not 100 % accurate — users need to know that going in.
8. Otter.ai · AVA · Innocaption — Meeting and Phone Captions
Otter.ai (otter.ai) — The shorthand for automated meeting transcription. Founded in 2016.
- Live Captions — Joins Zoom, Google Meet, and Microsoft Teams as a bot for real-time captioning with speaker diarization.
- Otter AI Chat — Ask post-meeting questions in natural language ("What action items were decided?").
- Pricing — Free (300 minutes monthly), Pro (16.99 USD monthly), Business (30 USD monthly).
From a deaf user's perspective, Otter has become — somewhat by accident — the most-used hearing assistive tool in workplaces. Drop the Otter bot into any meeting and every utterance shows up as text.
AVA (ava.me) — A live caption app designed specifically for deaf users.
- AVA Pro — Pair every participant's mic to lift speaker-separation accuracy.
- Group meetings — A QR code gathers participants so each speaker is diarized.
- Pricing — Free, Pro (29 USD monthly), Scribe (human captioner assist, around 3 USD per minute).
Innocaption (innocaption.com) — A US FCC-subsidized IP CTS (Internet Protocol Captioned Telephone Service). Deaf users register and use it free. Combines AI captions with human captioners for hybrid accuracy.
RogerVoice (rogervoice.com) — French. 100+ languages, automated captioned calls. Global service across Europe, Africa, and Asia.
9. VRS — Video Relay Service
VRS (Video Relay Service) in the US and Canada lets deaf users sign over video to an interpreter, who relays in voice to the other party — bidirectional real-time interpretation.
Four major US FCC-funded providers.
- Sorenson Communications — Market share leader. ntouch VP videophone, Sorenson Express app.
- Purple Communications — ZP4, ZP Mobile.
- ZVRS (CSDVRS) — Z5 endpoint.
- Convo Communications — Founded by deaf entrepreneurs. Developer-friendly tools like CodeOPS API.
Korea has a comparable service from KCC (the Korea Communications Commission) called "Telecommunications Relay Service" (sonmal-ieum center). Japan's Ministry of Internal Affairs launched its national "Telephone Relay Service" (電話リレーサービス) in July 2021, operated by the Nippon Foundation Telephone Relay Service.
10. Speech Accessibility — The New Wave of Non-Standard Speech Recognition
People with ALS, cerebral palsy, Down syndrome, or post-stroke speech changes whose speech departs from "standard" patterns had practically no usable voice-assistant access. This area is changing fastest in 2026.
VoiceItt (voiceitt.com) — Israeli. An app that learns "non-standard speech" and recognizes it.
- Personal model — User records around 200 words and phrases to train a personal model.
- Integrations — Joins Alexa, Google Home, Apple HomeKit, and Zoom captioning as a voice-input layer.
- Pricing — Free for individuals; enterprise and facility licensing separately.
Project Relate (Google) — Android app. Lets ALS and cerebral palsy users train on their own voice for better recognition. English first; Korean and Japanese are still beta as of 2026.
Talkitt — VoiceItt's former name. Same company.
The core of this category is "per-user personalization." Generic models drop in accuracy fast, so whichever vendor makes the personal-model training workflow smoothest wins.
11. Cognitive and Learning Disabilities — A Text-to-Speech Renaissance
Dyslexia affects an estimated 15-20 % of the US population; in Japan and Korea estimates run from about 5 %. ADHD and the autism spectrum are also major cognitive accessibility populations.
Speechify (speechify.com) — Founded in 2017 by Cliff Weitzman based on his own dyslexia experience.
- High-quality TTS — Licensed celebrity voices including Snoop Dogg and Gwyneth Paltrow.
- OCR plus read-aloud — Photograph a book and hear it spoken.
- PDF/EPUB/web sync — Highlights the position being read.
- Pricing — Free; Premium 139 USD yearly.
NaturalReader (naturalreaders.com) — Canadian. Adjacent space to Speechify. Free basic tier plus paid premium voices.
Microsoft Immersive Reader — Built into Edge, Word, OneNote, and Teams. Free. Multilingual decoder, picture dictionary, syllable splitting.
Read&Write (Texthelp) — UK. A K-12 and higher-ed market leader. Line focus, word prediction, vocabulary builder.
Glean (glean.co) — UK. Formerly Sonocent Audio Notetaker. Records lectures or meetings and syncs audio with notes. Solves the problem of learning-disabled students who can't keep up with notetaking during a lecture.
- 2024 rebrand — Sonocent to Glean, with a simplified UI.
- AI summaries (added 2025) — Auto-extracts key topics from recordings.
12. Motor Disabilities — Voice Control · Eye Tracking · Switch
Motor disabilities (spinal cord injury, ALS, cerebral palsy, post-stroke) make keyboard, mouse, and touch input difficult. The 2026 toolkit splits across four input channels.
- Voice control — Apple Voice Control, Android Voice Access, Dragon NaturallySpeaking.
- Eye tracking — Tobii Dynavox, Apple Eye Tracking (iPadOS 18), EyeWriter.
- Face and head tracking — Smyle Mouse, Cephable, Quha Zono (head mouse).
- Switch control — iOS Switch Control, Android Camera Switches.
Apple Voice Control (macOS, iOS) — Free. Voice commands like "Open Safari" and "Click Send" drive the entire UI. Multilingual including English, Mandarin, and Japanese; Korean still beta as of 2026.
Apple Eye Tracking (iPadOS 18+, select iPad Pro) — Front-camera-only eye tracking. Five-second dwell to click, automatic calibration. No extra hardware needed.
Tobii Dynavox (tobiidynavox.com) — Swedish. The standard for ALS and muscular atrophy. PCEye series (dedicated eye-tracking cameras) plus Tobii Dynavox I-Series (AAC-integrated devices). Pricing 4,000-15,000 USD.
Smyle Mouse (smylemouse.com) — US. Uses a standard laptop webcam to convert facial movement into mouse motion. Winks or smiles trigger clicks.
Cephable (cephable.com) — Camera-plus-AI converts head movement, facial expression, and voice into input channels. Integrates with Xbox and PlayStation consoles.
13. AAC — Augmentative and Alternative Communication
AAC (Augmentative and Alternative Communication) is the toolset for people who can't speak or struggle to. Used by autism-spectrum, cerebral-palsy, aphasia, and post-laryngectomy populations.
iPad/iPhone AAC apps.
- Proloquo2Go (AssistiveWare) — Around 250 USD. Picture-plus-text vocabulary board. Over 1,000 core words. English, Dutch, French, Spanish, German. Japanese in beta.
- Proloquo4Text (AssistiveWare) — Around 120 USD. Text-centric. Word prediction and saved phrases.
- TouchChat (Saltillo, PRC-Saltillo) — Around 230 USD. WordPower vocabulary system.
- TouchChat HD with WordPower — Enhanced vocabulary.
- CoughDrop (coughdrop.com) — Combined web/app AAC. Mixed free and paid.
- Snap Core First (Tobii Dynavox) — Dedicated device plus iPad. For teens and adults.
- GoTalk Now (Attainment Company) — iPad. Beginner-friendly.
The core design principle of AAC is the "vocabulary set." A core vocabulary (50-150 frequently used words) plus a fringe vocabulary (specialized words). Category structure, color, and icon choice all directly determine the user's cognitive load.
14. The Big Four Screen Readers
The basic tool for web and app accessibility is the screen reader — software that reads the screen aloud and lets blind users navigate by keyboard or gesture.
- JAWS (Freedom Scientific, jawshq.com) — US. Windows-only. Around 1,295 USD (Pro), with separate school licensing. The enterprise and government standard.
- NVDA (nvaccess.org) — Australian. Open source (GPL). Windows-only. Completely free. Surpassed JAWS in global user share around the mid-2010s (per WebAIM surveys).
- VoiceOver (Apple) — Free, built into macOS, iOS, iPadOS, watchOS, and tvOS.
- TalkBack (Google) — Free, built into Android. Default on Pixel, Samsung, and every other Android shipping.
- ChromeVox (Google) — Built into ChromeOS plus a Chrome extension. Strong with K-12 ChromeBook users.
- Narrator (Microsoft) — Free, built into Windows 11. Smaller share than JAWS/NVDA but meaningful because it's free.
WebAIM's "Screen Reader User Survey #10" (2023) reports desktop share roughly as JAWS 53 %, NVDA 31 %, VoiceOver 9 %, and Narrator 6 %. On mobile, iOS VoiceOver and Android TalkBack split the market.
15. Standards — WCAG 2.2 · EN 301 549 · Section 508
The international standard for web and app accessibility is W3C's WCAG (Web Content Accessibility Guidelines).
- WCAG 2.0 (2008) — The first widely adopted version.
- WCAG 2.1 (2018) — Added mobile, low-vision, and cognitive criteria.
- WCAG 2.2 (October 2023) — Added 9 new success criteria, including focus appearance, dragging movements, consistent help, and accessible authentication.
- WCAG 3.0 (draft) — A major shift to a scoring-based model. Still a Working Draft as of May 2026.
WCAG defines conformance levels A, AA, and AAA. The legal and contractual baseline is typically AA.
Regional standards.
- EN 301 549 (EU) — Public-sector ICT procurement standard. Incorporates WCAG 2.1 AA.
- Section 508 (US federal) — From the 1998 Rehabilitation Act amendments. Aligned with WCAG 2.0 AA.
- ADA Title III (US private sector) — DOJ final rule in April 2024. State and local government digital assets must meet WCAG 2.1 AA, with phased compliance starting April 2026.
- KWCAG 2.2 (Korea) — Korean Web Content Accessibility Guidelines. Revised in 2023. 24 inspection items.
- JIS X 8341-3:2016 (Japan) — Japanese Industrial Standard. Aligned with WCAG 2.0. A new revision is under discussion as of May 2026.
16. Automated Accessibility Testing Tools
CI/CD-compatible automated accessibility testers.
- axe-core (Deque Systems, github.com/dequelabs/axe-core) — The de facto standard. Apache 2.0. Around 90 rules. Strength is near-zero false positives.
- axe DevTools — Browser extension plus mobile plus CI. Deque's commercial offering.
- Lighthouse Accessibility (Google) — Built into Chrome DevTools. Layers extra rules on top of axe-core. Score-based report.
- WAVE (WebAIM, wave.webaim.org) — Visual overlay marking issues directly on the page. Popular as a learning tool.
- Pa11y (pa11y.org) — Open source CLI. Easy to plug into CI. Built on HTML CodeSniffer and Chrome.
- Tenon.io — API-first accessibility testing. SaaS.
- Microsoft Accessibility Insights (accessibilityinsights.io) — Free. Web, Windows, Android. "FastPass" (quick auto-scan) plus "Assessment" (full WCAG AA manual sweep).
- IBM Equal Access (ibm.github.io/able) — Open source. Rule set separate from axe.
- Siteimprove Accessibility — Danish. Enterprise governance tool.
A crucial limit — automated tools catch only about 30-40 % of WCAG violations. Keyboard accessibility, contrast, alt-text quality, and logical heading structure all need human judgment.
17. Accessibility Overlays — The AccessiBe and UserWay Debate
Overlay — A SaaS that drops a single line of JavaScript onto your page and claims that an AI-powered "accessibility widget" will automatically fix alt text, contrast, and keyboard access.
- AccessiBe (accessibe.com) — Israeli. The largest overlay vendor.
- UserWay (userway.org) — Another major overlay.
- EqualWeb, Audioeye — Same space.
The core controversies.
- Disability community criticism — Overlays conflict with real screen readers, the auto-generated alt text is unreliable, and the "disability mode" toggle itself is stigmatizing. The National Federation of the Blind passed a resolution discouraging AccessiBe use in 2021.
- Lawsuit data — Data from 2021-2024 (UsableNet, Seyfarth analyses) suggested overlay-using sites attracted more ADA lawsuits, not fewer.
- FTC settlement — accessiBe settled with the FTC in 2025 for 1 million USD over allegations of deceptive advertising and adjusted some marketing.
WebAIM's "WebAIM Million" 2024 report found that sites using overlays averaged more accessibility errors than sites that did not.
Bottom line — an overlay is not "one line for ADA compliance." Real solutions live in design systems, semantic HTML, and test automation.
18. Apple Accessibility · Google Euphonia · Microsoft AI for Accessibility
The accessibility R&D programs at the big three.
- Apple Accessibility — A dedicated team in Cupertino. Run by long-time leaders such as Sady Paulson and Sarah Herrlinger. iOS 17's Personal Voice (15-minute recording to synthesize your voice), Live Speech, and Point and Speak are flagship outputs.
- Google Project Euphonia (sites.research.google/euphonia) — Non-standard speech recognition R&D. Collected around 1 million utterances from ALS, Down syndrome, and cerebral-palsy users. Project Relate is a direct output.
- Microsoft AI for Accessibility (microsoft.com/en-us/ai/ai-for-accessibility) — A five-year, 25 million USD grant program (2018-2023) with follow-ons. Seeing AI, Hearing AI, Microsoft Accessibility Insights, and Soundscape are downstream.
- Meta Accessibility — Quest 3 and Vision OS accessibility, Facebook Automatic Alt Text (AAT), and Instagram captions.
- Amazon Alexa Accessibility — Alexa Show and Tell (voice shopping for blind users), Tap to Alexa (text input for deaf users), and sponsorship of the Voice First Accessibility conference.
19. Korean Accessibility — KWCAG · Saltlux · Asleep
Korea's accessibility ecosystem leans heavily on government.
- KWCAG 2.2 — Administered by NIA (National Information Society Agency). 24 inspection items. Mandatory for public-sector sites.
- Korean Web Accessibility Mark — Mandatory display on public-sector sites.
- WA (Web Accessibility) certification — Private-sector certification based on KWCAG.
- Anti-Discrimination Against and Remedies for Persons with Disabilities Act — Mandatory accessibility for public-sector and certain-sized private-sector sites.
- Mobile App Accessibility Guidelines — Revised in 2018.
Private-sector tools.
- NHN Toast Accessibility Audit (NHN Cloud) — Cloud-hosted inspection service.
- OpenWAX (Ministry of Science and ICT) — Free inspection tool. Chrome extension.
- K-WAH 4.5 — Tool from the National Information Society Agency.
Industry and research.
- Saltlux — AAC plus AI. Maeumdaero and related solutions for visually impaired and speech-impaired users.
- Asleep — Sleep monitoring via acoustic signal. Compatible with hearing assistive uses.
- Infobank — Voice ARS and accessibility calling.
- LG U+ Sonnu-ri Ring — Video calling plus sign-language interpretation for deaf users.
- Kakao Enterprise Clova Note — Meeting captions. Korea's answer to Otter.
- Naver Clova (Clova Note, Clova X) — Voice/text conversion plus LLM.
20. Japanese Accessibility — JIS X 8341-3 · Sourcenext · NEC
Japan's accessibility ecosystem.
- JIS X 8341-3:2016 — The official Japanese standard. Aligned with WCAG 2.0. 2026 revision under discussion.
- Act for Eliminating Discrimination against Persons with Disabilities (effective 2016, amended April 2024) — Extended the duty of reasonable accommodation to private-sector providers.
- MIC "Guidelines for Public Websites Operation" — Public-sector site guidelines from the Ministry of Internal Affairs and Communications.
- WAIC (Web Accessibility Infrastructure Committee) — Private-sector consortium. Publishes JIS X 8341-3 interpretation and translations.
Tools.
- Sourcenext — Voice recognition software. Strong in the Japanese dictation market.
- NEC PaperLab AAC — AAC device plus paper printing.
- Pen-Talk (Wakasa Inc.) — Pen-form OCR with audio output. Tailored to Japanese EPUB and textbook content.
- UD-Talk — Live meeting caption app for deaf users. Voice recognition plus translation. Used by broadcasters including NHK.
- Koesute — Voice synthesis from Tohoku University with Toast. Voice preservation for ALS patients.
- Stenocaptioner — Specialized captioning professional certification in Japan.
Sign language.
- NHK Sign Language CG — NHK R&D converting news automatically into sign-language CG.
- SureTalk (SoftBank) — Chat and call assist for deaf users.
21. The Evolution of Braille Displays
A refreshable braille display is hardware that lets blind users read text in braille. Pins rise and fall to form braille cells.
- HumanWare Brailliant BI 40X — 40 cells. Around 3,500 USD.
- Freedom Scientific Focus 40 Blue — 40 cells. Around 2,995 USD.
- APH Mantis Q40 — Braille plus QWERTY. Around 2,495 USD.
- Orbit Reader 20·40 — Around 700-1,300 USD. Budget option. Distributed via the US National Library Service (NLS) and partners.
- Dot Pad (Dot Inc., dotincorp.com) — Korean. A braille tablet (20 rows by 32 cells) capable of graphic output, converting visual material into braille graphics. Launched in 2023.
Braille displays remain expensive in 2026. The main driver is precision mechanical parts (miniature solenoids). The Dot Pad's arrival is a turning point toward next-generation graphic-capable devices.
22. Hearing Aids · Cochlear Implants · Auditory Prosthetics
Hardware for hearing assistance.
- Hearing aids — Amplify residual hearing. Prescription vs. OTC (US made OTC hearing aids legal in 2022).
- Cochlear implants — Stimulate the auditory nerve directly. Cochlear, MED-EL, and Advanced Bionics are the three major manufacturers.
- Bone-anchored — Baha, Ponto.
Smartphone integration.
- MFi Hearing Aids (Made for iPhone) — Phonak Audéo, ReSound, Starkey Genesis, and more.
- ASHA (Audio Streaming for Hearing Aids, Android) — Google's standard.
- LE Audio + Auracast — Bluetooth 5.2's new audio-broadcast standard. Hearing aids and headsets adopting through 2024-2026. Airports, lecture halls, and museums building infrastructure.
Cochlear prosthetics plus AI.
- Cochlear Smart Sound IQ 2 — Cochlear-implant processor with AI noise suppression.
- Phonak SmartSpeech Technology — Automatic environment classification for speech emphasis.
- Starkey Genesis AI — Combines motion detection, real-time translation, and heart-rate tracking.
23. AI Hallucination in Accessibility — Real Harm Cases
The biggest risk of AI accessibility tools is hallucination influencing medical or safety decisions.
- Medication label misread — User reports of Be My AI mis-reading dosage on a pill bottle label. One user posted on r/Blind in 2024 that "one tablet daily" came back as "two tablets daily."
- Expiration date error — Cases of Seeing AI misreading expiration-date digits.
- Person identification error — AI recognizing a non-family member as family.
- Traffic light color error — Color identification mode returning "green" for a red light. Very dangerous mid-crosswalk.
- Alt-text hallucination — Auto alt text inventing people or objects not in the photo. Facebook AAT moved to conservative phrasing like "2 people, outdoors" to reduce this.
Mitigations — Be My Eyes recommends verifying via the volunteer video-call channel. Microsoft Seeing AI shows a "verify medical and food-safety decisions with a person" warning for medication and food. WCAG 3.0 draft recommends confidence indicators on automatically generated content.
24. Design Systems and Accessibility — Inclusive Design
The culture on top of the tools. "Design with accessibility from the start" (Inclusive Design) beats retrofit patches by a large margin.
Major design systems and their accessibility.
- Material Design 3 (Google) — Contrast, touch targets, dynamic color.
- Apple Human Interface Guidelines — VoiceOver guidance, Dynamic Type, Reduce Motion.
- Microsoft Fluent 2 — Fluent Accessibility Toolkit. Keyboard, contrast, High Contrast Mode.
- GOV.UK Design System — UK government. A reference accessibility-first system.
- U.S. Web Design System (USWDS) — US federal. Section 508 compliant.
- Carbon Design System (IBM) — Open source. Integrates with axe.
Core principles.
- Semantic HTML first — A button is a button; div with onclick is not a substitute.
- ARIA is a last resort — "No ARIA is better than bad ARIA" (W3C maxim).
- Focus appearance — Every interactive element shows a clearly visible focus state.
- Respect reduced motion — The prefers-reduced-motion media query.
- Color and contrast — Never carry meaning by color alone; contrast at least 4.5:1.
25. Game Accessibility — Microsoft XAG · Naughty Dog
Games were late to accessibility but caught up fast after 2020.
Reference cases.
- The Last of Us Part II (Naughty Dog, 2020) — Over 60 accessibility options. Covers vision, hearing, and motor disabilities almost end-to-end.
- Xbox Adaptive Controller (Microsoft, 2018) — Large buttons and external switch input. Demolished the entry barrier to game controllers for motor-disabled users.
- PlayStation Access Controller (Sony, 2023) — Sony's PS5-side answer to the XAC.
- Forza Motorsport (2023) — Acoustic-cue navigation system (Sound Beacons) for blind users.
- God of War Ragnarök (2022) — Auto-aim and combat-assist modes.
Standards.
- Xbox Accessibility Guidelines (XAG) — Roughly 30 guidelines published by Microsoft. Built with AbleGamers and CanIPlayThat input.
- Game Accessibility Guidelines (gameaccessibilityguidelines.com) — UK non-profit. Basic, Intermediate, and Advanced tiers.
26. VR · AR · Metaverse Accessibility
VR and AR are a new accessibility frontier. Headset weight, field of view, motion sickness, and two-handed controller dependence.
2026 solutions.
- Apple Vision Pro VoiceOver — Voice-guides gaze plus hand gestures.
- Meta Quest field-of-view adjustment and captions — Built-in captioning on Quest 3.
- WebXR Accessibility Working Group — W3C-affiliated. Guideline drafts.
- XR Access (xraccess.org) — Non-profit based at Cornell Tech.
Accessibility in VR is still a standards-frontier area. How to describe a 3D space audibly for blind users, and how to visualize spatial audio for deaf users, are the central open questions.
27. Picking the Right Tool — A 2026 Decision Guide
[Tool Selection by Disability Type — 2026 Model]
[Vision]
Free + immediate use: VoiceOver(iOS), TalkBack(Android), Seeing AI(iOS), Lookout(Android)
Volunteer video call: Be My Eyes(free)
Professional video call: Aira(paid)
Glasses-form hardware: Envision Glasses, OrCam MyEye
Braille display: Brailliant, Focus 40, Orbit Reader, Dot Pad(graphic)
[Hearing]
Free live captions: Apple Live Captions, Android Live Caption, Live Transcribe
Meeting captions: Otter.ai, AVA, Microsoft Teams captions
Phone captions: Innocaption(US), RogerVoice
Hearing aids + phone: MFi Hearing Aids, ASHA, LE Audio Auracast
Video relay(sign): Sorenson, ZVRS, Convo / Korea KCC / Japan Nippon Foundation
[Speech]
Non-standard speech: VoiceItt, Project Relate(Google)
AAC picture vocabulary: Proloquo2Go, TouchChat, CoughDrop
AAC text: Proloquo4Text
AAC device: Tobii Dynavox Snap Core First
[Cognitive/Learning]
Dyslexia TTS: Speechify, NaturalReader
Study notes: Glean, Notion AI, Microsoft OneNote Immersive Reader
K-12/HE bundle: Read&Write(Texthelp)
[Motor]
Voice control: Apple Voice Control, Android Voice Access, Dragon
Eye tracking(budget): Apple Eye Tracking(iPadOS 18)
Eye tracking(pro): Tobii Dynavox PCEye
Face tracking: Smyle Mouse, Cephable
Switch: iOS Switch Control, Xbox Adaptive Controller
28. Self-Checklist — Seven Questions Before Picking a Tool
- Check OS-built-in first — Before buying an app, check whether VoiceOver, TalkBack, or Live Caption already covers it. Free and tightly integrated.
- Privacy — where does the data live — Apps that photograph medical info or medication labels send those images somewhere. Verify HIPAA (US), K-PIPA (Korea), and APPI (Japan) compliance.
- Offline behavior — Does it work without internet. Seeing AI Short Text runs offline.
- Language support — English coverage and Korean/Japanese accuracy are not the same. Test Korean OCR and STT quality with real use.
- Insurance and subsidies — Check public support: Korea's Assistive Device Subsidy, Japan's prosthetics subsidy, and US Medicare / VA programs.
- Maintenance — Will the company survive. Learn from Pear Therapeutics and Mindstrong's wind-downs.
- User community — Search r/Blind, r/deaf, AAC user groups, and similar for real-user feedback.
Epilogue — AI Is an Accessibility Assistant, Not a Replacement
In 2026 the most effective accessibility tools are hybrids of people and AI. Be My Eyes combines AI (fast) with volunteers (accurate). Innocaption combines AI captions with human captioners. AAC vocabulary design is owned by an SLP (speech-language pathologist); AI assists with word prediction.
AI alone carries hallucination, language bias, and the deeper risk that accessibility itself is under-represented in training data. Medication labels, traffic lights, and medical decisions should not be left to AI alone.
Even so, accessibility is overwhelmingly better than five years ago. In 2020 a 1,000 USD screen reader was standard; in 2026 free OS-built-in tools do almost the same job. The daily lives of 1.3 billion people are flattening a little more each year.
What's left is standards, certification, design systems, and an inclusive-design culture. The tools are already abundant. Adoption and enforcement are the next five years' agenda.
References
- Be My Eyes
- Be My AI
- Microsoft Seeing AI
- Google Lookout
- Apple Accessibility
- Envision AI
- OrCam
- Aira
- Glide
- Apple Live Captions
- Android Live Caption
- Otter.ai
- AVA
- Innocaption
- RogerVoice
- VoiceItt
- Google Project Relate
- Google Project Euphonia
- Speechify
- NaturalReader
- Microsoft Immersive Reader
- Texthelp Read&Write
- Glean
- Tobii Dynavox
- Smyle Mouse
- Cephable
- AssistiveWare Proloquo2Go
- PRC-Saltillo TouchChat
- CoughDrop
- NV Access NVDA
- Freedom Scientific JAWS
- WebAIM Screen Reader Survey
- WCAG 2.2
- WCAG 3.0 Working Draft
- EN 301 549
- Section 508
- DOJ ADA Title II Final Rule (2024)
- European Accessibility Act
- KWCAG 2.2 Korean Web Content Accessibility Guidelines
- JIS X 8341-3:2016
- WAIC Japan Web Accessibility Committee
- axe-core
- WAVE WebAIM
- Microsoft Accessibility Insights
- Pa11y
- WebAIM Million 2024
- NFB AccessiBe Resolution
- Microsoft AI for Accessibility
- Xbox Adaptive Controller
- Game Accessibility Guidelines
- Dot Pad
- UD-Talk
- MIC Telephone Relay Service Japan
- WHO Disability Report 2024