Tech & Gear

Apple’s New Accessibility Tools Bring AI Captions, Smarter VoiceOver, And Vision Pro Wheelchair Support

By Aimirul|
Share

Apple is giving its accessibility features a bigger AI upgrade this year, and honestly, this is the kind of tech update that matters beyond the usual spec-sheet flex.

According to Apple’s latest accessibility preview, new tools are coming later this year across iPhone, iPad, Mac, Apple TV, and Apple Vision Pro. The headline feature: AI-generated captions for videos that do not already have subtitles, powered by on-device speech recognition.

For Malaysian and SEA users, this could be very useful. Think about random gameplay clips, livestream VODs, esports interviews, anime-related videos, or creator content where captions are missing or poorly done. If Apple’s system works smoothly, it could make a lot more content easier to follow — especially for deaf and hard-of-hearing users, people watching without sound in public, or anyone dealing with noisy mamak-level background chaos.

The important bit is that Apple says this uses on-device processing. That matters because accessibility data can be sensitive. If your phone or Mac can process speech locally instead of constantly sending everything to the cloud, that is a better privacy direction, especially for users who depend on these features daily.

Apple is also updating VoiceOver with richer image descriptions through Image Explorer. This should help users understand what is inside an image with more detail, though Apple is also warning that these AI descriptions should not be treated as reliable in risky or dangerous situations. Good call, because AI image understanding can still confidently get things wrong.

Another neat addition: users will be able to ask questions about what the camera sees by pressing the Action button. That sounds especially practical for everyday tasks — identifying objects, checking signs, or getting context from the environment without needing a separate app flow.

Voice Control is getting natural language navigation too, which should make controlling devices by voice feel less robotic. Instead of memorising exact commands, users may be able to speak more naturally and still get around the system. Apple’s Accessibility Reader is also being improved to summarise more complex material, which could help with long articles, dense documents, or study content.

Vision Pro is getting some of the more specific updates. Apple’s headset will be able to connect its eye-tracking system to powered wheelchair drive systems, including Tolt and LUCI in the US. Connection can work via Bluetooth or through a wired setup using the Developer Strap. Vision Pro is still a very expensive device at US$3,499 — roughly around RM16,000 before local taxes, shipping, or reseller markups — so this is not exactly mass-market in Malaysia. But the direction is interesting: eye tracking is moving from “cool futuristic interface” into actual mobility and independence use cases.

Apple is also adding Vehicle Motion Cues to Vision Pro, aimed at reducing motion sickness when the headset is used inside a moving vehicle. Whether you should be wearing a Vision Pro in a car is another question, bro, but motion comfort is definitely something mixed-reality hardware needs to solve.

There are smaller accessibility updates too. Apple mentioned wider availability for the MagSafe-compatible Hikawa Grip & Stand through the Apple Store, Larger Text support on tvOS, and a new API that can bring sign language interpreters into FaceTime calls.

For SEA users, the big takeaway is not just “Apple added more AI.” It is that accessibility features are becoming more deeply built into the devices people already use. If this works well, it benefits not only disabled users, but also students, gamers, creators, parents, multilingual households, and anyone who needs tech to be more flexible.

Now the real test: how well these tools handle accents, noisy audio, mixed-language speech, and real-world Malaysian usage. Because if the AI captions can survive Manglish, tournament comms, and someone shouting “push mid lah” over bad mic audio, then we are cooking.

Source: The Verge

Tags

AppleAccessibilityApple IntelligenceVision Pro