Apple Vision Pro Live Captions

by Apple

Real-time captions in visionOS — overlay spoken speech as floating text in your field of view.

TL;DR

Real-time captions in visionOS — overlay spoken speech as floating text in your field of view.

Best for deaf and hard-of-hearing Vision Pro users in supported English / Spanish / French / German / Korean / Mandarin locales. Pricing: Free with Vision Pro ($3,499 headset).

Category
Desktop apps
License
Stars
Last push
Pricing
Free with Vision Pro ($3,499 headset)
Platforms
visionOS, Hardware

What it is

Live Captions on visionOS uses on-device speech recognition to surface captions over any audio source — phone calls, FaceTime, ambient room conversation, or app audio. Same recognizer family as macOS / iOS Live Captions.

Best for: Deaf and hard-of-hearing Vision Pro users in supported English / Spanish / French / German / Korean / Mandarin locales.
Watch out for: On-device but locale-restricted; FaceTime captioning still tagged "beta"; not a transcription product — captions are ephemeral, not saved.

Install / use

Settings > Accessibility > Live Captions on visionOS

Features

Speaker diarizationNo
Word-level timestampsNo
Streaming / real-timeYes
Languages supported6
HIPAA eligibleNo

Apple Vision Pro Live Captions vs Whipscribe

FeatureApple Vision Pro Live CaptionsWhipscribe
CategoryDesktop appsTranscription APIs
PricingFree with Vision Pro ($3,499 headset)free beta
Speaker diarizationYes
Word timestampsYes
StreamingYesNo
Languages699
PlatformsvisionOS, HardwareWeb, API, MCP

Alternatives to Apple Vision Pro Live Captions

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.