Gladia Realtime
Gladia's real-time streaming ASR API with multilingual code-switching.
Gladia's real-time streaming ASR API with multilingual code-switching.
Best for multilingual real-time agents and meeting captioning. Pricing: per-hour streaming (see Gladia pricing).
What it is
Gladia's Realtime endpoint provides streaming ASR with code-switching multilingual support, diarization, and word-level timestamps. Distinct from the batch entry already in this directory; pricing and feature coverage differ. Suitable for voice agents and live captioning. Best fit: multilingual real-time agents and meeting captioning. Caveats: distinct from the 'gladia' batch entry already in the directory. Pricing as listed: per-hour streaming (see Gladia pricing). Feature flags from vendor docs: speaker diarization, word-level timestamps, streaming. Directory tags: commercial-api, streaming. Last vendor-page check: 2026-05-12.
Watch out for: Distinct from the 'gladia' batch entry already in the directory.
Install / use
wss://api.gladia.io/audio/text/audio-transcription
Features
| Speaker diarization | Yes |
| Word-level timestamps | Yes |
| Streaming / real-time | Yes |
| Languages supported | None |
| HIPAA eligible | No |
Gladia Realtime vs Whipscribe
| Feature | Gladia Realtime | Whipscribe |
|---|---|---|
| Category | Transcription APIs | Transcription APIs |
| Pricing | per-hour streaming (see Gladia pricing) | free beta |
| Speaker diarization | Yes | Yes |
| Word timestamps | Yes | Yes |
| Streaming | Yes | No |
| Languages | — | 99 |
| Platforms | API | Web, API, MCP |
Alternatives to Gladia Realtime
Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.