AssemblyAI Realtime / Streaming

by AssemblyAI

AssemblyAI's WebSocket streaming endpoint for live captions and agents.

TL;DR

AssemblyAI's WebSocket streaming endpoint for live captions and agents.

Best for live captioning, real-time voice agents, contact-center streaming use cases. Pricing: from $0.15/hr (Streaming).

Category
Transcription APIs
License
Stars
Last push
Pricing
from $0.15/hr (Streaming)
Platforms
API

What it is

AssemblyAI Streaming (v3 WebSocket) is the real-time variant of AssemblyAI's ASR, returning partial and final transcripts as audio is streamed in. Used for live captions and voice agents. Distinct from the 'assemblyai' batch entry already in this directory; pricing and features differ per their public pricing page. Best fit: live captioning, real-time voice agents, contact-center streaming use cases. Caveats: distinct from the 'assemblyai' batch entry already in the directory. Pricing as listed: from $0.15/hr (Streaming). Feature flags from vendor docs: word-level timestamps, streaming, HIPAA-eligible under BAA. Directory tags: commercial-api, streaming. Last vendor-page check: 2026-05-12.

Best for: Live captioning, real-time voice agents, contact-center streaming use cases.
Watch out for: Distinct from the 'assemblyai' batch entry already in the directory.

Install / use

wss://streaming.assemblyai.com/v3/ws

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeYes
Languages supportedNone
HIPAA eligibleYes

AssemblyAI Realtime / Streaming vs Whipscribe

FeatureAssemblyAI Realtime / StreamingWhipscribe
CategoryTranscription APIsTranscription APIs
Pricingfrom $0.15/hr (Streaming)free beta
Speaker diarizationYes
Word timestampsYesYes
StreamingYesNo
Languages99
PlatformsAPIWeb, API, MCP

Alternatives to AssemblyAI Realtime / Streaming

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.