Vonage Voice API (ASR Connector)

by Vonage

Vonage's CPaaS speech-to-text via the ASR connector (typically Deepgram-powered).

TL;DR

Vonage's CPaaS speech-to-text via the ASR connector (typically Deepgram-powered).

Best for cPaaS / IVR builders on Vonage Voice who need transcripts in their call flow. Pricing: Vonage Voice price + ASR per-minute.

Category
Transcription APIs
License
Stars
Last push
Pricing
Vonage Voice price + ASR per-minute
Platforms
API

What it is

Vonage's Voice API supports inline speech recognition via its NCCO ASR action and call-recording transcription. ASR is delivered via partnerships (historically Deepgram). Best suited for IVR, voice-bot and call-routing flows where you already use Vonage for telephony. Pricing combines Voice minute charges with per-minute ASR. Best fit: cpaas / ivr builders on vonage voice who need transcripts in their call flow. Caveats: asr pricing is on top of voice minute charges; vendor under the hood may change. Pricing as listed: Vonage Voice price + ASR per-minute. Feature flags from vendor docs: word-level timestamps, streaming. Directory tags: commercial-api, telephony. Last vendor-page check: 2026-05-12.

Best for: CPaaS / IVR builders on Vonage Voice who need transcripts in their call flow.
Watch out for: ASR pricing is on top of Voice minute charges; vendor under the hood may change.

Install / use

Vonage Voice API: NCCO ASR action

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeYes
Languages supportedNone
HIPAA eligibleNo

Vonage Voice API (ASR Connector) vs Whipscribe

FeatureVonage Voice API (ASR Connector)Whipscribe
CategoryTranscription APIsTranscription APIs
PricingVonage Voice price + ASR per-minutefree beta
Speaker diarizationYes
Word timestampsYesYes
StreamingYesNo
Languages99
PlatformsAPIWeb, API, MCP

Alternatives to Vonage Voice API (ASR Connector)

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.