Alibaba DAMO ASR

by Alibaba Cloud

Alibaba Cloud / DAMO Academy speech recognition with Paraformer non-autoregressive models.

TL;DR

Alibaba Cloud / DAMO Academy speech recognition with Paraformer non-autoregressive models.

Best for mandarin transcription with the option to self-host DAMO's open Paraformer checkpoints separately. Pricing: tiered · per-hour pricing in CNY/USD.

Category
Transcription APIs
License
Stars
Last push
Pricing
tiered · per-hour pricing in CNY/USD
Platforms
Web, Linux, Android, iOS

What it is

DAMO Academy is Alibaba's research arm; its Paraformer family of non-autoregressive Mandarin ASR models powers Alibaba Cloud's Intelligent Speech Interaction product. DAMO also publishes open-source variants via ModelScope, so teams can prototype on the same architecture they pay for in production. Strongest fit for e-commerce voice search, call-center analytics, and any workload already routed through Alibaba Cloud. Best fit when the buyer is mandarin transcription with the option to self-host damo's open paraformer checkpoints separately. The honest caveat: cross-region availability varies; english support is secondary to mandarin and cantonese. Developer-grade API with self-serve signup; validate accuracy on representative audio for the target dialect before committing volume.

Best for: Mandarin transcription with the option to self-host DAMO's open Paraformer checkpoints separately.
Watch out for: Cross-region availability varies; English support is secondary to Mandarin and Cantonese.

Install / use

alibabacloud.com/product/intelligent-speech-interaction

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeYes
Languages supportedNone
HIPAA eligibleNo

Alibaba DAMO ASR vs Whipscribe

FeatureAlibaba DAMO ASRWhipscribe
CategoryTranscription APIsTranscription APIs
Pricingtiered · per-hour pricing in CNY/USDfree beta
Speaker diarizationYes
Word timestampsYesYes
StreamingYesNo
Languages99
PlatformsWeb, Linux, Android, iOSWeb, API, MCP

Alternatives to Alibaba DAMO ASR

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.