Yitu Speech
Yitu Tech speech recognition — Mandarin, dialects, and far-field microphone arrays.
Yitu Tech speech recognition — Mandarin, dialects, and far-field microphone arrays.
Best for on-premise Mandarin transcription for finance, telecom, and government customers in mainland China. Pricing: enterprise · contact sales.
What it is
Yitu Tech is a Chinese AI vendor better known for computer vision but maintains a speech division that competes with iFlyTek and Tencent for on-premise enterprise deals. The product targets banks, telcos, and public-sector buyers who require speech recognition to run inside their own data centre with no outbound traffic. Acquisition is via direct sales rather than a developer signup. Best fit when the buyer is on-premise mandarin transcription for finance, telecom, and government customers in mainland china. The honest caveat: sales process is in chinese; no self-serve developer console. Procurement is via direct sales rather than a developer signup, so plan for a discovery call and a written proposal before the first transcript runs.
Watch out for: Sales process is in Chinese; no self-serve developer console.
Install / use
yitutech.com — contact sales for licensing
Features
| Speaker diarization | No |
| Word-level timestamps | No |
| Streaming / real-time | No |
| Languages supported | None |
| HIPAA eligible | No |
Yitu Speech vs Whipscribe
| Feature | Yitu Speech | Whipscribe |
|---|---|---|
| Category | Products | Transcription APIs |
| Pricing | enterprise · contact sales | free beta |
| Speaker diarization | — | Yes |
| Word timestamps | — | Yes |
| Streaming | — | No |
| Languages | — | 99 |
| Platforms | On-premise, Web | Web, API, MCP |
Alternatives to Yitu Speech
Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.