PaddleSpeech

by Baidu

Baidu's all-in-one speech toolkit on PaddlePaddle.

TL;DR

Baidu's all-in-one speech toolkit on PaddlePaddle.

Best for mandarin-first ASR + TTS + KWS in one toolkit. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux, macOS, Windows

What it is

ASR (Conformer/U2++), TTS (FastSpeech2/HiFiGAN), speaker, KWS, and translation. Apache-2.0.

Best for: Mandarin-first ASR + TTS + KWS in one toolkit.
Watch out for: PaddlePaddle ecosystem narrower than PyTorch globally.

Install / use

pip install paddlespeech

Features

Speaker diarizationYes
Word-level timestampsYes
Streaming / real-timeYes
Languages supported20
HIPAA eligibleNo

PaddleSpeech vs Whipscribe

FeaturePaddleSpeechWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationYesYes
Word timestampsYesYes
StreamingYesNo
Languages2099
PlatformsLinux, macOS, WindowsWeb, API, MCP

Alternatives to PaddleSpeech

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.