distil-whisper
by Hugging Face
Distilled Whisper: 6× faster, 49% smaller, within 1% WER of the teacher.
TL;DR
Distilled Whisper: 6× faster, 49% smaller, within 1% WER of the teacher.
Best for english-only workloads where latency and cost matter more than multilingual coverage. Pricing: free.
Category
Open source
License
MIT
Stars
★ 4.1k
Last push
2025-01-08
Pricing
free
Platforms
Linux, macOS, GPU, CPU
What it is
A Hugging Face distillation of Whisper that keeps most of the accuracy while cutting inference cost. Best when you know your audio is English and you want to serve many concurrent requests on modest hardware. MIT-licensed.
Best for: English-only workloads where latency and cost matter more than multilingual coverage.
Watch out for: English-only in v3; slightly lower WER than Whisper-large-v3 on long-form.
Watch out for: English-only in v3; slightly lower WER than Whisper-large-v3 on long-form.
Install / use
pip install transformers # then load distil-whisper/distil-large-v3
Features
| Speaker diarization | No |
| Word-level timestamps | Yes |
| Streaming / real-time | No |
| Languages supported | 1 |
| HIPAA eligible | No |
Links
distil-whisper vs Whipscribe
| Feature | distil-whisper | Whipscribe |
|---|---|---|
| Category | Open source | Transcription APIs |
| Pricing | free | free beta |
| Speaker diarization | No | Yes |
| Word timestamps | Yes | Yes |
| Streaming | No | No |
| Languages | 1 | 99 |
| Platforms | Linux, macOS, GPU, CPU | Web, API, MCP |
Cost comparison: distil-whisper vs Whipscribe
How Whipscribe handles your audio
Upload
AI Transcripts
Edit
Export
Alternatives to distil-whisper
Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, we're one click away.
Try Whipscribe →