SUPERB
by Academic consortium (NTU + CMU + JHU + Meta)
Speech processing Universal PERformance Benchmark — 10 English speech tasks.
TL;DR
Speech processing Universal PERformance Benchmark — 10 English speech tasks.
Best for probing English SSL speech models across ASR, KWS, SID, ER, SD, QbE, IC, SF, PR, TTS-related tasks. Pricing: free.
Category
Open source
License
—
Stars
—
Last push
—
Pricing
free
Platforms
HuggingFace, GitHub
What it is
SUPERB defines 10 frozen-feature-extraction tasks on English speech corpora to probe self-supervised models. The standard SSL leaderboard before ML-SUPERB. Toolkit: Apache-2.0; component data licenses vary.
Best for: Probing English SSL speech models across ASR, KWS, SID, ER, SD, QbE, IC, SF, PR, TTS-related tasks.
Watch out for: Apache-2.0 toolkit · uses LibriSpeech/VoxCeleb/IEMOCAP/etc. (component licenses vary, IEMOCAP is research-only) · English-only. Cite: Yang et al., Interspeech 2021.
Watch out for: Apache-2.0 toolkit · uses LibriSpeech/VoxCeleb/IEMOCAP/etc. (component licenses vary, IEMOCAP is research-only) · English-only. Cite: Yang et al., Interspeech 2021.
Install / use
git clone https://github.com/s3prl/s3prl
Features
| Speaker diarization | Yes |
| Word-level timestamps | No |
| Streaming / real-time | No |
| Languages supported | 1 |
| HIPAA eligible | No |
SUPERB vs Whipscribe
| Feature | SUPERB | Whipscribe |
|---|---|---|
| Category | Open source | Transcription APIs |
| Pricing | free | free beta |
| Speaker diarization | Yes | Yes |
| Word timestamps | No | Yes |
| Streaming | No | No |
| Languages | 1 | 99 |
| Platforms | HuggingFace, GitHub | Web, API, MCP |
Alternatives to SUPERB
Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.