Looking at ML-SUPERB? Try this first.

Drop your audio. Transcript in seconds. 30 free min, then $2 = 200 min

ML-SUPERB

Name: ML-SUPERB
Author: Academic consortium (CMU + NTU + JHU + others)

by Academic consortium (CMU + NTU + JHU + others)

Multilingual SUPERB — 143 languages × multiple tasks for self-supervised speech models.

TL;DR

Multilingual SUPERB — 143 languages × multiple tasks for self-supervised speech models.

Best for probing multilingual self-supervised speech models (XLS-R, MMS, mHuBERT) across LID + ASR. Pricing: free.

What it is

ML-SUPERB is the multilingual extension of SUPERB — 143 languages across language identification and ASR tasks, designed for probing SSL speech encoders. Toolkit: Apache-2.0; component data licenses vary.

Best for: Probing multilingual self-supervised speech models (XLS-R, MMS, mHuBERT) across LID + ASR.
Watch out for: Apache-2.0 toolkit · underlying data licenses vary per corpus (Common Voice, MLS, Babel, etc.) · academic-only Babel subset. Cite: Shi et al., Interspeech 2023.

Install / use

git clone https://github.com/s3prl/s3prl  # ML-SUPERB benchmark suite

Features

Speaker diarization	No
Word-level timestamps	No
Streaming / real-time	No
Languages supported	143
HIPAA eligible	No

ML-SUPERB vs Whipscribe

Feature	ML-SUPERB	Whipscribe
Category	Open source	Transcription APIs
Pricing	free	free beta
Speaker diarization	No	Yes
Word timestamps	No	Yes
Streaming	No	No
Languages	143	99
Platforms	GitHub, HuggingFace	Web, API, MCP

Alternatives to ML-SUPERB

OpenAI Whisper

OpenAI

The reference open-source multilingual ASR model from OpenAI.

OSS · MIT ★ 98.1k

whisper.cpp

Georgi Gerganov

C/C++ port of Whisper — runs on anything, from a Raspberry Pi to Apple Silicon.

OSS · MIT ★ 48.8k

faster-whisper

SYSTRAN

4× faster than reference Whisper using CTranslate2 — production sweet spot.

OSS · MIT ★ 22.3k

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.