SpeechBrain

by SpeechBrain consortium

PyTorch toolkit for ASR, speaker, diarization, enhancement.

TL;DR

PyTorch toolkit for ASR, speaker, diarization, enhancement.

Best for researchers building end-to-end speech pipelines with PyTorch idioms. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux, macOS, Windows

What it is

An all-in-one PyTorch speech toolkit with recipes for ASR, speaker ID, enhancement, and diarization. Apache-2.0.

Best for: Researchers building end-to-end speech pipelines with PyTorch idioms.
Watch out for: Inference performance lags CTranslate2-based stacks.

Install / use

pip install speechbrain

Features

Speaker diarizationYes
Word-level timestampsYes
Streaming / real-timeNo
Languages supported50
HIPAA eligibleNo

SpeechBrain vs Whipscribe

FeatureSpeechBrainWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationYesYes
Word timestampsYesYes
StreamingNoNo
Languages5099
PlatformsLinux, macOS, WindowsWeb, API, MCP

Alternatives to SpeechBrain

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.