pyannote.audio

by pyannote / Hervé Bredin

The reference open diarization + speaker embedding toolkit.

TL;DR

The reference open diarization + speaker embedding toolkit.

Best for production diarization + voice-activity-detection pipelines. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux, macOS, Windows

What it is

The standard diarization toolkit, used by many open Whisper pipelines. MIT.

Best for: Production diarization + voice-activity-detection pipelines.
Watch out for: Pretrained weights gated behind HF user license (free for research/commercial with acceptance).

Install / use

pip install pyannote.audio

Features

Speaker diarizationYes
Word-level timestampsYes
Streaming / real-timeNo
Languages supported1
HIPAA eligibleNo

pyannote.audio vs Whipscribe

Featurepyannote.audioWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationYesYes
Word timestampsYesYes
StreamingNoNo
Languages199
PlatformsLinux, macOS, WindowsWeb, API, MCP

Alternatives to pyannote.audio

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.