Fish Speech

by fishaudio

Open zero-shot voice cloning + TTS.

TL;DR

Open zero-shot voice cloning + TTS.

Best for multilingual voice cloning paired with Whisper transcription. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux, macOS, Windows

What it is

Fish Audio's open TTS / voice-cloning stack. CC-BY-NC-SA-4.0.

Best for: Multilingual voice cloning paired with Whisper transcription.
Watch out for: License is BY-CC-NC-SA-4.0 — non-commercial.

Install / use

pip install fish-speech

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeNo
Languages supported8
HIPAA eligibleNo

Fish Speech vs Whipscribe

FeatureFish SpeechWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationNoYes
Word timestampsYesYes
StreamingNoNo
Languages899
PlatformsLinux, macOS, WindowsWeb, API, MCP

Alternatives to Fish Speech

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.