Bark (Suno)

by Suno

Open-source generative audio model from Suno — speech, music, and sound effects.

TL;DR

Open-source generative audio model from Suno — speech, music, and sound effects.

Best for researchers exploring expressive non-speech sounds (laughs, sighs, music) alongside text-to-speech. Pricing: free (MIT).

Category
Open source
License
Stars
Last push
Pricing
free (MIT)
Platforms
Linux, macOS, Windows

What it is

Bark is a transformer-based text-to-audio model from Suno that generates expressive speech with nonverbal cues, music, and ambient sound. The public release ships with curated speaker presets only — voice cloning was disabled by Suno for safety reasons. MIT-licensed. Consent posture: no cloning surface in the public model.

Best for: Researchers exploring expressive non-speech sounds (laughs, sighs, music) alongside text-to-speech.
Watch out for: Voice cloning was intentionally disabled in the public release; only Suno-provided history prompts are supported.

Install / use

Features

Speaker diarizationNo
Word-level timestampsNo
Streaming / real-timeNo
Languages supported13
HIPAA eligibleNo

Bark (Suno) vs Whipscribe

FeatureBark (Suno)Whipscribe
CategoryOpen sourceTranscription APIs
Pricingfree (MIT)free beta
Speaker diarizationYes
Word timestampsYes
StreamingNo
Languages1399
PlatformsLinux, macOS, WindowsWeb, API, MCP

Alternatives to Bark (Suno)

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.