Pipecat

by Daily.co

Open-source framework for voice and multimodal conversational AI agents.

TL;DR

Open-source framework for voice and multimodal conversational AI agents.

Best for developers building self-hosted voice agents with explicit control over the stack. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux, macOS, Windows, Cloud

What it is

Pipecat is an open-source Python framework from Daily.co for building realtime voice and multimodal AI agents. It composes pipeline stages — VAD, ASR, LLM, TTS, function calls — over WebRTC or WebSocket transport. Pipecat is the open alternative to Vapi or Retell for teams that want full ownership of the runtime, model choice, and cost model. BSD-2-Clause licensed.

Best for: Developers building self-hosted voice agents with explicit control over the stack.
Watch out for: DIY; you operate the media transport, model selection, and scaling.

Install / use

pip install pipecat-ai

Features

Speaker diarizationNo
Word-level timestampsNo
Streaming / real-timeNo
Languages supportedNone
HIPAA eligibleNo

Pipecat vs Whipscribe

FeaturePipecatWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationYes
Word timestampsYes
StreamingNo
Languages99
PlatformsLinux, macOS, Windows, CloudWeb, API, MCP

Alternatives to Pipecat

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.