TEN Framework

by Agora

Open-source framework by Agora for building realtime multimodal voice AI agents.

TL;DR

Open-source framework by Agora for building realtime multimodal voice AI agents.

Best for builders on Agora RTC who want an OSS voice/video agent runtime. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux, macOS, Cloud

What it is

TEN (The Embodied Network) is Agora's open-source framework for realtime multimodal agents over its global RTC SDK. It abstracts ASR, LLM, TTS, and video components into reusable extensions and is positioned alongside Pipecat and LiveKit Agents as an OSS alternative for voice-agent runtime. Apache-2.0 licensed.

Best for: Builders on Agora RTC who want an OSS voice/video agent runtime.
Watch out for: Newer project; ecosystem narrower than Pipecat or LiveKit Agents.

Features

Speaker diarizationNo
Word-level timestampsNo
Streaming / real-timeNo
Languages supportedNone
HIPAA eligibleNo

TEN Framework vs Whipscribe

FeatureTEN FrameworkWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationYes
Word timestampsYes
StreamingNo
Languages99
PlatformsLinux, macOS, CloudWeb, API, MCP

Alternatives to TEN Framework

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.