Transcription tools directory

Meta's free natural-language and speech understanding platform.

Vonage Voice API (ASR Connector)

Vonage

Vonage's CPaaS speech-to-text via the ASR connector (typically Deepgram-powered).

Vonage Voice price + ASR per-minute

Plivo Voice (Speech Recognition)

Plivo

Plivo's CPaaS speech recognition for IVR + call-recording workflows.

Plivo Voice + per-minute ASR

Bandwidth Voice (Transcription)

Bandwidth

Bandwidth's voice CPaaS with optional transcription on recordings and IVR.

per-minute (see Bandwidth pricing)

Play.HT (STT endpoints)

Play.HT

Play.HT's transcription endpoint as a counterpart to its TTS family.

per-minute (see Play.HT pricing)

Lemonfox.ai

Hosted Whisper API at low per-hour pricing for developers.

from $0.17/hr (Whisper)

Speakbot (Whisper API alt)

Speakbot

Hosted Whisper API with file-based and URL ingestion.

per-minute (see Speakbot pricing)

Voicegain

Deep-learning ASR you can deploy in your own cloud or use as managed SaaS.

per-minute SaaS + Edge license

Amazon Transcribe Streaming

per-Azure-Speech pricing (Fast variant)

Real-time streaming variant of Amazon Transcribe over HTTP/2 + WebSocket.

from $0.024/min

Azure Fast Transcription

Microsoft Azure

Azure Speech's batch-fast mode for short-turnaround transcription with predictable latency.

Google Cloud Speaker Diarization

Diarization layer for Google Cloud Speech-to-Text v2.

per-Speech v2 pricing

Deepgram Nova-3

from $0.0043/min (Nova-3, batch)

Deepgram's current-generation streaming + batch ASR model.

AssemblyAI Realtime / Streaming

AssemblyAI

AssemblyAI's WebSocket streaming endpoint for live captions and agents.

from $0.15/hr (Streaming)

Gladia Realtime

Gladia

Gladia's real-time streaming ASR API with multilingual code-switching.

per-hour streaming (see Gladia pricing)

OpenAI /audio/transcriptions (whisper-1, gpt-4o-transcribe)

from $0.006/min (whisper-1)

OpenAI's hosted Whisper + gpt-4o-transcribe models, batch endpoint.

OpenAI /audio/translations

from $0.006/min (whisper-1)

OpenAI's translate-to-English audio endpoint.

SambaNova (Whisper endpoints)

SambaNova

SambaNova's hosted Whisper-large-v3 endpoint on its RDU accelerator.

see SambaNova pricing

Together AI (Whisper)

Together AI

Together AI's hosted Whisper models among its open-model catalog.

per-Together-AI pricing

DeepInfra (Whisper)

DeepInfra

DeepInfra's hosted Whisper endpoint with per-second GPU pricing.

per-Deepinfra pricing

OVHcloud AI Speech-to-Text

OVHcloud

OVHcloud's managed speech-to-text inside its sovereign EU cloud.

per-OVHcloud pricing

Scaleway AI Inference

Scaleway

Scaleway's GPU inference platform commonly used for hosted Whisper.

per-Scaleway pricing (compute-based)

Alibaba Tongyi Audio (Qwen-Audio)

Alibaba Cloud

Alibaba's Tongyi multimodal model exposed for transcription + audio understanding.

tiered RMB-per-token / per-second

Baidu ERNIE Speech

Baidu

Baidu's ERNIE-aligned speech models inside ERNIE Bot Cloud.

tiered RMB-per-call

Huawei Pangu Speech

Huawei Cloud

Huawei's Pangu foundation models extended to speech for enterprise scenarios.

tiered RMB-per-call

Tencent Hunyuan (audio modality)

Tencent Cloud

Tencent's Hunyuan multimodal model with audio understanding endpoints.

tiered RMB-per-token

Naver HyperCLOVA X (audio)

Naver Cloud Platform

Naver's HyperCLOVA X foundation model with audio understanding.

Naver Cloud HyperCLOVA pricing

Kakao Kanana Speech

Kakao Enterprise

Kakao's Kanana foundation-model family with audio understanding.

Rev.ai Streaming

from $0.035/min (Streaming)

Rev.ai's WebSocket streaming endpoint for live transcripts.

Speechmatics (batch / language packs)

Speechmatics batch ASR with broad language pack catalog.

from $1.04/hr (Standard)

Hume EVI

Hume AI Inc.

Empathic voice interface with emotional-tone awareness.

pay-as-you-go

Otter.ai API

Otter.ai Inc.

Developer API access to Otter.ai's transcription engine.

Rasa Pro

Rasa Technologies GmbH

Open-source-anchored conversational AI for enterprise.

free / contact sales

Google Cloud Dialogflow

Google's conversational-AI platform for voice and chat agents.

usage-based

Amazon Lex

AWS conversational-AI platform for voice and text bots.

usage-based

Microsoft Bot Framework

Microsoft Corporation

Microsoft's open-source SDK and platform for conversational bots.

free SDK + Azure costs

Rev VoiceHub

Rev's enterprise transcription and recording API platform.

Trint API

Trint's transcription and translation API for newsrooms and media teams.

tiered · free quota + pay-as-you-go in CNY

iFlyTek Open Platform

iFlyTek

China's largest speech AI vendor — Mandarin, dialects, and 60+ languages via developer APIs.

Tencent Cloud ASR

Tencent Cloud

Tencent's cloud speech-to-text with one-sentence, sentence, and real-time APIs.

tiered · per-second pricing in CNY

Alibaba DAMO ASR

Alibaba Cloud

Alibaba Cloud / DAMO Academy speech recognition with Paraformer non-autoregressive models.

tiered · per-hour pricing in CNY/USD

Volcengine Speech

ByteDance Volcano Engine

ByteDance's Volcano Engine speech-to-text — short, long, and streaming Mandarin ASR.

tiered · pay-as-you-go in CNY

Mobvoi Speech

Mobvoi

Mobvoi (Chumen Wenwen) speech APIs — Mandarin recognition behind TicWatch and Volkswagen voice.

tiered · per-character pricing in CNY

NetEase Youdao ASR

NetEase Youdao

Youdao Cloud speech-to-text — Mandarin recognition behind Youdao Translator and dictionary pen.

Sogou ASR

Sogou (Tencent)

Sogou (Tencent-owned) speech-to-text — input-method-grade Mandarin recognition.

Reverie Language Technologies

Reverie Language Tech

Reverie's Indic speech recognition — 11 Indian languages from one of Reliance Jio's group companies.

Government of India (Digital India Bhashini Division)

Bhashini

Government of India's national language platform — public ASR APIs for 22 official languages.

free for non-commercial; commercial tiers TBD

Sarvam AI

Sarvam AI — full-stack Indian foundation models including Saaras / Saaransh speech APIs.

tiered · per-minute pricing with free tier

Tinkoff VoiceKit

Tinkoff (T-Bank)

Tinkoff VoiceKit — Russian-language ASR + TTS used inside Tinkoff Bank's contact centre.

tiered · per-second pricing in RUB

SoundHound

SoundHound AI

SoundHound Houndify — multilingual voice AI platform with embedded and cloud ASR.

tiered · contact sales for enterprise pricing

Lelapa AI

Lelapa AI — South African startup building Vulavula speech and language tools for African languages.

tiered · pay-as-you-go with free tier

Intella

Intella — Arabic speech-to-text API focused on MSA and major Arabic dialects.

tiered · per-hour pricing in USD with free tier

Alvenir Danish ASR

Alvenir

Alvenir — Danish-language speech-to-text product from a Copenhagen startup.

tiered · pay-as-you-go in EUR/DKK

AI-Loop

AI-Loop — multilingual African-language speech and NLP infrastructure.

tiered · pay-as-you-go in USD

Hume EVI

Hume AI

Empathic Voice Interface — voice AI that reads and responds to emotion in speech.

Dialogflow CX

Google Cloud's enterprise conversational AI platform with voice and chat channels.

Microsoft Bot Framework (Voice)

Microsoft's bot orchestration SDK with voice channels via Direct Line Speech.

IBM watsonx Assistant (Voice)

IBM

IBM's enterprise conversational AI platform with voice and contact-center integrations.

Vertex AI Conversation

Google Cloud's LLM-native conversational AI builder with voice support.

Twilio Voice Intelligence + Agents

Twilio

Twilio's ASR, voice intelligence, and ConversationRelay primitives for voice agents.

Plivo AI

Plivo

Voice AI agent capability layered on Plivo's CPaaS voice network.

Bandwidth Voice AI

Bandwidth

AI voice tooling layered on Bandwidth's tier-1 U.S. carrier network.

Telnyx Voice AI

Telnyx

AI inference and voice agents on Telnyx's own carrier and GPU stack.

Voximplant

CPaaS with serverless VoxEngine scenarios and AI voice integrations.

Daily.co Voice

Daily.co

WebRTC infrastructure for realtime voice and video AI agents.

Deepgram Voice Agent API

Single API for low-latency voice agents bundling Deepgram ASR + LLM + TTS.

AssemblyAI LeMUR Voice

AssemblyAI

AssemblyAI's LLM framework over its ASR for voice intelligence and agents.

ElevenLabs Conversational AI

ElevenLabs' end-to-end voice agent API with ASR, LLM, and premium TTS.

Azure AI Speech Voice Agent

Microsoft Azure's bundle of Speech SDK + Bot Framework for voice agents.

Ultravox Agents

Fixie.ai

Speech-native LLM and hosted agent runtime by Fixie.ai.

OpenAI Realtime Agents SDK

OpenAI's Agents SDK pattern over the Realtime API for voice-native assistants.

Anthropic Voice Agent Patterns

Anthropic

Reference patterns for building voice agents with Anthropic Claude models.

Cartesia Voice Agent stack

Cartesia

Cartesia's Sonic TTS plus partner ASR/LLM for low-latency voice agents.

Pollyo

AI-dubbing API for video platforms — backend OEM rather than a creator-facing app.

Camb.ai TTS

Camb.ai

Camb.ai's standalone text-to-speech surface — same MARS model that powers their dubbing.

freemium — API per-request, consumer apps free

iSpeech

TTS + STT API with consumer text-reader apps.

Desktop apps · 146 tracked

Native desktop applications for macOS, Windows, and Linux that transcribe files locally. All desktop apps →

MacWhisper

Jordi Bruin

Polished Mac app for Whisper — the default pick if you're on macOS.

SuperWhisper

Sindre Sorhus

Always-on system-wide dictation for macOS and iOS, powered by local Whisper.

Aiko

Sindre Sorhus

Free Mac App Store Whisper app — drag, drop, done.

Hindenburg PRO

Hindenburg Systems

Broadcast-style DAW built for journalists, podcasters, and audiobook producers.

Filmora AI

Wondershare

Wondershare's consumer video editor with AI captioning and short-form features.

Premiere Pro Auto-Transcribe

Adobe Premiere Pro's built-in speech-to-text and caption track.

Final Cut Pro Live Captions

Apple FCP's caption track with macOS dictation-based transcription assist.

DaVinci Resolve Auto-Caption

Blackmagic Design

Resolve Studio's built-in speech-to-text caption track.

Recut

Native macOS app that removes silences from videos before you import to your editor.

iZotope RX

Industry-standard audio repair suite — dialogue isolation, declip, dehum, denoise.

Adobe Audition AI

Adobe Audition with built-in Enhance Speech, multitrack, and spectral repair.

Audacity

Muse Group

Free open-source DAW with community plugins and a budding AI line.

Ocenaudio

Ocenaudio Team

Fast, simple cross-platform audio editor for clean-up work.

Hindenburg Smart Audio

Hindenburg Systems

Hindenburg's automatic leveling + Voice Profiler tuned for spoken word.

Adobe Speech Enhance (Premiere)

Adobe's Enhance Speech model exposed inside Premiere Pro.

Subtitle Edit

Nikse

Free open-source subtitle editor with broad format support.

Aegisub

Aegisub Project

Open-source advanced subtitle editor favored by anime fansubbers.

Subtitle Workshop

URUSoft

Veteran Windows subtitle editor with batch conversion.

Stenograph CATalyst

Stenograph LLC

Court reporter CAT (computer-aided transcription) software for stenographers.

Eclipse CAT (Advantage Software)

Advantage Software

Stenographic court-reporter CAT software — main competitor to Stenograph CATalyst.

Case CATalyst

Stenograph LLC

Court reporter CAT software historically marketed as Case CATalyst by Stenograph.

Sonocent Audio Notetaker (legacy)

Glean Education

Predecessor to Glean — audio note-taking software for students.

discontinued — see Glean

Express Scribe

NCH Software

Long-standing transcription playback software with foot pedal support.

free / Pro one-time license

f4transkript

audiotranskription.de

Academic transcription playback software popular in qualitative research.

license + subscription

MAXQDA Transcription

VERBI Software

Built-in transcription playback inside the MAXQDA qualitative analysis software.

license · academic discount

InqScribe

Inquirium LLC

Veteran transcription playback and subtitle authoring tool.

license $99

Transana

Wisconsin Center for Education Research

Qualitative analysis software for audio and video transcripts.

license · academic pricing

Wispr Flow

Wispr AI

AI-augmented dictation app — types into any text field on macOS and Windows.

free / from $12/mo

Nuance Dragon NaturallySpeaking

Classic Windows desktop dictation for general users.

license from $200

Nuance Dragon Home

Consumer-tier Dragon for personal Windows users.

license $200

Talon Voice

Programmable voice control for power users — Linux, macOS, Windows.

free / Patreon paid

Braina

Brainasoft

Windows AI voice assistant and dictation product.

free / Pro license

e-Speaking

Long-running Windows voice command and dictation utility.

license

SpeechPulse

Windows dictation utility built around Whisper.

license

Voice Finger

Windows accessibility utility for mouse and keyboard control by voice.

Aqua Voice

AI dictation app — type into any field on macOS and Windows.

Voice Control for Mac (easy)

Spokenly

macOS dictation app that uses Whisper locally.

$24.99 one-time

Front-end utilities making Apple Voice Control easier to configure.

Telestream MacCaption

Telestream

Mac authoring tool for broadcast captions and post-production workflows.

Telestream CaptionMaker

Telestream

Windows authoring tool for closed captions and subtitles in broadcast and OTT.

EZTitles

ELF Software

Professional subtitle authoring suite used across European broadcast and OTT.

Spot Subtitle Editor

Spot Software

Classic Windows subtitle authoring tool with deep cinema and broadcast support.

FAB Subtitler

FAB Subtitling

Subtitle preparation and broadcast playout suite used across European TV.

Newfor (Screen Subtitling)

Screen Subtitling Systems

UK teletext-era subtitle origination still used in legacy broadcast workflows.

Screen Subtitling Systems

Screen Poliscript

Subtitle preparation tool from Screen Subtitling for broadcast and OTT delivery.

Cheetah CaptionMaker (legacy)

Cheetah International (legacy)

Original Cheetah Systems caption authoring tool — name now used by Telestream.

service · contact sales

Logosys

Subtitle preparation suite from Italy's Logosys, used in European broadcast.

Caplav

Subtitle preparation and live re-speaking tool aimed at European broadcasters.

FoCal Subtitler

FoCal

Niche subtitle authoring tool used by smaller European subtitle houses.

Stenograph CATalyst (Case CATalyst)

Stenograph

Industry-standard CAT software for stenographers and CART captioners.

StenoCAT

Niche but durable CAT software for stenographers, with realtime captioning support.

Aiseesoft AI Translator

Aiseesoft

Desktop AI video translation suite for Windows + macOS — batch dub multiple files offline.

Voicemod

Real-time voice changer + AI voice cloning — desktop, Windows-first, streamer audience.

Balabolka

Ilya Morozov

Free Windows TTS reader — uses installed SAPI voices, scriptable, no telemetry.

Pocketalk

Pocketalk Corp.

Pocket-sized dedicated live-translation hardware — 84 languages, two-way voice.

Vasco Translator

Vasco Electronics

Handheld translator with lifetime free internet — 70+ languages, M3 + V4 models.

Travis Touch

Travis

Crowdfunded handheld translator — 105 languages, Kickstarter origin.

Boeleo W1

Boeleo

Pen-shaped scanning translator — point at printed text, get spoken translation.

Lingmo Translate One

Lingmo International

Wearable / handheld live translation hardware from Lingmo International.

Trados Studio (Multilingual)

RWS (Trados)

Enterprise CAT tool with subtitle + voice-over file-format support.

memoQ (Multilingual)

memoQ

Enterprise CAT tool with subtitle file + AV reference support.

Articulate Storyline 360

Articulate

Authoring tool for SCORM e-learning with closed-caption support per slide.

Adobe Captivate

Adobe's e-learning authoring tool with closed-caption support for SCORM courses.

iSpring Suite

iSpring Solutions

PowerPoint-based e-learning authoring with closed-caption and narration tools.

Camtasia (Auto-Captions)

TechSmith

TechSmith Camtasia desktop video editor with auto-caption generation.

Lumivero (formerly QSR International)

NVivo

Qualitative research software with auto-transcription for interviews and focus groups.

ATLAS.ti

ATLAS.ti GmbH

Qualitative analysis platform with AI auto-coding and transcription.

MAXQDA

VERBI Software

Qualitative + mixed-methods research with built-in transcription tooling.

QDA Miner

Provalis Research

Qualitative + mixed-methods coding tool from Provalis Research.

Quirkos

Visual qualitative coding for thematic analysis, with transcript import.

Pro Tools Speech-to-Text

Avid

Avid Pro Tools' built-in dialogue transcription clip-effect for post and ADR sessions.

subscription $34.99/mo

Logic Pro Auto Mix

$199.99 one-time (macOS) · subscription $4.99/mo (iPad)

Apple Logic Pro's AI mixing assistant — leans on Stem Splitter + transcript-aware vocal balancing.

Ableton Live (AI plugin ecosystem)

Ableton

Ableton Live as a host for third-party AI vocal/speech plugins — Sonible, Acon, Synchro Arts, etc.

Standard $449 one-time · Suite $749 one-time

FL Studio (AI plugin ecosystem)

Image-Line

FL Studio as a VST host for AI speech-enhancement plugins (smart:EQ, Clarity Vx, DeNoise).

Producer $199 one-time · Signature $299 one-time · All Plugins Bundle $499 one-time · lifetime free updates

Cubase Pro VocalChain + speech-align

Steinberg

Steinberg Cubase Pro with VocalChain, AudioWarp, and built-in hooks for VocAlign / Revoice Pro.

Cubase Pro 14 $579 one-time · Artist $329 one-time · Elements $99 one-time

Steinberg Nuendo

Steinberg

Steinberg's post-production DAW with AI-assisted DialogueDetective and ADR workflow.

Nuendo 14 $1,799 one-time · NEK $279 one-time

Reaper (AI plugin host)

Cockos

Cockos Reaper — cheap, scriptable, hosts every AI dialogue plugin via VST3/CLAP/JSFX.

Discounted $60 one-time · Commercial $225 one-time

PreSonus Studio One

PreSonus

Studio One with Stem Separation, Lyrics Display, and Vocal/Speech-aware tooling.

Pro+ subscription $14.95/mo · Pro perpetual $399 one-time

Synchro Arts Revoice Pro 5

Synchro Arts

Standalone + ARA dialogue/vocal alignment with AI Process Assist.

Revoice Pro 5 $599 one-time · subscription $19.99/mo

Synchro Arts VocAlign Project 6

Synchro Arts

Lighter VocAlign aimed at music producers for unison/double-track tightening.

$99 one-time

Synchro Arts VocAlign Pro

Synchro Arts

VocAlign Pro — timing + pitch alignment, used for ADR and lip-sync dubbing.

$259 one-time

Sound Radix Auto-Align Post 2

Sound Radix

Phase + timing alignment across multi-mic dialogue recordings.

$349 one-time

iZotope Dialogue Match

$499 one-time (often bundled in RX Post Production Suite)

Match production-dialogue tone to ADR recordings — EQ + reverb + ambience capture.

iZotope RX Dialogue Isolate (standalone)

bundled with RX Standard / Advanced / Post Production Suite

Source-separation module for stripping music + ambience away from dialogue.

Acon Digital Extract:Dialogue

Acon Digital

AI-based dialogue extraction plugin for film, broadcast, and podcast post.

$199 one-time

Acon Digital DeNoise

Acon Digital

Spectral noise reduction with adaptive AI noise-print learning.

$99 one-time

Waves Clarity Vx

Waves Audio

Neural-network dialogue noise reduction — single-knob, AAX/VST3/AU.

$99 one-time (Clarity Vx) · $299 one-time (Clarity Vx Pro)

Waves Clarity Vx DeReverb

Waves Audio

AI de-reverb companion to Clarity Vx — removes room reflections from spoken dialogue.

$99 one-time

CEDAR Studio

CEDAR Audio

Broadcast-grade dialogue restoration suite — DNS, DeClick, DeHiss, DeBuzz.

per-module licensing · contact CEDAR for quote

CEDAR DNS 2 / DNS 8D / DNS One

CEDAR Audio

CEDAR's Dialogue Noise Suppression hardware + plugin line — production-set staple.

per-module quote · standalone hardware also available

NUGEN Audio Halo Upmix / DialogueChecker

NUGEN Audio

NUGEN's upmix + loudness suite with speech-aware dialogue level checking.

Halo Upmix $499 one-time · LM-Correct / VisLM $399+ one-time

Klevgrand Brusfri

Klevgrand

Single-fingerprint noise reducer tuned for dialogue and vocals.

$59.99 one-time (desktop) · $19.99 (iOS)

Sonible smart:EQ 4

Sonible

AI-driven adaptive EQ with a 'speech' profile for podcast and dialogue tracks.

$129 one-time · subscription $9.90/mo (Sonible Studio)

Sonible smart:dynamics

Sonible

Compressor with AI gain-profile detection — speech / vocals / drums presets.

$129 one-time

Sonible pure:comp / pure:limit / pure:verb

Sonible

AI-presetting compressor / limiter / reverb trio aimed at podcasters.

$39 one-time per plugin

Accentize DeBleeder / DialogueEnhance / VoiceGate

Accentize

ML-based dialogue plugins for spill removal, enhancement, and noise-aware gating.

DialogueEnhance $179 one-time · DeBleeder $179 one-time · VoiceGate $109 one-time

CrumplePop AudioDenoise AI / EchoRemover

CrumplePop (Boris FX)

FCP / Premiere / DaVinci-native AI noise + echo removal for video editors.

AudioDenoise AI $99 one-time · EchoRemover $99 one-time · Bundles available

Zynaptiq UNVEIL

Zynaptiq

Real-time de-reverb plugin built on Zynaptiq's MAP (Mixed-signal Audio Processing) tech.

$499 one-time

Zynaptiq UNFILTER

Zynaptiq

Real-time linear-filter compensator — fixes muffled mics, comb filtering, telephone EQ.

$369 one-time

Soundtoys Little AlterBoy

Soundtoys

Vocal pitch + formant shifter — used as a quick speech-disguise / character voicer.

$99 one-time · often bundled in Soundtoys 5

Audionamix XTRAX STEMS / Speech Volume

Audionamix

Source-separation suite for music + speech-volume normalization for podcasts.

varies by product · contact Audionamix for quote

Soundwhale

Remote-collaboration DAW client for film/TV post — built-in AI dialogue tooling.

subscription $19.99/mo · Pro $39.99/mo

Avid Media Composer Phonetic Index

Avid

Media Composer's phonetic search + transcript-aware logging for NLE editors.

subscription $23.99/mo · annual $269.88/yr

Vegas Pro AI Audio Tools

MAGIX

MAGIX Vegas Pro with AI transcription, noise reduction, and smart-mask audio routing.

Edit subscription $19.99/mo · Pro perpetual $399 one-time

Lightworks AI Tools

LWKS

Lightworks NLE with AI-assisted transcription + caption workflow.

free tier · Create subscription $9.99/mo · Pro $23.99/mo

OBS Studio with AI Caption plugins

OBS Project + community plugins

OBS hosts third-party AI live-caption plugins (e.g. obs-localvocal) for streamers.

Streamlabs Desktop AI Tools

Streamlabs (Logitech)

Streamlabs (OBS fork) with AI Highlighter and stream-clip detection.

free · Ultra subscription $19/mo or $149/yr

NDI Tools + AI Captions

Vizrt (NDI)

NDI's free toolset with built-in caption + speech routing for production switchers.

Resolume Arena / Avenue (AI audio reactives)

Resolume

VJ software where AI speech-/audio-reactive plugins drive visuals from voice input.

Avenue 7 €299 one-time · Arena 7 €799 one-time

Apple GarageBand

Apple's free DAW — useful entry-point for podcasters before upgrading to Logic Pro.

Reason Studios (plugin host)

Reason Studios

Reason DAW as a VST3 / Rack Extension host for AI speech-cleanup plugins.

Reason 13 subscription $19.99/mo · perpetual $499 one-time

Bitwig Studio (AI plugin host)

Bitwig

Bitwig Studio with VST3/CLAP-hosted AI dialogue plugins — Sonible, Acon, Waves.

Bitwig Studio $399 one-time · Producer $199 · Essentials $99

Ardour (open-source DAW)

Ardour community

Cross-platform open-source DAW that hosts AI VST3/LV2 dialogue plugins.

free (build from source) · $1+/mo or $45+ one-time for binary downloads

Harrison Mixbus 32C

Harrison Audio

Console-style DAW (Harrison) with classic-mic preamp emulations + dialogue workflow.

Mixbus 32C $239 one-time

MAGIX Samplitude Pro X

MAGIX

MAGIX's pro DAW with object-oriented editing + speech-aware mastering chain.

$399 one-time

Steinberg WaveLab Pro

Steinberg

Mastering DAW with podcast-oriented loudness + speech-aware analysis tools.

WaveLab Pro 12 $579 one-time · Elements $99

TwistedWave

macOS / iOS / online audio editor used widely in podcast post — clean speech-edit UX.

macOS $79.90 one-time · iOS $9.99 one-time · Online subscription tiers

HairerSoft Amadeus Pro

HairerSoft

macOS-native multi-track editor with spectral repair and AU plugin hosting.

$59.99 one-time

iZotope RX Elements

$129 one-time (often included with audio interface bundles)

RX Elements — the entry-tier of the RX family, often bundled with audio interfaces.

Audio Design Desk

AI-assisted post-production editor that auto-syncs SFX + dialogue to picture.

Creator $4.99/mo · Pro $19.99/mo · Studio $39.99/mo

Klang:fabrik / Klang:apps

Klang:technologies

Immersive object-based mixing platform — speech routing in 360 audio.

enterprise · contact Klang for quote

Supertone Clear

Supertone

AI noise/reverb removal plugin from Supertone — broadcast-grade real-time dialogue cleanup.

subscription $9.99/mo · perpetual $179 one-time

Neutone FX / Morpho

Neutone (Qosmo)

Neural-audio research plugin platform — host AI speech / instrument models in any DAW.

Neutone FX free · Morpho subscription tiers

Vienna MIR Pro / MIR x

Vienna Symphonic Library

Convolution / room-simulation suite used for dialogue placement in immersive mixes.

MIR Pro 24 €395 one-time · MIR x bundled with selected libraries

FabFilter Pro-Q 4 (with AI Spectrum Match)

FabFilter

FabFilter's flagship EQ — Pro-Q 4 adds an AI Spectrum Match for dialogue / vocal matching.

$169 one-time (Pro-Q 4)

ElevenLabs Voice Changer (plugin path)

free tier · Creator $22/mo · Pro $99/mo

ElevenLabs' speech-to-speech voice conversion — wrapped as a desktop / plugin workflow.

Voicemod (real-time voice changer)

Voicemod

Real-time AI voice-changer routed into Discord, OBS, Zoom — system-level audio plugin.

free · Pro subscription $5+/mo (intro) up to $15/mo

Topaz Audio AI (planned / beta)

Topaz Labs

Topaz Labs' speech-enhancement engine — currently in invite beta.

beta · pricing tbd by Topaz Labs

iZotope Ozone 11 (with Master Assistant)

Standard $249 one-time · Advanced $499 one-time · subscription tiers

iZotope Ozone — AI Master Assistant with explicit speech / podcast targets.

iZotope Nectar 4 (vocal-aware chain)

Standard $249 one-time · Advanced $399 one-time

iZotope's vocal channel strip with Vocal Assistant — speech and vocal modes.

Voice.ai

Real-time AI voice changer with universal app compatibility.

free + Pro paid tier

MagicMic

iMyFone

iMyFone's real-time voice changer for Windows/macOS with 600+ effects.

Monthly $9.95, Yearly $19.95, Lifetime $39.95

Clownfish Voice Changer

Clownfish

Free Windows voice changer with system-wide audio interception.

AV Voice Changer Software

Audio4fun

Audio4fun's commercial voice changer suite for Windows.

Basic $29.95, Gold $59.95, Diamond $99.95 (one-time)

MorphVOX

Screaming Bee

Screaming Bee's veteran voice changer for gaming and online play.

Junior free, Pro $39.99 one-time

Panopreter

Panopreter Software

Windows TTS reader with batch file conversion.

Basic free, Plus $29.95 (one-time)

Apple Siri

Apple Intelligence Siri (Siri 2.0)

Apple's voice assistant across iPhone, iPad, Mac, Apple Watch, and HomePod.

Free with Apple device

Free with supported Apple device

The LLM-augmented Siri introduced with Apple Intelligence on iOS 18 / macOS Sequoia.

Microsoft Cortana

Deprecated · use Copilot instead

Microsoft's voice assistant — consumer surfaces deprecated; remnants live on in Teams.

Apple Vision Pro Live Captions

Free with Vision Pro ($3,499 headset)

Real-time captions in visionOS — overlay spoken speech as floating text in your field of view.

Android Live Caption

Free with supported Android device

System-wide on-device captioning of any audio on Android — calls, video, podcasts.

AirPods Pro Live Listen

Free with AirPods Pro ($249)

Use AirPods Pro as remote mic + real-time amplifier — hearing-assist mode.

AirPods Pro Hearing Aid

Free with AirPods Pro 2 ($249)

FDA-cleared hearing-aid mode in AirPods Pro 2 — clinical-grade hearing assistance.

Apple HomePod Voice (Siri on HomePod)

$99 mini · $299 full HomePod

Siri on HomePod / HomePod mini — voice-first smart-home control.

Apple TV Siri Remote Voice Search

Included with Apple TV ($129+)

Hold the Siri button on the Apple TV remote to search content by voice.

Apple Vision Pro Dictation

Free with Vision Pro ($3,499)

On-device dictation in visionOS for any text-input UI element.

Products · 981 tracked

End-user transcription products — meeting bots, editors, and turnkey workflows. All products →

Meeting-bot transcription product for Zoom/Meet/Teams.

AI from $0.25/min · human from $1.50/min

Human + AI transcription, highest accuracy tier on the market.

Descript

Audio/video editor that treats the transcript as the timeline — different product category.

free / from $12/mo

Enterprise-focused transcription + collaborative editor for newsrooms.

from $60/mo

Fireflies.ai

Meeting-bot transcription + CRM integrations, competitor to Otter.

Amazon Transcribe Call Analytics

from $0.03/min (post-call) + add-ons

Post-call and real-time call analytics on AWS with sentiment, talk-time, and issue detection.

Symbl.ai

Conversation intelligence API with transcripts, action items, and live agent assist.

per-minute (free tier + paid tiers)

Voci Technologies (V-Spark / V-Blaze)

Medallia (Voci)

Enterprise call-analytics ASR engine, on-prem-friendly, acquired by Medallia.

VoiceBase (LivePerson)

LivePerson (VoiceBase)

Conversation analytics API acquired by LivePerson, originally a transcription-first vendor.

Behavox

Compliance-focused voice + comms surveillance for regulated finance.

Cogito

Real-time agent assist analyzing voice paralinguistics for emotion + behavior coaching.

Hume EVI (Empathic Voice + Prosody)

Hume AI

Hume's voice-AI platform with prosody/expression analysis bundled with transcription.

per-minute (see Hume pricing)

Picovoice Cheetah

Free tier + per-user/Enterprise plans

On-device streaming speech-to-text optimized for embedded and edge.

Picovoice Leopard

Free tier + per-user/Enterprise plans

On-device offline speech-to-text from Picovoice — file-based, no cloud.

Cobalt Speech (Cubic / Diatheke)

Cobalt Speech & Language

On-prem ASR + dialog stack popular in defense, regulated finance, and accessibility.

Krisp API

Krisp

Krisp's noise-cancellation + transcription + AI meeting assistant API for embedding.

Twilio Voice Intelligence

Twilio

Twilio's call-analytics product on top of Twilio Voice — transcripts + language operators.

from $0.05/min (transcription) + per-operator add-ons

Nylas Notetaker

Nylas

API to drop a notetaker bot into Zoom/Meet/Teams meetings and return transcripts.

per-bot-minute + transcription minute (see Recall.ai pricing)

Recall.ai

Universal meeting bot API for Zoom/Meet/Teams/Webex with transcription + raw audio.

RingCentral RingSense

RingCentral

RingCentral's call-recording and conversation-intelligence layer.

per-seat add-on to RingCentral

Dialpad Ai (DialpadGPT)

Dialpad

Dialpad's in-house conversation intelligence on top of its UCaaS + contact-center.

Bundled in Dialpad tiers

Zoom AI Companion

Zoom

Zoom's bundled meeting AI: transcripts, summaries, smart compose.

Included with paid Zoom Workplace plans

Webex AI Assistant

Cisco Webex

Cisco Webex's meeting AI: transcripts, summaries, action items.

Bundled in Webex Suite tiers

Microsoft Teams Premium (Intelligent Recap)

$10/user/month (Teams Premium add-on, list)

Teams Premium's AI meeting features: intelligent recap, live captions, translations.

Google Meet AI (Gemini in Meet)

Included in Gemini for Workspace tiers

Gemini-powered take-notes-for-me, captions, and translation inside Google Meet.

Fireflies.ai API

Fireflies.ai

Fireflies' notetaker exposed as an API for meetings + custom audio uploads.

Bundled in Fireflies plans (Business / Enterprise)

Grain

Sales-focused meeting recorder with transcripts, clips, and coaching.

Free + paid plans (Starter / Business)

Fathom

Free AI meeting notetaker with transcripts, summaries, and CRM sync.

Free for individuals; paid Team tiers

tl;dv

AI meeting notetaker focused on Zoom + Meet with multilingual transcripts.

Free + paid Pro / Business plans

Supernormal

AI notetaker for Google Meet, Zoom, and Teams with customizable templates.

Free + paid Pro / Business / Enterprise

Avoma

Meeting lifecycle + conversation intelligence platform for revenue teams.

Free + paid Starter / Plus / Business / Enterprise

Fellow

Meeting management + AI notetaker with agendas, action items, and 1:1 templates.

Free + paid Pro / Business / Enterprise

Circleback

AI notetaker focused on premium meeting summaries and CRM workflows.

paid tiers (see Circleback pricing)

Spinach.ai

AI scrum master + meeting notetaker for engineering teams.

Free + paid plans

Otter Business / Enterprise

$20/user/mo (Business) and Enterprise (contact sales)

Otter.ai's team and enterprise tiers with OtterPilot for live meetings.

Rev Max / Enterprise

from $34.99/user/mo (Max) and Enterprise (contact sales)

Rev's enterprise plans bundling ASR, transcripts and Verbit-class managed services.

Enterprise transcription + captioning + ASR for legal, media, education, government.

from $0.10/min (machine) to $0.80/min (manual)

Scribie

Hybrid AI + human transcription with file-based ordering.

GoTranscript

Human transcription service with 119+ language pairs and multiple turnaround tiers.

from $0.84/min (human, standard tier)

TranscribeMe

Hybrid AI + human transcription with strong privacy and security positioning.

from $0.79/min (human verbatim) and lower for AI

SpeakWrite

US-based human transcription with same-day turnaround for legal and government.

per-word pricing (see speakwrite.com)

Happy Scribe

Dublin-based AI + human transcription, subtitles, and dubbing platform.

from $17/user/mo (Basic) and per-hour overage

Sonix

AI transcription + collaborative editor with multilingual support.

from $10/hr (Standard) + per-hour or subscription

Temi

Rev (Temi)

Rev's machine-only transcription product at a flat per-minute rate.

$0.25/min (machine-only)

Amberscript

Dutch AI + human transcription and subtitling platform with EU data residency.

from EUR 8/hr (machine) up to manual + certified tiers

Rev.com Podcast Transcription

$1.99/min (human) and $0.25/min (AI)

Rev.com's per-minute podcast transcription with English + Spanish coverage.

Descript Storyboard

Descript

Descript's video + audio editor with ASR-driven transcript editing.

from $24/user/mo (Hobbyist) up to Pro / Business

Trint Stories

Bundled in Trint subscription tiers

Trint's newsroom-focused transcript editor with multi-language workflows.

Observe.AI

Contact-center AI for agent assist, automated QA, and conversational intelligence.

CallMiner Eureka

CallMiner

Long-running conversation-analytics suite for contact-center operations and compliance.

Verint Speech Analytics

Verint

Verint's voice-of-customer + workforce-engagement analytics on calls.

NICE Enlighten AI

NICE

NICE's AI layer for CXone with proprietary contact-center speech models.

Level AI

Contact-center AI for auto-QA, agent assist and analytics on conversational signal.

Add-on to Talkdesk CCaaS plans

Talkdesk Copilot

Talkdesk

Talkdesk's AI layer for agent assist, automated summaries, and QA inside its CCaaS.

Genesys Cloud AI Experience

Genesys

Genesys Cloud's bundled AI tier with voice transcripts, summaries, and agent copilot.

Bundled in Genesys Cloud AI Experience tiers

Five9 AI (Aceyus + Inference)

Five9

Five9's AI capabilities for contact-center voice and digital channels.

Add-on to Five9 CCaaS plans

Convoso

Outbound contact-center platform with conversation analytics for sales calls.

Aircall AI

Aircall

Aircall's AI layer for transcription, summaries, and call insights.

Add-on to Aircall plans

Cresta

Real-time agent assist and contact-center AI built on proprietary LLMs.

Balto

Real-time playbook + coaching engine for contact-center agents.

Tethr

Conversation analytics platform focused on customer-effort and CX measurement.

Invoca

Revenue-execution platform analyzing inbound calls for marketing attribution + conversion.

Marchex

Call analytics + conversation AI for marketing attribution.

CallRail Conversation Intelligence

CallRail

CallRail's transcription + conversation analytics layer for inbound calls.

from $50/mo (Conversation Intelligence add-on, plus base CallRail plan)

Gridspace

Voice AI infrastructure with proprietary ASR + voice bots for enterprise contact centers.

Phonic

Voice and video research platform using ASR + NLP for qualitative analysis.

per-minute (see Phonely pricing)

Userbird Voice

Userbird

Voice + video feedback widget with auto-transcription for product teams.

Free + paid tiers

Phonely

AI voice agent platform for inbound and outbound phone calls.

Bland.ai

AI voice-agent platform for outbound + inbound calls at scale.

from $0.09/min

Vapi

Voice-AI infrastructure: turnkey assistants composed of STT + LLM + TTS providers.

$0.05/min (Vapi orchestration) + provider pass-through

Retell AI

Real-time voice-agent platform with low-latency conversational AI.

per-minute (see Retell pricing)

Synthflow

No-code voice-agent builder for SMB phone automation.

from $29/mo (Starter)

Voiceflow

Conversation-design platform for voice and chat agents.

Free + paid plans (Pro / Teams / Enterprise)

Regal AI

Regal

AI phone agents and contact-center for outbound sales.

Status: acquired — verify availability

Speechly (Roblox)

Roblox (Speechly)

Real-time speech-to-text + intent platform; acquired by Roblox in 2023.

Speak Ai

Qualitative-research transcription + NLP for media monitoring and research.

from $19/mo (Starter)

Notta

Multilingual transcription app with strong Asia-Pacific footprint.

Free + paid Pro / Business plans

Yandex Cloud SpeechSense

Yandex Cloud

Yandex Cloud's call-center analytics product layered on SpeechKit.

tiered RUB-per-second + analytics fee

interScriber

Swiss interview transcription tool for journalists and researchers.

per-minute (see interScriber pricing)

Noota

Meeting recorder for recruitment and sales with structured-output templates.

Free + paid Starter / Pro / Business

Vatic AI

Voice intelligence platform for transcripts, summaries and topic analysis.

Spitch

Swiss voice-AI platform with strong Swiss-German + multilingual coverage.

Intelligent Voice

Enterprise voice analytics + eDiscovery on call recordings.

SmartAction

Voice-AI IVR replacement for contact centers.

Interactions IVA

Interactions

Enterprise virtual assistant for contact centers with human-in-the-loop fallback.

Yactraq

Cloud-based ASR + speech-analytics service for contact centers.

VoiceIntelligence.AI

Voice-analytics platform spanning recording, transcription, scoring and BI.

AudioShake

AI stem separation + transcription for media, music, and podcast post-production.

Wordcab

Conversation-AI API focused on long-form transcription + summarization.

Azure Conversation Transcription

Microsoft Azure

Azure Speech's multi-speaker meeting transcription with channel and speaker ID.

per-Azure-Speech pricing

Speechmatics Flow

Speechmatics' voice agent API combining ASR, LLM, and TTS.

IBM watsonx Assistant (speech)

IBM

IBM watsonx Assistant's speech-in for chat/voice agents.

per IBM watsonx pricing

Twilio Media Streams + Partner ASR

Twilio

Twilio's media-streaming primitive piped to partner ASR vendors.

Twilio Voice + chosen ASR vendor

Deepgram Nova-3 Medical

per Deepgram pricing (medical SKU)

Deepgram's medical-tuned variant of Nova-3 for healthcare ASR.

Speechmatics (medical / regulated)

Speechmatics' enterprise ASR with on-prem option for healthcare.

Contact sales (enterprise)

Abridge

Clinical documentation: ambient transcription + structured medical notes.

Nuance DAX (Microsoft Dragon Copilot)

Microsoft (Nuance)

Ambient clinical documentation, now Microsoft Dragon Copilot.

Contact Microsoft Healthcare sales

Suki AI

Voice-enabled AI assistant for clinical documentation.

Augmedix

Hybrid human + AI medical scribe with real-time documentation.

DeepScribe

AI medical scribe converting clinical conversations into structured notes.

Ambience Healthcare

AI documentation platform for clinicians with multi-specialty coverage.

Add-on to Talkdesk CCaaS plans

Heidi Health

AI scribe popular outside the US for solo and small-practice clinicians.

Free + paid Pro plans

Freed

Lightweight AI scribe targeting independent US clinicians.

from $99/clinician/mo

Talkdesk QM Assist

Talkdesk

Automated quality-management scoring + coaching from Talkdesk.

Five9 Agent Assist

Five9

Five9's real-time agent-assist module for sentiment, next-best-action, and summary.

Add-on to Five9 CCaaS plans

Genesys Cloud Voice Bot Flows

Genesys

Genesys Cloud's built-in voice-bot builder for IVR replacement and self-service.

Bundled in Genesys Cloud tiers

Vonage AI Studio

Vonage

Vonage's no-code IVA builder for voice and chat agents.

Vonage AI Studio pricing

TranscribeCx (regional)

Various regional vendors

Smaller regional transcription vendors aggregated under a directory-level placeholder.

Varies by vendor

Tomedes Transcription

Tomedes

Tomedes translation agency's human transcription division.

per-minute (see Tomedes quote)

Lexikeet (regional human transcription)

Lexikeet

Regional human-transcription marketplaces for niche language pairs.

per-minute, quote-based

Podsqueeze

Podcast workflow tool with transcription, show notes, and clips.

from $9/mo (Starter)

Swell AI

Podcast content engine with transcription + summaries + repurposing.

from $29/mo

Deciphr

AI-powered podcast and video producer for transcripts + clips + content.

Captions

AI video editor with auto-captions and dubbing for short-form creators.

Free + paid Pro

Submagic

AI captions and short-clip editor for short-form creators.

Free + paid Basic / Pro / Business

Opus Clip

AI clipper that turns long videos into short-form clips with captions.

from $19/mo (Starter)

Vizard

Online AI video editor with auto-captions and clip generation.

Free + paid plans

Kapwing AI Subtitles

Kapwing

Kapwing's online editor with AI auto-subtitles for video creators.

Free + paid Pro

VEED.IO

Online video editor with AI subtitles, translation, and dubbing.

Rev Captions

from $1.50/min (human captions) and $0.25/min (AI)

Rev's captioning SKU for video accessibility and broadcast.

Capshion

AI captioning service for short-form video creators.

from $10/mo

SubtitleBee

Web tool generating styled subtitles for video creators.

from $11.99/mo

YouTube Studio Auto-Captions

YouTube

YouTube's built-in automatic captions for uploaded videos.

TikTok Auto-Captions

TikTok

TikTok's in-app automatic captions for uploaded short-form videos.

Instagram Auto-Captions

Instagram's in-app auto-captions for Reels and Stories.

Facebook Auto-Captions

Meta's Page-level auto-caption tool for uploaded video.

LinkedIn Auto-Captions

Microsoft (LinkedIn)

LinkedIn's native auto-captions for uploaded videos.

Bundled in StageTen plans

StageTen Captions

StageTen

Live-streaming production tool with built-in auto-captions.

AI-Media (LEXI)

Broadcast-grade live captioning for TV, government, and education.

Hardware purchase + service

Epiphan LiveScrypt

Epiphan Video

Live captioning appliance for events and broadcast.

Interprefy

Remote simultaneous interpretation platform with AI captions and human interpreters.

KUDO

Multilingual virtual meeting platform with human + AI interpretation.

Wordly

AI live translation and captioning for meetings and events.

Tactiq

Tactiq Pty Ltd.

Live in-browser transcription overlay for Google Meet, Zoom, and Teams.

free / from $12/user/mo

Granola

Granola Inc.

AI notepad that augments your own notes with meeting-context transcripts.

free trial / from $18/mo

Read.ai

Read AI Inc.

Meeting copilot with engagement analytics and AI-generated recaps.

free / from $19.75/mo

Sembly AI

Sembly AI Inc.

Multilingual AI meeting assistant with searchable team knowledge base.

free / from $9.99/user/mo

Spinach

Spinach.io Inc.

AI scrum-master that runs standups, retros, and engineering rituals.

MeetGeek

MeetGeek.ai

AI meeting assistant with library, analytics, and conversation insights.

Limitless (formerly Rewind)

Krisp Notes

Krisp Technologies Inc.

AI meeting notes added on top of Krisp's noise-cancellation app.

free / from $8/mo

Limitless

Wearable AI pendant that captures and transcribes in-person conversations.

hardware $99 + from $19/mo

Plaud Note

Plaud AI

AI voice recorder card that pairs with ChatGPT for transcription and summarization.

hardware $159 + free or $79/yr

Rewind

Rewind AI Inc.

macOS app that records and indexes everything seen and heard on a Mac.

from $19/mo

Briefly

Briefly Inc.

Quick AI-generated meeting summaries for Zoom, Meet, and Teams.

Magical (formerly Text Blaze).

Loopin

Loopin AI

AI notetaker focused on calendar-based summaries and team standups.

free / from $12/user/mo

Magical Notes

AI templates and meeting recaps inside the Magical productivity extension.

free / from $12/user/mo

Equal Time

Equal Time Inc.

AI meeting copilot that measures speaking time and inclusion metrics.

free / from $20/user/mo

Cogram

Cogram Ltd.

Enterprise AI notetaker with on-premise deployment and strong compliance posture.

Notiv

Notiv Pty Ltd.

Sales-call analysis and notetaker with focus on Pacific and APAC teams.

from $12/user/mo

Gong

Gong.io Inc.

Revenue intelligence platform that records, transcribes, and analyzes sales calls.

Chorus.ai

ZoomInfo Technologies

Conversation intelligence platform inside ZoomInfo's revenue OS.

Clari Copilot

Clari Inc.

Conversation intelligence inside the Clari revenue platform (formerly Wingman).

Salesloft Conversations

Salesloft Inc.

Call recording and intelligence inside the Salesloft revenue workflow platform.

Outreach Kaia

Outreach Corporation

Real-time sales assistant and conversation intelligence in Outreach.

free / from $39.99/user/mo

Apollo Conversations

Apollo.io Inc.

Call recording and AI insights inside Apollo's go-to-market platform.

from $99/user/mo

Trellus

Trellus AI

Real-time conversation coach for cold-call dialers.

Rilla

Rilla Voice Inc.

Virtual ride-along sales coach for in-home and field sales reps.

Refract

Refract.ai (Allego)

Conversation analytics and coaching for sales and customer-success teams.

ExecVision

Mediafly Inc.

Conversation intelligence and coaching platform inside Mediafly.

Allego

Allego Inc.

Sales enablement platform with conversation intelligence and onboarding video.

Mindtickle

Mindtickle Inc.

Sales readiness platform with call AI and rep skill analytics.

SetSail Technologies Inc.

SetSail

Revenue data platform that scores sales rep activity and signal capture.

Modjo

Modjo SAS

European conversation intelligence platform with deep CRM integration.

Salesken

Salesken Inc.

Real-time AI sales coaching during live calls.

Revenue Grid

Revenue Grid Inc.

Revenue intelligence with conversational AI and Salesforce inbox sidebar.

HubSpot Conversation Intelligence

Dialpad Ai Voice

Dialpad Inc.

Cloud phone system with real-time transcription and AI call summaries.

from $15/user/mo

HubSpot Inc.

Native call recording and AI insights inside HubSpot Sales Hub.

bundled from $90/user/mo

Einstein Conversation Insights

Salesforce Inc.

Salesforce's native AI for sales-call summarization and analysis.

Salesforce / Slack Technologies

Pipedrive AI

Pipedrive OÜ

AI sales assistant and CRM summarization inside Pipedrive.

from $59/user/mo

Zoho Zia

Zoho Corporation

Zoho's AI assistant with voice transcription across CRM, mail, and meetings.

from $45/user/mo

ClickUp Brain

ClickUp Inc.

AI summarization and assistant inside the ClickUp project-management suite.

$7/user/mo add-on

Notion AI Meeting Notes

Notion Labs Inc.

Notion's built-in AI summaries for transcribed meetings and notes.

$10/user/mo add-on

Mem

Mem Labs Inc.

Self-organizing AI notebook with meeting transcription and recall.

free / from $14.99/mo

Reflect Notes

Reflect Notes Inc.

Encrypted personal notes app with built-in OpenAI-powered AI features.

$10/mo or $100/yr

Tana

Tana Inc.

Outliner with AI nodes and meeting-capture workflows.

free / from $14/mo

Coda AI

Coda Project Inc.

AI assistant inside the Coda doc-and-database platform.

from $12/mo

Bardeen

Bardeen AI Inc.

Browser-based AI automation that includes meeting-bot summary playbooks.

free / from $20/user/mo

Zoom AI Companion

Zoom Communications Inc.

Native AI assistant across Zoom meetings, chat, and phone.

bundled with paid Zoom

Slack AI

Channel and thread summarization plus AI search across Slack workspaces.

$10/user/mo add-on

Loom AI

Atlassian / Loom Inc.

AI titles, summaries, and chapters for async video messages.

from $15/user/mo

Microsoft 365 Copilot in Teams

Microsoft Corporation

Microsoft Copilot inside Teams for live meeting summaries and recaps.

$30/user/mo

Gemini for Google Meet

Google LLC

Google Gemini for live captions, summaries, and Take Notes for Me in Meet.

bundled with Workspace

Slite AI

Slite SAS

Slite's AI assistant for asynchronous docs and meeting notes.

from $8/user/mo

Zoom Events AI Recap

Zoom Communications Inc.

AI recap and on-demand replay for Zoom Events and Webinars.

from $79/license/mo

ON24 Intelligence

ON24 Inc.

AI-driven webinar engagement and content intelligence.

BigMarker

BigMarker.com LLC

Webinar and virtual-event platform with AI session highlights.

from $79/mo

Goldcast

Goldcast Inc.

B2B event platform with Content Lab for AI clip generation.

Welcome

Welcome Inc.

Webinar and event platform with AI-generated recaps and clip library.

Banzai International Inc.

StreamYard

StreamYard (Hopin Inc.)

Browser-based live-streaming studio owned by Hopin.

free / from $20/mo

Restream

Restream Inc.

Multi-destination live-streaming platform with AI recording features.

free / from $16/mo

Demio

Webinar platform with built-in transcription and AI recap.

from $59/mo

Wistia AI

Wistia Inc.

AI captions, chapters, and SEO for marketing-video hosting.

from $24/mo

Lattice AI

Lattice Inc.

AI summaries and feedback inside the Lattice performance-management platform.

15Five

15Five Inc.

Performance management with AI-generated coaching insights.

Reclaim.ai

Reclaim Inc.

AI calendar that protects time for habits, meetings, and focus blocks.

free / from $14.99/user/mo

Colibri.ai

Real-time AI sales coach with conversation intelligence and CRM sync.

free / from $16/user/mo

Scribbl

Free Chrome-extension AI notetaker for Google Meet.

free / from $15/user/mo

Airgram

Airgram Inc.

Meeting notetaker with timeline scribbling and AI summaries.

free / from $18/user/mo

Noty.ai

AI meeting notes for Google Meet and Zoom with live captioning.

Vowel

Vowel Inc.

Meeting-platform-and-notetaker combo with built-in video calling.

free / from $20/user/mo

Grain

Grain Inc.

Conversation intelligence and clip-sharing for revenue teams.

Jamy

Jamy AI

Multilingual AI meeting notetaker with action-item automation.

free / from $15.99/user/mo

Laxis

Laxis Inc.

Meeting notetaker with research-interview templates and CRM workflows.

Clearword

Real-time AI meeting assistant with searchable highlights library.

free / from $25/user/mo

Fellow

Fellow.app Inc.

Meeting management platform with agendas, notes, and AI summaries.

free / from $11/user/mo

Hypercontext

Hypercontext Inc.

1:1 and team-meeting platform with AI-suggested talking points.

free / from $7/user/mo

Hive Notes

Hive Technology Inc.

Meeting notes inside Hive's project-management platform.

free / from $1/user/mo

Docket

Docket Inc.

AI meeting prep, agenda, and recap for sales and customer-success teams.

Attention

Attention Inc.

Real-time AI sales coach and CRM autopilot.

tldv Notes (formerly Lavender Notes)

Lavender.ai

AI meeting recap layer plus email outbound assistant.

from $29/user/mo

Sybill

Sybill Inc.

AI sales coach that scores reps on emotional and behavioral cues.

from $59/user/mo

Clari Groove

Clari Inc.

Sales engagement layer (formerly Groove) inside Clari's revenue platform.

NICE CXone Analytics

NICE Ltd.

Enterprise contact-center suite with conversation analytics.

Five9 Genius

Five9 Inc.

AI and conversation analytics inside the Five9 contact-center platform.

Dialpad Ai Contact Center

Dialpad Inc.

Cloud contact center with built-in real-time transcription and AI scorecards.

from $80/agent/mo

JustCall IQ

JustCall.io (SaaS Labs)

Conversation intelligence inside the JustCall cloud-phone platform.

from $19/user/mo

Zoom Revenue Accelerator

Zoom Communications Inc.

Conversation intelligence inside Zoom for revenue teams.

contact sales add-on

Clari Listen

Clari Inc.

Clari's revenue-team conversation-listening posture across signals.

Revenue.io

Revenue.io Inc.

Sales engagement and conversation intelligence platform.

Groove.co GenAI

Groove.co (Clari)

Groove sales engagement platform's generative-AI feature set.

Equilar BoardEdge

Equilar Inc.

Executive-meeting intelligence for IR and board-engagement teams.

Diligent Boards AI

Diligent Corporation

Board-meeting AI summaries and governance intelligence.

Nasdaq Boardvantage

Nasdaq Inc.

Board portal with AI-assisted minutes and meeting recap.

Synthflow AI

No-code voice-AI platform for inbound and outbound phone calls.

from $29/mo

Hiver Harvey AI

Hiver Technologies

Email-and-chat customer-service AI inside Hiver shared inbox.

from $34/user/mo

Front AI

Front App Inc.

AI inside Front's customer-communication platform.

from $19/user/mo

Intercom Fin

Intercom Inc.

Generative-AI agent that resolves customer-support conversations.

$0.99/resolution

Zendesk AI

Zendesk Inc.

Generative AI inside the Zendesk customer-service platform.

$50/agent/mo add-on

Service Cloud Einstein

Salesforce Inc.

Salesforce's AI for customer-service and contact centers.

Alex Anywhere

macOS AI agent that captures and summarizes any audio on the system.

Zetland Media (Good Tape)

Voicenotes

iOS / Android voice-first note-taking app with AI summaries.

from $10/mo

AudioPen

Voice-to-clean-text app for capturing rambling thoughts.

free / $99/yr

Good Tape

Online transcription service from a journalism nonprofit.

Transkriptor

Multilingual file transcription with summarization and meeting-bot.

free / from $4.99/mo

Happy Scribe

Automatic and human-corrected transcription and subtitling.

from €0.20/min

Sonix

Sonix Inc.

Automatic transcription, translation, and subtitling.

from $10/hr

Steno

Steno Agency Inc.

Court-reporting and legal deposition platform with AI transcripts.

Dragon Medical One

Microsoft (Nuance)

Cloud-based clinical speech recognition for clinical documentation.

Heidi Health

Heidi Health Pty Ltd.

AI medical scribe for clinicians in 30+ countries.

free / from $99/mo

Scribeberry

Scribeberry Inc.

Canadian AI medical scribe for primary care physicians.

Hubilo

Hubilo Softech Inc.

Virtual and hybrid event platform with AI recap.

vFairs

vFairs Inc.

Virtual event platform with AI session summaries.

Airmeet

Airmeet Inc.

Virtual and hybrid event platform with engagement and AI features.

from $4,000/event

Kaltura Events

Kaltura Inc.

Enterprise video platform with event broadcasting and AI captions.

ZoomInfo Engage

ZoomInfo Technologies

Sales engagement (cadence + dialer) inside ZoomInfo.

Scratchpad

Scratchpad Inc.

Lightning-fast revenue workspace for Salesforce updates and notes.

BrightHire

BrightHire Inc.

Interview intelligence for talent-acquisition teams.

Metaview

Metaview Ltd.

AI interviewer-notes platform for recruiting teams.

Hireflix

One-way video interview platform with AI transcription.

from $150/mo

hireEZ Vue

hireEZ Inc.

Outbound recruiting platform with AI sourcing and engagement.

Paradox Olivia

Paradox Inc.

Conversational-AI recruiter for high-volume hiring.

Kaltura Classroom

Kaltura Inc.

Lecture-capture and transcription for higher education.

Panopto

Panopto Inc.

Lecture-capture, video CMS, and AI transcription for education.

Echo360

Echo360 Inc.

Active-learning and lecture-capture platform for higher education.

Yoodli

Yoodli Inc.

AI public-speaking coach with private practice mode.

free / from $9/mo

Praktika

Praktika.ai

AI English-language tutor with voice conversations.

from $11.99/mo

Speak

Speakeasy Labs Inc.

AI English-speaking practice app backed by OpenAI.

from $19.99/mo

Riverside

Riverside.fm Inc.

Studio-quality remote recording for podcasts and video.

free / from $19/mo

SquadCast

SquadCast.fm (Descript)

Lossless remote-recording platform for podcasters.

free / pay-per-transcript

Zencastr

Zencastr Inc.

Browser-based podcast and video recording with AI features.

free / from $20/mo

Supercreator AI

Supercreator AI Ltd.

AI app for shorts/reels creators with auto-captioning.

free / from $19.99/mo

Rev Voice Recorder

Rev.com Inc.

Rev.com's iOS / Android voice recorder with one-tap transcription.

Just Press Record

Open Planet Software

iOS / Apple Watch voice recorder with on-device transcription.

$4.99 one-time

HubSpot AI Content Assistant

HubSpot Inc.

Generative-AI features across HubSpot marketing and sales tools.

bundled with HubSpot

Zoho Zia for Meetings

Zoho Corporation

Zoho's AI meeting summaries inside Zoho Meeting.

bundled from $1/host/mo

Microsoft Teams Premium

Microsoft Corporation

Premium Teams tier with intelligent recap and AI features.

$10/user/mo add-on

Workspace Duet AI

Google LLC

Original branding for Google's Workspace AI before Gemini rebrand.

bundled with Workspace

Krisp Meetings

Krisp Technologies Inc.

Krisp's combined audio-cleanup-plus-meeting-notes product.

bundled with Krisp

Nyota AI

AI meeting assistant with team-knowledge questions across past meetings.

from $19/mo

Krisp Federal

Krisp Technologies Inc.

FedRAMP-aligned tier of Krisp for US government use.

free / from $15.99/user/mo

Wudpecker

Wudpecker Oy

Privacy-first AI meeting notetaker.

Speak Ai

Speak Ai Inc.

Qualitative-research AI for transcribing and analyzing interviews.

from $19/mo

Dovetail

Dovetail Research

Research repository with AI analysis of qualitative data.

free / from $30/user/mo

Notably

Notably Inc.

Visual-canvas research-analysis tool with AI transcription.

free / from $25/mo

Condens

Condens GmbH

European qualitative-research analysis platform with AI features.

User Interviews

User Interviews Inc.

Participant-recruitment platform with AI session features.

Respondent

Respondent.io Inc.

B2B participant-recruitment platform for user research.

pay-per-participant

UserTesting

UserTesting Inc.

User-testing platform with AI insights and transcription.

Lookback

Lookback Inc.

Live and async user-testing platform for product teams.

from $25/mo

PlaybookUX

PlaybookUX Inc.

Remote-user-research platform with AI session analysis.

pay-per-session

Orum

Orum Inc.

Parallel dialer for outbound sales with conversation analytics.

Nooks

Nooks Inc.

AI-powered parallel dialer and sales platform.

Uniphore Technologies Inc.

Kixie PowerCall

Kixie Inc.

Cloud-phone and SMS platform with AI features for sales teams.

from $35/user/mo

Dialpad Meetings

Dialpad Inc.

Video-conferencing product from Dialpad with AI recap.

free / from $15/user/mo

Uniphore

Conversational AI for customer experience across voice and video.

Verint Customer Engagement

Verint Systems Inc.

Customer-engagement and contact-center AI platform.

Calabrio ONE

Calabrio Inc.

Workforce management and contact-center AI suite.

CallRail

CallRail Inc.

Call-tracking and AI conversation-intelligence for SMB.

from $50/mo

Ema

Ema Unlimited Inc.

Universal AI employee for enterprise workflows.

Decagon

Decagon AI Inc.

Generative-AI customer-support agents.

Sierra

Sierra.ai Inc.

Conversational-AI platform for consumer brands.

Instaminutes

AI meeting summarizer for short, accurate recaps.

free / from $9.99/mo

MeetingPulse

MeetingPulse Inc.

Audience-engagement and Q&A platform with AI summaries.

free / from $39/mo

Slido

Cisco Webex (Slido)

Audience-interaction platform with AI summary of meeting Q&A.

Airtame Rooms

Airtame ApS

Hardware-based meeting-room solution with AI features.

hardware + subscription

Logitech Rally Bar AI

Logitech International

Logitech meeting-room camera with AI features for hybrid meetings.

hardware purchase

Neat

Neat AS

All-in-one video bar and meeting-room hardware.

hardware purchase

Castmagic

Turn long-form audio into show notes, clips, tweets, and newsletters in one upload.

Podcastle

Podcastle AI

Browser DAW for podcasts with AI voice clones and one-click cleanup.

Cleanvoice.ai

Cleanvoice

Automatic remover of ums, mouth sounds, dead air, and stutters from podcast tracks.

Auphonic

Automatic audio leveling, loudness normalization, and noise reduction for podcast post.

Adobe Podcast Enhance

Free web tool that makes any voice recording sound like it was tracked in a studio.

Krisp

Real-time noise, voice, and echo cancellation on any call or recording.

Alitu

The Podcast Host

Hands-off podcast maker — drag in raw tracks, get a leveled, intro-stitched episode.

Resonate Recordings

Human-edited podcast post-production with optional AI assist.

Buzzsprout

Podcast host with built-in AI transcripts, magic mastering, and episode chapters.

Spotify for Podcasters

Spotify's free podcast host with AI Voice Translation and auto-transcripts.

Headliner

SpareMin

Turn podcast audio into shareable audiograms and waveform videos.

Wavve

Audiogram generator with templated designs for podcast promo clips.

Castos

Podcast host with WordPress integration and private feed support.

Transistor.fm

Transistor

Modern podcast host built for networks — unlimited shows on one account.

Captivate.fm

Captivate

Growth-focused podcast host with marketing tools built in.

Podbean

Podcast host with live streaming, monetization, and AI episode notes.

Spreaker

iHeartMedia

Podcast host and live audio platform owned by iHeart with a programmatic ad network.

Acast

Enterprise podcast host with monetization and global ad sales.

Megaphone

Spotify's enterprise podcast publishing and ad-insertion platform.

Simplecast

SiriusXM

Podcast host owned by SiriusXM with strong embeddable players.

Libsyn

The original podcast host, still running shows that started in the 2000s.

Pinecast

Indie-favorite podcast host with technical features and fair pricing.

RedCircle

Free podcast host with cross-promotion and host-read ad network.

Veed.io

Veed

Browser video editor with auto-subtitles in 100+ languages.

Pictory

Turn blog posts and long videos into branded short clips with auto-captions.

Vidcap

Bulk subtitle and caption generator for video creators.

Captions

Mobile-first captioning app for creators recording on a phone.

Munch

Long-form video → short-form clips with social-trend awareness.

Klap

AI shorts generator — paste a YouTube URL, get vertical clips with captions.

Vizard

AI clipping tool with face-tracking auto-reframe.

ChopCast

Repurpose long webinars and podcasts into clips, blog posts, and quote graphics.

Zubtitle

Add captions, headlines, and resize video for social — one upload.

Kapwing

Browser video editor with strong meme, subtitle, and team-collaboration tooling.

ScriptMe

AI transcription + caption studio with translation in 90+ languages.

Reduct.video

Reduct

Transcript-based video editing for research, user interviews, and journalism.

Vimeo Create

AI video creation and captioning inside the Vimeo platform.

Animoto AI

Animoto

Slide-based video creation with AI scripts and captioning.

Lumen5

Convert blog posts to social videos with auto-captions and stock B-roll.

InVideo AI

InVideo

Prompt-to-video generator with stock library and AI voiceover.

Steve.ai

AI text-to-animation and live-action video generator.

HeyGen

AI avatar video generator with talking-head cloning and 100+ language dubs.

Highlight by Marvin

Marvin

Find and clip the most highlight-worthy moments from podcast and interview footage.

Repurpose.io

Automated pipeline to publish one video to every social platform.

Tweet Hunter Repurpose

Lempire

AI-driven Twitter / X scheduling with video-clip-to-thread repurposing.

Crayo

AI clip-generator targeting faceless TikTok and Reels.

Rev Captions

Rev's human-grade captioning service with SRT/VTT/SCC delivery.

3Play Media

Enterprise captioning, transcription, and audio-description service.

Participatory Culture Foundation

Amara

Open subtitling platform run by the Participatory Culture Foundation.

Subly

Subtitle, translate, and edit captions for social video in 70+ languages.

Maestra AI

Maestra

Transcription, subtitling, voiceover, and dubbing in 125+ languages.

Notta Translate

Notta

Real-time meeting and video translation across 50+ languages.

Cielo24

Enterprise captioning and video data services for media and education.

StreamText

Live captioning delivery network for events and webinars.

SubtitleBee

Browser auto-subtitle and translation tool for social video.

Type Studio

Transcript-based video editor with multi-language subtitle output.

Auphonic Multitrack

Auphonic

Multi-track leveling and crosstalk reduction for podcast editors.

Castmagic Clips

Castmagic

Castmagic's short-form clip generator built on its podcast transcript output.

Castos Repurpose

Castos

Castos host's AI clip + show-note generator for hosted episodes.

Tweet Hunter Repurpose Mode

Lempire

Drop a video URL → get pre-drafted X threads in your voice.

Castmagic Newsletter

Castmagic

Auto-drafted podcast-driven email newsletters.

Limecraft

Production management with AI transcription for broadcast workflows.

TranscribeMe

Human-grade transcription service with HIPAA + enterprise options.

Scriptix

Speech-to-text platform aimed at broadcast and media in EU markets.

Noteit.ai

Noteit

Meeting bot with focused share-with-anyone summary links.

Happyscribe Subtitle Editor

Happyscribe

Frame-accurate subtitle editor inside the Happyscribe transcription product.

EZsub

Cloud subtitle workflow for educators and small media teams.

Checksub

AI subtitle and dubbing platform with team workflows.

Splento AI

Splento

Event video service with AI editing and captioning.

Vodalus

AI workflow for broadcast captioners and accessibility teams.

Rev Translated Captions

Rev's translated subtitle service across 16+ language pairs.

DubDub.ai

DubDub

AI dubbing platform with lip-sync for video translation.

Rask AI

AI video translation and dubbing platform.

ElevenLabs Dubbing

ElevenLabs' video dubbing layer on top of its voice model.

Blakify

AI voiceover platform with multi-language output for podcasts and videos.

Play.ht

PlayHT

AI voice generator favored for podcast and audiobook workflows.

WellSaid Labs

Enterprise AI voiceover platform with consented voice avatars.

Murf AI

Studio-grade AI voice generation with timeline editor.

Speechgen.io

Speechgen

Cheap AI voice generator with broad language coverage.

Studio-grade AI voice generation and cloning.

Vidnoz AI

Vidnoz

AI avatar video generator with talking-head templates.

Synthesia

Enterprise AI avatar video platform with translation.

DeepBrain AI Studios

DeepBrain AI

AI avatar video platform popular in APAC enterprises.

Elai.io

AI avatar video tool with text-to-video and templates.

Colossyan

Multi-avatar dialogue video for corporate L&D.

Veed Auto-Subtitles

Veed

Veed.io's standalone auto-subtitle workflow.

Kapwing Subtitler

Kapwing

Kapwing's stand-alone subtitle workflow.

Transkriptor

Affordable AI transcription with strong Turkish and European-language coverage.

SpeechTexter

Free browser-based dictation tool using browser speech APIs.

Podscribe

Podcast measurement and attribution platform — transcripts power the analytics.

Magellan AI

Podcast advertising intelligence built on transcript analysis.

Podchaser Insights

Acast

Podcast database with transcript-powered creator and listener intelligence.

Audiogram by Headliner

SpareMin

Headliner's automated audiogram workflow for podcast snippets.

Veed Live Captions

Veed

Veed's live caption layer for streaming and recording.

Loom AI Captions

Loom (Atlassian)

Loom's auto-caption layer for screen recordings.

Vidyo.ai

AI clipping and short-form video repurposing for creators.

YouTube Studio Auto-Captions

YouTube Creator Studio's built-in auto-caption track.

Vimeo AI Captions

Vimeo's auto-caption layer for hosted videos.

Panopto ASR

Panopto

Panopto's auto-captioning for higher-education and enterprise video.

Kaltura REACH

Kaltura

Kaltura's enterprise auto-captioning for video platforms.

Rev Live Captions

Rev's live captioning service for Zoom and Webex.

LEXI Live

AI-Media's live broadcast captioning engine.

Spinach.io

Spinach

AI standup notetaker for engineering teams.

Krisp AI Meeting Notes

Krisp's AI meeting-notes layer paired with its noise cancellation.

Rev AI Notetaker

Rev's meeting-bot notetaker built on Rev AI transcription.

Krisp Live Captions

Krisp's on-device live caption feature for calls.

Cogi

Mobile voice notetaker with retroactive recording.

Otter Meeting GenAI

Otter's question-and-answer layer over recorded meetings.

Krisp Voice Privacy

Krisp's voice privacy filter for call agents.

Fireflies Soundbites

Fireflies.ai

Fireflies' clip-extraction layer for meeting highlights.

Magic Mastering by Buzzsprout

Buzzsprout

Buzzsprout's one-click mastering for hosted episodes.

Blubrry

RawVoice

Long-running podcast host with WordPress PowerPress plugin.

Fusebox

Embeddable podcast player and player-network host.

RSS.com

Affordable podcast host with monetization and AI features.

Transistor Private Podcasts

Transistor

Transistor's private podcast feature for B2B and internal podcasts.

Supercast

Premium podcast subscription platform with private RSS.

Anchor (legacy)

Spotify's prior podcast brand, now redirected to Spotify for Podcasters.

Soundtrap

Spotify's browser DAW with collaborative podcast tools.

Soundwise

Audio learning and private podcast platform.

Fable Studio

AI tools for emotive voice and character dialogue.

Boombox Studio

Boombox.io

AI sound design and audio post platform.

Soundverse

AI music and sound bed generator for video creators.

Musicfy

AI music + voice generation aimed at creator workflows.

Suno

AI song generation with vocals and instruments from a prompt.

Udio

AI song generation with prompt-driven vocals and arrangement.

Soundraw

Royalty-free AI music for creators and video producers.

Videoleap

Lightricks

Lightricks' mobile video editor with AI features.

CapCut

ByteDance

ByteDance's free cross-platform editor with auto-captions and AI tools.

Lightcut

Ulike Tech

Mobile AI editor for travel and creator video.

Microsoft Clipchamp

Microsoft's browser video editor with auto-captions and Speaker Coach.

Wisecut

AI video editor that cuts silences and auto-captions long-form video.

Speakflow

Teleprompter app for creators recording on a phone or laptop.

Nuance Dragon Medical One

Cloud-based medical speech recognition for clinicians, owned by Microsoft.

Microsoft DAX Copilot

Ambient AI clinical documentation — listens to the visit, writes the note.

paid plans · contact sales

Iris Medical

Vendor-specific AI scribe — verify availability and BAA status before relying on it.

unknown · contact vendor

Nabla Copilot

Nabla Technologies

Ambient AI assistant for clinicians — Paris-founded, used in EU and US.

free trial / paid plans

Tali AI

Canadian AI scribe and medical voice assistant for primary care.

Robin Healthcare

AI medical scribe with virtual scribe overlay for orthopedics and specialty care.

Sayvant

Generative AI scribe targeted at emergency medicine and urgent care.

free for verified clinicians

Doximity Scribe

Doximity

Free AI medical scribe inside the Doximity clinician network app.

Mutuo Health

Mutuo Health Solutions

AI clinical documentation for Canadian and Australian healthcare.

paid plans · contact sales

Sully AI

AI clinical assistant that handles documentation, coding, and tasks for clinicians.

Lindy Health

Lindy

Healthcare-vertical instance of the Lindy AI agent platform.

paid plans

Noted AI

AI scribe and note-generation product for clinicians.

discontinued — see DAX Copilot

Saykara (legacy)

Nuance (Microsoft)

Early ambient AI scribe — acquired by Nuance in 2021.

Modernizing Medicine ScribeAI

Modernizing Medicine

AI scribing built into the ModMed EMA EHR for specialty practices.

EMA add-on · contact sales

eClinicalWorks Scribe (Sunoh.ai)

eClinicalWorks

AI medical scribe bundled with the eClinicalWorks EHR.

EHR add-on · contact sales

athenahealth AI Notes

athenahealth

AI documentation features inside the athenahealth EHR.

EHR add-on · contact sales

Epic Stage / In Basket AI

Epic Systems

AI documentation features built into the Epic EHR.

EHR add-on · contact sales

AVA Health (formerly AVA Speech)

AVA Inc.

Healthcare instance of AVA's live-captioning platform.

Veritone Legal (Illuminate)

Veritone Inc.

AI-powered legal transcription and evidence-search platform.

vTestify

Remote deposition and video testimony platform with built-in transcription.

per-deposition pricing

Otter for Education

edu discount · from $4.99/mo student

Education tier of Otter.ai — classroom and lecture transcription.

Glean (Sonocent successor)

Glean Education

Note-taking app for students that records the lecture and structures the notes.

from ~$10/mo · institutional plans

Notability Audio Recording

Ginger Labs

Audio recording inside the Notability note-taking app on Apple devices.

from $14.99/yr

AudioNote

Luminant Software

Linked-notes-and-audio app for iOS, Android, Windows, and Mac.

from $9.99 one-time / subscription

Ava for Education

AVA Inc.

AVA's classroom and group-discussion live-captioning product.

Microsoft Translator (Education)

Free Microsoft live-translation app with classroom mode for teachers.

Google Live Transcribe (Education)

Free Android live-transcription app used in education accessibility settings.

ATLAS.ti AI Transcription

VoicePad Pro

Mobile-Magic

Education and accessibility dictation product.

paid one-time

NVivo Transcription

Lumivero

AI transcription add-on for the NVivo qualitative analysis software.

credits · ~$0.90/min

ATLAS.ti Scientific Software GmbH

AI transcription inside the ATLAS.ti qualitative research software.

license · AI credits

Trint Education

edu pricing · contact sales

Education tier of the Trint AI transcription platform.

Sonix for Education

Sonix, Inc.

Education tier of Sonix's AI transcription platform.

edu pricing · per-hour credits

Ava

AVA Inc.

Live captioning app for deaf and hard-of-hearing users — group conversations.

Google Live Caption

System-wide live captions on Android, Chrome, and Pixel devices.

Microsoft Live Captions

free · built into Windows 11

Windows 11 system-wide live captioning for any audio on the device.

Apple Live Captions

System-wide live captions on iOS, iPadOS, and macOS.

RogerVoice

Live captioning and relay calling app for deaf and hard-of-hearing users.

XREAL / Nreal AR Captions

MMR Translate

Live captioning and translation product targeted at conferences and events.

per-event pricing

SpeakSee

Multi-microphone live captioning hardware + app for the deaf community.

hardware + subscription

InnoCaption

Free real-time captioned telephone service for hard-of-hearing US users.

free · FCC-funded

Olelo

Multilingual live captioning and translation product (verify availability).

contact vendor

Hello Eye / Hello AI

Hello Eye

Smart-glasses live captioning product (verify availability).

contact vendor

XREAL

AR-glasses live captioning experiences from XREAL (formerly Nreal).

hardware $399+

Sorenson Buzz Cards / Apps

Sorenson Communications

Apps and tools for the deaf community from Sorenson Communications.

free for verified deaf users

EEG Enterprises iCap / Falcon

EEG Enterprises (Ai-Media)

Broadcast captioning hardware, encoders, and AI captioning software.

Caption.Ed

Lecture-capture and live-captioning tool for higher education and workplaces.

Temi

Audio2Text (Speech-to-Text apps)

Rev's AI-only $0.25/min transcription product.

$0.25/min

Transcribe by Wreally

Wreally

Manual transcription playback tool for journalists and researchers.

from $20/yr

Whisper Memos

Apple-platform voice memo app that uses OpenAI Whisper for transcription.

from $2.99/mo

Generic name for several mobile audio-to-text apps.

iA Writer Dictation

iA Inc.

Voice dictation inside the iA Writer minimalist writing app.

from $9.99 one-time

Dragon Anywhere

Mobile dictation app from Nuance for iOS and Android.

from $14.99/mo

Speechnotes

Free browser-based voice typing notepad.

Dictation.io

Digital Inspiration

Free browser dictation tool with simple UI.

Apple Voice Control

Apple's accessibility voice control on macOS, iOS, and iPadOS.

Apple Dictation

Built-in OS dictation across iOS, iPadOS, and macOS.

Windows Speech Recognition

free · built into Windows

Built-in Windows dictation and OS voice control.

Windows Voice Access

free · built into Windows 11

Modern AI-driven dictation and voice OS control built into Windows 11.

Google Voice Typing (Docs)

free · built into Google Docs

Voice typing inside Google Docs and Google Slides.

Gboard Voice Typing

Voice typing in the Gboard keyboard on Android and iOS.

free · built into Gboard

ListNote

Khimaera

Free Android voice-typing notepad.

Voice Notebook

Web and Android voice typing tool with translation.

EzDictation

Free browser dictation utility.

AudioPen

Speak-to-edit app that turns ramble into clean structured text.

free / from $6.50/mo

Otter Bot Voice

Otter.ai's mobile voice-recording surface.

SpeakNotes

Mobile voice-to-text note app.

VoiceIn Voice Typing

Chrome extension that adds voice typing to any text field on any website.

free / Pro $14.99/mo

Web-based dictation notepad with cloud sync.

free / Pro $14.99/mo

TypeTalk

Mobile dictation utility (verify publisher).

Microsoft Dictate (Office 365)

Notta Dictate

Notta

Notta's dictation feature for voice typing on mobile.

from $13.99/mo

Readwise Voice Memos

Readwise

Voice-memo capture in the Readwise reading and knowledge product.

from $7.99/mo

Microsoft 365 subscription

Voice dictation inside Word, Outlook, PowerPoint, and OneNote.

Trint for Newsrooms

Trint's newsroom-targeted product tier.

Reduct Newsroom

Reduct.Video

Reduct.Video's newsroom tier for journalism teams.

ScribeUp

Web transcription service for journalists.

per-hour rates

Verbz

Mobile-first AI transcription app for journalists and creators.

free / Pro plans

Sonix for Newsrooms

Sonix, Inc.

Sonix's newsroom tier targeted at broadcast and digital news.

per-hour credits

Scribee

Mobile transcription product for journalists.

free + IAP

Descript for Journalism

Descript

Descript's journalism use case (existing tool entry for reference).

see descript entry

Recordr / Recordly

Mobile recording-and-transcription apps used by journalists.

Simbo AI

AI medical voice assistant — appointment booking and clinical voice tasks.

NoteTracker

Medical transcription compliance and audit tool.

VoiceTag

Medical voice macros and dictation utility (verify publisher).

varies

e-MDs Dictation

CompuGroup Medical

Dictation built into the e-MDs / CompuGroup ambulatory EHR.

Greenway Intergy Dictation

Greenway Health

Dictation features inside the Greenway Health EHRs.

Tebra (Kareo) Voice Notes

Tebra

Voice notes inside the Tebra (Kareo + PatientPop) EHR.

DrChrono AI Scribe Marketplace

DrChrono (EverHealth)

AI scribe partnerships inside the DrChrono EHR marketplace.

Praxis EMR (text-templates)

Infor-Med

Praxis EMR's concept-based note generation — adjacent to voice dictation.

EHR license

Ambient Clinical Intelligence (M*Modal)

Solventum (3M)

Solventum (3M) ambient clinical intelligence built on the M*Modal engine.

3M Fluency Direct

Solventum (3M)

Clinical front-end speech recognition product.

InScribe (Higher Ed)

InScribe

Higher-ed support platform with transcription features (verify).

Kaltura REACH

Kaltura

Education video platform's transcription and captioning add-on.

Panopto Captions

Panopto

Captioning and transcription built into the Panopto video platform for education.

YuJa Captioning

YuJa

AI captioning inside the YuJa video platform for higher education.

Echo360 Captioning

Echo360

Captioning inside the Echo360 lecture-capture platform.

Microsoft Teams Transcription

Zoom Live Transcription

Zoom Communications

Built-in live transcription and captioning in Zoom meetings.

free for paid Zoom plans

Microsoft 365 subscription

Built-in live transcription and meeting recap in Microsoft Teams.

Google Meet Captions

Olympus Dictation Management System

Built-in live captions and Gemini-powered notes in Google Meet.

Workspace subscription

Webex Real-time Captions

Cisco

Live captions and post-meeting transcript in Cisco Webex.

Webex subscription

Riverside.fm Transcripts

Riverside

Automatic transcripts inside the Riverside.fm podcast and video studio.

from $15/mo

Podcastle Transcription

Podcastle

Built-in transcription in the Podcastle podcast creation platform.

from $11.99/mo

Philips SpeechLive

Philips Speech

Cloud dictation and transcription workflow for legal and corporate dictation.

from ~$15/mo per seat

Olympus / OM System

Olympus's professional dictation workflow software for legal and medical.

Yitu Speech

Yitu Technology

Yitu Tech speech recognition — Mandarin, dialects, and far-field microphone arrays.

tiered · cloud API pay-as-you-go + on-premise license tiers

AmiVoice

Advanced Media

Advanced Media's AmiVoice — Japan's longest-running enterprise speech recognition family.

Fujitsu LiveTalk

Fujitsu

Fujitsu LiveTalk — Japanese real-time captioning for meetings and classrooms.

Selvas Selvy STT

Selvas AI

Selvas AI's Selvy speech recognition — Korean and English ASR for media, finance, and government.

Skit.ai

Skit.ai (formerly Vernacular.ai) — voice AI for collections and Indian-language call automation.

tiered · contact sales for current plans

Slang Labs

Slang Labs — in-app multilingual voice assistant SDK with Indic-language ASR.

MTS AI VoiceTech

MTS AI

MTS AI VoiceTech — Russian-language ASR and voice biometrics from telecom operator MTS.

Speech Technology Center (STC)

Speech Technology Center

STC (Sankt-Peterburg) — long-running Russian speech and biometrics vendor, formerly STC-innovations.

VoiceInteraction VoxSigma

VoiceInteraction

VoiceInteraction VoxSigma — Portuguese-strong multilingual broadcast transcription.

Vocapia VoxSigma

Vocapia Research

Vocapia Research VoxSigma — multilingual broadcast and call-centre transcription from LIMSI heritage.

Verbio

Verbio Technologies

Verbio Technologies — Spanish-strong multilingual ASR and voice biometrics from Barcelona.

Phonexia

Phonexia — Czech speech and voice-biometrics vendor focused on government and forensic use.

Tilde ASR

Tilde

Tilde — Baltic-language NLP vendor with Latvian, Lithuanian, and Estonian speech recognition.

Lingsoft

Lingsoft — Finnish language-services group offering Nordic-language ASR and dictation.

tiered · published per-minute and subscription plans on site

Speechmore

Speechmore — Italian-language transcription product for journalists and professionals.

Vocally

Vocally — French-first transcription tool for journalists and researchers.

tiered · per-minute pricing in EUR

VocTroLabs

VocTroLabs — speech and subtitling research lab from Universitat Politècnica de Catalunya.

Cedat 85

Cedat 85 — Italian speech-to-text and stenography vendor for parliaments and media.

tiered · per-minute pricing in USD

Rumi Arabic Speech

Rumi

Rumi — Arabic-first transcription product tuned for Egyptian, Levantine, and Gulf dialects.

Mawdoo3 / Mowjaz

Mawdoo3

Mawdoo3 — Jordan-based Arabic AI lab building Salma voice assistant and ASR research.

tiered · per-minute Arabic transcription pricing

Kalam.ai

Kalam — Arabic speech-to-text product for journalists and researchers.

VoxLab

VoxLab — Brazilian Portuguese speech recognition for journalists, agencies, and businesses.

tiered · per-minute pricing in BRL

TranscribeMe LATAM

TranscribeMe

TranscribeMe Latin-America operations — Spanish-and-Portuguese human + automatic transcription.

tiered · per-minute pricing in USD

Voiceitt

Voiceitt — non-standard-speech recognition for people with speech impairments.

tiered · subscription pricing on site

VoxQube

VoxQube — Urdu and South-Asian language transcription research and services.

Navana Tech

Navana Tech — Bengali-language conversational AI and ASR from Bangladesh.

ASR Yandex on-prem

Yandex Cloud

Yandex SpeechKit on-premise build — Russian-language ASR for isolated networks.

tiered · freemium with premium subscription on app stores

Tarteel AI

Tarteel — Qur'anic Arabic recitation recognition for memorisation and tajweed feedback.

Armada AI

Armada — Russian-language conversational and contact-centre AI with embedded ASR.

tiered · published per-minute pricing in EUR

Speakable PT

Speakable

Speakable — European-Portuguese-tuned transcription for Iberian newsrooms.

Bertin IT

Bertin IT MediaSpeech — French defence and intelligence multilingual transcription suite.

Syllable Health (multilingual)

Syllable

Syllable — patient-facing healthcare voicebots with Spanish, English, and Mandarin support.

Convai LATAM

LATAM voice-AI vendors

Convai-style Spanish voice automation tuned for Mexican and Central-American customer service.

Phonexia LATAM

Phonexia

Phonexia's Latin-American operations — Spanish-language voice biometrics and STT.

Vivoka

Vivoka — French embedded voice-AI vendor with offline multilingual ASR.

tiered · subscription plans in USD

Speak AI Arabic

Speak AI

Speak AI — Arabic-and-multilingual research-grade transcription with NLP insights.

Rakuten AIris

Rakuten

Rakuten AIris — Japanese-language speech recognition from Rakuten Institute of Technology.

tiered · per-minute pricing in JPY

Onsei

Onsei — Japanese-language web transcription for individual professionals and researchers.

Air.ai

Air AI

Conversational AI claimed to hold 10-40 minute human-like phone calls.

PolyAI

Enterprise voice assistants for contact centers in hospitality, banking, and retail.

Stack AI

No-code platform for building AI workflows including voice agents and assistants.

Replicant

Contact-center voice AI that autonomously resolves routine customer calls.

Hyro

Plug-and-play conversational AI for healthcare and enterprise voice + chat.

Kore.ai Voice

Kore.ai

Enterprise conversational AI platform with first-class voice and IVR support.

Cognigy Voice

Cognigy

Enterprise contact-center voice AI from Cognigy.AI conversational platform.

Mindsay

Conversational AI for travel and customer service automation.

Yellow.ai Voice

Yellow.ai

Voice AI from Yellow.ai's dynamic automation platform for enterprise CX.

Avaamo

Conversational AI platform for healthcare, banking, and enterprise voice + chat.

Druid Voice

Druid AI

Voice agent capability layered on the Druid AI conversational automation platform.

Inya.ai

Gnani.ai

Generative voice AI agents from Gnani.ai for enterprise CX.

SmartAction

AI-powered voice and chat agents for enterprise contact centers.

Aimee.ai

Aimee AI

Voice AI assistants for inbound and outbound business calls.

Toma

AI voice agents for auto dealerships and service-based businesses.

Helium AI

Helium

Voice AI agents for customer service and sales workflows.

MagicAI

AI agents including voice, chat, and content automation in one platform.

Charlie

11x.ai

AI sales development representative making outbound voice calls.

Sayhi

AI voice agents for customer support and lead conversion.

VoiceGenAI

Generative voice AI platform for outbound and inbound business calls.

Pragmatic Voice

Voice AI agents and contact-center automation.

Echo AI

Conversation intelligence and voice-AI insights for customer-experience teams.

Curious Thing

Voice AI for inbound and outbound customer engagement.

Spitch Voice

Spitch

Speech analytics, biometrics, and voice bots for European enterprises.

AnswerForce Voice AI

AnswerForce

Hybrid AI plus human virtual receptionists for small businesses.

Goodcall

AI phone agents for service-based small businesses.

Daily Bots

Daily.co

Hosted RTVI bots — Daily.co's managed runtime for Pipecat voice agents.

JustCall AI

JustCall

Sales-focused cloud phone with AI transcription, coaching, and agent assist.

CloudTalk AI

CloudTalk

AI features for CloudTalk's cloud-based call center software.

RingCentral RingSense (Voice)

RingCentral

AI conversation intelligence across RingCentral's UCaaS and CCaaS platform.

Rasa Pro Voice

Rasa

Commercial Rasa offering with CALM dialog and enterprise voice connectors.

Five9 Intelligent Virtual Agent

Five9

Five9's contact-center voice and chat virtual agent product.

NICE CXone Mpower

NICE

NICE's AI-orchestrated CX platform with voice virtual agents and Enlighten AI.

Genesys Cloud Voice AI

Genesys

Genesys Cloud's voicebot, agent assist, and AI experience orchestration.

Amazon Connect + Amazon Q in Connect

AWS Amazon Connect contact center with Q-powered agent assist and bots.

Interactions LLC

Managed conversational AI for enterprise voice and chat.

Uniphore Voice AI

Uniphore

Conversational AI and automation across contact-center voice and chat.

Openstream EVA

Openstream.ai

Multimodal conversational AI platform for enterprise voice and chat.

Tovie AI

Conversational AI platform with voice and chat for enterprises.

VoiceOwl.ai

VoiceOwl

Generative AI voice agents for B2B sales and marketing.

Rosie

Rosie.ai

AI answering service and virtual receptionist for small businesses.

Deepconverse

AI customer-service agents across voice, chat, and self-service.

Ada Voice

Ada

Ada's brand interaction platform extended to voice channels.

Forethought Voice

Forethought

Generative AI for customer support with voice and chat channels.

Convore

AI conversation agents for B2B revenue teams.

Voxalize

Voice AI agents tuned for non-English markets.

Swyx Voice (Cognigy AI Copilot)

Cognigy

Generative agent-assist layer for human contact-center agents.

Talkdesk Autopilot

Talkdesk

Talkdesk's generative AI voice-and-chat virtual agent for contact centers.

Thoughtful AI

Thoughtful Automation

Healthcare-focused AI agents for revenue-cycle and patient communications.

Infinitus Systems

Infinitus

Voice AI agents that handle benefits-verification and prior-auth calls in healthcare.

Kasisto KAI

Kasisto

Conversational AI platform purpose-built for banking and wealth management.

Rulai (Conversica family)

Conversica

Enterprise voice and chat AI agents from Conversica.

Salesforce Einstein Service Agent

Salesforce

Salesforce's autonomous AI agent for service, with voice and chat channels.

HubSpot Breeze AI (Voice surface)

HubSpot

HubSpot Breeze customer agents extending into voice channels.

AI-Media LEXI

Automatic live captioning for broadcast, news and live events — flagship of AI-Media's LEXI family.

AI-Media LEXI Local

On-prem variant of LEXI for stations that cannot send audio to the cloud.

AI-Media LEXI Tool

Web-based agent for live event captioning operators, paired with the LEXI engine.

AI-Media iCap Translate

Real-time caption translation overlay for the iCap broadcast caption network.

AI-Media iCap

Cloud caption delivery network for live broadcast, the transport layer behind LEXI.

AI-Media Alta

IP caption encoder for SMPTE 2110, NDI and SRT broadcast facilities.

AI-Media (EEG Enterprises)

EEG Falcon

Server-based automatic captioning appliance — the predecessor product line that became LEXI.

AI-Media (EEG Enterprises)

EEG enCaption

Self-contained automatic caption appliance for live linear TV — first-gen broadcast ASR.

AI-Media (EEG Enterprises)

EEG HD492 iCap Encoder

Classic SDI caption encoder used across thousands of US TV master controls.

EEG Lexi DR (Direct Recall)

Glossary-aware automatic captioning trained on a station's own proper-noun list.

ENCO enCaption

ENCO Systems

Automatic caption appliance for radio and TV — a different enCaption from EEG's.

Telestream Vantage Timed Text

Telestream

Server-side caption automation inside the Vantage media-processing platform.

1CapApp

Caption-delivery platform used by independent live captioners to bill, deliver and embed captions.

freemium · paid plans contact sales

Ava

Live captioning app for in-person meetings and small events, built for Deaf and hard-of-hearing users.

Captionsmart

AI live captioning for SaaS meetings and webinars with a focus on enterprise accessibility.

Cielo24 Compliance (ECC)

Cielo24

Accessibility-compliance tier of Cielo24, aimed at ADA / Section 508 / WCAG 2.1 deliverables.

service · contact sales

OOONA

Cloud subtitle authoring and project-management suite used by media-localisation vendors.

Limecraft Flow

Limecraft

Cloud media-logistics platform with built-in transcription and subtitle authoring.

MediaSilo Captions

Shift Media (MediaSilo)

Caption-review and proof functionality bolted onto the MediaSilo review-and-approval platform.

ZOO Subs

ZOO Digital

Cloud subtitle authoring suite from ZOO Digital, used across Hollywood OTT delivery.

Screen Subtitling Systems

Screen Polistream

Subtitle playout server for live and file-based broadcast distribution.

ENCO Comprompter

ENCO Systems

Newsroom prompter/script system that frequently feeds the captioning pipeline.

Inscriber CG (Ross Inscriber)

Ross Video

Broadcast character generator with caption overlay support — legacy of Inscriber Technology.

Switchboard Live Captions

Switchboard Live

Live captioning add-on for the Switchboard Live multistreaming platform.

BoxCast Live Captions

BoxCast

Automatic captions for the BoxCast live-streaming platform, popular with churches and schools.

Vbrick AI Captions

Vbrick

Enterprise video platform with built-in AI captioning, popular for internal communications.

Kaltura Live Captions

Kaltura

Caption support inside Kaltura's video platform for higher-ed and enterprise.

Brightcove Auto Captions

Brightcove

Automatic captioning service inside Brightcove's Video Cloud platform.

JW Player Captions

JW Player

Caption pipeline inside JW Player's video platform with optional ASR.

Cincopa AI Captions

Cincopa

Auto-caption feature inside the Cincopa video-hosting platform.

IBM Watson Captioning

IBM

Legacy IBM-branded captioning offering on top of Watson Speech to Text.

US Courts Audio/Video Conferencing System

Administrative Office of the US Courts

Federal courts' AV system that increasingly bolts on AI captions for hearings.

government internal

JAVS

Court audio/video recording platform increasingly paired with AI transcription.

For The Record (FTR)

For The Record

Court recording platform widely deployed across US, UK and Australian court systems.

Sorenson NXG (Caption Relay)

Sorenson

Sorenson's next-generation captioned-call and relay-services platform.

free for qualified US users (FCC TRS Fund)

accessiBe (audio/video angle)

accessiBe

WCAG overlay platform whose video module wraps third-party caption ASR.

AudioEye (captioning angle)

AudioEye

Accessibility platform that integrates captioning and transcript services.

UserWay (captioning angle)

UserWay

Accessibility widget vendor with optional caption and transcript services.

AI-Media Smart Lexi

LEXI variant tuned for enterprise events that prioritises high precision over speed.

AI-Media LEXI Toolkit

Glossary, profile and workflow management portal for LEXI customers.

Daily Live Captions

Daily

Built-in live captioning on the Daily video API platform.

Microsoft Translator Presenter mode

free · enterprise via Microsoft 365

Browser-based live caption + translation overlay for in-person presentations.

Microsoft Teams Live Captions

bundled · Microsoft 365 plans

Built-in live captioning inside Microsoft Teams meetings and live events.

Google Meet Live Captions

bundled · Google Workspace plans

Built-in live captions inside Google Meet, used widely for accessibility in education.

Webex Live Captions

Cisco

Live caption and translation features inside Cisco Webex Meetings.

bundled · Webex plans

Zoom Live Transcription

Zoom

Built-in Zoom live captioning across meetings and webinars.

bundled · Zoom plans

YouTube Live Auto Captions

YouTube

Auto-generated live captions inside YouTube Live streams.

free with YouTube Live

Facebook Live Captions

StreamYard (Hopin / Bending Spoons)

Auto-captioning support for live broadcasts on Facebook.

free with Facebook Live

LinkedIn Live Captions

Auto-captioning for LinkedIn Live and LinkedIn Events.

free with LinkedIn Live

StreamYard Captions

Live captioning add-on inside the StreamYard browser studio.

Restream Captions

Restream

Live captioning feature inside the Restream multistream and studio product.

Vimeo Live Captions

Auto and manual captioning inside Vimeo's Live and on-demand video products.

Wowza Captions

Wowza

Caption support inside Wowza's streaming-server ecosystem.

Haivision Captions

Haivision

Captioning passthrough on Haivision's broadcast and enterprise video products.

Interact-Streamer

Interact-AS

Live caption viewer used by interpreters and CART writers in European events.

Text-on-Top

Open-source live caption viewer used by Deaf communities and EU accessibility groups.

free / donation

HeyGen Video Translate

HeyGen

Translate a talking-head video into 175+ languages with lip-sync rebuild from the original speaker.

Camb.ai (MARS)

Camb.ai

Speech-to-speech translation in 140+ languages with voice cloning — pitched at live sports + media.

Speak.AI Dubbing

Speak Ai Inc.

Multilingual dubbing layered on top of Speak.AI's transcription + qualitative-analysis workflow.

Wavel

Wavel AI

Subtitle, dubbing, and voice-over in 70+ languages — pitched at marketing teams localizing video ads.

Synthesys Translate

Synthesys

AI dubbing and multilingual voice-over from Synthesys — overlay on their avatar + TTS stack.

Papercup

Human-in-the-loop AI dubbing pitched at premium media — Sky News, BBC Studios, Bloomberg.

Speechify Video Translate

Speechify

Speechify's dubbing add-on — translate video into 20+ languages keeping the original voice.

BlipCut

iMyFone (BlipCut)

Cloud video translation with lip sync, voice cloning, and subtitle export.

InVideo Translate

InVideo

InVideo's video-translation surface on top of its template-based video editor.

Vidqu

Video translator and AI dubbing tool with a freemium tier.

Translate.video

Translate, dub, transcribe, and subtitle videos in 75+ languages — single web workflow.

Words.io (Translated)

Translated

Human + AI translation marketplace from Translated — voice-over and dubbing add-ons.

Dubverse

India-built AI dubbing platform — Indic-language strength.

AI-Media Translate

Captioning + translation services for broadcast and enterprise — Lexi family extension.

3Play Translate

3Play Media

Subtitle translation as a managed service from 3Play Media.

Languify

Speech feedback + multilingual coaching app — pitched at interview / pitch prep.

Play.ai (Voice Agents)

Play (Play.ai)

Conversational voice agents built on PlayHT's voice stack — real-time + multilingual.

Resemble AI

AI voice cloning, multilingual TTS, and real-time voice conversion for media + games.

Listnr

Listnr AI

AI text-to-speech and podcast distribution — 142 languages, voice-cloning API.

Lovo (Genny)

Lovo AI

AI voice generator + video editor pitched at marketing teams — 100+ languages, 500+ voices.

Voxxos

Multilingual AI voice and dubbing platform — smaller-shop alternative to ElevenLabs.

NaturalReader

Natural Reader Inc.

TTS reader for documents, ebooks, and PDFs — long-running with classroom + accessibility deployments.

FakeYou

Community-trained character voices for parody and fan-fiction — Tortoise + custom models.

Microsoft Translator (Live)

Cross-device live conversation translation — share a room code, every device speaks its own language.

Google Translate (Conversation)

Two-language conversation mode in Google Translate — phone-on-the-table interpreter.

iTranslate Voice

iTranslate (Sonico)

Phone-to-phone voice translator — Apple Watch + AirPods integrations.

SayHi Translate

Amazon (SayHi)

Voice-first live translation app — Amazon-owned since 2018.

Vocre

myLanguage

Side-by-side voice translation app with a kid-friendly UI.

TripLingo

Business-travel-focused live translator with cultural-cue overlays.

Smartling (Voice + Multimedia)

Smartling

Enterprise translation-management platform with a multimedia + voice-over add-on.

Phrase (Multimedia)

Phrase

Phrase TMS / Strings with a multimedia localization module.

Lokalise (AV Localization)

Lokalise

Translation-management platform with an AV localization integration layer.

Crowdin (Audio + Video)

Crowdin

Crowdsourced localization platform with audio + video file support.

Mediasite

Sonic Foundry

Lecture-capture pioneer (Sonic Foundry) used in higher-ed, healthcare, and corporate training.

YuJa Enterprise Video Platform

YuJa

Lecture-capture, live-streaming, and video CMS with auto-captioning across 200+ universities.

YuJa Verity

YuJa

Accessibility + captioning add-on for YuJa's video platform.

BoxCast

Live-streaming platform for houses of worship, schools, and athletics with auto-captioning.

Verbit (Higher Ed)

AI + human transcription service marketed to universities for accessibility-grade captions.

Vbrick Rev

Vbrick

Enterprise video platform with transcription used by Fortune 500 L&D and government.

Brightcove for Education

Brightcove

Brightcove's enterprise video cloud configured for university and L&D delivery.

Wowza Streaming Cloud

Wowza Media Systems

Real-time live-stream infrastructure used under many education-video platforms.

Cattura

Cattura Video Solutions

Lecture-capture appliance vendor for universities and government, with caption pipeline.

Stream.Live

Live-stream studio used by educators, conferences, and faith organizations.

BombBomb

Async video messaging platform used heavily in corporate training and customer education.

Canvas Studio

Instructure

Instructure's video platform inside Canvas LMS with auto-captioning and inline comments.

Brightspace + ReadSpeaker

D2L / ReadSpeaker

D2L Brightspace LMS with ReadSpeaker text-to-speech and captioning integration.

Blackboard Ally

Anthology

Accessibility platform from Anthology that auto-captions and audio-describes course materials.

Moodle (AI Caption plugins)

Moodle HQ + community

Open-source LMS with community plugins for ASR captioning and AI transcription.

Sakai (Auto-caption integrations)

Apereo Foundation

Open-source higher-ed LMS with caption integrations via Kaltura and partner ASR.

Schoology Learning

PowerSchool

K-12-focused LMS from PowerSchool with caption-friendly media tools.

itslearning

European K-12 + higher-ed LMS with built-in media and caption support.

Anthology Ally

Anthology

Renamed Blackboard Ally, the accessibility + caption layer across Anthology's LMS portfolio.

Canvas Captions (LTI ecosystem)

Instructure (LTI partners)

Caption tools from the Canvas LTI partner ecosystem (3Play, Verbit, Cielo24, AI-Media).

Coursera Captions

Coursera

Coursera's in-platform machine + community captions across 50+ languages for MOOC video.

Udemy Auto-Captions

Udemy

Udemy's auto-caption pipeline for instructor-uploaded course video.

edX Captions

2U / edX

edX Open edX platform with caption tracks attached to every course video.

FutureLearn Captions

FutureLearn

FutureLearn's MOOC platform with auto + reviewed captions for university partners.

Khan Academy Captions

Khan Academy

Khan Academy's community + machine caption pipeline across 50+ languages.

Pluralsight Captions

Pluralsight

Pluralsight's caption + transcript layer on technology training video.

LinkedIn Learning Captions

Captions and synchronized transcripts on LinkedIn Learning's professional video library.

Udacity Captions

Udacity

Caption tracks across Udacity Nanodegree video content.

MasterClass Captions

MasterClass

Captions + downloadable transcripts on MasterClass celebrity-led courses.

Vyond

Animated training video tool used by corporate L&D, with caption support on output.

Synthesia for L&D

Synthesia

AI avatar video platform widely used for corporate training, with multi-language captioning.

Vimeo Enterprise Captions

Vimeo Enterprise's caption + transcript layer for corporate video portals.

Knowmore

Knowledge management + training video platform with auto-transcription.

WorkRamp

Corporate LMS used for sales-enablement, onboarding, and customer training.

360Learning

Collaborative LMS focused on internal trainers, with caption support on uploaded video.

Docebo

Enterprise LMS used by mid-to-large companies, with caption + AI content tools.

Dovetail

User-research repository with auto-transcription used by UX teams.

Reduct.video Research

Reduct.video

Transcript-based video editor positioned for UX and qualitative research teams.

Trint Research

Trint's enterprise transcription positioned for research and academic teams.

Rev for Researchers

Rev.com

Rev's transcription + caption services positioned for academic and market research.

Khanmigo

Khan Academy

Khan Academy's AI tutor that listens to learners and provides voice-based feedback.

Duolingo Max (Roleplay)

Duolingo

Duolingo's premium tier with AI Roleplay voice conversations and Explain My Answer.

Speak

Speakeasy Labs

AI English-tutor app backed by OpenAI's Startup Fund, focused on speaking practice.

Praktika

Praktika.ai

AI English tutor with avatar-based speaking practice across phone and web.

Talkpal AI

Talkpal

Multi-language AI language tutor with voice conversations.

Loora AI

AI English tutor with phone-call style practice and accent feedback.

Voxy

Enterprise English-language training platform used by global corporations.

Lingmo

Lingmo International

Real-time translation + transcription used by enterprises and educators.

ELSA Speak

ELSA Corp

AI pronunciation coach for English learners, with on-device speech scoring.

Verbit Higher Ed

Verbit's premium captioning + transcription program targeted at universities.

3Play Media Higher Ed

3Play Media

3Play Media's caption services positioned for university accessibility offices.

CaptionAccess

Live and post-production captioning service for higher-ed and conferences.

Rev for Education

Rev.com

Rev.com's education program for universities and K-12 districts.

AI-Media Education

AI-Media's iCap and LEXI live-caption services positioned for higher-ed.

FutureLearn Trainer

FutureLearn

FutureLearn's corporate-training arm with captioned course content for enterprise buyers.

Skillsoft Percipio

Skillsoft

Enterprise learning platform with captions and synchronized transcripts.

Cornerstone Content Anytime

Cornerstone OnDemand

Cornerstone's training-content subscription with captioned video and analytics.

Absorb LMS

Absorb Software

Mid-market LMS with caption support across courses and integrated video tools.

LearnWorlds

Course-creator platform with interactive video and auto-caption support.

Thinkific

Course-creator platform with caption support on course video.

Teachable

Course-creator platform with SRT-based captioning workflow.

Kajabi

All-in-one creator platform with caption-supported video hosting.

Intellum

Enterprise customer-education LMS with captioned video and analytics.

Skilljar

Customer-education LMS used by B2B SaaS, with caption-ready video and SCORM support.

Litmos (Francisco Partners)

Litmos

Mid-market corporate LMS with caption-ready video and SCORM authoring.

CYPHER Learning

AI-driven LMS for K-12, higher-ed, and business, with captioned content.

Instructure Canvas LMS

Instructure

The dominant North American higher-ed LMS, with caption hooks across the partner ecosystem.

Blackboard Learn Ultra

Anthology

Anthology's flagship higher-ed LMS, with Anthology Ally as the caption layer.

D2L Brightspace

D2L

Canadian-rooted higher-ed and K-12 LMS with broad caption-vendor integrations.

TalentLMS

Epignosis

SMB-focused corporate LMS with captioned video and SCORM/xAPI support.

EdApp by SafetyCulture

SafetyCulture

Microlearning platform with captioned mobile-first lessons.

Axonify

Frontline corporate training platform with captioned daily microlearning.

Saba Cloud

Cornerstone OnDemand

Cornerstone Saba enterprise LMS with captioned video and SCORM compliance.

Totara Learn

Totara Learning

Moodle-derived enterprise LMS used in regulated industries, with caption support.

Valamis

Learning experience platform with captioned video and integrated analytics.

Degreed

Learning experience platform aggregating courses with captioned source video.

EdCast (Cornerstone Xplor)

Cornerstone OnDemand

Cornerstone's LXP with captioned video and AI skill mapping.

OpenLearning

Australian MOOC + microcredential platform with captioned course video.

Government of India (MoE)

SWAYAM

India's national MOOC platform with captioned video across school and college courses.

IIT Madras / Government of India

NPTEL

IIT-led video lecture archive with subtitles across thousands of engineering courses.

Duolingo English Test

Duolingo

Voice-and-language proficiency test using on-device ASR for speaking sections.

Replika (Voice)

Luka

AI companion app with real-time voice conversations.

Lectio AI

AI lecture-notes assistant that transcribes class audio and generates study guides.

Fathom for Education

Fathom

Fathom's free Zoom transcription tool used by educators and student organizations.

Minerva Forum

Minerva Project

Active-learning platform from Minerva University with transcript-driven analytics.

Explain Everything

Promethean

Interactive whiteboard for educators with voice-narrated lessons.

Edpuzzle

Video-lesson platform with caption support and embedded quizzes.

Nearpod

Renaissance Learning

Interactive K-12 lesson platform with captioned video and audio responses.

Swivl

Classroom-recording hardware + Reflectivity coaching app with auto-transcription.

Web Captioner

Curt Grimes

Free in-browser real-time captioning powered by your browser's speech recognition.

Chrome Live Caption

Built-in Chrome feature that captions any audio playing in the browser, on-device.

Voice In Voice Typing

Speech Recognition Anywhere

Chrome extension that lets you dictate into any text field on any site.

free / Pro from $39/yr

TalkTyper

Chrome extension with custom voice commands across any site.

free / Pro one-time

TalkTyper

TalkTyper.com

Bookmarkable web page that converts your speech to text in the browser.

Speechnotes (web)

TTSReader Ltd

Free voice notepad with Google Speech accuracy and a Chrome extension counterpart.

free / Premium

SpeechTexter (web)

SpeechTexter

Free in-browser dictation with custom voice commands and 70+ languages.

Transcribe by Wreally (web)

Wreally

Browser transcription editor combining auto-transcription with foot-pedal-style controls.

from $20/yr (sub)

oTranscribe (web)

Elliot Bentley

Free open-source browser transcription pad — no upload, runs locally.

VoiceNote II

Vito Galante

Chrome app for taking voice notes via the browser's speech recognizer.

Note-taking app with built-in dictation across web, iOS, and Android.

free / Pro from $39/yr

TalkPad

Free in-browser dictation pad with autosave and quick share.

Talkie Dictation

Talkie

Chrome extension that speaks back what it transcribed for proofing.

NaturalReader (Chrome extension)

NaturalSoft Ltd

TTS plus dictation companion for the NaturalReader reading platform.

free / Premium from $9.99/mo

Read&Write by Texthelp

Texthelp

Literacy support toolbar with TTS, dictation, and word prediction across browsers.

free trial / Premium (institution-licensed)

Co:Writer Universal

Don Johnston Inc.

Word-prediction and dictation extension for struggling writers.

license-based (school)

ClaroRead Chrome

Claro Software

Accessibility extension with TTS, dictation, screen masking, and color overlays.

free trial / Premium

Google Docs Voice Typing

Microsoft 365 Dictate (web)

Built-in dictation inside Google Docs, accessed under Tools > Voice typing.

free (Google account)

included (Microsoft 365 sub)

Dictation built into Word, Outlook, OneNote in the browser.

Microsoft Word Transcribe

included (Microsoft 365 sub, 5 hours/mo)

Upload audio and get speaker-attributed transcripts inside Word on the web.

Apple Dictation (Safari)

System-wide dictation that works in every text field across macOS, iOS, and Safari.

free (Apple device)

Lex.page (voice)

Every (Lex)

AI writing tool with voice prompts and dictation in the browser.

free (signed-in) / API billed

Grammarly (voice typing)

Grammarly

Grammarly's browser extension pipes dictation into any text field with grammar checks.

free / Premium

AssemblyAI Playground

AssemblyAI

In-browser playground for AssemblyAI's Universal speech model.

Deepgram Playground

OpenAI Whisper Playground

Browser playground for testing Deepgram Nova STT and Aura TTS.

free / API billed

free (signed-in) / API billed

OpenAI's docs playground with file-upload and translation against the Whisper API.

HuggingFace ASR Spaces

Hugging Face

Hundreds of community-hosted browser demos of speech recognition models.

free (community-hosted)

Modal Whisper Demo

Modal Labs

Hosted browser demo of Whisper running on Modal's serverless GPUs.

free demo / Modal compute billed

Replicate Whisper (web demo)

Replicate

In-browser playground for Whisper variants hosted on Replicate.

free demo / Replicate compute billed

RunPod Whisper template

RunPod

One-click Whisper endpoints from a browser dashboard on RunPod GPUs.

GPU billed by minute

Sieve ASR demo

Sieve

Browser playground for Sieve's chained-video ASR pipelines.

free credits / pay-per-run

Whisper Web (Xenova)

Xenova / Hugging Face

Whisper running fully in the browser via Transformers.js — no server.

Whisper WebGPU

Xenova / Hugging Face

GPU-accelerated Whisper inference in the browser via WebGPU.

Whisper Turbo (web)

Fleek HQ / community

Faster on-device Whisper in the browser using a quantized turbo build.

Live Transcribe (web)

Browser version of Google's accessibility live-transcription app.

Caption.TV

Caption.tv

Browser overlay that captions any streaming video tab in real time.

Transcribe Online Free (screen audio)

TranscribeOnline.com

No-install browser site that captures tab audio and transcribes it.

free / Premium

SubtitleBee Live (web)

SubtitleBee

Browser-based live captioning paired with the SubtitleBee subtitle generator.

free trial / Premium

Stenogenius

Browser stenography pad for live captioning by trained CART providers.

per provider license

Speakly Voice Typing

Speakly

Chrome extension that turns voice into typed text in any field.

Scribe AI

Voice typing and clipboard helpers for productivity in Chrome.

Speakify

Chrome dictation with offline-style behavior using the browser engine.

Speech Pulse

Free Chrome speech-to-text with sentence segmentation and punctuation.

Mighty Voice

Chrome dictation extension with custom voice macros.

free trial / pay-as-you-go

Vocalmatic (web)

Vocalmatic

Browser audio-to-text service with a free trial transcription credit.

Audext

Browser transcription editor with sync-highlight playback.

from $5/hr

Txtify

Free browser transcription service with privacy-friendly pricing.

Live Caption for Meet (community)

Writeout AI

Writeout

Pay-per-minute Whisper API wrapper running in the browser.

per-minute usage

various

Community Chrome extensions adding richer captions to Google Meet.

Twitch Caption Helpers (community)

various

Community Chrome extensions adding live captions to Twitch streams.

YouTube Caption Downloader (community)

various

Chrome extensions that export YouTube auto-captions to SRT/TXT.

from $0.10/min (auto) / from $0.80/min (human)

Rev (Chrome extension)

Rev.com

Rev's Chrome extension for capturing audio from Zoom/Meet/Teams calls.

free trial / Rev sub

Scribie (Chrome)

Scribie

Browser companion for the Scribie human-transcription marketplace.

Konch.ai

Konch

Browser transcription with built-in collaboration for journalists.

free trial / Pro

Scribee

Quick browser transcription with the OpenAI Whisper backbone.

free starter / Pro from $12/mo

Good Tape (web)

Zetland (Good Tape)

Browser transcription built by Wired's parent for journalists.

Vidby

Browser video-translation and transcription service with 100+ languages.

per-minute pricing

FreeSubtitles AI

FreeSubtitles

Free browser subtitle generator using Whisper.

free trial / Pro from $9.99/mo

Transkriptor (web)

Transkriptor

Browser transcription with Turkish-language strength and team workspace.

Amberscript (web)

Amberscript

Dutch browser transcription and subtitling platform with human-review option.

free trial / Pro / human-grade tier

Verbit (web)

Browser transcription with human-review pipeline aimed at legal and education.

enterprise / per-minute

Interscriber

Browser transcription tool with structured interview templates.

free trial / Pro

Scribepad

Free browser dictation pad with Markdown shortcuts.

Speechmatics Realtime Demo

Browser demo of Speechmatics' real-time engine with live mic capture.

free demo / API billed

ElevenLabs Scribe Demo

free signed-in / API billed

In-browser demo of ElevenLabs' Scribe transcription model.

Gladia Playground

Gladia

Browser playground for Gladia's multilingual transcription API.

free credits / API billed

Krisp (Chrome extension)

Krisp

Browser companion for Krisp's noise-cancelling and transcription stack.

noScribe (web companion)

Kai Dröge

GUI for offline transcription on the browser/desktop with diarization.

Web Speech API demo (caption this page)

MDN / Mozilla

MDN's reference Web Speech API demo — bookmarkable for SEO research.

Soniox Realtime Demo

Soniox

Browser demo of Soniox's low-latency streaming speech engine.

free demo / API billed

Picovoice Web Demo

free / commercial license

Browser playground for Picovoice's on-device wake-word and ASR engines.

BandLab AI Tools

BandLab Technologies

Cloud DAW with one-click Vocal Cleanup, Splitter, Mastering, and Vocal Tuner.

free · paid tier (BandLab Membership)

Soundtrap (Spotify) — transcription

Storyteller subscription $14.99/mo

Spotify's web DAW with podcast transcription baked into the storyboard editor.

Audius (creator tools)

Audius

Decentralized audio platform — listed for creator-facing AI tagging/transcription experiments.

EditShare FLOW with AI metadata

EditShare

Media asset platform with AI face / speech / scene tagging across ingested footage.

enterprise · contact EditShare for quote

Voloco

Resonant Cavity

AI vocal-processor on iOS / Android + desktop — pitch correction + de-noise for creators.

free · Voloco Pro subscription $7.99/mo

Respeecher (voice-replace for ADR)

Respeecher

Speech-to-speech voice replacement used in film/TV ADR — cloud-side with DAW integrations.

subscription · contact for studio pricing

Promo Submit Master (PSM AI mastering)

Promo Submit Master

AI mastering for podcasts + broadcast — speech-aware loudness profiles.

freemium · per-master pricing

LANDR Mastering

LANDR

Pioneering AI mastering service — speech and music profiles for podcast + music releases.

free trial · subscription $4+/mo (Studio) up to $24/mo (Pro)

ElevenLabs Voice Clone

freemium — Starter $5/mo, Creator $22/mo, Pro $99/mo, Scale $330/mo (Professional Voice Clone requires paid)

Studio-grade Instant + Professional Voice Cloning with multilingual output (33+ languages).

PlayHT Voice Clone

PlayHT

Ultra-realistic instant + high-fidelity voice cloning with 142+ languages and 800+ stock voices.

freemium — Creator $39/mo, Unlimited $99/mo (Instant Clone on free, High-Fidelity Clone on paid)

Speechify Voice Clone

Speechify

One-take personal voice clone built into Speechify Studio.

freemium — Premium $11.58/mo (Voice Clone gated to Premium+)

WellSaid Labs

Enterprise-licensed AI voices for corporate L&D, training, and marketing narration.

Maker $44/mo, Creator $89/mo, Team $179/seat/mo, Enterprise custom

Speechki

Audiobook-grade TTS with audio engineers in the loop.

per-project — $0.045/minute (TTS) + audiobook tiers

BeyondWords

TTS + podcast pipeline for publishers — articles to audio at scale.

freemium — Standard $19/mo, Premium $49/mo, Enterprise custom

VoiceChanger.io

Free browser voice changer with effect-based transformations.

Creator $24/mo, Pro $60/mo, Studio $180/mo, Enterprise custom

Replica Studios

Game voice actors — licensed AI voices with SAG-AFTRA agreement.

Acapela Group

Long-running enterprise TTS — accessibility, AAC, signage, IVR.

from $29.99 per voice (desktop) / Enterprise contact sales

Cepstral

Long-running embedded TTS — IVR and telephony.

ReadSpeaker

Enterprise TTS for accessibility, e-learning, and document narration.

Nuance Vocalizer

Microsoft (Nuance)

Nuance (Microsoft) enterprise TTS — IVR and contact centers.

NeoSpeech

Embedded TTS engine for IVR and consumer products.

Cliptalk

AI voice generator for short-form social videos.

freemium — Pro $19/mo, Studio $79/mo, Enterprise custom

Camb.ai

MARS-7 multilingual TTS + dubbing platform with low-resource language support.

Altered Studio

Altered AI

AI voice generation for game audio and dubbing with speech-to-speech.

Creator $4.99/mo, Pro $35.99/mo, Studio custom

Audible Maven Beta

Amazon (Audible / ACX)

Amazon's AI-narrated audiobook tier with self-publishing pipeline.

free to ACX publishers (Beta)

Apple Books Digital Narration

free to participating publishers

Apple's AI-narrated audiobook tier for indie publishers.

Findaway Voices by Spotify

Spotify (Findaway)

Audiobook self-publishing pipeline with a 2024 AI-narration tier.

free + per-sale royalty split

Murf for Audiobooks

Murf AI

Murf's audiobook-specific workflow with chapter markers and ACX-compatible export.

Creator $29/mo, Business $99/mo

Lovo for Audiobooks

Lovo AI

Lovo's audiobook workflow with multi-character scripts and emotion control.

Basic $24/mo, Pro $48/mo, Pro+ $149/mo

BeyondWords for Audiobooks

BeyondWords

BeyondWords' audiobook narration tier — TTS plus distribution.

Premium $49/mo, Enterprise custom

NaturalReader Accessibility

NaturalReader

NaturalReader's enterprise tier for ADA / Section 508 web accessibility overlays.

ReadSpeaker docReader

ReadSpeaker

Document-narration plugin for accessible PDFs and Office files.

Standard £79+, Plus £150+, site licenses

ClaroRead

Claro Software

Assistive-tech TTS reader for dyslexic and visually-impaired users.

Voiceflow Voices

Voiceflow

Voiceflow's TTS for prototyping voice assistants and chatbots.

freemium — Pro $50/mo, Teams $125/mo, Enterprise custom

Uberduck

AI voice and music generation with a focus on rap and music vocals.

freemium — Creator $9.99/mo, Pro $24.99/mo

DeepZen

Emotional audiobook narration with licensed voice IP.

Respeecher

Speech-to-speech voice cloning for film, gaming, and broadcast.

Veritone Voice

Veritone

Enterprise voice cloning with licensed-talent marketplace.

Spotify (acquired Sonantic)

Voicery

Apple (acquired Voicery)

Neural TTS API — acquired by Apple in 2020, archived as a public product.

discontinued

Sonantic

Emotional voice synthesis for games — acquired by Spotify in 2022.

discontinued as standalone

TTS Monster

Twitch / streamer TTS donation tool with celebrity voice characters.

freemium — Pro $4.99/mo, Streamer $9.99/mo

Amazon Alexa

Free with compatible hardware

Amazon's cloud voice assistant, powering Echo devices and Alexa-enabled hardware.

Alexa+

$19.99/mo · free for Prime

Amazon's LLM-powered upgrade to Alexa with a more conversational ASR + reasoning stack.

Google Assistant

Google's voice assistant for Android, Nest speakers, and Wear OS.

Free with Google account

Gemini Live

Free tier · Gemini Advanced $19.99/mo

Google's conversational Gemini voice mode — successor surface for Assistant on Android.

Microsoft Copilot Voice

Free · Copilot Pro $20/mo

The voice mode of Microsoft Copilot — Cortana's successor in the Microsoft assistant lineage.

Samsung Bixby

Samsung

Samsung's voice assistant across Galaxy phones, TVs, and home appliances.

Free with Samsung hardware

Xiaomi XiaoAI

Xiaomi

Xiaomi's Mandarin-first voice assistant across Mi phones, speakers, and home IoT.

Free with Xiaomi hardware

Huawei Celia

Huawei

Huawei's voice assistant for HarmonyOS phones, tablets, and smart-home devices.

Free with Huawei hardware

OPPO Breeno

OPPO

OPPO's voice assistant for ColorOS phones in China.

Free with OPPO hardware

Vivo Jovi

Vivo

Vivo's voice assistant for FuntouchOS / OriginOS phones in China and India.

Free with Vivo hardware

Yandex Alice

Yandex

Yandex's Russian-language voice assistant for Stantsiya speakers, cars, and phones.

Free with Yandex account

Sber Salyut

Sberbank / SberDevices

Sberbank's Russian voice-assistant family (Salyut, Joy, Athena) for SberDevices hardware.

Free with SberDevices hardware

MTS Marvin

MTS

MTS's Russian-language voice assistant for Capsule speakers and the MTS Music app.

Bundled with MTS Capsule speaker (~₽7,000)

VK Marusya

VK's Russian-language voice assistant for Capsule Mini and the VK Music app.

Free with VK account

AliGenie / Tmall Genie

Alibaba

Alibaba's Mandarin voice assistant powering the Tmall Genie smart-speaker line.

From ¥99 device + free service

JD Joy / Dingdong

JD.com

JD.com's Mandarin voice assistant on the Dingdong smart speaker line.

From ¥199 device + free service

Baidu DuerOS / Xiaodu

Baidu

Baidu's Mandarin voice OS powering Xiaodu smart speakers and displays.

From ¥249 device + free service

Meta Ray-Ban Smart Glasses

Meta · EssilorLuxottica

Ray-Ban Meta smart glasses with Meta AI voice — capture photos and ask questions hands-free.

$299 frames + free Meta AI service

Google Live Transcribe

$599 frames + free service

On-device live captioning of in-person conversations — Pixel and many Android phones.

Free with Android device

Even Realities G1

Even Realities

Prescription-friendly smart glasses with monocular captioning HUD and live translation.

Brilliant Labs Frame

Brilliant Labs

Hackable open-source smart glasses with mic + display, runs custom captioning apps.

$349 frames + open SDK

XREAL Air / Air 2 / Beam

XREAL

Display-only AR glasses pairing with phones — captioning apps via the XREAL ecosystem.

From $379 frames

Rokid Max / Glass

Rokid

AR glasses with on-device translation and captioning via the Rokid Station companion.

From $449 frames

INMO Air 2

INMO Technology

Standalone Android-based smart glasses with on-board mic, captioning, and translation.

From $529 frames

TCL RayNeo X2 / Air

TCL

TCL's AR glasses with built-in live translation and captioning HUD.

From $599 frames

ARxVision

ARx Vision

AI-powered glasses for low-vision users — voice description + captioning of environment.

$3,800 device + subscription tier

Olive Smart Ear / Olive Pro

Olive Union

Affordable hearing assistance earbuds with app-tuned amplification.

From $299 device

Eargo

Invisible OTC hearing aids tuned by app, with telecare support.

From $1,650 pair

Phonak Roger

Phonak (Sonova)

Wireless mic system for hearing aids — pairs with Roger receivers in noisy rooms.

From $799 per mic + audiologist fitting

Sennheiser Conversation Clear Plus

Sennheiser · Sonova

Speech-enhancement earbuds — focus on speech in noisy environments.

From $849 pair

Nuheara IQbuds Max

Nuheara

Hearing-assist earbuds with adjustable speech focus and ear-tuned DSP.

From $499 pair

Google Live Transcribe + Pixel Buds

Buds from $229 + free Live Transcribe app

Pixel Buds capturing audio + Live Transcribe rendering captions in the Pixel Buds app.

Mercedes-Benz MBUX Voice Assistant

Mercedes-Benz

"Hey Mercedes" — factory-fitted voice assistant in Mercedes vehicles.

BMW Intelligent Personal Assistant

BMW

"Hey BMW" — voice assistant in BMW iDrive 7 / 8 / 9 vehicles.

Audi MMI Voice

Audi

"Hey Audi" — natural-language voice assistant in Audi MMI infotainment.

Volkswagen We Connect Voice / IDA

Volkswagen

"Hello IDA" — voice assistant across the VW ID. and Golf families.

Toyota "Hey Toyota" Voice Assistant

Toyota

Toyota Audio Multimedia voice assistant — "Hey Toyota" wake phrase in newer models.

Honda Personal Assistant

Honda

"Hey Honda" — voice assistant in Honda e:HEV and e:NS models.

Ford SYNC 4

Ford

Ford's in-vehicle infotainment with natural-language voice control.

Included with vehicle · 8 yrs included data on EVs

GM Google Built-In Voice

General Motors

Google Assistant native in Chevy / Cadillac / GMC vehicles — no phone needed.

Tesla Voice Commands

Tesla

Tesla's in-cabin voice control for navigation, climate, media, and vehicle settings.

Rivian Voice Control

Rivian

Rivian R1T / R1S voice commands — "Hey Rivian" for nav, media, and vehicle features.

Lucid Voice Assistant

Lucid Motors

Lucid Air voice control — native voice in Lucid's Glass Cockpit + Pilot Panel.

NIO Nomi

NIO

NIO's in-car AI assistant with a physical animated head on the dash.

XPENG Xmart OS

XPENG

XPENG's Mandarin voice assistant in Xmart OS — "Hi Xpeng".

Included with vehicle · subscription after trial

Hyundai Bluelink

Hyundai

Hyundai's connected-car voice + telematics suite with hands-free vehicle control.

Kia Connect

Kia

Kia's connected-car platform — voice control + remote app + over-the-air services.

Included with vehicle · subscription after trial

Stellantis Uconnect 5 with AI

Stellantis

Uconnect 5 voice assistant across Jeep, Chrysler, Dodge, Ram, Fiat, Peugeot, Citroën.

Included with vehicle · subscription after trial

Cerence Drive

Cerence

Automotive ASR / TTS / dialogue platform — powers most OEM in-car assistants worldwide.

Enterprise · OEM contract

SoundHound Houndify Automotive

SoundHound AI

Independent voice-AI platform — Cerence competitor used by Mercedes, Hyundai, and Stellantis.

Tiered — free dev / commercial license / OEM

Sonos Voice Control

Sonos

Privacy-first on-device voice control for Sonos speakers — "Hey Sonos".

Free with Sonos speakers

Amazon Echo Show

Alexa smart display with video calling, recipe view, and on-device voice.

From $109 (Echo Show 5)

Amazon Echo Auto

$54.99 device + Alexa free

Plug-in Alexa accessory for cars without built-in Alexa.

Amazon Echo Dot Kids

$59.99 device + Amazon Kids+ $5.99/mo

Echo Dot edition with Amazon Kids+ parental controls and a kid-safe Alexa.

Google Nest Hub

From $99 (Nest Hub) · $229 (Nest Hub Max)

Google's smart display with Assistant / Gemini voice and on-device hot-word.

Roku Voice Remote

Roku

Push-to-talk voice search on Roku TVs and streaming sticks.

Included with Roku hardware

Amazon Fire TV Voice Remote

Included with Fire TV hardware

Alexa Voice Remote for Fire TV — push-to-talk and hands-free Fire TV variants.

Samsung TV Bixby

Samsung

Bixby voice control built into Samsung Smart TVs and soundbars.

Included with TV hardware

LG ThinQ Voice

LG Electronics

Voice control across LG TVs and ThinQ appliances — "Hi LG".

Included with LG hardware

Whirlpool Smart Appliances Voice

Whirlpool Corporation

Voice control for Whirlpool / Maytag connected appliances via Alexa + Google.

Included with smart appliance + hub

GE Appliances SmartHQ Voice

GE Appliances (Haier)

Voice control for GE connected appliances via SmartHQ + Alexa / Google.

Included with smart appliance

Frigidaire Connected Appliances Voice

Electrolux

Voice control for Frigidaire connected appliances via the Frigidaire app + Alexa / Google.

Included with smart appliance

Picovoice Porcupine

Free tier · paid commercial · enterprise

On-device wake-word engine — runs on micro-controllers, mobile, browsers.

Picovoice Cobra

Free tier · paid commercial

On-device voice-activity detector — detect speech vs silence in real time.

Snips (legacy)

Sonos (Snips acquisition)

Acquired-by-Sonos on-device assistant platform — now lives inside Sonos Voice Control.

Discontinued

Sensory TrulyHandsfree

Sensory

Sensory's on-device wake-word + small-vocab ASR — long-standing OEM voice IP.

Enterprise · OEM contract

Meta Quest Live Captions