Hume EVI

by Hume AI Inc.

Empathic voice interface with emotional-tone awareness.

TL;DR

Empathic voice interface with emotional-tone awareness.

Best for developers building voice agents where emotional-tone awareness materially changes the response (mental-health, coaching). Pricing: pay-as-you-go.

Category
Transcription APIs
License
Stars
Last push
Pricing
pay-as-you-go
Platforms
API

What it is

Hume AI's Empathic Voice Interface (EVI) is a voice-AI model that combines speech recognition, language generation, and prosodic-tone analysis into a single model. The product reads the emotional tone of the speaker — frustration, hesitation, enthusiasm — and lets the agent adjust its response. EVI is positioned for use cases where tone matters: mental-health support, coaching, and consumer voice assistants. Hume AI is run by former Google DeepMind researcher Alan Cowen and has raised over $50M from EQT Ventures, Union Square Ventures, and Comcast Ventures.

Best for: Developers building voice agents where emotional-tone awareness materially changes the response (mental-health, coaching).
Watch out for: Emotion-AI scoring is heuristic and contested; not appropriate for high-stakes decisions about people.

Install / use

hume.ai — Tiered pay-as-you-go, contact sales for production tier

Features

Speaker diarizationNo
Word-level timestampsNo
Streaming / real-timeNo
Languages supportedNone
HIPAA eligibleNo

Hume EVI vs Whipscribe

FeatureHume EVIWhipscribe
CategoryTranscription APIsTranscription APIs
Pricingpay-as-you-gofree beta
Speaker diarizationYes
Word timestampsYes
StreamingNo
Languages99
PlatformsAPIWeb, API, MCP

Alternatives to Hume EVI

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.