DeepInfra (Whisper)

by DeepInfra

DeepInfra's hosted Whisper endpoint with per-second GPU pricing.

TL;DR

DeepInfra's hosted Whisper endpoint with per-second GPU pricing.

Best for indie devs and startups seeking cheap hosted Whisper with low ops burden. Pricing: per-Deepinfra pricing.

Category
Transcription APIs
License
Stars
Last push
Pricing
per-Deepinfra pricing
Platforms
API

What it is

DeepInfra is an inference platform hosting open-source models, including Whisper-large-v3 and variants. Endpoints are OpenAI-compatible where applicable. Pricing is per-second compute. Useful as a cost alternative to OpenAI's hosted Whisper for batch transcription. Best fit: indie devs and startups seeking cheap hosted whisper with low ops burden. Caveats: smaller vendor; sla and roadmap are limited compared to hyperscalers. Pricing as listed: per-Deepinfra pricing. Feature flags from vendor docs: word-level timestamps. Directory tags: commercial-api, model-host. Last vendor-page check: 2026-05-12. The product slots into the broader voice-AI tooling landscape and is best evaluated head-to-head with adjacent vendors in its subcategory; verify current language coverage, region availability, and compliance terms on the vendor's public docs at time of build.

Best for: Indie devs and startups seeking cheap hosted Whisper with low ops burden.
Watch out for: Smaller vendor; SLA and roadmap are limited compared to hyperscalers.

Install / use

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeNo
Languages supported99
HIPAA eligibleNo

DeepInfra (Whisper) vs Whipscribe

FeatureDeepInfra (Whisper)Whipscribe
CategoryTranscription APIsTranscription APIs
Pricingper-Deepinfra pricingfree beta
Speaker diarizationNoYes
Word timestampsYesYes
StreamingNoNo
Languages9999
PlatformsAPIWeb, API, MCP

Alternatives to DeepInfra (Whisper)

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.