ONNX Runtime

by Microsoft

Microsoft's cross-platform inference runtime for ONNX-exported Whisper.

TL;DR

Microsoft's cross-platform inference runtime for ONNX-exported Whisper.

Best for production inference of Whisper / Wav2Vec2 / Parakeet across hardware. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux, macOS, Windows, Android, iOS, Web

What it is

The standard ONNX runtime — supports CPU, CUDA, DirectML, CoreML, NNAPI. MIT.

Best for: Production inference of Whisper / Wav2Vec2 / Parakeet across hardware.
Watch out for: Whisper-specific decoders require custom export pipelines.

Install / use

pip install onnxruntime

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeNo
Languages supported99
HIPAA eligibleNo

ONNX Runtime vs Whipscribe

FeatureONNX RuntimeWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationNoYes
Word timestampsYesYes
StreamingNoNo
Languages9999
PlatformsLinux, macOS, Windows, Android, iOS, WebWeb, API, MCP

Alternatives to ONNX Runtime

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.