cuDNN Frontend

by NVIDIA

C++ / Python API for cuDNN — speeds up custom ASR kernels.

TL;DR

C++ / Python API for cuDNN — speeds up custom ASR kernels.

Best for fused attention + convolution ops in Conformer/Whisper engines. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux

What it is

A modern wrapper around the cuDNN library. MIT.

Best for: Fused attention + convolution ops in Conformer/Whisper engines.
Watch out for: Low-level; expect to read examples.

Install / use

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeNo
Languages supported99
HIPAA eligibleNo

cuDNN Frontend vs Whipscribe

FeaturecuDNN FrontendWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationNoYes
Word timestampsYesYes
StreamingNoNo
Languages9999
PlatformsLinuxWeb, API, MCP

Alternatives to cuDNN Frontend

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.