Name: Whisper 3 Large Turbo
Brand: OpenAI

Whisper 3 Large Turbo is OpenAI's speech to text model, available from 2 providers. A faster, distilled variant of Whisper Large V3 that maintains strong multilingual ASR accuracy with reduced inference latency.

Specifications
Canonical ID	`openai-whisper-3-large-turbo`
Type	Speech to Text
Status	Active
Creator	OpenAI
Providers	Groq IBM watsonx
Input Modalities	Audio
Output Modalities	Text
Parameters	0.81B
HuggingFace Likes	3,012
HuggingFace Downloads (30d)	7,277,395
HuggingFace Downloads (all-time)	83,858,224

Capabilities

Input1/5

Text·

Image·

Audio✓

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Pricing by Provider

Provider	Standard
Provider	Audio In $ / 1M	Audio Out $ / 1M
Groq groq/whisper-large-v3-turbo	$0.000011	N/A
IBM watsonx watsonx/whisper-large-v3-turbo	$0.000100	$0.000100

Cost Calculator

Preset:

Input tokens

Output tokens

Number of calls

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Whisper 3 Large Turbo	—	—	—	—	Current
Whisper 3	—	4K	—	—	Available
Whisper 3 Large	—	—	—	—	Available
Whisper 3 Turbo	—	4K	—	—	Available
Whisper 2 Large	—	—	—	—	Available
Whisper	—	—	—	—	Available
Whisper Base	—	—	—	—	Available
Whisper Large	—	—	—	—	Available
Whisper Medium	—	—	—	—	Available
Whisper Small	—	—	—	—	Available
Whisper Tiny	—	—	—	—	Available

Whisper 3 Large Turbo

Capabilities

Pricing by Provider

Cost Calculator

Versions

Model IDs