Voxtral Mini Transcribe Realtime is Mistral AI's speech to text model, starting at $0.0060 / 1M input. A real-time speech transcription variant of Voxtral Mini, optimized for low-latency audio-to-text conversion.
Specifications
Canonical IDmistral-voxtral-mini-transcribe-realtime
TypeSpeech to Text
StatusActive
CreatorMistral AIMistral AI
Providers
Input ModalitiesAudio
Output ModalitiesText

Capabilities

Input1/5
Text·
Image·
Audio
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Input
$ / 1M
Mistral AI logo
Mistral AI
voxtral-mini-transcribe-realtime-latest
$0.0060$0.0030

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Voxtral Mini 3B128K$0.040$0.040Available
Voxtral Mini Transcribe Realtime$0.006Current
Voxtral Mini TTS$0.000$16.00Available

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
Voxtral Small 24BSmall128K$0.100$0.300
Voxtral TTS

Model IDs