GPT-4o-mini-transcribe

GPT-4o-mini-transcribe is a audio (stt/tts) model from OpenAI with a context window of 16K tokens and max output of 2K tokens. Pricing starts at 1.25 per million input tokens and 5.00 per million output tokens.

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keygpt-4o-mini-transcribe
ProviderOpenAI
Provider IDopenai
ModeAudio (STT/TTS)
Canonical Namegpt-4o-mini-transcribe
Context Window16K tokens
Max Output2K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.00131.25
Output Tokens0.00505.00

Benchmarks

No benchmark data is available for this model.

All Variants

All available versions, regions, and API endpoints for GPT-4o-mini-transcribe.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
gpt-4o-mini-transcribeOpenAIAudio (STT/TTS)1.255.0016K2Knono
gpt-4o-mini-transcribe-2025-03-20OpenAIAudio (STT/TTS)1.255.0016K2Knono
gpt-4o-mini-transcribe-2025-12-15OpenAIAudio (STT/TTS)1.255.0016K2Knono
gpt-4o-transcribeOpenAIAudio (STT/TTS)2.5010.0016K2Knono