OpenAI logo

GPT-4o Audio


GPT-4o Audio is OpenAI's language model with a 128K context window and up to 16K output tokens, available from 3 providers, starting at $2.50 / 1M input and $10.00 / 1M output. Preview release of GPT-4o with native audio input and output support, enabling real-time voice interaction via the Chat Completions API.
Specifications
Canonical IDopenai-gpt-4o-audio-preview
TypeLanguage
StatusActive
CreatorOpenAIOpenAI
Providers
Context Window128K tokens
Max Output16K tokens
Input ModalitiesAudioText
Output ModalitiesAudioText
Release Date · 9 months ago
Knowledge Cutoff

Capabilities

Input2/5
Text
Image·
Audio
Video·
PDF·
Output2/5
Text
Image·
Audio
Video·
Embedding·
Capabilities4/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Audio In
$ / 1M
Audio Out
$ / 1M
Azure AI Foundry logo
Azure AI Foundry
azure/gpt-4o-audio-preview-2024-12-17
$2.50$10.00$40.00$80.00
OpenAI logo
OpenAI
gpt-4o-audio-preview
$2.50$10.00$40.00$80.00
OpenRouter logo
OpenRouter
openai/gpt-4o-audio-preview
$2.50$10.00$40.00N/A

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
GPT-5.51.1M$5.00$30.00Available
GPT-5.4 Mini1.1M$0.750$4.50Available
GPT-5.4 Nano1.1M$0.200$1.25Available
GPT-5.41.1M$2.50$15.00Available
GPT-5.3 Codex400K$1.75$14.00Available
GPT-5.2 Codex400K$1.75$14.00Available
GPT-5.2410K$1.75$14.00Available
GPT-5.1410K$1.25$10.00Available
GPT-5.1 Codex400K$1.25$10.00Available
GPT-5.1 Codex Mini400K$0.250$2.00Available
GPT-4o Audio128K$2.50$10.00Current

Model IDs