GPT Audio 1.5 is OpenAI's language model with a 128K context window and up to 16K output tokens, available from 2 providers, starting at $2.50 / 1M input and $10.00 / 1M output. A versioned release of OpenAI's GPT Audio model supporting audio input and output for conversational and voice-interface applications.
Specifications
Canonical IDopenai-gpt-audio-1-5
TypeLanguage
StatusActive
CreatorOpenAIOpenAI
Providers
Context Window128K tokens
Max Output16K tokens
Input ModalitiesAudioText
Output ModalitiesAudioText
Knowledge Cutoff

Capabilities

Input2/5
Text
Image·
Audio
Video·
PDF·
Output2/5
Text
Image·
Audio
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Audio In
$ / 1M
Audio Out
$ / 1M
Azure AI Foundry logo
Azure AI Foundry
azure/gpt-audio-1.5-2026-02-23
$2.50$10.00$40.00$80.00
OpenAI logo
OpenAI
gpt-audio-1.5
$2.50$10.00$32.00$64.00

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
GPT Audio 1.5128K$2.50$10.00Current
GPT Audio Mini128K$0.600$2.40Available
GPT Audio128K$2.50$10.00Available
GPT Realtime 2 ImageAvailable
GPT Realtime 2 TextAvailable

Model IDs