GPT Realtime is OpenAI's language model with a 32K context window and up to 4K output tokens, available from 2 providers, starting at $4.00 / 1M input and $16.00 / 1M output. A general-availability realtime LLM capable of responding to audio and text inputs over WebRTC, WebSocket, or SIP connections with low latency.
Specifications
Canonical IDopenai-gpt-realtime
TypeLanguage
StatusActive
CreatorOpenAIOpenAI
Providers
Context Window32K tokens
Max Output4K tokens
Input ModalitiesAudioImageText
Output ModalitiesAudioText
Release Date · 10 months ago
Knowledge Cutoff · 3 years ago

Capabilities

Input3/5
Text
Image
Audio
Video·
PDF·
Output2/5
Text
Image·
Audio
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Audio In
$ / 1M
Audio Out
$ / 1M
Image In
$ / 1M
Azure AI Foundry logo
Azure AI Foundry
azure/gpt-realtime-2025-08-28
$4.00$16.00$4.00$32.00$64.00$5.00
OpenAI logo
OpenAI
gpt-realtime
$4.00$16.00$0.4$32.00$64.00$5.00

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
GPT-5.51.1M$5.00$30.00Available
GPT-5.4 Mini1.1M$0.750$4.50Available
GPT-5.4 Nano1.1M$0.200$1.25Available
GPT-5.41.1M$2.50$15.00Available
GPT-5.3 Codex400K$1.75$14.00Available
GPT-5.2 Codex400K$1.75$14.00Available
GPT-5.2410K$1.75$14.00Available
GPT-5.1410K$1.25$10.00Available
GPT-5.1 Codex400K$1.25$10.00Available
GPT-5.1 Codex Mini400K$0.250$2.00Available
GPT Realtime32K$4.00$16.00Current

Model IDs

azure/gpt-realtime-2025-08-28
gpt-realtime
gpt-realtime-2025-08-28
openai-gpt-realtime