GPT Realtime is
OpenAI's language model with a 32K context window and up to 4K output tokens, available from 2 providers, starting at $4.00 / 1M input and $16.00 / 1M output. A general-availability realtime model capable of responding to audio and text inputs over WebRTC, WebSocket, or SIP connections with low latency.
Capabilities
Input3/5
✓
✓
✓
·
·
Output2/5
✓
·
✓
·
·
Capabilities2/13
·
·
✓
✓
·
·
·
·
·
·
·
·
·
Pricing by Provider
| Provider | Standard | ||||
|---|---|---|---|---|---|
| Input $ / 1M | Output $ / 1M | Cache Read $ / 1M | Audio In $ / 1M | Audio Out $ / 1M | |
Azure AI Foundry | $4.00 | $16.00 | $4.00 | $32.00 | $64.00 |
OpenAI | $4.00 | $16.00 | $0.400 | $32.00 | $64.00 |
Cost Calculator
Preset:
Compares every provider & tier in USD
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| GPT-5.4 Mini | 1.1M | $0.750 | $4.50 | Available | |
| GPT-5.4 Nano | 1.1M | $0.200 | $1.25 | Available | |
| GPT-5.4 | 1.1M | $2.50 | $15.00 | Available | |
| GPT-5.4 Pro | 1.1M | $30.00 | $180.00 | Available | |
| GPT-5.4 3.5 | — | 1.1M | — | — | Available |
| GPT-5.4 Pro 3.5 | — | 1.1M | — | — | Available |
| GPT-5.3 Chat | 128K | $1.75 | $14.00 | Available | |
| GPT-5.3 Codex | 400K | $1.75 | $14.00 | Available | |
| GPT-5.3 Codex Spark | — | 128K | — | — | Available |
| GPT-5.3 Instant | — | 128K | — | — | Available |
| GPT Realtime | 32K | $4.00 | $16.00 | Current |