GPT Realtime 2 Image is OpenAI's image generation model. A real-time multimodal GPT model supporting image input within low-latency streaming interactions.
| Specifications | |
|---|---|
openai-gpt-realtime-2-image | |
| Image Generation | |
| Active | |
| Text | |
| Image | |
Capabilities
Input1/5
Text✓
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image✓
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Pricing by Provider
US Dollar ($)
Per 1M tokens
| Provider | Standard | |
|---|---|---|
| Input $ / 1M | Cache Read $ / 1M | |
| $5.00 | $0.5 | |
Cost Calculator
US Dollar ($)
Preset:
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| GPT Audio 1.5 | — | 128K | $2.50 | $10.00 | Available |
| GPT Audio Mini | 128K | $0.600 | $2.40 | Available | |
| GPT Audio | 128K | $2.50 | $10.00 | Available | |
| GPT Realtime 2 Image | — | — | — | — | Current |
| GPT Realtime 2 Text | — | — | — | — | Available |