GPT Realtime 2 Image is OpenAI's image generation model. A real-time multimodal GPT model supporting image input within low-latency streaming interactions.
Specifications
Canonical IDopenai-gpt-realtime-2-image
TypeImage Generation
StatusActive
CreatorOpenAIOpenAI
Providers
Input ModalitiesText
Output ModalitiesImage

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Cache Read
$ / 1M
Azure AI Foundry logo
Azure AI Foundry
openai:gptrealtime2image
$5.00$0.500

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
GPT Audio 1.5128K$2.50$10.00Available
GPT Audio Mini128K$0.600$2.40Available
GPT Audio128K$2.50$10.00Available
GPT Realtime 2 ImageCurrent
GPT Realtime 2 TextAvailable

Model IDs