GPT-realtime mini

GPT-realtime mini is a text model from OpenAI with a context window of 128K tokens and max output of 4K tokens. Pricing starts at 0.60 per million input tokens and 2.40 per million output tokens (cheapest at Azure OpenAI).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keygpt-realtime-mini-2025-12-15
ProviderOpenAI
Provider IDopenai
ModeText
Canonical Namegpt-realtime-mini
Context Window128K tokens
Max Output4K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0006000.600
Output Tokens0.00242.40
Cache Read (Input)0.0000600.060

Benchmarks

No benchmark data is available for this model.

Price Comparison by Provider

Compare prices for GPT-realtime mini across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
OpenAIgpt-realtime-mini0.6002.40
Azure OpenAIazure/gpt-realtime-mini-2025-10-060.6002.40

All Variants

All available versions, regions, and API endpoints for GPT-realtime mini.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
azure/gpt-realtime-mini-2025-10-06Azure OpenAIText0.6002.4032K4Knoyes
gpt-realtime-miniOpenAIText0.6002.40128K4Knoyes
gpt-realtime-mini-2025-10-06OpenAIText0.6002.40128K4Knoyes
gpt-realtime-mini-2025-12-15OpenAIText0.6002.40128K4Knoyes