GPT-4o

Azure OpenAITextDeprecated: 2026-02-27

GPT-4o is a text model from Azure OpenAI with a context window of 128K tokens and max output of 16K tokens. Pricing starts at $2.50 per million input tokens and $10.00 per million output tokens (cheapest at Replicate).

Specifications

Model Keyazure/global/gpt-4o-2024-08-06
ProviderAzure OpenAI
LiteLLM Providerazure
ModeText
Canonical Namegpt-4o
Context Window128K tokens
Max Output16K tokens

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.0025$2.50
Output Tokens$0.010$10.00
Cache Read (Input)$0.0013$1.25

Price Comparison by Provider

Compare prices for GPT-4o across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price
Output Price
Azure OpenAIazure/global-standard/gpt-4o-2024-08-06$2.50$10.00
Github Copilotgithub_copilot/gpt-4o-2024-08-06N/AN/A
Gmigmi/openai/gpt-4o$2.50$10.00
OpenRouteropenrouter/openai/gpt-4o$2.50$10.00
Replicatereplicate/openai/gpt-4o$2.50$10.00
Vercel Ai Gatewayvercel_ai_gateway/openai/gpt-4o$2.50$10.00
OpenAIgpt-4o-transcribe-diarize$2.50$10.00
Gradient Aigradient_ai/openai-gpt-4oN/AN/A

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
Gemma 3 4B It GGUFLemonadeTextN/AN/A128K8Knoyes
Gemma3 4BLlamagateText$0.030$0.080128K8Kyesyes
GigaChat 2 LiteGigachatTextN/AN/A128K8Knoyes
GigaChat 2 MaxGigachatTextN/AN/A128K8Kyesyes
GigaChat 2 ProGigachatTextN/AN/A128K8Kyesyes
Glm 4.5 FlashZaiTextN/AN/A128K32Knoyes
Llama 3.1 70B Instruct MaasGoogle Vertex AITextN/AN/A128K2Kyesno
Llama 3.1 8B Instruct MaasGoogle Vertex AITextN/AN/A128K2Kyesno
Llama 3.2 90B Vision Instruct MaasGoogle Vertex AITextN/AN/A128K2Kyesno
Qwen3 4B Fp8NovitaText$0.030$0.030128K20Knono