Grok 2 Vision

Grok 2 Vision is a text model from xAI with a context window of 33K tokens and max output of 33K tokens. Pricing starts at 2.00 per million input tokens and 10.00 per million output tokens (cheapest at Vercel AI Gateway).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyxai/grok-2-vision-latest
ProviderxAI
Provider IDxai
ModeText
Canonical Namegrok-2-vision-1212
Context Window33K tokens
Max Output33K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.00202.00
Output Tokens0.01010.00

Benchmarks

No benchmark data is available for this model.

Price Comparison by Provider

Compare prices for Grok 2 Vision across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
xAIxai/grok-2-vision2.0010.00
Vercel AI Gatewayvercel_ai_gateway/xai/grok-2-vision2.0010.00
Perplexityperplexity/xai/grok-2-vision-1212N/AN/A

All Variants

All available versions, regions, and API endpoints for Grok 2 Vision.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
perplexity/xai/grok-2-vision-1212PerplexityOtherN/AN/AN/AN/Anono
vercel_ai_gateway/xai/grok-2-visionVercel AI GatewayText2.0010.0033K33Kyesyes
xai/grok-2-visionxAIText2.0010.0033K33Kyesyes
xai/grok-2-vision-1212xAIText2.0010.0033K33Kyesyes
xai/grok-2-vision-latestxAIText2.0010.0033K33Kyesyes