xAI logo

Grok 2 Vision

Deprecated

Grok 2 Vision is xAI logoxAI's language model with a 33K context window, available from 2 providers, starting at $2.00 / 1M input and $10.00 / 1M output. A multimodal variant of Grok 2 that adds image understanding to xAI's second-generation language model for vision-language tasks.
Spec
Canonical IDxai-grok-2-vision
TypeLanguage
StatusDeprecated
CreatorxAIxAI
Providers
Context Window33K tokens
Input ModalitiesImage
Output ModalitiesText
Deprecation Date

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Vercel AI Gateway logo
Vercel AI Gateway
xai/grok-2-vision
$2.00$10.00
xAI logo
xAI
grok-2-vision
$2.00$10.00

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Grok 4.20 Multi-Agent2.0M$2.00$6.00Available
Grok 4.20 Multi Agent Beta2.0M$2.00$6.00Available
Grok 4 20 Non-Reasoning2.0MAvailable
Grok 4 20 Reasoning2.0MAvailable
Grok 4 20131K$3.00$15.00Available
Grok 4.1 Fast2.0M$0.200$0.500Available
Grok 4.1 Fast Non-Reasoning2.0M$0.200$0.500Available
Grok 4.1 Fast Reasoning2.0M$0.200$0.500Available
Grok 4 Non-Reasoning2.0M$2.00$6.00Available
Grok 4 Reasoning2.0M$2.00$6.00Available
Grok 2 Vision33K$2.00$10.00Current

Model IDs