Grok Vision Beta

Grok Vision Beta is a text model from xAI with a context window of 8K tokens and max output of 8K tokens. Pricing starts at 5.00 per million input tokens and 15.00 per million output tokens.

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyxai/grok-vision-beta
ProviderxAI
Provider IDxai
ModeText
Canonical Namegrok-vision-beta
Context Window8K tokens
Max Output8K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.00505.00
Output Tokens0.01515.00

Benchmarks

No benchmark data is available for this model.