Grok Vision Beta
Grok Vision Beta is a text model from
xAI with a context window of 8K tokens and max output of 8K tokens. Pricing starts at 5.00 per million input tokens and 15.00 per million output tokens.
Capabilities
✓ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✓ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Specifications
| Model Key | xai/grok-vision-beta |
| Provider | |
| Provider ID | xai |
| Mode | Text |
| Canonical Name | grok-vision-beta |
| Context Window | 8K tokens |
| Max Output | 8K tokens |
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | 0.0050 | 5.00 |
| Output Tokens | 0.015 | 15.00 |
Benchmarks
No benchmark data is available for this model.