Grok 4.1 Fast is xAI's language model with a 2.0M context window and up to 30K output tokens, available from 2 providers, starting at $1.25 / 1M input and $2.50 / 1M output. xAI's fast agentic tool-calling LLM with a 2M context window, excelling at real-world use cases like customer support and deep research.
Specifications
Canonical IDxai-grok-4-1-fast
TypeLanguage
StatusDeprecated
CreatorxAIxAI
Providers
Context Window2.0M tokens
Max Output30K tokens
Input ModalitiesAudioImageText
Output ModalitiesText
Reasoning Effortsdefault
Release Date · 6 months ago
Deprecation Date
Benchmarks
Intelligence Index
23.6
#190
Coding Index
19.5
#183
Math Index
34.3
#169
MMLU-Pro
0.7
#172
GPQA
0.6
#230
HLE
0.1
#284
LiveCodeBench
0.4
#172
IFBench
0.4
#265
Time to First Token
0.00s
#200
SciCode
0.3
#228
AIME 2025
0.3
#169
LCR
0.2
#237
TerminalBench Hard
0.1
#168
TAU2
0.6
#143
Output TPS
0.0
#474

Capabilities

Input3/5
Text
Image
Audio
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities6/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
xAI logo
xAI
grok-4-1-fast
$1.25$2.50$0.200$0.625$1.25$0.100
Oracle Cloud (OCI) logo
Oracle Cloud (OCI)
oci/xai.grok-4.1-fast
$5.00$25.00N/A

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Grok 4.31.0M$1.25$2.50Available
Grok 4.202.0M$1.25$2.50Available
Grok 4.20 Multi-Agent2.0M$1.25$2.50Available
Grok 4.20 Multi-Agent Beta2.0M$1.25$2.50Available
Grok 4.20 Non-Reasoning2.0M$1.25$2.50Available
Grok 4.20 Reasoning2.0M$1.25$2.50Available
Grok 4.1 Fast2.0M$1.25$2.50Current
Grok 4 Fast131K$0.200$0.500Deprecated
Grok 4 Fast Non-Reasoning2.0M$0.200$0.500Available
Grok 4256K$1.25$2.50Deprecated
Grok 4.1 Reasoning2.0M$0.200$0.500Available

Model IDs