Grok 4.1 Fast is xAI's language model with a 2.0M context window and up to 30K output tokens, available from 2 providers, starting at $1.25 / 1M input and $2.50 / 1M output. xAI's fast agentic tool-calling LLM with a 2M context window, excelling at real-world use cases like customer support and deep research.
Specifications
Canonical IDxai-grok-4-1-fast
TypeLanguage
StatusDeprecated
CreatorxAIxAI
Providers
Context Window2.0M tokens
Max Output30K tokens
Input ModalitiesAudioImageText
Output ModalitiesText
Reasoning Effortsdefault
Release Date · 6 months ago
Deprecation Date
Benchmarks
Intelligence Index
23.6
#185
Coding Index
19.5
#178
Math Index
34.3
#169
MMLU-Pro
0.7
#172
GPQA
0.6
#225
HLE
0.1
#279
LiveCodeBench
0.4
#172
IFBench
0.4
#260
Time to First Token
0.00s
#199
SciCode
0.3
#223
AIME 2025
0.3
#169
LCR
0.2
#232
TerminalBench Hard
0.1
#163
TAU2
0.6
#137
Output TPS
0.0
#468

Capabilities

Input3/5
Text
Image
Audio
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities6/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
xAI logo
xAI
grok-4-1-fast
$1.25$2.50$0.200$0.625$1.25$0.100
Oracle Cloud (OCI) logo
Oracle Cloud (OCI)
oci/xai.grok-4.1-fast
$5.00$25.00N/A

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Grok 4.31.0M$1.25$2.50Available
Grok 4.202.0M$1.25$2.50Available
Grok 4.20 Multi-Agent2.0M$1.25$2.50Available
Grok 4.20 Multi-Agent Beta2.0M$1.25$2.50Available
Grok 4.20 Non-Reasoning2.0M$1.25$2.50Available
Grok 4.20 Reasoning2.0M$1.25$2.50Available
Grok 4.1 Fast2.0M$1.25$2.50Current
Grok 4 Fast131K$0.200$0.500Deprecated
Grok 4 Fast Non-Reasoning2.0M$0.200$0.500Available
Grok 4256K$1.25$2.50Deprecated
Grok 4.1 Reasoning2.0M$0.200$0.500Available

Model IDs