Grok Imagine Video is xAI's video generation model. xAI's video generation model capable of producing video from text prompts or images, balancing quality, cost, and latency with audio support.
Specifications
Canonical IDxai-grok-imagine-video
TypeVideo Generation
StatusActive
CreatorxAIxAI
Providers
Input ModalitiesText
Output ModalitiesVideo
Release Date · 4 months ago

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio·
Video
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Vercel AI Gateway logo
Vercel AI Gateway
xai/grok-imagine-video

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Grok 4.31.0M$1.25$2.50Available
Grok 4.202.0M$1.25$2.50Available
Grok 4.20 Multi-Agent2.0M$1.25$2.50Available
Grok 4.20 Multi-Agent Beta2.0M$1.25$2.50Available
Grok 4.20 Non-Reasoning2.0M$1.25$2.50Available
Grok 4.20 Reasoning2.0M$1.25$2.50Available
Grok 4.1 Fast2.0M$1.25$2.50Deprecated
Grok 4 Fast131K$0.200$0.500Deprecated
Grok 4 Fast Non-Reasoning2.0M$0.200$0.500Deprecated
Grok 4256K$1.25$2.50Deprecated
Grok Imagine VideoCurrent

Model IDs

grok-imagine-video
xai-grok-imagine-video
xai/grok-imagine-video