Llama 4 Scout

Llama 4 Scout is a text model from Vercel AI Gateway with a context window of 131K tokens and max output of 8K tokens. Pricing starts at 0.10 per million input tokens and 0.30 per million output tokens.

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyvercel_ai_gateway/meta/llama-4-scout
ProviderVercel AI Gateway
Provider IDvercel_ai_gateway
ModeText
Canonical Namellama-scout-4
Context Window131K tokens
Max Output8K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0001000.100
Output Tokens0.0003000.300

Benchmarks

Intelligence Index13.5#144
Coding Index6.7#152
Math Index14.0#109
MMLU-Pro0.8#76
GPQA0.6#106
HLE0.0#151
LiveCodeBench0.3#109
AIME0.3#51
IFBench0.4#82
Time to First Token0.46s#122
SciCode0.2#174
MATH-5000.8#59
AIME 20250.1#109
LCR0.3#91
TerminalBench Hard0.0#133
TAU20.2#134