Llama 4 Maverick 17B 128E Instruct

SambaNovaText

Llama 4 Maverick 17B 128E Instruct is a text model from SambaNova with a context window of 131K tokens and max output of 131K tokens. Pricing starts at $0.63 per million input tokens and $1.80 per million output tokens (cheapest at Lambda Ai).

Specifications

Model Keysambanova/Llama-4-Maverick-17B-128E-Instruct
ProviderSambaNova
LiteLLM Providersambanova
ModeText
Canonical Namellama-4-maverick-17b-128e
Context Window131K tokens
Max Output131K tokens

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.000630$0.630
Output Tokens$0.0018$1.80

Price Comparison by Provider

Compare prices for Llama 4 Maverick 17B 128E Instruct across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price
Output Price
SambaNovasambanova/Llama-4-Maverick-17B-128E-Instruct$0.630$1.80
Lambda Ailambda_ai/llama-4-maverick-17b-128e-instruct-fp8$0.050$0.100
Novitanovita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8$0.270$0.850
Ocioci/meta.llama-4-maverick-17b-128e-instruct-fp8$0.720$0.720

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
Gemma 3 27B ItGoogle GeminiTextN/AN/A131K8Kyesyes
GPT-oss-120b-mxfp-GGUFLemonadeTextN/AN/A131K33Knoyes
GPT-oss-20bOpenRouterText$0.020$0.100131K33Knoyes
GPT-oss-20b-mxfp4-GGUFLemonadeTextN/AN/A131K33Knoyes
GPT-oss:120b-cloudOllamaTextN/AN/A131K131Knoyes
GPT-oss:20b-cloudOllamaTextN/AN/A131K131Knoyes
Llama 3.2 3B InstructDeepinfraText$0.020$0.020131K131Knono
Llama3.2 11B Vision InstructLambda AiText$0.015$0.025131K131Kyesyes
Llama3.2 3B InstructLambda AiText$0.015$0.025131K131Knoyes
Meta Llama 3.1 8B Instruct TurboDeepinfraText$0.020$0.030131K131Knono