Llama 4 Maverick 17B 128e Instruct Maas

Llama 4 Maverick 17B 128e Instruct Maas is a text model from Vertex AI (Llama) with a context window of 1.0M tokens and max output of 1.0M tokens. Pricing starts at 0.35 per million input tokens and 1.15 per million output tokens (cheapest at Novita AI).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyvertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas
ProviderVertex AI (Llama)
Provider IDvertex_ai-llama_models
ModeText
Canonical Namellama-maverick-4-17b-128e
Context Window1.0M tokens
Max Output1.0M tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0003500.350
Output Tokens0.00111.15

Benchmarks

No benchmark data is available for this model.

Price Comparison by Provider

Compare prices for Llama 4 Maverick 17B 128e Instruct Maas across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Vertex AI (Llama)vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas0.3501.15
Novita AInovita/meta-llama/llama-4-maverick-17b-128e-instruct-fp80.2700.850

All Variants

All available versions, regions, and API endpoints for Llama 4 Maverick 17B 128e Instruct Maas.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8Novita AIText0.2700.8501.0M8Kyesno
vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maasVertex AI (Llama)Text0.3501.151.0M1.0Mnoyes