Llama 4 Maverick 17 b 128 e Instruct Maas Pricing & Specs | AI Models

Llama 4 Maverick 17B 128e Instruct Maas is a text model from Vertex AI (Llama) with a context window of 1.0M tokens and max output of 1.0M tokens. Pricing starts at 0.35 per million input tokens and 1.15 per million output tokens (cheapest at Novita AI).

Capabilities

✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output

Specifications

Model Key	`vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas`
Provider	Vertex AI (Llama)
Provider ID	vertex_ai-llama_models
Mode	Text
Canonical Name	llama-maverick-4-17b-128e
Context Window	1.0M tokens
Max Output	1.0M tokens

Pricing

Type	Per 1K Tokens	Per 1M Tokens
Input Tokens	0.000350	0.350
Output Tokens	0.0011	1.15

Benchmarks

No benchmark data is available for this model.

Price Comparison by Provider

Compare prices for Llama 4 Maverick 17B 128e Instruct Maas across different providers. The same model may be available through multiple providers at different price points.

Provider	Model Key	Input Price, $	Output Price, $
Vertex AI (Llama)	vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas	0.350	1.15
Novita AI	novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8	0.270	0.850

All Variants

All available versions, regions, and API endpoints for Llama 4 Maverick 17B 128e Instruct Maas.

Model Key	Provider	Mode	Input Price, $	Output Price, $	Context	Max Output	Vision	Functions
novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8	Novita AI	Text	0.270	0.850	1.0M	8K	yes	no
vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas	Vertex AI (Llama)	Text	0.350	1.15	1.0M	1.0M	no	yes

← Back to All Models