Llama 4 Maverick 17B 128e Instruct Maas
Llama 4 Maverick 17B 128e Instruct Maas is a text model from
Vertex AI (Llama) with a context window of 1.0M tokens and max output of 1.0M tokens. Pricing starts at 0.35 per million input tokens and 1.15 per million output tokens (cheapest at Novita AI).
Capabilities
✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Specifications
| Model Key | vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas |
| Provider | |
| Provider ID | vertex_ai-llama_models |
| Mode | Text |
| Canonical Name | llama-maverick-4-17b-128e |
| Context Window | 1.0M tokens |
| Max Output | 1.0M tokens |
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | 0.000350 | 0.350 |
| Output Tokens | 0.0011 | 1.15 |
Benchmarks
No benchmark data is available for this model.
Price Comparison by Provider
Compare prices for Llama 4 Maverick 17B 128e Instruct Maas across different providers. The same model may be available through multiple providers at different price points.
Provider | Model Key | Input Price, $ | Output Price, $ |
|---|---|---|---|
| vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas | 0.350 | 1.15 | |
| novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8 | 0.270 | 0.850 |
All Variants
All available versions, regions, and API endpoints for Llama 4 Maverick 17B 128e Instruct Maas.
Model Key | Provider | Mode | Input Price, $ | Output Price, $ | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8 | Text | 0.270 | 0.850 | 1.0M | 8K | yes | no | |
| vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas | Text | 0.350 | 1.15 | 1.0M | 1.0M | no | yes |