Llama 4 Maverick 17B 128E Instruct FP8
Meta LlamaText
Llama 4 Maverick 17B 128E Instruct FP8 is a text model from Meta Llama with a context window of 1.0M tokens and max output of 4K tokens.
Specifications
| Model Key | meta_llama/Llama-4-Maverick-17B-128E-Instruct-FP8 |
| Provider | Meta Llama |
| LiteLLM Provider | meta_llama |
| Mode | Text |
| Canonical Name | llama-4-maverick-17b |
| Context Window | 1.0M tokens |
| Max Output | 4K tokens |
Capabilities
✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | N/A | N/A |
| Output Tokens | N/A | N/A |
Price Comparison by Provider
Compare prices for Llama 4 Maverick 17B 128E Instruct FP8 across different providers. The same model may be available through multiple providers at different price points.
Provider | Model Key | Input Price | Output Price |
|---|---|---|---|
| Watsonx | watsonx/meta-llama/llama-4-maverick-17b | $0.350 | $1.40 |
| Meta Llama | meta_llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | N/A | N/A |
| AWS Bedrock | meta.llama4-maverick-17b-instruct-v1:0 | $0.240 | $0.970 |
Similar Models
Models with similar capabilities and context window size.
Model | Provider | Mode | Input Price | Output Price | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| Gemini 1.5 Flash | Google Vertex AI | Text | $0.075 | $0.300 | 1.0M | 8K | yes | yes |
| Gemini 1.5 Flash Preview-0514 | Google Vertex AI | Text | $0.075 | $0.0047 | 1.0M | 8K | yes | yes |
| Gemini 1.5 Flash-001 | Google Vertex AI | Text | $0.075 | $0.300 | 1.0M | 8K | yes | yes |
| Gemini 1.5 Flash-8b-exp-0827 | Google Gemini | Text | N/A | N/A | 1.0M | 8K | yes | yes |
| Gemini 1.5 Flash-exp-0827 | Google Vertex AI | Text | $0.0047 | $0.0047 | 1.0M | 8K | yes | yes |
| Gemini flash Experimental | Google Vertex AI | Text | N/A | N/A | 1.0M | 8K | no | no |
| Gemini pro Experimental | Google Vertex AI | Text | N/A | N/A | 1.0M | 8K | no | no |
| Qwen Turbo | Dashscope | Text | $0.050 | $0.200 | 1.0M | 8K | no | yes |
| Qwen Turbo | Dashscope | Text | $0.050 | $0.200 | 1.0M | 16K | no | yes |
| Qwen Turbo | Dashscope | Text | $0.050 | $0.200 | 1.0M | 16K | no | yes |