Llama 4 Maverick 17B 128E Instruct FP8

Together AIText

Llama 4 Maverick 17B 128E Instruct FP8 is a text model from Together AI. Pricing starts at $0.27 per million input tokens and $0.85 per million output tokens (cheapest at Deepinfra).

Specifications

Model Keytogether_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
ProviderTogether AI
LiteLLM Providertogether_ai
ModeText
Canonical Namellama-maverick-4-17b-128e
Context WindowN/A tokens
Max OutputN/A

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.000270$0.270
Output Tokens$0.000850$0.850

Price Comparison by Provider

Compare prices for Llama 4 Maverick 17B 128E Instruct FP8 across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price
Output Price
Groqgroq/meta-llama/llama-4-maverick-17b-128e-instruct$0.200$0.600
Deepinfradeepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8$0.150$0.600
Together AItogether_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8$0.270$0.850
Google Vertex AIvertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas$0.350$1.15

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
DeepSeek R1 Distill Llama 8BNscaleText$0.025$0.025N/AN/Anono
DeepSeek R1 Distill Qwen 14BNscaleText$0.070$0.070N/AN/Anono
GPT-5 nanoReplicateText$0.050$0.400N/AN/Anoyes
Granite 3.3 8B InstructReplicateText$0.030$0.250N/AN/Anoyes
Llama 3.1 8B InstructNscaleText$0.030$0.030N/AN/Anono
Llama 3.3 70B Instruct Turbo FreeTogether AITextN/AN/AN/AN/Anoyes
Qwen2.5 Coder 32B InstructNscaleText$0.060$0.200N/AN/Anono
Qwen2.5 Coder 3B InstructNscaleText$0.010$0.030N/AN/Anono
Qwen2.5 Coder 7B InstructNscaleText$0.010$0.030N/AN/Anono
Titan Embed Text V2Vercel Ai GatewayText$0.020N/AN/AN/Anono