Llama 4 Maverick 17B 128e Instruct Fp8

Llama 4 Maverick 17B 128e Instruct Fp8 is a text model from Novita AI with a context window of 1.0M tokens and max output of 8K tokens. Pricing starts at 0.27 per million input tokens and 0.85 per million output tokens (cheapest at Novita AI).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keynovita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8
ProviderNovita AI
Provider IDnovita
ModeText
Canonical Namellama-maverick-4-17b-128e
Context Window1.0M tokens
Max Output8K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0002700.270
Output Tokens0.0008500.850

Benchmarks

No benchmark data is available for this model.

Price Comparison by Provider

Compare prices for Llama 4 Maverick 17B 128e Instruct Fp8 across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Vertex AI (Llama)vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas0.3501.15
Novita AInovita/meta-llama/llama-4-maverick-17b-128e-instruct-fp80.2700.850

All Variants

All available versions, regions, and API endpoints for Llama 4 Maverick 17B 128e Instruct Fp8.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8Novita AIText0.2700.8501.0M8Kyesno
vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maasVertex AI (Llama)Text0.3501.151.0M1.0Mnoyes