Llama 4 17B 128E Instruct is Meta's language model with a 1.0M context window and up to 8K output tokens, starting at $0.72 / 1M input and $0.72 / 1M output. Meta's Llama 4 instruction-tuned MoE model with 17B active parameters and 128 experts, optimized for multimodal vision-language tasks.
Specifications
Canonical IDmeta-llama-4-17b-128e-instruct
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window1.0M tokens
Max Output8K tokens
Input ModalitiesImageText
Output ModalitiesText

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
Oracle Cloud (OCI) logo
Oracle Cloud (OCI)
oci/meta.llama-4-maverick-17b-128e-instruct-fp8
$0.72$0.72

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 4 17B 128E Instruct1.0M$0.720$0.720Current
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct131K$0.027$0.080Deprecated
Llama 3.2 11B128K$0.160$0.160Available
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B Instruct131K$0.120$0.300Available
Llama 3.1 8B Instruct200K$0.020$0.030Available
Llama 3.1 70B128K$0.360$0.360Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3 70B Instruct131K$0.120$0.300Deprecated

Model IDs

meta-llama-4-17b-128e-instruct
oci/meta.llama-4-maverick-17b-128e-instruct-fp8