Llama 3.1 405B Instruct is Meta's language model with a 131K context window and up to 16K output tokens, available from 11 providers, starting at $0.12 / 1M input and $0.3 / 1M output. Meta's 405B instruction-tuned LLM optimized for following complex instructions, with FP8 quantization for efficient large-scale inference.
Specifications
Canonical IDmeta-llama-3-1-405b-instruct
TypeLanguage
StatusDeprecating
CreatorMetaMeta
Providers
Context Window131K tokens
Max Output16K tokens
Input ModalitiesImageText
Output ModalitiesText
Parameters405B
Release Date · 2 years ago
Deprecation Date
Benchmarks
Intelligence Index
8.5
#320
Math Index
3.0
#249
MMLU-Pro
0.7
#181
GPQA
0.5
#316
HLE
0.0
#381
LiveCodeBench
0.3
#204
AIME
0.2
#92
IFBench
0.4
#244
Time to First Token
0.65s
#294
SciCode
0.3
#232
MATH-500
0.7
#126
AIME 2025
0.0
#249
LCR
0.2
#234
TerminalBench Hard
0.1
#234
TAU2
0.2
#321
Output TPS
43.6
#238

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities3/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Amazon Bedrock logo
Amazon Bedrock
meta.llama3-1-405b-instruct-v1:0
$5.32$16.00$1.20$1.20
Azure AI Foundry logo
Azure AI Foundry
azure_ai/Meta-Llama-3.1-405B-Instruct
$5.33$16.00
Databricks logo
Databricks
databricks/databricks-meta-llama-3-1-405b-instruct
$5.00$15.00
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct
$3.00$3.00
Google Vertex AI logo
Google Vertex AI
vertex_ai/meta/llama-3.1-405b-instruct-maas
$5.00$16.00
Hyperbolic logo
Hyperbolic
hyperbolic/meta-llama/Meta-Llama-3.1-405B-Instruct
$0.12$0.3
Lambda logo
Lambda
lambda_ai/llama3.1-405b-instruct-fp8
$0.8$0.8
Nebius logo
Nebius
nebius/meta-llama/Meta-Llama-3.1-405B-Instruct
$1.00$3.00
Oracle Cloud (OCI) logo
Oracle Cloud (OCI)
oci/meta.llama-3.1-405b-instruct
$10.68$10.68
SambaNova logo
SambaNova
sambanova/Meta-Llama-3.1-405B-Instruct
$5.00$10.00
Together AI logo
Together AI
together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
$3.50$3.50

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct131K$0.027$0.080Deprecated
Llama 3.2 11B128K$0.160$0.160Available
Llama 3.1 405B Instruct131K$0.120$0.300Current
Llama 3.1 70B Instruct131K$0.120$0.300Available
Llama 3.1 8B Instruct200K$0.020$0.030Available
Llama 3.1 70B128K$0.360$0.360Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3 70B Instruct131K$0.120$0.300Deprecated
Llama 3 8B Instruct32K$0.030$0.040Available

Model IDs

accounts/fireworks/models/llama-v3p1-405b-instruct
azure_ai/Meta-Llama-3.1-405B-Instruct
databricks/databricks-meta-llama-3-1-405b-instruct
fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct
hyperbolic/meta-llama/Meta-Llama-3.1-405B-Instruct
lambda_ai/llama3.1-405b-instruct-fp8
llama-3-1-instruct-405b
meta-llama-3-1-405b-instruct
meta-textgeneration-llama-3-1-405b-instruct-fp8
meta.llama3-1-405b-instruct-v1:0
nebius/meta-llama/Meta-Llama-3.1-405B-Instruct
oci/meta.llama-3.1-405b-instruct
sambanova/Meta-Llama-3.1-405B-Instruct
together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
us.meta.llama3-1-405b-instruct-v1:0
vertex_ai/meta/llama-3.1-405b-instruct-maas