Llama3.1 405 b Instruct Fp8 Pricing & Specs | AI Models

Llama3.1 405B Instruct Fp8 is a text model from Lambda with a context window of 131K tokens and max output of 131K tokens. Pricing starts at 0.80 per million input tokens and 0.80 per million output tokens (cheapest at Fireworks AI).

Capabilities

✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✓ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output

Specifications

Model Key	`lambda_ai/llama3.1-405b-instruct-fp8`
Provider	Lambda
Provider ID	lambda_ai
Mode	Text
Canonical Name	llama-3.1-405b
Context Window	131K tokens
Max Output	131K tokens

Pricing

Type	Per 1K Tokens	Per 1M Tokens
Input Tokens	0.000800	0.800
Output Tokens	0.000800	0.800

Benchmarks

Intelligence Index	17.4#107
Coding Index	14.5#100
Math Index	3.0#134
MMLU-Pro	0.7#94
GPQA	0.5#129
HLE	0.0#161
LiveCodeBench	0.3#107
AIME	0.2#63
IFBench	0.4#87
Time to First Token	0.52s#132
SciCode	0.3#100
MATH-500	0.7#93
AIME 2025	0.0#134
LCR	0.2#93
TerminalBench Hard	0.1#87
TAU2	0.2#126

Price Comparison by Provider

Compare prices for Llama3.1 405B Instruct Fp8 across different providers. The same model may be available through multiple providers at different price points.

Provider	Model Key	Input Price, $	Output Price, $
Vertex AI (Llama)	vertex_ai/meta/llama-3.1-405b-instruct-maas	5.00	16.00
Together AI	together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo	3.50	3.50
Snowflake	snowflake/snowflake-llama-3.1-405b	N/A	N/A
SambaNova	sambanova/Meta-Llama-3.1-405B-Instruct	5.00	10.00
Oracle Cloud (OCI)	oci/meta.llama-3.1-405b-instruct	10.68	10.68
Nebius	nebius/meta-llama/Meta-Llama-3.1-405B-Instruct	1.00	3.00
AWS Bedrock	meta.llama3-1-405b-instruct-v1:0	5.32	16.00
Lambda	lambda_ai/llama3.1-405b-instruct-fp8	0.800	0.800
Hyperbolic	hyperbolic/meta-llama/Meta-Llama-3.1-405B-Instruct	0.120	0.300
Fireworks AI	fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct-long	0.100	0.100
Databricks	databricks/databricks-meta-llama-3-1-405b-instruct	5.00	15.00
Azure AI	azure_ai/Meta-Llama-3.1-405B-Instruct	5.33	16.00

All Variants

All available versions, regions, and API endpoints for Llama3.1 405B Instruct Fp8.

Model Key	Provider	Mode	Input Price, $	Output Price, $	Context	Max Output	Vision	Functions
meta.llama3-1-405b-instruct-v1:0	AWS Bedrock	Text	5.32	16.00	128K	4K	no	yes
us.meta.llama3-1-405b-instruct-v1:0	AWS Bedrock	Text	5.32	16.00	128K	4K	no	yes
azure_ai/Meta-Llama-3.1-405B-Instruct	Azure AI	Text	5.33	16.00	128K	2K	no	no
databricks/databricks-meta-llama-3-1-405b-instruct	Databricks	Text	5.00	15.00	128K	128K	no	no
fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct	Fireworks AI	Text	3.00	3.00	128K	16K	no	yes
fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct-long	Fireworks AI	Text	0.100	0.100	4K	4K	no	no
hyperbolic/meta-llama/Meta-Llama-3.1-405B-Instruct	Hyperbolic	Text	0.120	0.300	33K	33K	no	yes
lambda_ai/llama3.1-405b-instruct-fp8	Lambda	Text	0.800	0.800	131K	131K	no	yes
nebius/meta-llama/Meta-Llama-3.1-405B-Instruct	Nebius	Text	1.00	3.00	128K	128K	no	yes
oci/meta.llama-3.1-405b-instruct	Oracle Cloud (OCI)	Text	10.68	10.68	128K	4K	no	yes
sambanova/Meta-Llama-3.1-405B-Instruct	SambaNova	Text	5.00	10.00	16K	16K	no	yes
snowflake/llama3.1-405b	Snowflake	Text	N/A	N/A	128K	8K	no	no
snowflake/snowflake-llama-3.1-405b	Snowflake	Text	N/A	N/A	8K	8K	no	no
together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo	Together AI	Text	3.50	3.50	N/A	N/A	no	yes
vertex_ai/meta/llama-3.1-405b-instruct-maas	Vertex AI (Llama)	Text	5.00	16.00	128K	2K	yes	no

← Back to All Models