Llama3.1 8 b Instruct Pricing & Specs | AI Models

Llama3.1 8B Instruct is a text model from Lambda with a context window of 131K tokens and max output of 131K tokens. Pricing starts at 0.02 per million input tokens and 0.04 per million output tokens (cheapest at Vertex AI (Llama)).

Capabilities

✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✓ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output

Specifications

Model Key	`lambda_ai/llama3.1-8b-instruct`
Provider	Lambda
Provider ID	lambda_ai
Mode	Text
Canonical Name	llama-3.1-8b
Context Window	131K tokens
Max Output	131K tokens

Pricing

Type	Per 1K Tokens	Per 1M Tokens
Input Tokens	0.000025	0.025
Output Tokens	0.000040	0.040

Benchmarks

Intelligence Index	11.8#174
Coding Index	4.9#157
Math Index	4.3#130
MMLU-Pro	0.5#165
GPQA	0.3#214
HLE	0.1#100
LiveCodeBench	0.1#170
AIME	0.1#93
IFBench	0.3#145
Time to First Token	0.46s#120
SciCode	0.1#185
MATH-500	0.5#116
AIME 2025	0.0#130
LCR	0.2#119
TerminalBench Hard	0.0#138
TAU2	0.2#133

Price Comparison by Provider

Compare prices for Llama3.1 8B Instruct across different providers. The same model may be available through multiple providers at different price points.

Provider	Model Key	Input Price, $	Output Price, $
Weights & Biases	wandb/meta-llama/Llama-3.1-8B-Instruct	0.022	0.022
Vertex AI (Llama)	vertex_ai/meta/llama-3.1-8b-instruct-maas	N/A	N/A
Vercel AI Gateway	vercel_ai_gateway/meta/llama-3.1-8b	0.050	0.080
Together AI	together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo	0.180	0.180
Snowflake	snowflake/llama3.1-8b	N/A	N/A
SambaNova	sambanova/Meta-Llama-3.1-8B-Instruct	0.100	0.200
Perplexity	perplexity/llama-3.1-8b-instruct	0.200	0.200
OVHcloud	ovhcloud/Llama-3.1-8B-Instruct	0.100	0.100
Ollama	ollama/llama3.1	N/A	N/A
Nscale	nscale/meta-llama/Llama-3.1-8B-Instruct	0.030	0.030
Novita AI	novita/meta-llama/llama-3.1-8b-instruct	0.020	0.050
Nebius	nebius/meta-llama/Meta-Llama-3.1-8B-Instruct	0.020	0.060
AWS Bedrock	meta.llama3-1-8b-instruct-v1:0	0.220	0.220
LlamaGate	llamagate/llama-3.1-8b	0.030	0.050
Lambda	lambda_ai/llama3.1-8b-instruct	0.025	0.040
Hyperbolic	hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct	0.120	0.300
Groq	groq/llama-3.1-8b-instant	0.050	0.080
FriendliAI	friendliai/meta-llama-3.1-8b-instruct	0.100	0.100
Fireworks AI	fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct	0.100	0.100
DeepInfra	deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo	0.020	0.030
Databricks	databricks/databricks-meta-llama-3-1-8b-instruct	0.150	0.450
Cerebras	cerebras/llama3.1-8b	0.100	0.100
Azure AI	azure_ai/Meta-Llama-3.1-8B-Instruct	0.300	0.610

All Variants

All available versions, regions, and API endpoints for Llama3.1 8B Instruct.

Model Key	Provider	Mode	Input Price, $	Output Price, $	Context	Max Output	Vision	Functions
meta.llama3-1-8b-instruct-v1:0	AWS Bedrock	Text	0.220	0.220	128K	2K	no	yes
us.meta.llama3-1-8b-instruct-v1:0	AWS Bedrock	Text	0.220	0.220	128K	2K	no	yes
azure_ai/Meta-Llama-3.1-8B-Instruct	Azure AI	Text	0.300	0.610	128K	2K	no	no
cerebras/llama3.1-8b	Cerebras	Text	0.100	0.100	128K	128K	no	yes
databricks/databricks-meta-llama-3-1-8b-instruct	Databricks	Text	0.150	0.450	200K	128K	no	no
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct	DeepInfra	Text	0.030	0.050	131K	131K	no	yes
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo	DeepInfra	Text	0.020	0.030	131K	131K	no	yes
fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct	Fireworks AI	Text	0.100	0.100	16K	16K	no	no
friendliai/meta-llama-3.1-8b-instruct	FriendliAI	Text	0.100	0.100	8K	8K	no	yes
groq/llama-3.1-8b-instant	Groq	Text	0.050	0.080	128K	8K	no	yes
hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct	Hyperbolic	Text	0.120	0.300	33K	33K	no	yes
lambda_ai/llama3.1-8b-instruct	Lambda	Text	0.025	0.040	131K	131K	no	yes
llamagate/llama-3.1-8b	LlamaGate	Text	0.030	0.050	131K	8K	no	yes
nebius/meta-llama/Meta-Llama-3.1-8B-Instruct	Nebius	Text	0.020	0.060	128K	128K	no	yes
novita/meta-llama/llama-3.1-8b-instruct	Novita AI	Text	0.020	0.050	16K	16K	no	no
nscale/meta-llama/Llama-3.1-8B-Instruct	Nscale	Text	0.030	0.030	N/A	N/A	no	no
ollama/llama3.1	Ollama	Text	N/A	N/A	8K	8K	no	yes
ovhcloud/Llama-3.1-8B-Instruct	OVHcloud	Text	0.100	0.100	131K	131K	no	yes
perplexity/llama-3.1-8b-instruct	Perplexity	Text	0.200	0.200	131K	131K	no	no
sambanova/Meta-Llama-3.1-8B-Instruct	SambaNova	Text	0.100	0.200	16K	16K	no	yes
snowflake/llama3.1-8b	Snowflake	Text	N/A	N/A	128K	8K	no	no
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo	Together AI	Text	0.180	0.180	N/A	N/A	no	yes
vercel_ai_gateway/meta/llama-3.1-8b	Vercel AI Gateway	Text	0.050	0.080	131K	131K	no	yes
vertex_ai/meta/llama-3.1-8b-instruct-maas	Vertex AI (Llama)	Text	N/A	N/A	128K	2K	yes	no
wandb/meta-llama/Llama-3.1-8B-Instruct	Weights & Biases	Text	0.022	0.022	128K	128K	no	no

← Back to All Models