Name: Llama 3.1 8B Instruct
Brand: Meta

Llama 3.1 8B Instruct is Meta's language model with a 200K context window and up to 128K output tokens, available from 20 providers, starting at $0.020 / 1M input and $0.030 / 1M output. Meta's 8B instruction-tuned LLM optimized for fast, cost-effective deployment across multiple cloud regions with strong instruction-following performance.

Specifications
Canonical ID	`meta-llama-3-1-8b-instruct`
Type	Language
Status	Active
Creator	Meta
Providers	Amazon Bedrock Microsoft Azure AI Foundry Databricks Deep Infra Fireworks AI FriendliAI Groq Hugging Face Hyperbolic Lambda Labs Nebius Novita AI Nscale OpenRouter Oracle Cloud Infrastructure (OCI)other/wandb OVHcloud Perplexity SambaNova Together AI
Context Window	200K tokens
Max Output	128K tokens
Input Modalities	ImageText
Output Modalities	Text
Parameters	8B
HuggingFace Likes	5,731
HuggingFace Downloads (30d)	9,306,502
HuggingFace Downloads (all-time)	140,394,735
Release Date	2024-07-23 · 2 years ago
Knowledge Cutoff	2023-12-31

Benchmarks
Intelligence Index	11.8 #368
Coding Index	4.9 #347
Math Index	4.3 #241
MMLU-Pro	0.5 #280
GPQA	0.3 #435
HLE	0.1 #273
LiveCodeBench	0.1 #287
AIME	0.1 #130
IFBench	0.3 #338
Time to First Token	0.49s #244
SciCode	0.1 #379
MATH-500	0.5 #153
AIME 2025	0.0 #241
LCR	0.2 #268
TerminalBench Hard	0.0 #329
TAU2	0.2 #323
Output TPS	217.0 #23

Capabilities

Input2/5

Text✓

Image✓

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities4/13

Reasoning·

Adaptive Reasoning·

Function Calling✓

Parallel Function Calling✓

Structured Outputs✓

Native JSON Schema✓

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Pricing by Provider

Provider	Standard		Batch
Provider	Input $ / 1M	Output $ / 1M	Input $ / 1M	Output $ / 1M
DeepInfra deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo	$0.020	$0.030	—	—
Nebius nebius/meta-llama/Meta-Llama-3.1-8B-Instruct	$0.020	$0.060	—	—
Novita novita/meta-llama/llama-3.1-8b-instruct	$0.020	$0.050	—	—
OpenRouter meta-llama/llama-3.1-8b-instruct	$0.020	$0.050	—	—
Lambda lambda_ai/llama3.1-8b-instruct	$0.025	$0.040	—	—
Nscale nscale/meta-llama/Llama-3.1-8B-Instruct	$0.030	$0.030	—	—
Groq groq/llama-3.1-8b-instant	$0.050	$0.080	—	—
Hugging Face nscale:meta-llama/Llama-3.1-8B-Instruct	$0.060	$0.060	—	—
Fireworks AI fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct	$0.100	$0.100	—	—
FriendliAI friendliai/meta-llama-3.1-8b-instruct	$0.100	$0.100	—	—
OVHcloud ovhcloud/Llama-3.1-8B-Instruct	$0.100	$0.100	—	—
SambaNova sambanova/Meta-Llama-3.1-8B-Instruct	$0.100	$0.200	—	—
Hyperbolic hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct	$0.120	$0.300	—	—
Databricks databricks/databricks-meta-llama-3-1-8b-instruct	$0.150	$0.450	—	—
Together AI together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo	$0.180	$0.180	—	—
Perplexity perplexity/llama-3.1-8b-instruct	$0.200	$0.200	—	—
Amazon Bedrock meta.llama3-1-8b-instruct-v1:0	$0.220	$0.220	$0.110	$0.110
Azure AI Foundry azure_ai/Meta-Llama-3.1-8B-Instruct	$0.300	$0.610	—	—
Oracle Cloud (OCI) oci/meta.llama-3.1-8b-instruct	$0.720	$0.720	—	—
Other/Wandb wandb/meta-llama/Llama-3.1-8B-Instruct	$22.00	$22.00	—	—

View Amazon Bedrock →

Cost Calculator

Preset:

Input tokens

Output tokens

Number of calls

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Llama 3.3 70B Instruct	2024-12-06	131K	$0.100	$0.200	Available
Llama 3.2 3B Instruct	2024-09-25	131K	$0.015	$0.020	Deprecated
Llama 3.2 1B Instruct	2024-09-25	131K	$0.027	$0.080	Deprecated
Llama 3.1 8B Instruct	2024-07-23	200K	$0.020	$0.030	Current
Llama 3.1 405B Instruct	2024-07-23	131K	$0.120	$0.300	Deprecating
Llama 3.1 70B Instruct	2024-07-23	131K	$0.100	$0.100	Available
Llama 3.1 70B	2024-07-23	128K	$0.360	$0.360	Available
Llama 3.1 8B	2024-07-23	131K	$0.030	$0.050	Available
Llama 3 70B Instruct	2024-04-18	131K	$0.120	$0.300	Available
Llama 3 8B Instruct	2024-04-18	32K	$0.030	$0.040	Available
Llama 3.1 Tulu3 405B	—	—	—	—	Available

Llama 3.1 8B Instruct

Capabilities

Pricing by Provider

Cost Calculator

Versions

Model IDs