Meta logo

Llama 3.1 8B Instruct


Llama 3.1 8B Instruct is Meta's language model with a 200K context window and up to 128K output tokens, available from 19 providers, starting at $0.020 / 1M input and $0.030 / 1M output. Meta's 8B instruction-tuned LLM optimized for fast, cost-effective deployment across multiple cloud regions with strong instruction-following performance.
Specifications
Canonical IDmeta-llama-3-1-8b-instruct
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window200K tokens
Max Output128K tokens
Input ModalitiesImageText
Output ModalitiesText
Parameters8B
HuggingFace Likes5,731
HuggingFace Downloads (30d)9,306,502
HuggingFace Downloads (all-time)140,394,735
Release Date · 2 years ago
Knowledge Cutoff
Benchmarks
Intelligence Index
11.8
#359
Coding Index
4.9
#339
Math Index
4.3
#241
MMLU-Pro
0.5
#280
GPQA
0.3
#426
HLE
0.1
#265
LiveCodeBench
0.1
#287
AIME
0.1
#130
IFBench
0.3
#328
Time to First Token
0.48s
#232
SciCode
0.1
#371
MATH-500
0.5
#153
AIME 2025
0.0
#241
LCR
0.2
#260
TerminalBench Hard
0.0
#321
TAU2
0.2
#314
Output TPS

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
DeepInfra logo
DeepInfra
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
$0.020$0.030
Nebius logo
Nebius
nebius/meta-llama/Meta-Llama-3.1-8B-Instruct
$0.020$0.060
Novita logo
Novita
novita/meta-llama/llama-3.1-8b-instruct
$0.020$0.050
OpenRouter logo
OpenRouter
meta-llama/llama-3.1-8b-instruct
$0.020$0.050
Lambda logo
Lambda
lambda_ai/llama3.1-8b-instruct
$0.025$0.040
Nscale logo
Nscale
nscale/meta-llama/Llama-3.1-8B-Instruct
$0.030$0.030
Groq logo
Groq
groq/llama-3.1-8b-instant
$0.050$0.080
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct
$0.100$0.100
FriendliAI logo
FriendliAI
friendliai/meta-llama-3.1-8b-instruct
$0.100$0.100
Hugging Face logo
Hugging Face
sambanova:Meta-Llama-3.1-8B-Instruct
$0.100$0.200
OVHcloud logo
OVHcloud
ovhcloud/Llama-3.1-8B-Instruct
$0.100$0.100
SambaNova logo
SambaNova
sambanova/Meta-Llama-3.1-8B-Instruct
$0.100$0.200
Hyperbolic logo
Hyperbolic
hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct
$0.120$0.300
Databricks logo
Databricks
databricks/databricks-meta-llama-3-1-8b-instruct
$0.150$0.450
Together AI logo
Together AI
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
$0.180$0.180
Perplexity logo
Perplexity
perplexity/llama-3.1-8b-instruct
$0.200$0.200
Amazon Bedrock logo
Amazon Bedrock
meta.llama3-1-8b-instruct-v1:0
$0.220$0.220$0.110$0.110
Azure AI Foundry logo
Azure AI Foundry
azure_ai/Meta-Llama-3.1-8B-Instruct
$0.300$0.610
Other/Wandb
wandb/meta-llama/Llama-3.1-8B-Instruct
$22.00$22.00
Provider-specific pricing that varies by region.
Amazon Bedrock logo
Amazon Bedrock
5 regions
RegionStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Global
Global$0.220$0.220$0.110$0.110
Cross-region
US (cross-region)/ us$0.220$0.220$0.110$0.110
US
US East (Ohio)/ us-east-2$0.220$0.220$0.110$0.110
US East (Virginia)/ us-east-1$0.220$0.220$0.110$0.110
US West (Oregon)/ us-west-2$0.220$0.220$0.110$0.110

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct128K$0.027$0.080Deprecated
Llama 3.1 8B Instruct200K$0.020$0.030Current
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B Instruct131K$0.100$0.100Available
Llama 3.1 70B128K$0.600$0.600Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3 70B Instruct131K$0.120$0.300Available
Llama 3 8B Instruct32K$0.030$0.040Available
Llama 3.1 Tulu3 405BAvailable

Model IDs