Llama 3.1 8B Instruct is Meta's language model with a 200K context window and up to 128K output tokens, available from 21 providers, starting at $0.02 / 1M input and $0.03 / 1M output. Meta's 8B instruction-tuned LLM optimized for fast, cost-effective deployment across multiple cloud regions with strong instruction-following performance.
Specifications
Canonical IDmeta-llama-3-1-8b-instruct
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window200K tokens
Max Output128K tokens
Input ModalitiesImageText
Output ModalitiesText
Parameters8B
HuggingFace Likes5,731
HuggingFace Downloads (30d)9,306,502
HuggingFace Downloads (all-time)140,394,735
Release Date · 2 years ago
Knowledge Cutoff · 2 years ago
Benchmarks
Intelligence Index
6.1
#375
Math Index
4.3
#241
MMLU-Pro
0.5
#280
GPQA
0.3
#444
HLE
0.1
#282
LiveCodeBench
0.1
#287
AIME
0.1
#130
IFBench
0.3
#347
Time to First Token
0.49s
#254
SciCode
0.1
#386
MATH-500
0.5
#153
AIME 2025
0.0
#241
LCR
0.2
#275
TerminalBench Hard
0.0
#337
TAU2
0.2
#331
Output TPS

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Amazon Bedrock logo
Amazon Bedrock
meta.llama3-1-8b-instruct-v1:0
$0.22$0.22$0.11$0.11
Azure AI Foundry logo
Azure AI Foundry
azure_ai/Meta-Llama-3.1-8B-Instruct
$0.3$0.61
Cloudflare Workers AI logo
Cloudflare Workers AI
@cf/meta/llama-3.1-8b-instruct
$0.282$0.827
Databricks logo
Databricks
databricks/databricks-meta-llama-3-1-8b-instruct
$0.15$0.45
DeepInfra logo
DeepInfra
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
$0.03$0.05
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct
$0.1$0.1
FriendliAI logo
FriendliAI
friendliai/meta-llama-3.1-8b-instruct
$0.1$0.1
Groq logo
Groq
groq/llama-3.1-8b-instant
$0.05$0.08
Hugging Face logo
Hugging Face
novita:meta-llama/llama-3.1-8b-instruct
$0.02$0.05
Hugging Face logo
Hugging Face
nscale:meta-llama/Llama-3.1-8B-Instruct
$0.06$0.06
Hyperbolic logo
Hyperbolic
hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct
$0.12$0.3
Lambda logo
Lambda
lambda_ai/llama3.1-8b-instruct
$0.025$0.04
Nebius logo
Nebius
nebius/meta-llama/Meta-Llama-3.1-8B-Instruct
$0.02$0.06
Novita logo
Novita
novita/meta-llama/llama-3.1-8b-instruct
$0.02$0.05
Nscale logo
Nscale
nscale/meta-llama/Llama-3.1-8B-Instruct
$0.03$0.03
OpenRouter logo
OpenRouter
meta-llama/llama-3.1-8b-instruct
$0.02$0.03
Oracle Cloud (OCI) logo
Oracle Cloud (OCI)
oci/meta.llama-3.1-8b-instruct
$0.72$0.72
OVHcloud logo
OVHcloud
ovhcloud/Llama-3.1-8B-Instruct
$0.1$0.1
Perplexity logo
Perplexity
perplexity/llama-3.1-8b-instruct
$0.2$0.2
SambaNova logo
SambaNova
sambanova/Meta-Llama-3.1-8B-Instruct
$0.1$0.2
Together AI logo
Together AI
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
$0.18$0.18
Weights & Biases logo
Weights & Biases
wandb/meta-llama/Llama-3.1-8B-Instruct
$22.00$22.00

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct131K$0.027$0.080Deprecated
Llama 3.2 11B128K$0.160$0.160Available
Llama 3.1 8B Instruct200K$0.020$0.030Current
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B Instruct131K$0.120$0.300Available
Llama 3.1 70B128K$0.360$0.360Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3 70B Instruct131K$0.120$0.300Deprecated
Llama 3 8B Instruct32K$0.030$0.040Available

Model IDs

@cf/meta/llama-3.1-8b-instruct
accounts/fireworks/models/full-llama-v3p1-8b-instruct-8b-fp8
accounts/fireworks/models/full-llama-v3p1-8b-instruct-8b-fp8-amd
accounts/fireworks/models/llama-v3p1-8b-instruct
azure_ai/Meta-Llama-3.1-8B-Instruct
databricks/databricks-meta-llama-3-1-8b-instruct
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct
friendliai/meta-llama-3.1-8b-instruct
groq/llama-3.1-8b-instant
hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct
lambda_ai/llama3.1-8b-instruct
llama-3-1-instruct-8b
meta-llama-3-1-8b-instruct
meta-llama/llama-3.1-8b-instruct
meta-llama/Meta-Llama-3.1-8B-Instruct
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
meta-textgeneration-llama-3-1-8b-instruct
meta-textgenerationneuron-llama-3-1-8b-instruct
meta.llama3-1-8b-instruct-v1:0
meta.llama3-1-8b-instruct-v1:0:128k
nebius/meta-llama/Meta-Llama-3.1-8B-Instruct
novita/meta-llama/llama-3.1-8b-instruct
nscale/meta-llama/Llama-3.1-8B-Instruct
oci/meta.llama-3.1-8b-instruct
ovhcloud/Llama-3.1-8B-Instruct
perplexity/llama-3.1-8b-instruct
sambanova/Meta-Llama-3.1-8B-Instruct
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
us.meta.llama3-1-8b-instruct-v1:0
vertex_ai/meta/llama-3.1-8b-instruct-maas
wandb/meta-llama/Llama-3.1-8B-Instruct