Llama 3.1 8B Instruct is Meta's language model with a 200K context window and up to 128K output tokens, available from 20 providers, starting at $0.020 / 1M input and $0.030 / 1M output. Meta's 8B instruction-tuned LLM optimized for fast, cost-effective deployment across multiple cloud regions with strong instruction-following performance.
Specifications
Canonical IDmeta-llama-3-1-8b-instruct
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window200K tokens
Max Output128K tokens
Input ModalitiesImageText
Output ModalitiesText
Parameters8B
HuggingFace Likes5,731
HuggingFace Downloads (30d)9,306,502
HuggingFace Downloads (all-time)140,394,735
Release Date · 2 years ago
Knowledge Cutoff
Benchmarks
Intelligence Index
11.8
#368
Coding Index
4.9
#347
Math Index
4.3
#241
MMLU-Pro
0.5
#280
GPQA
0.3
#435
HLE
0.1
#273
LiveCodeBench
0.1
#287
AIME
0.1
#130
IFBench
0.3
#338
Time to First Token
0.49s
#244
SciCode
0.1
#379
MATH-500
0.5
#153
AIME 2025
0.0
#241
LCR
0.2
#268
TerminalBench Hard
0.0
#329
TAU2
0.2
#323
Output TPS

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct131K$0.027$0.080Deprecated
Llama 3.1 8B Instruct200K$0.020$0.030Current
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B Instruct131K$0.100$0.100Available
Llama 3.1 70B128K$0.360$0.360Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3 70B Instruct131K$0.120$0.300Available
Llama 3 8B Instruct32K$0.030$0.040Available
Llama 3.1 Tulu3 405BAvailable

Model IDs