Llama 2 70B Chat is Meta's language model with a 4K context window and up to 4K output tokens, available from 7 providers, starting at $0.500 / 1M input and $0.900 / 1M output. A 70B Llama 2 model fine-tuned with RLHF for dialogue, providing high-quality conversational responses at the largest Llama 2 scale.
Specifications
Canonical IDmeta-llama-2-70b-chat
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window4K tokens
Max Output4K tokens
Input ModalitiesText
Output ModalitiesText
Parameters70B
Benchmarks
Intelligence Index
8.4
#428
MMLU-Pro
0.4
#297
GPQA
0.3
#398
HLE
0.1
#282
LiveCodeBench
0.1
#297
AIME
0.0
#167
Time to First Token
0.00s
#140
MATH-500
0.3
#171
Output TPS
0.0
#409

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct131K$0.027$0.080Deprecated
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B Instruct131K$0.100$0.100Available
Llama 3.1 8B Instruct200K$0.020$0.030Available
Llama 3.1 70B128K$0.600$0.600Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3 70B Instruct131K$0.120$0.300Available
Llama 3 8B Instruct32K$0.030$0.040Available
Llama 2 70B Chat4K$0.500$0.900Current

Model IDs