Llama 2 7B Chat is Meta's language model with a 4K context window and up to 4K output tokens, available from 4 providers, starting at $0.050 / 1M input and $0.150 / 1M output. A 7B Llama 2 model fine-tuned with RLHF for dialogue use cases, offering an efficient and accessible conversational LLM.
Specifications
Canonical IDmeta-llama-2-7b-chat
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window4K tokens
Max Output4K tokens
Input ModalitiesText
Output ModalitiesText
Parameters7B
Benchmarks
Intelligence Index
9.7
#400
MMLU-Pro
0.2
#320
GPQA
0.2
#447
HLE
0.1
#231
LiveCodeBench
0.0
#323
AIME
0.0
#168
Time to First Token
0.95s
#324
SciCode
0.0
#447
MATH-500
0.1
#183
Output TPS
119.3
#105

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct131K$0.027$0.080Deprecated
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B Instruct131K$0.100$0.100Available
Llama 3.1 8B Instruct200K$0.020$0.030Available
Llama 3.1 70B128K$0.360$0.360Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3 70B Instruct131K$0.120$0.300Available
Llama 3 8B Instruct32K$0.030$0.040Available
Llama 2 7B Chat4K$0.050$0.150Current

Model IDs