Llama 2 7B Chat

Llama 2 7B Chat is a text model from Replicate with a context window of 4K tokens and max output of 4K tokens. Pricing starts at 0.05 per million input tokens and 0.25 per million output tokens (cheapest at Ollama).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyreplicate/meta/llama-2-7b-chat
ProviderReplicate
Provider IDreplicate
ModeText
Canonical Namellama-2-7b
Context Window4K tokens
Max Output4K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0000500.050
Output Tokens0.0002500.250

Benchmarks

Intelligence Index9.7#195
MMLU-Pro0.2#196
GPQA0.2#219
HLE0.1#83
LiveCodeBench0.0#195
AIME0.0#127
Time to First Token0.57s#138
SciCode0.0#217
MATH-5000.1#144

Price Comparison by Provider

Compare prices for Llama 2 7B Chat across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
AWS SageMakersagemaker/meta-textgeneration-llama-2-7bN/AN/A
Replicatereplicate/meta/llama-2-7b0.0500.250
Ollamaollama/llama2:7bN/AN/A
Fireworks AIfireworks_ai/accounts/fireworks/models/llama-v2-7b0.2000.200
Cloudflare Workers AIcloudflare/@cf/meta/llama-2-7b-chat-fp161.921.92
Anyscaleanyscale/meta-llama/Llama-2-7b-chat-hf0.1500.150

All Variants

All available versions, regions, and API endpoints for Llama 2 7B Chat.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
anyscale/meta-llama/Llama-2-7b-chat-hfAnyscaleText0.1500.1504K4Knono
sagemaker/meta-textgeneration-llama-2-7bAWS SageMakerTextN/AN/A4K4Knono
sagemaker/meta-textgeneration-llama-2-7b-fAWS SageMakerTextN/AN/A4K4Knono
cloudflare/@cf/meta/llama-2-7b-chat-fp16Cloudflare Workers AIText1.921.923K3Knono
cloudflare/@cf/meta/llama-2-7b-chat-int8Cloudflare Workers AIText1.921.922K2Knono
fireworks_ai/accounts/fireworks/models/llama-v2-7bFireworks AIText0.2000.2004K4Knono
fireworks_ai/accounts/fireworks/models/llama-v2-7b-chatFireworks AIText0.2000.2004K4Knono
ollama/llama2:7bOllamaTextN/AN/A4K4Knono
replicate/meta/llama-2-7bReplicateText0.0500.2504K4Knono
replicate/meta/llama-2-7b-chatReplicateText0.0500.2504K4Knono