Llama 2 13B

Llama 2 13B is a text model from Replicate with a context window of 4K tokens and max output of 4K tokens. Pricing starts at 0.10 per million input tokens and 0.50 per million output tokens (cheapest at Ollama).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyreplicate/meta/llama-2-13b
ProviderReplicate
Provider IDreplicate
ModeText
Canonical Namellama-2-13b
Context Window4K tokens
Max Output4K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0001000.100
Output Tokens0.0005000.500

Benchmarks

Intelligence Index8.4#211
MMLU-Pro0.4#180
GPQA0.3#196
HLE0.0#121
LiveCodeBench0.1#179
AIME0.0#117
Time to First Token0.00s#1
SciCode0.1#188
MATH-5000.3#131

Price Comparison by Provider

Compare prices for Llama 2 13B across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
AWS SageMakersagemaker/meta-textgeneration-llama-2-13bN/AN/A
Replicatereplicate/meta/llama-2-13b0.1000.500
Ollamaollama/llama2:13bN/AN/A
AWS Bedrockmeta.llama2-13b-chat-v10.7501.00
Fireworks AIfireworks_ai/accounts/fireworks/models/llama-v2-13b0.2000.200
Anyscaleanyscale/meta-llama/Llama-2-13b-chat-hf0.2500.250

All Variants

All available versions, regions, and API endpoints for Llama 2 13B.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
anyscale/meta-llama/Llama-2-13b-chat-hfAnyscaleText0.2500.2504K4Knono
meta.llama2-13b-chat-v1AWS BedrockText0.7501.004K4Knono
sagemaker/meta-textgeneration-llama-2-13bAWS SageMakerTextN/AN/A4K4Knono
sagemaker/meta-textgeneration-llama-2-13b-fAWS SageMakerTextN/AN/A4K4Knono
fireworks_ai/accounts/fireworks/models/llama-v2-13bFireworks AIText0.2000.2004K4Knono
fireworks_ai/accounts/fireworks/models/llama-v2-13b-chatFireworks AIText0.2000.2004K4Knono
ollama/llama2:13bOllamaTextN/AN/A4K4Knono
replicate/meta/llama-2-13bReplicateText0.1000.5004K4Knono
replicate/meta/llama-2-13b-chatReplicateText0.1000.5004K4Knono