Llama 2 7B Chat Int8

Cloudflare Workers AIText

Llama 2 7B Chat Int8 is a text model from Cloudflare Workers AI with a context window of 2K tokens and max output of 2K tokens. Pricing starts at $1.92 per million input tokens and $1.92 per million output tokens (cheapest at Ollama).

Specifications

Model Keycloudflare/@cf/meta/llama-2-7b-chat-int8
ProviderCloudflare Workers AI
LiteLLM Providercloudflare
ModeText
Canonical Namellama-2-7b
Context Window2K tokens
Max Output2K tokens

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.0019$1.92
Output Tokens$0.0019$1.92

Price Comparison by Provider

Compare prices for Llama 2 7B Chat Int8 across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price
Output Price
Replicatereplicate/meta/llama-2-7b$0.050$0.250
Cloudflare Workers AIcloudflare/@cf/meta/llama-2-7b-chat-fp16$1.92$1.92
Anyscaleanyscale/meta-llama/Llama-2-7b-chat-hf$0.150$0.150
Fireworks AIfireworks_ai/accounts/fireworks/models/llama-v2-7b$0.200$0.200
Ollamaollama/llama2:7bN/AN/A
AWS SageMakersagemaker/meta-textgeneration-llama-2-7bN/AN/A

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
Alibaba Qwen3 32BGradient AiTextN/AN/A2KN/Anono
Anthropic Claude 3.5 HaikuGradient AiText$0.800$4.001KN/Anono
Llama 2 7B Chat Fp16Cloudflare Workers AIText$1.92$1.923K3Knono
Llama V2 70B ChatFireworks AIText$0.900$0.9002K2Knono
Llama3.3 70B InstructGradient AiText$0.650$0.6502KN/Anono
Luminous Base ControlAleph AlphaText$37.50$41.252KN/Anono
Luminous Extended ControlAleph AlphaText$56.25$61.882KN/Anono
Luminous Supreme ControlAleph AlphaText$218.75$240.632KN/Anono
Phi 2 3BFireworks AIText$0.100$0.1002K2Knono
Pythia 12BFireworks AIText$0.200$0.2002K2Knono