Llama V2 7B Chat

Fireworks AIText

Llama V2 7B Chat is a text model from Fireworks AI with a context window of 4K tokens and max output of 4K tokens. Pricing starts at $0.20 per million input tokens and $0.20 per million output tokens (cheapest at Ollama).

Specifications

Model Keyfireworks_ai/accounts/fireworks/models/llama-v2-7b-chat
ProviderFireworks AI
LiteLLM Providerfireworks_ai
ModeText
Canonical Namellama-2-7b
Context Window4K tokens
Max Output4K tokens

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.000200$0.200
Output Tokens$0.000200$0.200

Price Comparison by Provider

Compare prices for Llama V2 7B Chat across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price
Output Price
Replicatereplicate/meta/llama-2-7b$0.050$0.250
Cloudflare Workers AIcloudflare/@cf/meta/llama-2-7b-chat-fp16$1.92$1.92
Anyscaleanyscale/meta-llama/Llama-2-7b-chat-hf$0.150$0.150
Fireworks AIfireworks_ai/accounts/fireworks/models/llama-v2-7b$0.200$0.200
Ollamaollama/llama2:7bN/AN/A
AWS SageMakersagemaker/meta-textgeneration-llama-2-7bN/AN/A

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
Flux 1 Dev Controlnet UnionFireworks AIText$0.0010$0.00104K4Knono
Llama2OllamaTextN/AN/A4K4Knono
Llama2:13BOllamaTextN/AN/A4K4Knono
Llama2:70BOllamaTextN/AN/A4K4Knono
Llama2:7BOllamaTextN/AN/A4K4Knono
Meta Textgeneration Llama 2 13B FAWS SageMakerTextN/AN/A4K4Knono
Meta Textgeneration Llama 2 70B B FAWS SageMakerTextN/AN/A4K4Knono
Meta Textgeneration Llama 2 7B FAWS SageMakerTextN/AN/A4K4Knono
Pplx 70B OnlinePerplexityTextN/A$2.804K4Knono
Pplx 7B OnlinePerplexityTextN/A$0.2804K4Knono