Qwen2.5 7B Instruct

DeepinfraText

Qwen2.5 7B Instruct is a text model from Deepinfra with a context window of 33K tokens and max output of 33K tokens. Pricing starts at $0.04 per million input tokens and $0.10 per million output tokens (cheapest at Deepinfra).

Specifications

Model Keydeepinfra/Qwen/Qwen2.5-7B-Instruct
ProviderDeepinfra
LiteLLM Providerdeepinfra
ModeText
Canonical Nameqwen-2.5-7b
Context Window33K tokens
Max Output33K tokens

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.000040$0.040
Output Tokens$0.000100$0.100

Price Comparison by Provider

Compare prices for Qwen2.5 7B Instruct across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price
Output Price
Fireworks AIfireworks_ai/accounts/fireworks/models/qwen-v2p5-7b$0.200$0.200
Deepinfradeepinfra/Qwen/Qwen2.5-7B-Instruct$0.040$0.100
Novitanovita/qwen/qwen2.5-7b-instruct$0.070$0.070
Together AItogether_ai/Qwen/Qwen2.5-7B-Instruct-TurboN/AN/A

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
Codegeex4OllamaTextN/AN/A33K8Knono
DeepSeek Coder V2 InstructOllamaTextN/AN/A33K8Knoyes
DeepSeek Coder V2 Lite InstructOllamaTextN/AN/A33K8Knoyes
Internlm2 5 20B ChatOllamaTextN/AN/A33K8Knoyes
Mistral 7B Instruct V0.2OllamaTextN/AN/A33K33Knoyes
Mixtral 8x7B Instruct V0.1OllamaTextN/AN/A33K33Knoyes
Olmo 3 32B ThinkPublicaiTextN/AN/A33K4Knoyes
Olmo 3 7B InstructPublicaiTextN/AN/A33K4Knoyes
Olmo 3 7B ThinkPublicaiTextN/AN/A33K4Knoyes
Qwen SEA LION V4 32B ITPublicaiTextN/AN/A33K4Knoyes