Qwen2.5 7 B Instruct Pricing & Specs | AI Models

Qwen2.5 7B Instruct is a text model from DeepInfra with a context window of 33K tokens and max output of 33K tokens. Pricing starts at 0.04 per million input tokens and 0.10 per million output tokens (cheapest at DeepInfra).

Capabilities

✗ Vision✗ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output

Specifications

Model Key	`deepinfra/Qwen/Qwen2.5-7B-Instruct`
Provider	DeepInfra
Provider ID	deepinfra
Mode	Text
Canonical Name	qwen-2.5-7b
Context Window	33K tokens
Max Output	33K tokens

Pricing

Type	Per 1K Tokens	Per 1M Tokens
Input Tokens	0.000040	0.040
Output Tokens	0.000100	0.100

Benchmarks

No benchmark data is available for this model.

Price Comparison by Provider

Compare prices for Qwen2.5 7B Instruct across different providers. The same model may be available through multiple providers at different price points.

Provider	Model Key	Input Price, $	Output Price, $
Together AI	together_ai/Qwen/Qwen2.5-7B-Instruct-Turbo	N/A	N/A
Novita AI	novita/qwen/qwen2.5-7b-instruct	0.070	0.070
Fireworks AI	fireworks_ai/accounts/fireworks/models/qwen-v2p5-7b	0.200	0.200
DeepInfra	deepinfra/Qwen/Qwen2.5-7B-Instruct	0.040	0.100

All Variants

All available versions, regions, and API endpoints for Qwen2.5 7B Instruct.

Model Key	Provider	Mode	Input Price, $	Output Price, $	Context	Max Output	Vision	Functions
deepinfra/Qwen/Qwen2.5-7B-Instruct	DeepInfra	Text	0.040	0.100	33K	33K	no	no
fireworks_ai/accounts/fireworks/models/qwen-v2p5-7b	Fireworks AI	Text	0.200	0.200	131K	131K	no	no
fireworks_ai/accounts/fireworks/models/qwen2p5-7b-instruct	Fireworks AI	Text	0.200	0.200	33K	33K	no	no
novita/qwen/qwen2.5-7b-instruct	Novita AI	Text	0.070	0.070	32K	32K	no	yes
together_ai/Qwen/Qwen2.5-7B-Instruct-Turbo	Together AI	Text	N/A	N/A	N/A	N/A	no	yes

← Back to All Models