Qwen2.5 32B Instruct

Qwen2.5 32B Instruct is a text model from Nebius with a context window of 128K tokens and max output of 128K tokens. Pricing starts at 0.06 per million input tokens and 0.20 per million output tokens (cheapest at Nebius).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keynebius/Qwen/Qwen2.5-32B-Instruct
ProviderNebius
Provider IDnebius
ModeText
Canonical Nameqwen-2.5-32b
Context Window128K tokens
Max Output128K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0000600.060
Output Tokens0.0002000.200

Benchmarks

Intelligence Index13.2#149
MMLU-Pro0.7#111
GPQA0.5#148
HLE0.0#185
LiveCodeBench0.2#133
AIME0.1#81
Time to First Token0.00s#1
SciCode0.2#146
MATH-5000.8#68

Price Comparison by Provider

Compare prices for Qwen2.5 32B Instruct across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Nebiusnebius/Qwen/Qwen2.5-32B-Instruct0.0600.200
Fireworks AIfireworks_ai/accounts/fireworks/models/qwen2p5-32b0.9000.900

All Variants

All available versions, regions, and API endpoints for Qwen2.5 32B Instruct.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
fireworks_ai/accounts/fireworks/models/qwen2p5-32bFireworks AIText0.9000.900131K131Knono
fireworks_ai/accounts/fireworks/models/qwen2p5-32b-instructFireworks AIText0.9000.90033K33Knono
nebius/Qwen/Qwen2.5-32B-InstructNebiusText0.0600.200128K128Knoyes