Qwen3 32B

Qwen3 32B is a text model from Nebius with a context window of 33K tokens and max output of 33K tokens. Pricing starts at 0.10 per million input tokens and 0.30 per million output tokens (cheapest at Lambda).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keynebius/Qwen/Qwen3-32B
ProviderNebius
Provider IDnebius
ModeText
Canonical Nameqwen-3-32b
Context Window33K tokens
Max Output33K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0001000.100
Output Tokens0.0003000.300

Benchmarks

Intelligence Index14.5#136
Math Index19.7#101
MMLU-Pro0.7#97
GPQA0.5#121
HLE0.0#149
LiveCodeBench0.3#115
AIME0.3#47
IFBench0.3#132
Time to First Token0.98s#163
SciCode0.3#115
MATH-5000.9#51
AIME 20250.2#101
LCR0.0#151

Price Comparison by Provider

Compare prices for Qwen3 32B across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Vercel AI Gatewayvercel_ai_gateway/alibaba/qwen-3-32b0.1000.300
SambaNovasambanova/Qwen3-32B0.4000.800
AWS Bedrockqwen.qwen3-32b-v1:00.1500.600
OVHcloudovhcloud/Qwen3-32B0.0800.230
Novita AInovita/qwen/qwen3-32b-fp80.1000.450
Nebiusnebius/Qwen/Qwen3-32B0.1000.300
Lambdalambda_ai/qwen3-32b-fp80.0500.100
Groqgroq/qwen/qwen3-32b0.2900.590
Gradient AIgradient_ai/alibaba-qwen3-32bN/AN/A
Fireworks AIfireworks_ai/accounts/fireworks/models/qwen3-32b0.9000.900
DeepInfradeepinfra/Qwen/Qwen3-32B0.1000.280
Cerebrascerebras/qwen-3-32b0.4000.800

All Variants

All available versions, regions, and API endpoints for Qwen3 32B.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
qwen.qwen3-32b-v1:0AWS BedrockText0.1500.600131K16Knoyes
cerebras/qwen-3-32bCerebrasText0.4000.800128K128Knoyes
deepinfra/Qwen/Qwen3-32BDeepInfraText0.1000.28041K41Knoyes
fireworks_ai/accounts/fireworks/models/qwen3-32bFireworks AIText0.9000.900131K131Knono
gradient_ai/alibaba-qwen3-32bGradient AITextN/AN/A2KN/Anono
groq/qwen/qwen3-32bGroqText0.2900.590131K131Knoyes
lambda_ai/qwen3-32b-fp8LambdaText0.0500.100131K131Knoyes
nebius/Qwen/Qwen3-32BNebiusText0.1000.30033K33Knoyes
novita/qwen/qwen3-32b-fp8Novita AIText0.1000.45041K20Knono
ovhcloud/Qwen3-32BOVHcloudText0.0800.23032K32Knoyes
sambanova/Qwen3-32BSambaNovaText0.4000.8008K8Knoyes
vercel_ai_gateway/alibaba/qwen-3-32bVercel AI GatewayText0.1000.30041K16Knoyes