Qwen3 32 B Pricing & Specs | AI Models

Qwen3 32B is a text model from Nebius with a context window of 33K tokens and max output of 33K tokens. Pricing starts at 0.10 per million input tokens and 0.30 per million output tokens (cheapest at Lambda).

Capabilities

✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output

Specifications

Model Key	`nebius/Qwen/Qwen3-32B`
Provider	Nebius
Provider ID	nebius
Mode	Text
Canonical Name	qwen-3-32b
Context Window	33K tokens
Max Output	33K tokens

Pricing

Type	Per 1K Tokens	Per 1M Tokens
Input Tokens	0.000100	0.100
Output Tokens	0.000300	0.300

Benchmarks

Intelligence Index	14.5#136
Math Index	19.7#101
MMLU-Pro	0.7#97
GPQA	0.5#121
HLE	0.0#149
LiveCodeBench	0.3#115
AIME	0.3#47
IFBench	0.3#132
Time to First Token	0.98s#163
SciCode	0.3#115
MATH-500	0.9#51
AIME 2025	0.2#101
LCR	0.0#151

Price Comparison by Provider

Compare prices for Qwen3 32B across different providers. The same model may be available through multiple providers at different price points.

Provider	Model Key	Input Price, $	Output Price, $
Vercel AI Gateway	vercel_ai_gateway/alibaba/qwen-3-32b	0.100	0.300
SambaNova	sambanova/Qwen3-32B	0.400	0.800
AWS Bedrock	qwen.qwen3-32b-v1:0	0.150	0.600
OVHcloud	ovhcloud/Qwen3-32B	0.080	0.230
Novita AI	novita/qwen/qwen3-32b-fp8	0.100	0.450
Nebius	nebius/Qwen/Qwen3-32B	0.100	0.300
Lambda	lambda_ai/qwen3-32b-fp8	0.050	0.100
Groq	groq/qwen/qwen3-32b	0.290	0.590
Gradient AI	gradient_ai/alibaba-qwen3-32b	N/A	N/A
Fireworks AI	fireworks_ai/accounts/fireworks/models/qwen3-32b	0.900	0.900
DeepInfra	deepinfra/Qwen/Qwen3-32B	0.100	0.280
Cerebras	cerebras/qwen-3-32b	0.400	0.800

All Variants

All available versions, regions, and API endpoints for Qwen3 32B.

Model Key	Provider	Mode	Input Price, $	Output Price, $	Context	Max Output	Vision	Functions
qwen.qwen3-32b-v1:0	AWS Bedrock	Text	0.150	0.600	131K	16K	no	yes
cerebras/qwen-3-32b	Cerebras	Text	0.400	0.800	128K	128K	no	yes
deepinfra/Qwen/Qwen3-32B	DeepInfra	Text	0.100	0.280	41K	41K	no	yes
fireworks_ai/accounts/fireworks/models/qwen3-32b	Fireworks AI	Text	0.900	0.900	131K	131K	no	no
gradient_ai/alibaba-qwen3-32b	Gradient AI	Text	N/A	N/A	2K	N/A	no	no
groq/qwen/qwen3-32b	Groq	Text	0.290	0.590	131K	131K	no	yes
lambda_ai/qwen3-32b-fp8	Lambda	Text	0.050	0.100	131K	131K	no	yes
nebius/Qwen/Qwen3-32B	Nebius	Text	0.100	0.300	33K	33K	no	yes
novita/qwen/qwen3-32b-fp8	Novita AI	Text	0.100	0.450	41K	20K	no	no
ovhcloud/Qwen3-32B	OVHcloud	Text	0.080	0.230	32K	32K	no	yes
sambanova/Qwen3-32B	SambaNova	Text	0.400	0.800	8K	8K	no	yes
vercel_ai_gateway/alibaba/qwen-3-32b	Vercel AI Gateway	Text	0.100	0.300	41K	16K	no	yes

← Back to All Models