Alibaba logo

Qwen3 32B FP8


Qwen3 32B FP8 is Alibaba logoAlibaba's language model with a 131K context window and up to 20K output tokens, available from 2 providers, starting at $0.050 / 1M input and $0.100 / 1M output. An FP8-quantized dense Qwen3 model with 32B parameters that integrates reasoning and non-reasoning modes, matching QwQ-32B inference capability at reduced compute cost.
Spec
Canonical IDalibaba-qwen3-32b-fp8
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window131K tokens
Max Output20K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Parameters32B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities3/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Lambda logo
Lambda
qwen3-32b-fp8
$0.050$0.100
Novita logo
Novita
qwen/qwen3-32b-fp8
$0.100$0.450

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
DeepSeek R1 0528 Qwen3 8B128K$0.060$0.090Available
Qwen3 9.23 MaxAvailable
Qwen 7 28 Flash998KAvailable
Qwen 4 28 Plus129KAvailable
Qwen 3 32B128KAvailable
Qwen3.5-Flash1.0M$0.065$0.260Available
Qwen3.5 Plus 2026-02-151.0M$0.260$1.56Available
Qwen 1 25 Plus129KAvailable
Qwen3.5 Max258KAvailable
Qwen3.6 Plus1.0M$0.325$1.95Available
Qwen3 32B FP8131K$0.050$0.100Current

Model IDs