Alibaba logo

Qwen3 VL 32B Thinking


Qwen3 VL 32B Thinking is Alibaba logoAlibaba's language model with a 131K context window and up to 33K output tokens, starting at $0.160 / 1M input and $0.640 / 1M output. A thinking-optimized 32B dense vision-language model from the Qwen3 VL series, designed for deep multimodal reasoning across images, video, and text.
Spec
Canonical IDalibaba-qwen3-vl-32b-thinking
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window131K tokens
Max Output33K tokens
Input ModalitiesImage
Output ModalitiesText
Reasoning Effortsdefault
Parameters32B

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
qwen3-vl-32b-thinking
$0.160$0.640$0.080$0.320

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
DeepSeek R1 0528 Qwen3 8B128K$0.060$0.090Available
Qwen3 9.23 MaxAvailable
Qwen 7 28 Flash998KAvailable
Qwen 4 28 Plus129KAvailable
Qwen 3 32B128KAvailable
Qwen3.5-Flash1.0M$0.065$0.260Available
Qwen3.5 Plus 2026-02-151.0M$0.260$1.56Available
Qwen 1 25 Plus129KAvailable
Qwen3.5 Max258KAvailable
Qwen3.6 Plus1.0M$0.325$1.95Available
Qwen3 VL 32B Thinking131K$0.160$0.640Current

Model IDs