Qwen3 VL 32B Instruct is Alibaba's language model with a 262K context window and up to 33K output tokens, available from 3 providers, starting at $0.104 / 1M input and $0.416 / 1M output. A 32B dense vision-language model from the Qwen3 series with significantly enhanced text understanding, visual perception, and multimodal reasoning capabilities.
Specifications
Canonical IDalibaba-qwen3-vl-32b-instruct
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window262K tokens
Max Output33K tokens
Input ModalitiesImageText
Output ModalitiesText
Parameters32B
Release Date · 7 months ago
Benchmarks
Intelligence Index
17.2
#257
Coding Index
15.6
#219
Math Index
68.3
#93
MMLU-Pro
0.8
#118
GPQA
0.7
#211
HLE
0.1
#210
LiveCodeBench
0.5
#134
IFBench
0.4
#232
Time to First Token
1.35s
#381
SciCode
0.3
#223
AIME 2025
0.7
#93
LCR
0.3
#200
TerminalBench Hard
0.1
#204
TAU2
0.3
#231
Output TPS
77.4
#162

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
OpenRouter logo
OpenRouter
qwen/qwen3-vl-32b-instruct
$0.104$0.416
Alibaba Qwen logo
Alibaba Qwen
qwen3-vl-32b-instruct
$0.160$0.640$0.080$0.320
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/qwen3-vl-32b-instruct
$0.900$0.900

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Qwen3 VL 32B Instruct262K$0.104$0.416Current
Qwen3 VL 8B Instruct256K$0.080$0.200Available

Model IDs