Alibaba logo

Qwen3 VL 32B Instruct


Qwen3 VL 32B Instruct is Alibaba's language model with a 131K context window and up to 33K output tokens, available from 3 providers, starting at $0.104 / 1M input and $0.416 / 1M output. A 32B dense vision-language model from the Qwen3 series with significantly enhanced text understanding, visual perception, and multimodal reasoning capabilities.
Specifications
Canonical IDalibaba-qwen3-vl-32b-instruct
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window131K tokens
Max Output33K tokens
Input ModalitiesImageText
Output ModalitiesText
Parameters32B
Release Date · 7 months ago
Benchmarks
Intelligence Index
17.2
#249
Coding Index
15.6
#212
Math Index
68.3
#93
MMLU-Pro
0.8
#118
GPQA
0.7
#204
HLE
0.1
#204
LiveCodeBench
0.5
#134
IFBench
0.4
#225
Time to First Token
1.12s
#344
SciCode
0.3
#217
AIME 2025
0.7
#93
LCR
0.3
#193
TerminalBench Hard
0.1
#197
TAU2
0.3
#223
Output TPS
63.3
#184

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
OpenRouter logo
OpenRouter
qwen/qwen3-vl-32b-instruct
$0.104$0.416
Alibaba Qwen logo
Alibaba Qwen
qwen3-vl-32b-instruct
$0.160$0.640$0.080$0.320
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/qwen3-vl-32b-instruct
$0.900$0.900

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Qwen3 VL 32B Instruct131K$0.104$0.416Current
Qwen3 VL 8B Instruct131K$0.080$0.200Available

Model IDs