Alibaba logo

Qwen3 VL 235B A22B Instruct


Qwen3 VL 235B A22B Instruct is Alibaba logoAlibaba's language model with a 262K context window, starting at $0.200 / 1M input and $0.880 / 1M output. A large-scale 235B MoE vision-language instruction model with significantly improved visual coding, spatial perception, and multimodal recognition capabilities.
Spec
Canonical IDalibaba-qwen3-vl-instruct
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window262K tokens
Input ModalitiesImagePdf
Output ModalitiesText
Release Date · 7 months ago

Capabilities

Input2/5
Text·
Image
Audio·
Video·
PDF
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Vercel AI Gateway logo
Vercel AI Gateway
alibaba/qwen3-vl-instruct
$0.200$0.880$0.110

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
DeepSeek R1 0528 Qwen3 8B128K$0.060$0.090Available
Qwen3 9.23 MaxAvailable
Qwen 7 28 Flash998KAvailable
Qwen 4 28 Plus129KAvailable
Qwen 3 32B128KAvailable
Qwen3.5-Flash1.0M$0.065$0.260Available
Qwen3.5 Plus 2026-02-151.0M$0.260$1.56Available
Qwen 1 25 Plus129KAvailable
Qwen3.5 Max258KAvailable
Qwen3.6 Plus1.0M$0.325$1.95Available
Qwen3 VL 235B A22B Instruct262K$0.200$0.880Current

Model IDs