Alibaba logo

Qwen3 VL Flash


Qwen3 VL Flash is Alibaba logoAlibaba's language model with a 262K context window and up to 33K output tokens, starting at $0.050 / 1M input and $0.400 / 1M output. A fast, lightweight variant of the Qwen3 vision-language series, optimized for efficient multimodal inference with reduced latency.
Spec
Canonical IDalibaba-qwen3-vl-flash
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window262K tokens
Max Output33K tokens
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
qwen3-vl-flash-us
$0.050$0.400$0.025$0.200

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Qwen3 Coder Flash1.0M$0.195$0.975Available
Qwen3 VL Flash262K$0.050$0.400Current
Qwen3 TTS FlashAvailable

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
EAGLE Qwen 2.5 3B Instruct
Qwen3 Max ThinkingMax262K$0.780$3.90
Qwen3 Next 80B A3B128K$0.150$1.20
Qwen3 VL 235B A22B128K$0.530$2.66
Qwen3 VL 8B Thinking131K$0.117$1.36
Qwen3 VL 235B A22B Instruct131K$0.400$1.60
Qwen3 VL 235B A22B Thinking131K$0.400$4.00
Qwen3 MaxMax262K$0.359$1.43
Qwen3 Max PreviewMax262K$1.20$6.00
Qwen3 Coder PlusPlus1.0M$0.650$3.25

Model IDs