Alibaba logo

Qwen Flash


Qwen Flash is Alibaba logoAlibaba's language model with a 1.0M context window and up to 33K output tokens, starting at $0.050 / 1M input and $0.400 / 1M output. A lightweight, high-speed language model from Alibaba's Qwen series, designed for low-latency inference on simple to moderately complex tasks.
Spec
Canonical IDalibaba-qwen-flash
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window1.0M tokens
Max Output33K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
qwen-flash
$0.050$0.400$0.025$0.200

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Qwen 7 28 Flash998KAvailable
Qwen3.5-Flash1.0M$0.065$0.260Available
Qwen 3.5 Flash1.0M$0.100$0.400Available
Qwen3 Coder Flash1.0M$0.195$0.975Available
Qwen Flash1.0M$0.050$0.400Current
Qwen Flash1.0M$0.050$0.400Available
Qwen MT Flash16K$0.160$0.490Available
Qwen3 TTS FlashAvailable
Qwen3 VL Flash262K$0.200$1.60Available
Qwen3 VL Flash262K$0.050$0.400Available

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
DeepSeek R1 0528 Qwen3 8B128K$0.060$0.090
Qwen3 9.23 MaxMax
Qwen 4 28 PlusPlus129K
Qwen 3 32B128K
Qwen3.5 Plus 2026-02-15Plus1.0M$0.260$1.56
Qwen 1 25 PlusPlus129K
Qwen3.5 MaxMax258K
Qwen3.6 PlusPlus1.0M$0.325$1.95
Qwen3.6 Plus PreviewPlus1.0M
Qwen 3.5 PlusPlus1.0M$0.115$0.688

Model IDs