Alibaba logo

Qwen3.5 Flash


Qwen3.5 Flash is Alibaba's language model with a 1.0M context window and up to 66K output tokens, available from 3 providers, starting at $0.065 / 1M input and $0.260 / 1M output. A Flash-tier Qwen3.5 vision-language model integrating linear attention and sparse MoE for fast multimodal inference with reasoning and tool-use support.
Specifications
Canonical IDalibaba-qwen3-5-flash
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window1.0M tokens
Max Output66K tokens
Input ModalitiesImagePdfTextVideo
Output ModalitiesText
Reasoning Effortsdefault
Release Date · 3 months ago

Capabilities

Input4/5
Text
Image
Audio·
Video
PDF
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Input
$ / 1M
Output
$ / 1M
OpenRouter logo
OpenRouter
qwen/qwen3.5-flash-02-23
$0.065$0.260N/A
Alibaba Qwen logo
Alibaba Qwen
qwen3.5-flash
$0.100$0.400N/A$0.050$0.200
Vercel AI Gateway logo
Vercel AI Gateway
alibaba/qwen3.5-flash
$0.100$0.400$0.0010

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Qwen3.5 Flash1.0M$0.065$0.260Current
Qwen3 Coder Flash1.0M$0.195$0.975Available
Qwen3.5 Omni FlashAvailable
Qwen3 TTS FlashAvailable

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
EAGLE Qwen 2.5 3B Instruct
Qwen3.6 27B262K$0.320$3.20
Qwen3.6 35B A3B262K$0.150$1.00
Qwen3.6 Max PreviewMax262K$1.04$6.24
Qwen3.6 PlusPlus1.0M$0.325$1.95
Qwen3 Max ThinkingMax262K$0.780$3.90
Qwen3 Next 80B A3B128K$0.150$1.20
Qwen3 MaxMax262K$0.359$1.43
Qwen3 Max PreviewMax262K$1.20$6.00
Qwen3 Coder PlusPlus1.0M$0.650$3.25

Model IDs