Alibaba logo

Qwen3 TTS Flash


Qwen3 TTS Flash is Alibaba logoAlibaba's text to speech model. A fast, low-latency text-to-speech model from Alibaba's Qwen3 series, optimized for real-time voice synthesis with efficient inference.
Spec
Canonical IDalibaba-qwen3-tts-flash
TypeText to Speech
StatusActive
CreatorAlibabaAlibaba
Input ModalitiesText
Output ModalitiesAudio
Elo Rating
940
#232

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Qwen 7 28 Flash998KAvailable
Qwen3.5-Flash1.0M$0.065$0.260Available
Qwen 3.5 Flash1.0M$0.100$0.400Available
Qwen3 Coder Flash1.0M$0.195$0.975Available
Qwen3 TTS FlashCurrent
Qwen Flash1.0M$0.050$0.400Available
Qwen Flash1.0M$0.050$0.400Available
Qwen MT Flash16K$0.160$0.490Available
Qwen3 VL Flash262K$0.200$1.60Available
Qwen3 VL Flash262K$0.050$0.400Available

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
DeepSeek R1 0528 Qwen3 8B128K$0.060$0.090
Qwen3 9.23 MaxMax
Qwen 4 28 PlusPlus129K
Qwen 3 32B128K
Qwen3.5 Plus 2026-02-15Plus1.0M$0.260$1.56
Qwen 1 25 PlusPlus129K
Qwen3.5 MaxMax258K
Qwen3.6 PlusPlus1.0M$0.325$1.95
Qwen3.6 Plus PreviewPlus1.0M
Qwen 3.5 PlusPlus1.0M$0.115$0.688

Model IDs