Qwen3 Omni Flash Realtime is Alibaba's language model, starting at $0.52 / 1M input and $4.57 / 1M output. A real-time streaming fast-tier Qwen3 omnimodal model for live multimodal interaction.
Specifications
Canonical IDalibaba-qwen3-omni-flash-realtime
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
qwen3-omni-flash-realtime
$0.52$4.57$0.26$2.29

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Qwen3 Omni Flash Realtime$0.520$4.57Current
Qwen3.5 Omni Flash$0.400$3.00Available
Qwen Flash Character$0.050$0.400Available
Qwen TTS Flash$0.230$1.43Available
Qwen3 ASR Flash$0.000Available
Qwen3 ASR Flash Filetrans$0.000Available
Qwen3 ASR Flash Realtime$0.000Available

Other Models

ModelTierReleasedContextInput / 1MOutput / 1M
EAGLE Qwen 2.5 3B Instruct
Qwen3.7 PlusPlus1.0M$0.320$1.28
Qwen3.7 MaxMax1.0M$1.25$3.75
Qwen3.6 27B262K$0.150$0.500
Qwen3.6 35B A3B262K$0.140$0.450
Qwen3.6 Max PreviewMax262K$1.04$6.24
Qwen3.6 PlusPlus1.0M$0.325$1.95
Qwen3 Max ThinkingMax262K$0.780$3.90
Qwen3 MaxMax262K$0.780$3.90
Qwen3 Coder 30B A3B262K$0.150$0.600

Model IDs

alibaba-qwen3-omni-flash-realtime
qwen3-omni-flash-realtime
qwen3-omni-flash-realtime-2025-09-15
qwen3-omni-flash-realtime-2025-12-01