ASR Flash is Alibaba's language model, starting at $0.000035 / 1M input. A fast-tier automatic speech recognition model optimized for low-latency transcription.
Specifications
Canonical IDalibaba-asr-flash
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatch
Input
$ / 1M
Input
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
fun-asr-flash-2026-06-15
$0.000035$0.0000175

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
ASR Flash$0.000Current
ASR Flash 8K Realtime$0.000Available
Flash998K$0.050$0.400Available
Flash US1.0M$0.050$0.400Available
MT Flash16K$0.160$0.490Available

Other Models

ModelTierReleasedContextInput / 1MOutput / 1M
Text 4 Embedding$0.070
Text 3 Embedding$0.070
HappyHorse 1.0
ASR$0.000
ASR MTL$0.000
ASR MTL Realtime$0.000
ASR Realtime$0.000
Coder1.0M$0.300$1.50
Coder PlusPlus131K$0.502$1.00
Coder TurboTurbo131K$0.287$0.861

Model IDs

alibaba-asr-flash
fun-asr-flash-2026-06-15