ASR Flash 8K Realtime is Alibaba's language model, starting at $0.000032 / 1M input. A real-time automatic speech recognition model with 8K audio sampling support and fast-tier processing.
Specifications
Canonical IDalibaba-asr-flash-8k-realtime
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatch
Input
$ / 1M
Input
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
fun-asr-flash-8k-realtime
$0.000032$0.000016

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
ASR Flash 8K Realtime$0.000Current
ASR Flash$0.000Available
Flash998K$0.050$0.400Available
Flash US1.0M$0.050$0.400Available
MT Flash16K$0.160$0.490Available

Other Models

ModelTierReleasedContextInput / 1MOutput / 1M
Text 4 Embedding$0.070
Text 3 Embedding$0.070
HappyHorse 1.0
ASR$0.000
ASR MTL$0.000
ASR MTL Realtime$0.000
ASR Realtime$0.000
Coder1.0M$0.300$1.50
Coder PlusPlus131K$0.502$1.00
Coder TurboTurbo131K$0.287$0.861

Model IDs

alibaba-asr-flash-8k-realtime
fun-asr-flash-8k-realtime
fun-asr-flash-8k-realtime-2026-01-28