ASR MTL Realtime is Alibaba's language model, starting at $0.000047 / 1M input. A real-time multilingual automatic speech recognition model for live multi-language transcription.
Specifications
Canonical IDalibaba-asr-mtl-realtime
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatch
Input
$ / 1M
Input
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
fun-asr-mtl-realtime
$0.000047$0.0000235

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Text 4 Embedding$0.070Available
Text 3 Embedding$0.070Available
HappyHorse 1.0Available
ASR MTL Realtime$0.000Current
ASR$0.000Available
ASR Flash$0.000Available
ASR Flash 8K Realtime$0.000Available
ASR MTL$0.000Available
ASR Realtime$0.000Available
Coder1.0M$0.300$1.50Available
Coder Plus131K$0.502$1.00Available

Model IDs

alibaba-asr-mtl-realtime
fun-asr-mtl-realtime
fun-asr-mtl-realtime-2025-12-10