ASR MTL is Alibaba's language model, starting at $0.000035 / 1M input. A multilingual automatic speech recognition model supporting multiple target languages simultaneously.
Specifications
Canonical IDalibaba-asr-mtl
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatch
Input
$ / 1M
Input
$ / 1M
Alibaba Qwen logo
Alibaba Qwen
fun-asr-mtl
$0.000035$0.0000175

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Text 4 Embedding$0.070Available
Text 3 Embedding$0.070Available
HappyHorse 1.0Available
ASR MTL$0.000Current
ASR$0.000Available
ASR Flash$0.000Available
ASR Flash 8K Realtime$0.000Available
ASR MTL Realtime$0.000Available
ASR Realtime$0.000Available
Coder1.0M$0.300$1.50Available
Coder Plus131K$0.502$1.00Available

Model IDs

alibaba-asr-mtl
fun-asr-mtl
fun-asr-mtl-2025-08-25