Qwen3 Omni 30B A3B Instruct is Alibaba's language model with a 66K context window and up to 16K output tokens, starting at $0.250 / 1M input and $0.970 / 1M output. An end-to-end multilingual omni-modal LLM with 30B total and 3B activated parameters, natively processing text, images, audio, and video inputs.
Specifications
Canonical IDalibaba-qwen3-omni-30b-a3b-instruct
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window66K tokens
Max Output16K tokens
Input ModalitiesAudioImage
Output ModalitiesAudio
Parameters30B
Benchmarks
Intelligence Index
10.7
#375
Coding Index
7.2
#329
Math Index
52.3
#129
MMLU-Pro
0.7
#185
GPQA
0.6
#240
HLE
0.1
#266
LiveCodeBench
0.4
#161
IFBench
0.3
#320
Time to First Token
1.04s
#329
SciCode
0.2
#345
AIME 2025
0.5
#129
LCR
0.0
#342
TerminalBench Hard
0.0
#306
TAU2
0.2
#318
Output TPS
104.6
#106

Capabilities

Input2/5
Text·
Image
Audio
Video·
PDF·
Output1/5
Text·
Image·
Audio
Video·
Embedding·
Capabilities3/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Novita logo
Novita
novita/qwen/qwen3-omni-30b-a3b-instruct
$0.250$0.970

Cost Calculator

Preset:

Model IDs