Qwen3 4B Instruct is Alibaba's language model with a 262K context window and up to 33K output tokens, available from 2 providers, starting at $0.010 / 1M input and $0.030 / 1M output. An instruction-tuned 4B Qwen3 model offering efficient text generation and reasoning in a small parameter footprint.
Specifications
Canonical IDalibaba-qwen3-4b-instruct
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window262K tokens
Max Output33K tokens
Input ModalitiesText
Output ModalitiesText
Parameters4B
Benchmarks
Intelligence Index
12.5
#349
Coding Index
9.0
#311
Math Index
52.3
#128
MMLU-Pro
0.6
#249
GPQA
0.4
#370
HLE
0.0
#411
LiveCodeBench
0.2
#240
AIME
0.2
#91
IFBench
0.3
#290
Time to First Token
0.95s
#323
SciCode
0.2
#364
MATH-500
0.8
#88
AIME 2025
0.5
#128
LCR
0.1
#300
TerminalBench Hard
0.0
#251
TAU2
0.3
#246
Output TPS
0.0
#275

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Voyage 4 NanoAvailable
DeepSeek R1 Distill Qwen3 8B131K$0.200$0.200Available
Qwen3 Embedding 0.6B33K$0.010Available
Qwen3 Embedding 4B41K$0.020Available
Qwen3 Embedding 8B41K$0.020Available
Qwen3 14B132K$0.060$0.200Available
Qwen3 32B131K$0.050$0.100Available
Qwen3 8B131K$0.035$0.138Available
Qwen3 4B Instruct262K$0.010$0.030Current
KwaiPilot KAT 32B Dev131K$0.900$0.900Available
Qwen3 0.6B41K$0.100$0.100Available

Model IDs