Alibaba logo

Qwen3 4B Instruct


Qwen3 4B Instruct is Alibaba's language model with a 262K context window and up to 33K output tokens, available from 2 providers, starting at $0.010 / 1M input and $0.030 / 1M output. An instruction-tuned 4B Qwen3 model offering efficient text generation and reasoning in a small parameter footprint.
Specifications
Canonical IDalibaba-qwen3-4b-instruct
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Context Window262K tokens
Max Output33K tokens
Input ModalitiesText
Output ModalitiesText
Parameters4B
Benchmarks
Intelligence Index
12.5
#342
Coding Index
9.0
#305
Math Index
52.3
#128
MMLU-Pro
0.6
#249
GPQA
0.4
#364
HLE
0.0
#404
LiveCodeBench
0.2
#240
AIME
0.2
#91
IFBench
0.3
#283
Time to First Token
0.99s
#318
SciCode
0.2
#358
MATH-500
0.8
#88
AIME 2025
0.5
#128
LCR
0.1
#294
TerminalBench Hard
0.0
#245
TAU2
0.3
#239
Output TPS
0.0
#279

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Hugging Face logo
Hugging Face
nscale:Qwen/Qwen3-4B-Instruct-2507
$0.010$0.030
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/qwen3-4b-instruct-2507
$0.200$0.200

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Voyage 4 NanoAvailable
DeepSeek R1 Distill Qwen3 8B131K$0.200$0.200Available
Qwen3 Embedding 0.6B33K$0.010$0.000Available
Qwen3 Embedding 4B41K$0.020$0.000Available
Qwen3 Embedding 8B41K$0.020$0.000Available
Qwen3 14B131K$0.060$0.200Available
Qwen3 32B131K$0.050$0.100Available
Qwen3 8B131K$0.035$0.138Available
Qwen3 4B Instruct262K$0.010$0.030Current
KwaiPilot KAT 32B Dev131K$0.900$0.900Available
Qwen3 0.6B41K$0.100$0.100Available

Model IDs