Qwen3.5 4B is Alibaba's language model. A compact 4B-parameter model from the Qwen3.5 series, balancing small size with capable text generation for efficient deployment scenarios.
Specifications
Canonical IDalibaba-qwen3-5-4b
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Input ModalitiesText
Output ModalitiesText
Parameters4B
HuggingFace Likes496
HuggingFace Downloads (30d)3,966,264
HuggingFace Downloads (all-time)6,490,015
Benchmarks
Intelligence Index
20.1
#163
Coding Index
22.6
#71
GPQA
0.8
#131
HLE
0.1
#186
IFBench
0.5
#147
Time to First Token
0.63s
#299
SciCode
0.2
#380
LCR
0.6
#118
TerminalBench Hard
0.2
#148
TAU2
0.9
#47
Output TPS
20.0
#253

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Cheapest Instances to Run It

Cloud GPU instances that can host Qwen3.5 4B, ranked by cheapest on-demand price. The model needs about 10 GB of GPU memory at FP16 precision (estimated from its parameter count), so treat the fit as guidance rather than a guarantee.

All clouds
FP16 (full precision)
US Dollar ($)
Instance
Cloud
GPU
VRAM
Price
Cheapest region
Standard_NV4as_v4AzureAMD Radeon Instinct MI2516 GB$0.233/hrwestus2
g5g.xlargeAWST4g16 GB$0.420/hrus-east-1
Standard_NV8as_v4AzureAMD Radeon Instinct MI2516 GB$0.466/hrwestus2
7 more instances can run Qwen3.5 4B
Unlock the full ranked list and FP8 / INT4 quantization with a CloudPrice subscription.

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EAGLE Qwen 2.5 3B InstructAvailable
Qwen3.7 Plus1.0M$0.320$1.28Available
Qwen3.7 Max1.0M$1.25$3.75Available
Qwen3.6 Max Preview262K$1.04$6.24Available
Qwen3.6 27B262K$0.150$0.500Available
Qwen3.6 35B A3B262K$0.140$0.450Available
Qwen3.6 Plus1.0M$0.325$1.95Available
Qwen3 Max Thinking262K$0.780$3.90Available
Qwen3 Max262K$0.780$3.90Available
Qwen3 Coder 30B A3B262K$0.150$0.600Available
Qwen3.5 4BCurrent

Model IDs

alibaba-qwen3-5-4b
huggingface-vlm-qwen3-5-4b
Qwen/Qwen3.5-4B
qwen3-5-4b
qwen3-5-4b-non-reasoning