Qwen3 4B Thinking is Alibaba's language model, starting at $0.010 / 1M input and $0.030 / 1M output. A thinking-mode variant of the Qwen3 4B model that enables chain-of-thought reasoning within a compact, efficient architecture.
Specifications
Canonical IDalibaba-qwen3-4b-thinking
TypeLanguage
StatusActive
CreatorAlibabaAlibaba
Providers
Input ModalitiesText
Output ModalitiesText
Parameters4B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Hugging Face logo
Hugging Face
nscale:Qwen/Qwen3-4B-Thinking-2507
$0.010$0.030

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EAGLE Qwen 2.5 3B InstructAvailable
Qwen3.7 Max1.0M$1.25$3.75Available
Qwen3.6 Max Preview262K$1.04$6.24Available
Qwen3.6 27B262K$0.290$3.20Available
Qwen3.6 35B A3B262K$0.140$1.00Available
Qwen3.6 Plus1.0M$0.325$1.95Available
Qwen3 Max Thinking262K$0.780$3.90Available
Qwen3 Next 80B A3B128K$0.140$1.20Available
Qwen3 Max262K$0.359$1.43Available
Qwen3 Max Preview262K$1.20$6.00Available
Qwen3 4B Thinking$0.010$0.030Current

Model IDs