DeepSeek R1 Distill Qwen 1.5B is DeepSeek's language model with a 131K context window, available from 3 providers, starting at $0.090 / 1M input and $0.090 / 1M output. A 1.5B Qwen-based model distilled from DeepSeek R1's reasoning chains, offering chain-of-thought capabilities in an extremely compact form factor.
Specifications
Canonical IDdeepseek-r1-distill-qwen-1-5b
TypeLanguage
StatusActive
CreatorDeepSeekDeepSeek
Providers
Context Window131K tokens
Input ModalitiesText
Output ModalitiesText
Parameters1.5B
Benchmarks
Intelligence Index
9.1
#412
Math Index
22.0
#196
MMLU-Pro
0.3
#315
GPQA
0.1
#453
HLE
0.0
#438
LiveCodeBench
0.1
#307
AIME
0.2
#95
IFBench
0.1
#392
Time to First Token
SciCode
0.1
#414
MATH-500
0.7
#133
AIME 2025
0.2
#196
LCR
0.0
#329
Output TPS
0.0
#322

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Dolphin 2.9.2 Qwen2 72B131K$0.900$0.900Available
DeepSeek R1 Distill Qwen 32B131K$0.150$0.150Available
DeepSeek R1 Distill Qwen 1.5B131K$0.090$0.090Current
DeepSeek R1 Distill Qwen 14B131K$0.070$0.070Available
Cogito V1 Preview Qwen 14B131K$0.200$0.200Available
Cogito V1 Preview Qwen 32B131K$0.900$0.900Available
DeepSeek R1 Distill Qwen 7B131K$0.072$0.144Available
QwQ 32B131K$0.150$0.200Available
Qwen2.5 Coder 32B Instruct131K$0.050$0.100Available
Qwen2.5 7B Instruct131K$0.040$0.070Available
Qwen2.5 72B Instruct131K$0.120$0.300Available

Model IDs