DeepSeek R1 Distill Qwen 1.5B is DeepSeek's language model with a 131K context window, available from 3 providers, starting at $0.090 / 1M input and $0.090 / 1M output. A 1.5B Qwen-based model distilled from DeepSeek R1's reasoning chains, offering chain-of-thought capabilities in an extremely compact form factor.
Specifications
Canonical IDdeepseek-r1-distill-qwen-1-5b
TypeLanguage
StatusActive
CreatorDeepSeekDeepSeek
Providers
Context Window131K tokens
Input ModalitiesText
Output ModalitiesText
Parameters1.5B
Benchmarks
Intelligence Index
9.1
#411
Math Index
22.0
#196
MMLU-Pro
0.3
#315
GPQA
0.1
#452
HLE
0.0
#437
LiveCodeBench
0.1
#307
AIME
0.2
#95
IFBench
0.1
#391
Time to First Token
SciCode
0.1
#413
MATH-500
0.7
#133
AIME 2025
0.2
#196
LCR
0.0
#328
Output TPS
0.0
#323

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Dolphin 2.9.2 Qwen2 72B131K$0.900$0.900Available
DeepSeek R1 Distill Qwen 32B131K$0.150$0.150Available
DeepSeek R1 Distill Qwen 1.5B131K$0.090$0.090Current
DeepSeek R1 Distill Qwen 14B131K$0.070$0.070Available
Cogito V1 Preview Qwen 14B131K$0.200$0.200Available
Cogito V1 Preview Qwen 32B131K$0.900$0.900Available
DeepSeek R1 Distill Qwen 7B131K$0.072$0.144Available
QwQ 32B131K$0.150$0.200Available
Qwen2.5 Coder 32B Instruct131K$0.050$0.100Available
Qwen2.5 7B Instruct131K$0.040$0.070Available
Qwen2.5 72B Instruct131K$0.120$0.300Available

Model IDs