DeepSeek R1 Distill Qwen 14B is
DeepSeek's language model with a 131K context window and up to 16K output tokens, available from 4 providers, starting at $0.070 / 1M input and $0.070 / 1M output. A 14-billion-parameter Qwen2-based model distilled from DeepSeek R1, balancing reasoning performance with moderate computational requirements.
15.8#170 |
55.7#74 |
0.7#118 |
0.5#209 |
0.0#228 |
0.4#124 |
0.7#26 |
0.2#256 |
0.00s#46 |
0.2#199 |
0.9#21 |
0.6#74 |
0.1#197 |
0.0#247 |
Capabilities
Input1/5
✓
·
·
·
·
Output1/5
✓
·
·
·
·
Capabilities2/13
✓
·
·
·
✓
·
·
·
·
·
·
·
·
Pricing by Provider
| Provider | Standard | Batch | ||
|---|---|---|---|---|
| Input $ / 1M | Output $ / 1M | Input $ / 1M | Output $ / 1M | |
Nscale | $0.070 | $0.070 | — | — |
Alibaba Qwen | $0.144 | $0.431 | $0.072 | $0.215 |
Novita | $0.150 | $0.150 | — | — |
Fireworks AI | $0.200 | $0.200 | — | — |
Cost Calculator
Preset:
Compares every provider & tier in USD
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| DeepSeek R1 0528 | 164K | $0.200 | $0.600 | Available | |
| DeepSeek R1 528B | 164K | — | — | Available | |
| DeepSeek R1 Distill Qwen 32B | 131K | $0.270 | $0.270 | Available | |
| DeepSeek R1 Distill Llama 70B | 131K | $0.200 | $0.600 | Available | |
| DeepSeek R1 | 164K | $0.280 | $0.400 | Available | |
| DeepSeek R1 Distill Qwen 14B | — | 131K | $0.070 | $0.070 | Current |
| DeepSeek R1 | — | — | — | — | Available |
| DeepSeek R1 0528 Distill Qwen3 8B | — | 131K | $0.200 | $0.200 | Available |
| DeepSeek R1 528 Qwen3 8B | — | 128K | — | — | Available |
| DeepSeek R1 528B Turbo | — | 33K | — | — | Available |
| DeepSeek R1 671B | — | 131K | $0.800 | $0.800 | Available |
HuggingFace
631 likes508,773 downloads/month6,563,366 total downloads