DeepSeek R1 Distill Llama 8B is
DeepSeek's language model with a 131K context window and up to 16K output tokens, available from 2 providers, starting at $0.050 / 1M input and $0.050 / 1M output. A compact 8B Llama-based model distilled from DeepSeek R1, bringing open-source reasoning capabilities to a small, efficient parameter footprint.
12.1#236 |
41.3#93 |
0.5#209 |
0.3#294 |
0.0#247 |
0.2#186 |
0.3#48 |
0.2#270 |
0.00s#44 |
0.1#265 |
0.9#61 |
0.4#93 |
0.0#243 |
0.0#245 |
Capabilities
Input1/5
✓
·
·
·
·
Output1/5
✓
·
·
·
·
Capabilities3/13
✓
·
✓
·
✓
·
·
·
·
·
·
·
·
Pricing by Provider
| Provider | Standard | |
|---|---|---|
| Input $ / 1M | Output $ / 1M | |
Nscale | $0.050 | $0.050 |
Fireworks AI | $0.200 | $0.200 |
Cost Calculator
Preset:
Compares every provider & tier in USD
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| DeepSeek R1 0528 | 164K | $0.200 | $0.600 | Available | |
| DeepSeek R1 528B | 164K | — | — | Available | |
| DeepSeek R1 Distill Qwen 32B | 131K | $0.270 | $0.270 | Available | |
| DeepSeek R1 Distill Llama 70B | 131K | $0.200 | $0.600 | Available | |
| DeepSeek R1 | 164K | $0.280 | $0.400 | Available | |
| DeepSeek R1 Distill Llama 8B | — | 131K | $0.050 | $0.050 | Current |
| DeepSeek R1 | — | — | — | — | Available |
| DeepSeek R1 0528 Distill Qwen3 8B | — | 131K | $0.200 | $0.200 | Available |
| DeepSeek R1 528 Qwen3 8B | — | 128K | — | — | Available |
| DeepSeek R1 528B Turbo | — | 33K | — | — | Available |
| DeepSeek R1 671B | — | 131K | $0.800 | $0.800 | Available |
HuggingFace
855 likes1,959,522 downloads/month17,406,701 total downloads