DeepSeek R1 Distill Llama 8B is DeepSeek's language model with a 131K context window, available from 3 providers, starting at $0.025 / 1M input and $0.025 / 1M output. A compact 8B Llama-based model distilled from DeepSeek R1, delivering strong reasoning performance in a lightweight architecture.
Capabilities
Input1/5
Text✓
Image·
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Pricing by Provider
US Dollar ($)
Per 1M tokens
| Provider | Standard | |
|---|---|---|
| Input $ / 1M | Output $ / 1M | |
| $0.2 | $0.2 | |
| $0.05 | $0.05 | |
| $0.025 | $0.025 | |
Cost Calculator
US Dollar ($)
Preset:
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| DeepSeek R1T2 Chimera | 164K | — | — | Available | |
| DeepSeek R1 528 | 164K | $0.200 | $0.250 | Available | |
| DeepSeek R1 Distill Qwen 32B | 131K | $0.150 | $0.150 | Available | |
| DeepSeek R1 Distill Llama 70B | 131K | $0.200 | $0.375 | Deprecated | |
| DeepSeek R1 | 164K | $0.280 | $0.400 | Available | |
| DeepSeek R1 Distill Llama 8B | — | 131K | $0.025 | $0.025 | Current |
| DeepSeek R1 Distill Qwen 14B | — | 131K | $0.070 | $0.070 | Available |
| DeepSeek R1 Distill Qwen 1.5B | — | 131K | $0.090 | $0.090 | Available |
| DeepSeek R1 528 Turbo | — | 33K | $1.00 | $3.00 | Available |
| DeepSeek R1 528B | — | 131K | $0.550 | $2.19 | Available |
| DeepSeek R1 671B | — | 131K | $0.800 | $0.800 | Available |