DeepSeek R1 Distill Llama 70B is DeepSeek's language model with a 131K context window and up to 8K output tokens, available from 11 providers, starting at $0.2 / 1M input and $0.375 / 1M output. A 70B Llama-based model distilled from DeepSeek R1's chain-of-thought reasoning, combining Llama's architecture with R1's advanced reasoning capabilities.
Capabilities
Input1/5
Text✓
Image·
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities3/13
Reasoning✓
Adaptive Reasoning·
Function Calling✓
Parallel Function Calling·
Structured Outputs✓
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Pricing by Provider
US Dollar ($)
Per 1M tokens
| Provider | Standard | |
|---|---|---|
| Input $ / 1M | Output $ / 1M | |
| $0.2 | $0.6 | |
| $0.9 | $0.9 | |
| $0.99 | $0.99 | |
| $0.8 | $0.8 | |
| $0.75 | $0.75 | |
| $0.25 | $0.75 | |
| $0.8 | $0.8 | |
| $0.375 | $0.375 | |
| $0.8 | $0.8 | |
| $0.67 | $0.67 | |
| $0.7 | $1.40 | |
| $0.75 | $0.99 | |
Cost Calculator
US Dollar ($)
Preset:
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| DeepSeek R1T2 Chimera | 164K | — | — | Available | |
| DeepSeek R1 528 | 164K | $0.200 | $0.250 | Available | |
| DeepSeek R1 Distill Qwen 32B | 131K | $0.150 | $0.150 | Available | |
| DeepSeek R1 Distill Llama 70B | 131K | $0.200 | $0.375 | Current | |
| DeepSeek R1 | 164K | $0.280 | $0.400 | Available | |
| DeepSeek R1 Distill Qwen 14B | — | 131K | $0.070 | $0.070 | Available |
| DeepSeek R1 Distill Llama 8B | — | 131K | $0.025 | $0.025 | Available |
| DeepSeek R1 Distill Qwen 1.5B | — | 131K | $0.090 | $0.090 | Available |
| DeepSeek R1 528 Turbo | — | 33K | $1.00 | $3.00 | Available |
| DeepSeek R1 528B | — | 131K | $0.550 | $2.19 | Available |
| DeepSeek R1 671B | — | 131K | $0.800 | $0.800 | Available |