DeepSeek R1 8B is DeepSeek's language model with a 66K context window and up to 16K output tokens, starting at $0.1 / 1M input and $0.2 / 1M output. An 8B-parameter distillation of DeepSeek's R1 reasoning model, providing accessible chain-of-thought capabilities in a smaller footprint.
Capabilities
Input1/5
Text✓
Image·
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities3/13
Reasoning✓
Adaptive Reasoning·
Function Calling✓
Parallel Function Calling·
Structured Outputs✓
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Pricing by Provider
US Dollar ($)
Per 1M tokens
| Provider | Standard | |
|---|---|---|
| Input $ / 1M | Output $ / 1M | |
| $0.1 | $0.2 | |
Cost Calculator
US Dollar ($)
Preset:
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| DeepSeek R1T2 Chimera | 164K | — | — | Available | |
| DeepSeek R1 528 | 164K | $0.200 | $0.250 | Available | |
| DeepSeek R1 Distill Qwen 32B | 131K | $0.150 | $0.150 | Available | |
| DeepSeek R1 Distill Llama 70B | 131K | $0.200 | $0.375 | Deprecated | |
| DeepSeek R1 | 164K | $0.280 | $0.400 | Available | |
| DeepSeek R1 8B | — | 66K | $0.100 | $0.200 | Current |
| DeepSeek R1 Distill Qwen 14B | — | 131K | $0.070 | $0.070 | Available |
| DeepSeek R1 Distill Llama 8B | — | 131K | $0.025 | $0.025 | Available |
| DeepSeek R1 Distill Qwen 1.5B | — | 131K | $0.090 | $0.090 | Available |
| DeepSeek R1 528 Turbo | — | 33K | $1.00 | $3.00 | Available |
| DeepSeek R1 528B | — | 131K | $0.550 | $2.19 | Available |