DeepSeek R1 Distill Llama 70B is DeepSeek's language model with a 131K context window and up to 8K output tokens, available from 11 providers, starting at $0.2 / 1M input and $0.375 / 1M output. A 70B Llama-based model distilled from DeepSeek R1's chain-of-thought reasoning, combining Llama's architecture with R1's advanced reasoning capabilities.
Specifications
Canonical IDdeepseek-r1-distill-llama-70b
TypeLanguage
StatusDeprecated
CreatorDeepSeekDeepSeek
Providers
Context Window131K tokens
Max Output8K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Parameters70B
HuggingFace Likes770
HuggingFace Downloads (30d)171,910
HuggingFace Downloads (all-time)4,433,062
Release Date · 1 year ago
Knowledge Cutoff · 2 years ago
Deprecation Date
Benchmarks
Intelligence Index
16.0
#280
Coding Index
11.4
#280
Math Index
53.7
#125
MMLU-Pro
0.8
#112
GPQA
0.4
#371
HLE
0.1
#224
LiveCodeBench
0.3
#230
AIME
0.7
#39
IFBench
0.3
#351
Time to First Token
0.48s
#241
SciCode
0.3
#215
MATH-500
0.9
#44
AIME 2025
0.5
#125
LCR
0.1
#295
TerminalBench Hard
0.0
#318
TAU2
0.2
#290
Output TPS
41.8
#244

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities3/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
DeepInfra logo
DeepInfra
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
$0.2$0.6
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-70b
$0.9$0.9
Gradient AI logo
Gradient AI
gradient_ai/deepseek-r1-distill-llama-70b
$0.99$0.99
Hugging Face logo
Hugging Face
novita:deepseek/deepseek-r1-distill-llama-70b
$0.8$0.8
Hugging Face logo
Hugging Face
nscale:deepseek-ai/DeepSeek-R1-Distill-Llama-70B
$0.75$0.75
Nebius logo
Nebius
nebius/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
$0.25$0.75
Novita logo
Novita
novita/deepseek/deepseek-r1-distill-llama-70b
$0.8$0.8
Nscale logo
Nscale
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
$0.375$0.375
OpenRouter logo
OpenRouter
deepseek/deepseek-r1-distill-llama-70b
$0.8$0.8
OVHcloud logo
OVHcloud
ovhcloud/DeepSeek-R1-Distill-Llama-70B
$0.67$0.67
SambaNova logo
SambaNova
sambanova/DeepSeek-R1-Distill-Llama-70B
$0.7$1.40
Vercel AI Gateway logo
Vercel AI Gateway
vercel_ai_gateway/deepseek/deepseek-r1-distill-llama-70b
$0.75$0.99

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
DeepSeek R1T2 Chimera164KAvailable
DeepSeek R1 528164K$0.200$0.250Available
DeepSeek R1 Distill Qwen 32B131K$0.150$0.150Available
DeepSeek R1 Distill Llama 70B131K$0.200$0.375Current
DeepSeek R1164K$0.280$0.400Available
DeepSeek R1 Distill Qwen 14B131K$0.070$0.070Available
DeepSeek R1 Distill Llama 8B131K$0.025$0.025Available
DeepSeek R1 Distill Qwen 1.5B131K$0.090$0.090Available
DeepSeek R1 528 Turbo33K$1.00$3.00Available
DeepSeek R1 528B131K$0.550$2.19Available
DeepSeek R1 671B131K$0.800$0.800Available

Model IDs

accounts/fireworks/models/deepseek-r1-distill-llama-70b
deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
deepseek-llm-r1-distill-llama-70b
deepseek-r1-distill-llama-70b
deepseek/deepseek-r1-distill-llama-70b
fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-70b
gradient_ai/deepseek-r1-distill-llama-70b
nebius/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
novita/deepseek/deepseek-r1-distill-llama-70b
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
ovhcloud/DeepSeek-R1-Distill-Llama-70B
sambanova/DeepSeek-R1-Distill-Llama-70B
vercel_ai_gateway/deepseek/deepseek-r1-distill-llama-70b