DeepSeek R1 Distill Llama 8B

DeepSeek R1 Distill Llama 8B is a text model from Fireworks AI with a context window of 131K tokens and max output of 131K tokens. Pricing starts at 0.20 per million input tokens and 0.20 per million output tokens (cheapest at Nscale).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyfireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-8b
ProviderFireworks AI
Provider IDfireworks_ai
ModeText
Canonical Namedeepseek-r1-distill-llama-8b
Context Window131K tokens
Max Output131K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0002000.200
Output Tokens0.0002000.200

Benchmarks

Intelligence Index12.1#168
Math Index41.3#58
MMLU-Pro0.5#155
GPQA0.3#201
HLE0.0#161
LiveCodeBench0.2#137
AIME0.3#42
IFBench0.2#172
Time to First Token0.00s#1
SciCode0.1#186
MATH-5000.9#54
AIME 20250.4#58
LCR0.0#151

Price Comparison by Provider

Compare prices for DeepSeek R1 Distill Llama 8B across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Nscalenscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B0.0250.025
Fireworks AIfireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-8b0.2000.200

All Variants

All available versions, regions, and API endpoints for DeepSeek R1 Distill Llama 8B.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-8bFireworks AIText0.2000.200131K131Knono
nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8BNscaleText0.0250.025N/AN/Anono