DeepSeek logo

DeepSeek R1 Distill Qwen3 8B


DeepSeek R1 Distill Qwen3 8B is DeepSeek's language model with a 131K context window, starting at $0.200 / 1M input and $0.200 / 1M output. An 8B model distilled from DeepSeek R1 0528's chain-of-thought into the Qwen3 8B base, achieving strong open-source reasoning benchmark performance.
Specifications
Canonical IDdeepseek-r1-distill-qwen3-8b
TypeLanguage
StatusActive
CreatorDeepSeekDeepSeek
Providers
Context Window131K tokens
Input ModalitiesText
Output ModalitiesText
Parameters8B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/deepseek-r1-0528-distill-qwen3-8b
$0.200$0.200

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Voyage 4 NanoAvailable
DeepSeek R1 Distill Qwen3 8B131K$0.200$0.200Current
Qwen3 Embedding 0.6B33K$0.010$0.000Available
Qwen3 Embedding 4B41K$0.020$0.000Available
Qwen3 Embedding 8B41K$0.020$0.000Available
Qwen3 14B131K$0.060$0.200Available
Qwen3 32B131K$0.050$0.100Available
Qwen3 8B131K$0.035$0.138Available
Qwen3 4B Instruct262K$0.010$0.030Available
KwaiPilot KAT 32B Dev131K$0.900$0.900Available
Qwen3 0.6B41K$0.100$0.100Available

Model IDs