DeepSeek logo

DeepSeek R1 528B Qwen3 8B


DeepSeek R1 528B Qwen3 8B is DeepSeek's language model with a 128K context window and up to 32K output tokens, starting at $0.060 / 1M input and $0.090 / 1M output. A high-performance reasoning model distilled from DeepSeek R1 0528 into the Qwen3 8B base, combining R1's chain-of-thought optimization with a compact 8B architecture.
Specifications
Canonical IDdeepseek-r1-528b-qwen3-8b
TypeLanguage
StatusActive
CreatorDeepSeekDeepSeek
Providers
Context Window128K tokens
Max Output32K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Parameters528B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Novita logo
Novita
novita/deepseek/deepseek-r1-0528-qwen3-8b
$0.060$0.090

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EAGLE Qwen 2.5 3B InstructAvailable
Qwen3.6 Max Preview262K$1.04$6.24Available
Qwen3.6 27B262K$0.320$3.20Available
Qwen3.6 35B A3B262K$0.150$1.00Available
Qwen3.6 Plus1.0M$0.325$1.95Available
Qwen3 Max Thinking262K$0.780$3.90Available
Qwen3 Next 80B A3B128K$0.150$1.20Available
Qwen3 Max262K$0.359$1.43Available
Qwen3 Max Preview262K$1.20$6.00Available
Qwen3 Coder Plus1.0M$0.650$3.25Available
DeepSeek R1 528B Qwen3 8B128K$0.060$0.090Current

Model IDs