DeepSeek V4 Flash is DeepSeek's language model with a 1.0M context window and up to 384K output tokens, available from 4 providers, starting at $0.100 / 1M input and $0.200 / 1M output. An efficiency-optimized Mixture-of-Experts LLM from DeepSeek with 284B total and 13B activated parameters, supporting a 1M-token context window with reasoning and tool-use capabilities.
Specifications
Canonical IDdeepseek-v4-flash
TypeLanguage
StatusActive
CreatorDeepSeekDeepSeek
Providers
Context Window1.0M tokens
Max Output384K tokens
Input ModalitiesImagePdfText
Output ModalitiesText
Reasoning Effortsdefault
Parameters158B
HuggingFace Likes649
HuggingFace Downloads (30d)25,391
HuggingFace Downloads (all-time)25,391
Release Date · 1 month ago
Benchmarks
Intelligence Index
46.5
#33
Coding Index
38.7
#45
GPQA
0.9
#15
HLE
0.3
#19
IFBench
0.8
#7
Time to First Token
0.77s
#300
SciCode
0.4
#36
LCR
0.6
#68
TerminalBench Hard
0.4
#48
TAU2
1.0
#22
Output TPS
103.8
#107

Capabilities

Input3/5
Text
Image
Audio·
Video·
PDF
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities6/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
OpenRouter logo
OpenRouter
deepseek/deepseek-v4-flash
$0.100$0.200$0.020
DeepSeek logo
DeepSeek
deepseek-v4-flash(1)
$0.140$0.280$0.0028$0.070$0.140$0.0014
Hugging Face logo
Hugging Face
novita:deepseek/deepseek-v4-flash
$0.140$0.280N/A
Vercel AI Gateway logo
Vercel AI Gateway
deepseek/deepseek-v4-flash
$0.140$0.280$0.0028

Cost Calculator

Preset:

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
DeepSeek V4 ProPro1.0M$0.435$0.870

Model IDs