DeepSeek V4 Flash is DeepSeek's language model with a 1.0M context window and up to 384K output tokens, available from 11 providers, starting at $0.09 / 1M input and $0.18 / 1M output. An efficiency-optimized Mixture-of-Experts LLM from DeepSeek with 284B total and 13B activated parameters, supporting a 1M-token context window with reasoning and tool-use capabilities.
Capabilities
Input3/5
Text✓
Image✓
Audio·
Video·
PDF✓
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities7/13
Reasoning✓
Adaptive Reasoning·
Function Calling✓
Parallel Function Calling✓
Structured Outputs✓
Native JSON Schema✓
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching✓
Assistant Prefill✓
Pricing by Provider
US Dollar ($)
Per 1M tokens
| Provider | Standard | Batch | ||||
|---|---|---|---|---|---|---|
| Input $ / 1M | Output $ / 1M | Cache Read $ / 1M | Input $ / 1M | Output $ / 1M | Cache Read $ / 1M | |
| $0.2 | $0.4 | N/A | $0.1 | $0.2 | N/A | |
| $0.19 | $0.51 | N/A | — | — | — | |
| $0.14 | $0.28 | $0.0028 | $0.07 | $0.14 | $0.0014 | |
| $0.14 | $0.28 | $0.028 | — | — | — | |
| $0.14 | $0.28 | N/A | — | — | — | |
| $0.25 | $1.75 | N/A | — | — | — | |
| $0.09 | $0.18 | $0.018 | — | — | — | |
| $0.1 | $0.2 | N/A | — | — | — | |
| $0.14 | $0.28 | $0.0028 | — | — | — | |
| $0.14 | $0.28 | N/A | — | — | — | |
| $0.14 | $0.28 | $0.0028 | — | — | — | |
Cost Calculator
US Dollar ($)
Preset:
Cheapest Instances to Run It
Cloud GPU instances that can host DeepSeek V4 Flash, ranked by cheapest on-demand price. The model needs about 379 GB of GPU memory at FP16 precision (estimated from its parameter count), so treat the fit as guidance rather than a guarantee.
All clouds
FP16 (full precision)
US Dollar ($)
Instance | Cloud | GPU | VRAM | Price | Cheapest region | |
|---|---|---|---|---|---|---|
| Standard_NCC40ads_H100_v5 | NVIDIA H100 | 752 GB | $6.98/hr | eastus2 | ||
| g7e.24xlarge | 4× RTX PRO Server 6000 | 384 GB | $16.57/hr | us-east-1 | ||
| p4de.24xlarge | 8× A100 | 640 GB | $27.45/hr | us-east-1 | ||
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| DeepSeek V4 Flash | 1.0M | $0.090 | $0.180 | Current | |
| DeepSeek V4 Flash Thinking | — | 200K | $0.250 | $1.75 | Available |
Other Models
| Model | Tier | Released | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|---|
| DeepSeek V4 Pro | Pro | 1.0M | $0.435 | $0.870 |