DeepSeek 4 Flash is DeepSeek's language model, starting at $0.19 / 1M input and $0.51 / 1M output. A fast-tier DeepSeek LLM optimized for low-latency inference within the DeepSeek 4 generation.
Specifications
Canonical IDdeepseek-4-flash
TypeLanguage
StatusActive
CreatorDeepSeekDeepSeek
Providers
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
Azure AI Foundry logo
Azure AI Foundry
deepseek:v4flash
$0.19$0.51

Cost Calculator

US Dollar ($)
Preset:

Other Models

ModelTierReleasedContextInput / 1MOutput / 1M
DeepSeek 4 Pro$1.74$3.48
DeepSeek LLM 67B Chat (V1)
DeepSeek$0.580$1.68
DeepSeek Janus Pro
DeepSeek Janus Pro 1B
DeepSeek Janus Pro 7B
DeepSeek Llama3.3 70B131K$0.200$0.600

Model IDs

deepseek-4-flash