Name: DeepSeek-OCR 2
Brand: DeepSeek

DeepSeek-OCR 2 is DeepSeek's image to text model. An upgraded multimodal document recognition model from DeepSeek AI featuring the DeepEncoder V2 architecture for improved document understanding.

Specifications
Canonical ID	`deepseek-ocr-2`
Type	Image to Text
Status	Active
Creator	DeepSeek
Input Modalities	Text
Output Modalities	Text

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Mistral OCR 4	—	—	—	—	Available
Mistral OCR 4 Annot	—	—	—	—	Available
DeepSeek-OCR 2	—	—	—	—	Current
Qianfan OCR Fast	2026-04-20	66K	—	—	Deprecated
DeepSeek-OCR	—	—	—	—	Available
Document OCR	—	—	—	—	Available
Mistral OCR	—	—	$2.00	$3.00	Available
OCR	—	—	—	—	Available
Prebuilt Document	—	—	—	—	Available
Prebuilt Layout	—	—	—	—	Available
Prebuilt Read	—	—	—	—	Available

Model IDs

deepseek-ocr-2

deepseek/deepseek-ocr-2

DeepSeek-OCR 2

CapabilitiesAPIGET/api/v1/models/deepseek-ocr-2

VersionsAPIGET/api/v1/models?family=ocr

Model IDsAPIGET/api/v1/models/deepseek-ocr-2

Capabilities

Versions

Model IDs