DeepSeek logo

DeepSeek OCR


DeepSeek OCR is DeepSeek logoDeepSeek's image to text model with a 8K context window and up to 8K output tokens, available from 2 providers, starting at $0.030 / 1M input and $0.030 / 1M output. A multimodal OCR model from DeepSeek that extracts and interprets text from images with high accuracy across diverse document types.
Spec
Canonical IDdeepseek-ocr
TypeImage to Text
StatusActive
CreatorDeepSeekDeepSeek
Providers
Context Window8K tokens
Max Output8K tokens
Input ModalitiesImagePdfText
Output ModalitiesText

Capabilities

Input3/5
Text
Image
Audio·
Video·
PDF
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Novita logo
Novita
deepseek/deepseek-ocr
$0.030$0.030
Google Vertex AI logo
Google Vertex AI
deepseek-ocr-maas
$0.300$1.20

Cost Calculator

Preset:
Compares every provider & tier in USD

Model IDs