Upstage logo

Document OCR


Document OCR is Upstage logoUpstage's image to text model. An OCR model by Upstage that extracts all text content from documents, including scanned pages and complex layouts.
Spec
Canonical IDupstage-document-ocr
TypeImage to Text
StatusActive
CreatorUpstageUpstage
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Document OCRCurrent
Google OCRAvailable
OCR 3$0.000$0.000Available

Model IDs