Prebuilt Document is Microsoft's image to text model. A prebuilt Azure AI Document Intelligence model for extracting structured data, key-value pairs, and layout information from general documents.
Specifications
Canonical IDmicrosoft-prebuilt-document
TypeImage to Text
StatusActive
CreatorMicrosoftMicrosoft
Providers
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
DeepSeek-OCR 2Available
Qianfan OCR Fast66KDeprecated
Prebuilt DocumentCurrent
DeepSeek-OCR8K$0.030$0.030Available
DeepSeek-OCRAvailable
Document OCRAvailable
Mistral OCR$2.00$3.00Available
OCRAvailable
Prebuilt LayoutAvailable
Prebuilt ReadAvailable

Model IDs