Prebuilt Document is Microsoft's image to text model. A prebuilt Azure AI Document Intelligence model for extracting structured data, key-value pairs, and layout information from general documents.
Specifications
Canonical IDmicrosoft-prebuilt-document
TypeImage to Text
StatusActive
CreatorMicrosoftMicrosoft
Providers
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Page In
$ / page
Azure AI Foundry logo
Azure AI Foundry
azure_ai/doc-intelligence/prebuilt-document
$0.010

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Mistral OCR 4Available
Mistral OCR 4 AnnotAvailable
DeepSeek-OCR 2Available
Qianfan OCR Fast66KDeprecated
Prebuilt DocumentCurrent
DeepSeek-OCRAvailable
Document OCRAvailable
Mistral OCR$2.00$3.00Available
OCRAvailable
Prebuilt LayoutAvailable
Prebuilt ReadAvailable

Model IDs

azure_ai/doc-intelligence/prebuilt-document
microsoft-prebuilt-document