Microsoft logo

Prebuilt Layout

Prebuilt Layout is Microsoft's image to text model. A prebuilt Azure AI Document Intelligence model specialized in extracting text, tables, selection marks, and structural layout from documents.
Specifications
Canonical IDmicrosoft-prebuilt-layout
TypeImage to Text
StatusActive
CreatorMicrosoftMicrosoft
Providers
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Azure AI Foundry logo
Azure AI Foundry
azure_ai/doc-intelligence/prebuilt-layout

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
DeepSeek-OCR 2Available
Qianfan OCR Fast66K$0.680$2.81Available
Prebuilt LayoutCurrent
DeepSeek-OCR8K$0.030$0.030Available
DeepSeek-OCRAvailable
Document OCRAvailable
Mistral OCR$2.00$3.00Available
OCRAvailable
Prebuilt DocumentAvailable
Prebuilt ReadAvailable

Model IDs