DeiT Tiny Distilled Patch 16 224 is Meta's image to text model. A distilled Data-efficient Image Transformer (DeiT) model for image classification, trained with knowledge distillation at 224×224 resolution with 16×16 patches.
Specifications
Canonical IDmeta-deit-distilled
TypeImage to Text
StatusActive
CreatorMetaMeta
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
DeiT Tiny Distilled Patch 16 224Current
DeiT Base Distilled Patch 16 224Available
DeiT Base Distilled Patch 16 384Available
DeiT Base Patch 16 224Available
DeiT Base Patch 16 384Available
DeiT Small Distilled Patch 16 224Available
DeiT Small Patch 16 224Available
DeiT Tiny Patch 16 224Available

Model IDs