Microsoft logo

ResNet 18


ResNet 18 is Microsoft logoMicrosoft's image to text model. A lightweight 18-layer residual network for image classification, balancing efficiency and accuracy for vision tasks with limited compute.
Spec
Canonical IDmicrosoft-resnet18
TypeImage to Text
StatusActive
CreatorMicrosoftMicrosoft
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
TextΒ·
Imageβœ“
AudioΒ·
VideoΒ·
PDFΒ·
Output1/5
Textβœ“
ImageΒ·
AudioΒ·
VideoΒ·
EmbeddingΒ·
Capabilities0/13
ReasoningΒ·
Adaptive ReasoningΒ·
Function CallingΒ·
Parallel Function CallingΒ·
Structured OutputsΒ·
Native JSON SchemaΒ·
Web SearchΒ·
URL ContextΒ·
Computer UseΒ·
Code ExecutionΒ·
File SearchΒ·
Prompt CachingΒ·
Assistant PrefillΒ·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
ResNet V2 101β€”β€”β€”β€”Available
ResNet V2 50β€”β€”β€”β€”Available
ResNet V2 Classificationβ€”β€”β€”β€”Available
ResNet V2 Featurevectorβ€”β€”β€”β€”Available
ResNet V1 101β€”β€”β€”β€”Available
ResNet V1 152β€”β€”β€”β€”Available
ResNet V1 50β€”β€”β€”β€”Available
ResNet V1 Classificationβ€”β€”β€”β€”Available
ResNet 18β€”β€”β€”β€”Current
ResNet 101β€”β€”β€”β€”Available
ResNet 152β€”β€”β€”β€”Available

Model IDs