Name: ResNet V1 101
Brand: Google

ResNet V1 101 is Google's image to text model. A 101-layer ResNet V1 image feature extractor for image classification tasks, providing deep residual representations.

Specifications
Canonical ID	`google-resnet-1-101`
Type	Image to Text
Status	Active
Creator	Google
Input Modalities	Image
Output Modalities	Text

Capabilities

Input1/5

Text·

Image✓

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
ResNet V2 101	—	—	—	—	Available
ResNet V2 50	—	—	—	—	Available
ResNet V2 Classification	—	—	—	—	Available
ResNet V2 Featurevector	—	—	—	—	Available
ResNet V1 101	—	—	—	—	Current
ResNet V1 152	—	—	—	—	Available
ResNet V1 50	—	—	—	—	Available
ResNet V1 Classification	—	—	—	—	Available
ResNet 101	—	—	—	—	Available
ResNet 152	—	—	—	—	Available
ResNet 18	—	—	—	—	Available

Model IDs

google-resnet-1-101

tensorflow-icembedding-imagenet-resnet-v1-101-featurevector-4

ResNet V1 101

CapabilitiesAPIGET/api/v1/models/google-resnet-1-101

VersionsAPIGET/api/v1/models?family=resnet

Model IDsAPIGET/api/v1/models/google-resnet-1-101

Capabilities

Versions

Model IDs