ResNet V2 50 is Google's image to text model. A ResNet-50 V2 image classification model with improved residual connections for TensorFlow-based visual recognition.
Specifications
Canonical IDgoogle-resnet-2-50b-classification
TypeImage to Text
StatusActive
CreatorGoogleGoogle
Input ModalitiesImage
Output ModalitiesText
Parameters50B

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
ResNet V2 50Current
ResNet V2 101Available
ResNet V2 ClassificationAvailable
ResNet V2 FeaturevectorAvailable
ResNet V1 101Available
ResNet V1 152Available
ResNet V1 50Available
ResNet V1 ClassificationAvailable
ResNet 101Available
ResNet 152Available
ResNet 18Available

Model IDs