Name: RetinaNet ResNet-101 V1 FPN
Brand: Google

RetinaNet ResNet-101 V1 FPN is Google's image to text model. An object detection model combining RetinaNet with a ResNet-101 V1 backbone and Feature Pyramid Network for multi-scale detection.

Specifications
Canonical ID	`google-retinanet-resnet101v1-fpn`
Type	Image to Text
Status	Active
Creator	Google
Input Modalities	Image
Output Modalities	Text

Capabilities

Input1/5

Text·

Image✓

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
RetinaNet ResNet-101 V1 FPN	—	—	—	—	Current
RetinaNet	—	—	—	—	Available
RetinaNet ResNet-152 V1 FPN	—	—	—	—	Available
RetinaNet ResNet-50 V1 FPN	—	—	—	—	Available
Retinanet SSD Resnet-101 640x640	—	—	—	—	Available

Model IDs

amazon_sagemaker/tensorflow-od-retinanet-resnet101-v1-fpn-1024x1024-1
google-retinanet-resnet101v1-fpn