Google logo

SSD MobileNet V1 FPN


SSD MobileNet V1 FPN is Google's image to text model. A single-shot object detection model combining SSD with a MobileNet V1 backbone and Feature Pyramid Network for real-time detection at 640×640 resolution.
Specifications
Canonical IDgoogle-ssd-mobilenet-fpn-1
TypeImage to Text
StatusActive
CreatorGoogleGoogle
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
SSD MobileNet FPNLite 2Available
SSD Mobilenet V2Available
SSD MobileNet V1 FPNCurrent
SSD Mobilenet V1 FPN 640x640Available
SSD ResNet 101 V1 FPNAvailable
SSD ResNet 152 V1 FPNAvailable
SSD ResNet 50 V1 FPNAvailable
SSDAvailable

Model IDs