SSD is NVIDIA's image to text model. NVIDIA's Single Shot MultiBox Detector (SSD) model for real-time object detection in images, providing fast bounding-box predictions across multiple object categories.
Specifications
Canonical IDnvidia-ssd
TypeImage to Text
StatusActive
CreatorNVIDIANVIDIA
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
SSD MobileNet FPNLite 2Available
SSD Mobilenet V2Available
SSD MobileNet V1 FPNAvailable
SSD Mobilenet V1 FPN 640x640Available
SSD ResNet 101 V1 FPNAvailable
SSD ResNet 152 V1 FPNAvailable
SSD ResNet 50 V1 FPNAvailable
SSDCurrent

Model IDs