Google logo

EfficientDet D3


EfficientDet D3 is Google's image to text model. EfficientDet D3 object detection model providing higher accuracy than D2 through compound scaling of resolution, depth, and width.
Specifications
Canonical IDgoogle-efficientdet-d3
TypeImage to Text
StatusActive
CreatorGoogleGoogle
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EfficientDet D3Current
EfficientDet D0Available
EfficientDet D2Available
EfficientDet D5Available
SSD EfficientDet D1Available
SSD EfficientDet D4Available

Model IDs