EfficientNet B3 Feature Vector is Google's image to text model. An EfficientNet B3 vision model outputting feature vectors, offering increased depth and width over B2 for richer image representations in transfer learning scenarios.
Specifications
Canonical IDgoogle-efficientnet-b3
TypeImage to Text
StatusActive
CreatorGoogleGoogle
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EfficientNet V2 ImageNet-1k B0Available
EfficientNet V2 ImageNet-1k B1Available
EfficientNet V2 ImageNet-1k B2Available
EfficientNet V2 ImageNet-1k B3Available
EfficientNet V2 ImageNet-1k LargeAvailable
EfficientNet V2 ImageNet-1k MediumAvailable
EfficientNet V2 ImageNet-21k B0Available
EfficientNet V2 ImageNet-21k B1Available
EfficientNet V2 ImageNet-21k B2Available
EfficientNet V2 ImageNet-21k B3Available
EfficientNet B3 Feature VectorCurrent

Model IDs