Name: EfficientNet B5
Brand: Google

EfficientNet B5 is Google's image to text model. An EfficientNet B5 image classification model using compound scaling to achieve high accuracy on visual benchmarks, suited for applications where accuracy outweighs inference speed.

Specifications
Canonical ID	`google-efficientnet-b5-classification`
Type	Image to Text
Status	Active
Creator	Google
Input Modalities	Image
Output Modalities	Text

Capabilities

Input1/5

Text·

Image✓

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
EfficientNet V2 ImageNet-1k B0	—	—	—	—	Available
EfficientNet V2 ImageNet-1k B1	—	—	—	—	Available
EfficientNet V2 ImageNet-1k B2	—	—	—	—	Available
EfficientNet V2 ImageNet-1k B3	—	—	—	—	Available
EfficientNet V2 ImageNet-1k Large	—	—	—	—	Available
EfficientNet V2 ImageNet-1k Medium	—	—	—	—	Available
EfficientNet V2 ImageNet-21k B0	—	—	—	—	Available
EfficientNet V2 ImageNet-21k B1	—	—	—	—	Available
EfficientNet V2 ImageNet-21k B2	—	—	—	—	Available
EfficientNet V2 ImageNet-21k B3	—	—	—	—	Available
EfficientNet B5	—	—	—	—	Current

EfficientNet B5

Capabilities

Versions

Model IDs