EfficientNet B5 is Google's image to text model. An EfficientNet B5 image classification model using compound scaling to achieve high accuracy on visual benchmarks, suited for applications where accuracy outweighs inference speed.
google-efficientnet-b5-classification |
| Image to Text |
| Active |
| Image |
| Text |
Capabilities
Input1/5
Text·
Image✓
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| EfficientNet V2 ImageNet-1k B0 | — | — | — | — | Available |
| EfficientNet V2 ImageNet-1k B1 | — | — | — | — | Available |
| EfficientNet V2 ImageNet-1k B2 | — | — | — | — | Available |
| EfficientNet V2 ImageNet-1k B3 | — | — | — | — | Available |
| EfficientNet V2 ImageNet-1k Large | — | — | — | — | Available |
| EfficientNet V2 ImageNet-1k Medium | — | — | — | — | Available |
| EfficientNet V2 ImageNet-21k B0 | — | — | — | — | Available |
| EfficientNet V2 ImageNet-21k B1 | — | — | — | — | Available |
| EfficientNet V2 ImageNet-21k B2 | — | — | — | — | Available |
| EfficientNet V2 ImageNet-21k B3 | — | — | — | — | Available |
| EfficientNet B5 | — | — | — | — | Current |