Google logo

EfficientNet B4


EfficientNet B4 is Google's image to text model. An EfficientNet B4 image classification model with compound-scaled architecture, delivering high accuracy on visual recognition tasks with a larger parameter budget than B3.
Specifications
Canonical IDgoogle-efficientnet-b4-classification
TypeImage to Text
StatusActive
CreatorGoogleGoogle
Input ModalitiesImage
Output ModalitiesText

Capabilities

Input1/5
TextΒ·
Imageβœ“
AudioΒ·
VideoΒ·
PDFΒ·
Output1/5
Textβœ“
ImageΒ·
AudioΒ·
VideoΒ·
EmbeddingΒ·
Capabilities0/13
ReasoningΒ·
Adaptive ReasoningΒ·
Function CallingΒ·
Parallel Function CallingΒ·
Structured OutputsΒ·
Native JSON SchemaΒ·
Web SearchΒ·
URL ContextΒ·
Computer UseΒ·
Code ExecutionΒ·
File SearchΒ·
Prompt CachingΒ·
Assistant PrefillΒ·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
EfficientNet V2 ImageNet-1k B0β€”β€”β€”β€”Available
EfficientNet V2 ImageNet-1k B1β€”β€”β€”β€”Available
EfficientNet V2 ImageNet-1k B2β€”β€”β€”β€”Available
EfficientNet V2 ImageNet-1k B3β€”β€”β€”β€”Available
EfficientNet V2 ImageNet-1k Largeβ€”β€”β€”β€”Available
EfficientNet V2 ImageNet-1k Mediumβ€”β€”β€”β€”Available
EfficientNet V2 ImageNet-21k B0β€”β€”β€”β€”Available
EfficientNet V2 ImageNet-21k B1β€”β€”β€”β€”Available
EfficientNet V2 ImageNet-21k B2β€”β€”β€”β€”Available
EfficientNet V2 ImageNet-21k B3β€”β€”β€”β€”Available
EfficientNet B4β€”β€”β€”β€”Current

Model IDs