Name: Inception V3
Brand: Google

Inception V3 is Google's image to text model. Google's Inception V3 convolutional neural network for image classification, featuring factorized convolutions and auxiliary classifiers.

Specifications
Canonical ID	`google-inception-3-classification`
Type	Image to Text
Status	Active
Creator	Google
Input Modalities	Image
Output Modalities	Text

Capabilities

Input1/5

Text·

Image✓

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Inception V3	—	—	—	—	Current
Inception V3 Feature Vector	—	—	—	—	Available
Inception V3 Feature Vector	—	—	—	—	Available
Inception V2	—	—	—	—	Available
Inception V2 Feature Vector	—	—	—	—	Available
Inception V1	—	—	—	—	Available
Inception V1 Feature Vector	—	—	—	—	Available

Inception V3

Capabilities

Versions

Model IDs