Imagen is Google's image generation model with a 480 context window. Google's text-to-image diffusion model series known for high photorealism and strong text-image alignment.
Specifications
Canonical IDgoogle-imagen
TypeImage Generation
StatusActive
CreatorGoogleGoogle
Providers
Context Window480 tokens
Input ModalitiesText
Output ModalitiesImage

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Google Vertex AI logo
Google Vertex AI
vertex_ai/imagegeneration@006

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Imagen 4 Fast480Available
Imagen 4 Ultra480Available
Imagen 4480Available
Gen-4 ImageAvailable
Gen-4 Image TurboAvailable
Imagen 4 PreviewAvailable
Gemini 3.1 Flash Image Preview131K$0.250$1.50Available
Gemini 3 Pro Image Preview66K$2.00$12.00Available
Stable Diffusion 3.5 Large77Available
Recraft V3Available
Imagen480Current

Model IDs