Stable Diffusion 3 is Stability AI's image generation model, available from 2 providers. The third major generation of Stability AI's text-to-image model, featuring a multimodal diffusion transformer architecture for improved composition and typography.
Specifications
Canonical IDstability-ai-stable-diffusion-3
TypeImage Generation
StatusActive
CreatorStability AIStability AI
Providers
Input ModalitiesText
Output ModalitiesImage

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Imagen 4 Fast480Available
Imagen 4 Ultra480Available
Imagen 4480Available
Gen-4 ImageAvailable
Gen-4 Image TurboAvailable
Imagen 4 PreviewAvailable
Gemini 3.1 Flash Image Preview131K$0.250$1.50Available
Gemini 3 Pro Image Preview66K$2.00$12.00Available
Stable Diffusion 3.5 Large77Available
Recraft V3Available
Stable Diffusion 3Current

Model IDs