Stable Diffusion XL 1 is Stability AI's image to text model with a 4K context window and up to 4K output tokens. The official 1.0 release of Stability AI's Stable Diffusion XL foundation model, optimized for high-resolution photorealistic image generation.
stability-ai-stable-diffusion-xl-1 |
| Image to Text |
| Active |
| 4K tokens |
| 4K tokens |
| Image |
| Text |
| Benchmarks | |
|---|---|
| Elo Rating | 874#285 |
Capabilities
Input1/5
Text·
Image✓
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Versions
| Version | Released | Context | Input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|
| Imagen 4 Fast | 480 | — | — | Available | |
| Imagen 4 Ultra | 480 | — | — | Available | |
| Imagen 4 | 480 | — | — | Available | |
| Gen-4 Image | — | — | — | — | Available |
| Gen-4 Image Turbo | — | — | — | — | Available |
| Imagen 4 Preview | — | — | — | — | Available |
| Gemini 3.1 Flash Image Preview | 131K | $0.250 | $1.50 | Available | |
| Gemini 3 Pro Image Preview | 66K | $2.00 | $12.00 | Available | |
| Stable Diffusion 3.5 Large | 77 | — | — | Available | |
| Recraft V3 | — | — | — | Available | |
| Stable Diffusion XL 1 | — | 4K | — | — | Current |