ERNIE Image is Baidu's image generation model. Baidu's ERNIE-based image generation model, leveraging the ERNIE multimodal foundation for text-to-image synthesis.
Specifications
Canonical IDbaidu-ernie-image
TypeImage Generation
StatusActive
CreatorBaiduBaidu
Input ModalitiesText
Output ModalitiesImage
Benchmarks
Elo Rating
1179
#64

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text·
Image
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
ERNIE 5 Thinking PreviewAvailable
ERNIE 4.5 21B A3B Thinking131K$0.070$0.280Available
ERNIE 4.5 300B A47B131KAvailable
ERNIE 4.5 VL 424B A47B131K$0.420$1.25Available
ERNIE 4.5 300B A47B Paddle123K$0.280$1.10Available
ERNIE 4.5 VL 28B A3B Thinking131K$0.390$0.390Available
ERNIE ImageCurrent
ERNIE Image TurboAvailable

Model IDs