Baidu logo

ERNIE 4.5 VL 28B A3B


ERNIE 4.5 VL 28B A3B is Baidu logoBaidu's language model with a 30K context window and up to 8K output tokens, available from 3 providers, starting at $0.140 / 1M input and $0.560 / 1M output. A 28B multimodal MoE vision-language model from Baidu with 3B active parameters per token, enabling cross-modal understanding and generation.
Spec
Canonical IDbaidu-ernie-vl-4-5-28b-a3b
TypeLanguage
StatusActive
CreatorBaiduBaidu
Providers
Context Window30K tokens
Max Output8K tokens
Input ModalitiesImageText
Output ModalitiesText
Reasoning Effortsdefault
Parameters28B
HuggingFace Likes101
HuggingFace Downloads (30d)70,108
HuggingFace Downloads (all-time)802,331
Release Date · 8 months ago
Knowledge Cutoff

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities3/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Hugging Face logo
Hugging Face
novita:baidu/ernie-4.5-vl-28b-a3b
$0.140$0.560
Novita logo
Novita
novita/baidu/ernie-4.5-vl-28b-a3b
$0.140$0.560
OpenRouter logo
OpenRouter
baidu/ernie-4.5-vl-28b-a3b
$0.140$0.560

Cost Calculator

Preset:
Compares every provider & tier in USD

Model IDs