Baidu logo

ERNIE 4.5 VL 424B A47B


ERNIE 4.5 VL 424B A47B is Baidu logoBaidu's language model with a 123K context window and up to 16K output tokens, available from 3 providers, starting at $0.420 / 1M input and $1.25 / 1M output. A large-scale 424B multimodal MoE vision-language model from Baidu activating 47B parameters per token for cross-modal knowledge fusion.
Spec
Canonical IDbaidu-ernie-vl-4-5-424b-a47b
TypeLanguage
StatusActive
CreatorBaiduBaidu
Providers
Context Window123K tokens
Max Output16K tokens
Input ModalitiesImageText
Output ModalitiesText
Reasoning Effortsdefault
Parameters424B
Release Date · 10 months ago
Knowledge Cutoff

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Hugging Face logo
Hugging Face
novita:baidu/ernie-4.5-vl-424b-a47b
$0.420$1.25
Novita logo
Novita
novita/baidu/ernie-4.5-vl-424b-a47b
$0.420$1.25
OpenRouter logo
OpenRouter
baidu/ernie-4.5-vl-424b-a47b
$0.420$1.25

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
ERNIE 5 Thinking PreviewAvailable
ERNIE 4.5 21B A3B Thinking131K$0.070$0.280Available
ERNIE 4.5 VL 424B A47B123K$0.420$1.25Current
ERNIE 4.5 300B A47B123K$0.280$1.10Available
ERNIE 4.5 300B A47B Paddle123K$0.280$1.10Available
ERNIE 4.5 VL 28B A3B Thinking131K$0.390$0.390Available

Model IDs