LlamaGuard 4 12B is Meta's language model with a 164K context window and up to 16K output tokens, available from 4 providers, starting at $0.180 / 1M input and $0.180 / 1M output. A natively multimodal 12B safety classifier pruned from Llama 4 Scout, trained jointly on text and multiple images for comprehensive content moderation.
Specifications
Canonical IDmeta-llamaguard-4-12b
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window164K tokens
Max Output16K tokens
Input ModalitiesImageText
Output ModalitiesText
Parameters12B
HuggingFace Likes90
HuggingFace Downloads (30d)79,129
HuggingFace Downloads (all-time)788,036
Release Date · 1 year ago
Knowledge Cutoff

Capabilities

Input2/5
Text
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
LlamaGuard 4 12B164K$0.180$0.180Current
Llama 4 Maverick1.0M$0.120$0.485Available
Llama 4 Scout10.0M$0.080$0.300Available

Model IDs