Llama Guard 4 12B

Llama Guard 4 12B is a text model from DeepInfra with a context window of 164K tokens and max output of 164K tokens. Pricing starts at 0.18 per million input tokens and 0.18 per million output tokens (cheapest at DeepInfra).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keydeepinfra/meta-llama/Llama-Guard-4-12B
ProviderDeepInfra
Provider IDdeepinfra
ModeText
Canonical Namellama-guard-4-12b
Context Window164K tokens
Max Output164K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0001800.180
Output Tokens0.0001800.180

Benchmarks

No benchmark data is available for this model.

Price Comparison by Provider

Compare prices for Llama Guard 4 12B across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Groqgroq/meta-llama/llama-guard-4-12b0.2000.200
DeepInfradeepinfra/meta-llama/Llama-Guard-4-12B0.1800.180

All Variants

All available versions, regions, and API endpoints for Llama Guard 4 12B.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
deepinfra/meta-llama/Llama-Guard-4-12BDeepInfraText0.1800.180164K164Knono
groq/meta-llama/llama-guard-4-12bGroqText0.2000.2008K8Knono