Meta logo

Llama Guard 3 11B Vision


Llama Guard 3 11B Vision is Meta logoMeta's language model with a 128K context window, starting at $0.350 / 1M input and $0.350 / 1M output. An 11B multimodal content safety classifier from Meta's Llama Guard 3 series, capable of evaluating both text and image inputs for harmful content.
Spec
Canonical IDmeta-llama-guard-3-11b-vision
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window128K tokens
Input ModalitiesImage
Output ModalitiesText
Parameters11B

Capabilities

Input1/5
Text·
Image
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
IBM watsonx logo
IBM watsonx
meta-llama/llama-guard-3-11b-vision
$0.350$0.350

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama Guard 3 11B Vision128K$0.350$0.350Current
Llama GuardAvailable

Model IDs