Name: Llama 3.2 11B Instruct Vision
Brand: Meta

Llama 3.2 11B Instruct Vision is an AI model from Meta. Meta's 11B instruction-tuned vision-language model from Llama 3.2, combining image understanding with instruction-following for multimodal applications.

Specifications
Canonical ID	`meta-llama-3-2-11b-instruct-vision`
Status	Active
Creator	Meta

Benchmarks
Intelligence Index	8.7 #420
Coding Index	4.2 #351
Math Index	1.7 #253
MMLU-Pro	0.5 #284
GPQA	0.2 #447
HLE	0.1 #259
LiveCodeBench	0.1 #293
AIME	0.1 #123
IFBench	0.3 #325
Time to First Token	0.42s #236
SciCode	0.1 #387
MATH-500	0.5 #154
AIME 2025	0.0 #253
LCR	0.1 #284
TerminalBench Hard	0.0 #328
TAU2	0.1 #331
Output TPS	86.3 #143

Capabilities

Input0/5

Text·

Image·

Audio·

Video·

PDF·

Output0/5

Text·

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Llama 3.3 70B Instruct	2024-12-06	131K	$0.100	$0.200	Available
Llama 3.2 3B Instruct	2024-09-25	131K	$0.015	$0.020	Deprecated
Llama 3.2 1B Instruct	2024-09-25	131K	$0.027	$0.080	Deprecated
Llama 3.1 405B Instruct	2024-07-23	131K	$0.120	$0.300	Deprecating
Llama 3.1 70B Instruct	2024-07-23	131K	$0.100	$0.100	Available
Llama 3.1 8B Instruct	2024-07-23	200K	$0.020	$0.030	Available
Llama 3.1 70B	2024-07-23	128K	$0.600	$0.600	Available
Llama 3.1 8B	2024-07-23	131K	$0.030	$0.050	Available
Llama 3 70B Instruct	2024-04-18	131K	$0.120	$0.300	Available
Llama 3 8B Instruct	2024-04-18	32K	$0.030	$0.040	Available
Llama 3.2 11B Instruct Vision	—	—	—	—	Current

Llama 3.2 11B Instruct Vision

Capabilities

Versions

Model IDs