Name: Llama 3.2 11B Vision Instruct
Brand: Meta

Llama 3.2 11B Vision Instruct is

Meta's language model with a 128K context window and up to 8K output tokens, starting at $0.160 / 1M input and $0.160 / 1M output. An 11B instruction-tuned vision-language Llama 3.2 model optimized for visual recognition, image reasoning, and captioning tasks combining text and image inputs.

Spec
Canonical ID	`meta-llama-3-2-11b`
Type	Language
Status	Active
Creator	Meta
Providers	Vercel AI Gateway
Context Window	128K tokens
Max Output	8K tokens
Input Modalities	Image
Output Modalities	Text
Parameters	11B
Release Date	2024-09-25 · 2 years ago

Capabilities

Input1/5

Text·

Image✓

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities1/13

Reasoning·

Adaptive Reasoning·

Function Calling✓

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Pricing by Provider

Provider	Standard
Provider	Input $ / 1M	Output $ / 1M
Vercel AI Gateway meta/llama-3.2-11b	$0.160	$0.160

Cost Calculator

Preset:

Input tokens

Output tokens

Number of calls

Compares every provider & tier in USD

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Llama 3.3 70B Instruct	2024-12-06	131K	$0.720	$0.720	Available
Llama 3.3 70B Instruct	2024-12-06	131K	$0.100	$0.300	Available
Llama 3.3	—	—	—	—	Available
Llama 3.3 70B Instruct Turbo	—	131K	$0.130	$0.390	Available
Llama 3.3 70B Versatile	—	128K	$0.590	$0.790	Available
Llama 3.3 8B Instruct	—	128K	—	—	Available
Llama 3.2 11B Vision Instruct	2024-09-25	128K	$0.160	$0.160	Current
Llama 3.2 1B Instruct	2024-09-25	128K	$0.027	$0.080	Deprecated
Llama 3.2 3B Instruct	2024-09-25	131K	$0.015	$0.020	Deprecated
Llama 3.2 90B Vision Instruct	2024-09-25	128K	$0.720	$0.720	Available
Llama 3.2 1B	2024-09-18	131K	$0.100	$0.100	Available

Llama 3.2 11B Vision Instruct

Capabilities

Pricing by Provider

Cost Calculator

Versions

Model IDs