Name: Llama Nemotron Super
Brand: NVIDIA

Llama Nemotron Super is NVIDIA's language model. A compute-efficient Nemotron Super LLM fine-tuned by NVIDIA on a Llama base, targeting high-accuracy multi-agent and agentic system applications.

Specifications
Canonical ID	`nvidia-llama-nemotron-super`
Type	Language
Status	Active
Creator	NVIDIA
Input Modalities	Text
Output Modalities	Text

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Llama Nemotron 1.5 Super 49B Reasoning	—	—	—	—	Available
Llama Nemotron 1.5 Super 49B	—	—	—	—	Available
Llama Nemotron Super	—	—	—	—	Current
Llama 3.3 Nemotron Super 49B Reasoning	—	—	—	—	Available
Llama 3.3 Nemotron Super 49B	—	—	—	—	Available

Other models

Model	Tier	Released	Context	Input / 1M	Output / 1M
Llama 3.3 70B Instruct	—	2024-12-06	131K	$0.100	$0.200
Llama 3.2 3B Instruct	—	2024-09-25	131K	$0.015	$0.020
Llama 3.2 1B Instruct	—	2024-09-25	131K	$0.027	$0.080
Llama 3.1 405B Instruct	—	2024-07-23	131K	$0.120	$0.300
Llama 3.1 70B Instruct	—	2024-07-23	131K	$0.100	$0.100
Llama 3.1 8B Instruct	—	2024-07-23	200K	$0.020	$0.030
Llama 3.1 70B	—	2024-07-23	128K	$0.600	$0.600
Llama 3.1 8B	—	2024-07-23	131K	$0.030	$0.050
Llama 3 70B Instruct	—	2024-04-18	131K	$0.120	$0.300
Llama 3 8B Instruct	—	2024-04-18	32K	$0.030	$0.040

Llama Nemotron Super

Capabilities

Versions

Other models

Model IDs