Name: Llama 3.1 Nemotron Nano 4B Reasoning
Brand: NVIDIA

Llama 3.1 Nemotron Nano 4B Reasoning is NVIDIA's language model. A compact 4B-parameter reasoning model from NVIDIA's Nemotron Nano series, derived from Llama 3.1 for efficient on-device inference with reasoning capabilities.

Specifications
Canonical ID	`nvidia-llama-3-1-nemotron-nano-4b-reasoning`
Type	Language
Status	Active
Creator	NVIDIA
Input Modalities	Text
Output Modalities	Text
Parameters	4B

Benchmarks
Intelligence Index	14.4 #310
Math Index	50.0 #132
MMLU-Pro	0.6 #263
GPQA	0.4 #361
HLE	0.1 #273
LiveCodeBench	0.5 #141
AIME	0.7 #31
IFBench	0.3 #354
Time to First Token	0.00s #156
SciCode	0.1 #394
MATH-500	0.9 #37
AIME 2025	0.5 #132
LCR	0.0 #380
TAU2	0.1 #342
Output TPS	0.0 #428

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Other models

Model	Tier	Released	Context	Input / 1M	Output / 1M
Llama 3.3 70B Instruct	—	2024-12-06	131K	$0.100	$0.200
Llama 3.2 3B Instruct	—	2024-09-25	131K	$0.015	$0.020
Llama 3.2 1B Instruct	—	2024-09-25	131K	$0.027	$0.080
Llama 3.1 405B Instruct	—	2024-07-23	131K	$0.120	$0.300
Llama 3.1 70B Instruct	—	2024-07-23	131K	$0.100	$0.100
Llama 3.1 8B Instruct	—	2024-07-23	200K	$0.020	$0.030
Llama 3.1 70B	—	2024-07-23	128K	$0.600	$0.600
Llama 3.1 8B	—	2024-07-23	131K	$0.030	$0.050
Llama 3 70B Instruct	—	2024-04-18	131K	$0.120	$0.300
Llama 3 8B Instruct	—	2024-04-18	32K	$0.030	$0.040

Llama 3.1 Nemotron Nano 4B Reasoning

Capabilities

Other models

Model IDs