Name: Llama 3.1 Nemotron 1 Ultra 253B Reasoning
Brand: NVIDIA

Llama 3.1 Nemotron 1 Ultra 253B Reasoning is an AI model from NVIDIA. A 253B-parameter reasoning-specialized variant of NVIDIA's Nemotron Ultra, built on Llama 3.1 for complex multi-step inference and agentic workflows.

Specifications
Canonical ID	`nvidia-llama-3-1-nemotron-1-ultra-253b-reasoning`
Status	Active
Creator	NVIDIA

Benchmarks
Intelligence Index	9.1 #314
Math Index	63.7 #102
MMLU-Pro	0.8 #61
GPQA	0.7 #181
HLE	0.1 #191
LiveCodeBench	0.6 #94
AIME	0.7 #27
IFBench	0.4 #256
Time to First Token	0.68s #311
SciCode	0.3 #197
MATH-500	1.0 #34
AIME 2025	0.6 #102
LCR	0.1 #321
TerminalBench Hard	0.0 #311
TAU2	0.1 #360
Output TPS	53.3 #199

Capabilities

Input0/5

Text·

Image·

Audio·

Video·

PDF·

Output0/5

Text·

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Llama 3.1 Nemotron 1 Ultra 253B Reasoning	—	—	—	—	Current
Llama 3.1 Nemotron 1 Ultra 253B	—	128K	$0.600	$1.80	Available

Other Models

Model	Tier	Released	Context	Input / 1M	Output / 1M
Llama 3.3 70B Instruct	—	2024-12-06	131K	$0.120	$0.200
Llama 3.2 3B Instruct	—	2024-09-25	131K	$0.015	$0.020
Llama 3.2 1B Instruct	—	2024-09-25	131K	$0.027	$0.080
Llama 3.2 11B	—	2024-09-25	128K	$0.160	$0.160
Llama 3.1 405B Instruct	—	2024-07-23	131K	$0.120	$0.300
Llama 3.1 8B Instruct	—	2024-07-23	200K	$0.020	$0.030
Llama 3.1 70B Instruct	—	2024-07-23	131K	$0.120	$0.300
Llama 3.1 70B	—	2024-07-23	128K	$0.360	$0.360
Llama 3.1 8B	—	2024-07-23	131K	$0.030	$0.050
Llama 3 70B Instruct	—	2024-04-23	131K	$0.120	$0.300

Model IDs

llama-3-1-nemotron-ultra-253b-v1-reasoning

nvidia-llama-3-1-nemotron-1-ultra-253b-reasoning

Llama 3.1 Nemotron 1 Ultra 253B Reasoning

CapabilitiesAPIGET/api/v1/models/nvidia-llama-3-1-nemotron-1-ultra-253b-reasoning

VersionsAPIGET/api/v1/models?family=llama

Other ModelsAPIGET/api/v1/models/nvidia-llama-3-1-nemotron-1-ultra-253b-reasoning/similar

Model IDsAPIGET/api/v1/models/nvidia-llama-3-1-nemotron-1-ultra-253b-reasoning

Capabilities

Versions

Other Models

Model IDs