Llama 3.1 Nemotron 1 Ultra 253B Reasoning is an AI model from NVIDIA. A 253B-parameter reasoning-specialized variant of NVIDIA's Nemotron Ultra, built on Llama 3.1 for complex multi-step inference and agentic workflows.
Specifications
Canonical IDnvidia-llama-3-1-nemotron-1-ultra-253b-reasoning
StatusActive
CreatorNVIDIANVIDIA
Benchmarks
Intelligence Index
15.0
#292
Coding Index
13.1
#260
Math Index
63.7
#102
MMLU-Pro
0.8
#61
GPQA
0.7
#160
HLE
0.1
#169
LiveCodeBench
0.6
#94
AIME
0.7
#27
IFBench
0.4
#241
Time to First Token
0.72s
#291
SciCode
0.3
#176
MATH-500
1.0
#34
AIME 2025
0.6
#102
LCR
0.1
#299
TerminalBench Hard
0.0
#299
TAU2
0.1
#345
Output TPS
41.4
#229

Capabilities

Input0/5
Text·
Image·
Audio·
Video·
PDF·
Output0/5
Text·
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.1 Nemotron 1 Ultra 253B ReasoningCurrent
Llama 3.1 Nemotron 1 Ultra 253B128K$0.600$1.80Available

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
Llama 3.3 70B Instruct131K$0.100$0.200
Llama 3.2 3B Instruct131K$0.015$0.020
Llama 3.2 1B Instruct131K$0.027$0.080
Llama 3.1 405B Instruct131K$0.120$0.300
Llama 3.1 70B Instruct131K$0.100$0.100
Llama 3.1 8B Instruct200K$0.020$0.030
Llama 3.1 70B128K$0.600$0.600
Llama 3.1 8B131K$0.030$0.050
Llama 3 70B Instruct131K$0.120$0.300
Llama 3 8B Instruct32K$0.030$0.040

Model IDs