NVIDIA logo

NVIDIA Llama 3.1 Nemotron 1 Ultra 253B


NVIDIA Llama 3.1 Nemotron 1 Ultra 253B is NVIDIA logoNVIDIA's language model with a 128K context window, starting at $0.600 / 1M input and $1.80 / 1M output. A 253B-parameter ultra-scale LLM fine-tuned by NVIDIA on Llama 3.1, targeting high-accuracy reasoning and complex instruction-following tasks.
Spec
Canonical IDnvidia-llama-3-1-nemotron-1-ultra-253b
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Providers
Context Window128K tokens
Input ModalitiesText
Output ModalitiesText
Parameters253B
Intelligence Index
15.0
#184
Coding Index
13.1
#162
Math Index
63.7
#58
MMLU-Pro
0.8
#37
GPQA
0.7
#86
HLE
0.1
#92
LiveCodeBench
0.6
#60
AIME
0.7
#15
IFBench
0.4
#144
Time to First Token
0.73s
#217
SciCode
0.3
#103
MATH-500
1.0
#20
AIME 2025
0.6
#58
LCR
0.1
#195
TerminalBench Hard
0.0
#191
TAU2
0.1
#232
Output TPS
42.4
#165

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Nebius logo
Nebius
nvidia/Llama-3.1-Nemotron-Ultra-253B-v1
$0.600$1.80

Cost Calculator

Preset:
Compares every provider & tier in USD

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
Llama 3.3 70B Instruct131K$0.720$0.720
Llama 3.3 70B Instruct131K$0.100$0.300
Llama 3.3
Llama 3.3 70B Instruct Turbo131K$0.130$0.390
Llama 3.3 70B Versatile128K$0.590$0.790
Llama 3.3 8B Instruct128K
Llama 3.2 11B Vision Instruct128K$0.160$0.160
Llama 3.2 1B Instruct128K$0.027$0.080
Llama 3.2 3B Instruct131K$0.015$0.020
Llama 3.2 90B Vision Instruct128K$0.720$0.720

Model IDs