NVIDIA logo

Llama 3.1 Nemotron 1 Ultra 253B


Llama 3.1 Nemotron 1 Ultra 253B is NVIDIA logoNVIDIA's language model with a 128K context window, starting at $0.600 / 1M input and $1.80 / 1M output. A 253B-parameter ultra-scale LLM fine-tuned by NVIDIA on Llama 3.1, optimized for advanced reasoning and high-accuracy agentic tasks.
Spec
Canonical IDnvidia-llama-3-1-nemotron-1-ultra-253b
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Providers
Context Window128K tokens
Input ModalitiesText
Output ModalitiesText
Parameters253B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Nebius logo
Nebius
nebius/nvidia/Llama-3.1-Nemotron-Ultra-253B-v1
$0.600$1.80

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.1 Nemotron 1 Ultra 253B128K$0.600$1.80Current
Llama 3.1 Nemotron 1 Ultra 253B ReasoningAvailable

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
Llama 3.2 11B128K$0.160$0.160
Llama 3.2 11B Instruct128K$0.350$0.350
Llama 3.2 1B Instruct128K$0.027$0.080
Llama 3.2 3B Instruct131K$0.015$0.020
Llama 3.2 90B128K$0.720$0.720
Llama 3.2 90B Instruct128K$2.00$2.00
Llama 3.2 1B131K$0.100$0.100
Llama 3.2 3B131K$0.040$0.080
Llama 3.1 70B128K$0.600$0.600
Llama 3.1 70B Instruct131K$0.100$0.100

Model IDs