Nemotron 3 Ultra 550B A55B is NVIDIA's language model with a 1.0M context window and up to 66K output tokens, available from 3 providers, starting at $0.5 / 1M input and $2.20 / 1M output. A 550B-parameter mixture-of-experts Nemotron model with 55B active parameters, built for frontier-scale reasoning, tool use, and agentic tasks.
Specifications
Canonical IDnvidia-nemotron-3-ultra-550b-a55b
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Providers
Context Window1.0M tokens
Max Output66K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Parameters550B
HuggingFace Likes225
HuggingFace Downloads (30d)111,067
HuggingFace Downloads (all-time)111,067
Release Date · 23 days ago
Benchmarks
Intelligence Index
37.8
#42
Coding Index
49.3
#28
GPQA
0.9
#40
HLE
0.3
#39
IFBench
0.8
#2
Time to First Token
1.04s
#345
SciCode
0.4
#98
LCR
0.7
#41
TerminalBench Hard
0.4
#51
TAU2
0.8
#99
Output TPS

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities5/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
Hugging Face logo
Hugging Face
together_ai:nvidia/nemotron-3-ultra-550b-a55b
$0.6$3.60N/A
OpenRouter logo
OpenRouter
nvidia/nemotron-3-ultra-550b-a55b
$0.5$2.20$0.1
Vercel AI Gateway logo
Vercel AI Gateway
nvidia/nemotron-3-ultra-550b-a55b
$0.6$2.40$0.12

Cost Calculator

US Dollar ($)
Preset:

Other Models

ModelTierReleasedContextInput / 1MOutput / 1M
Nemotron 4 15B
Nemotron 3.5 Content Safety128K
Nemotron Nano 3 30B A3B Omni Reasoning256K
Nemotron Super 3 120B256K$0.150$0.650
Nemotron Nano 3 30B262K$0.060$0.240
Nemotron Nano 3 30B A3B Omni
Nemotron Nano 3 30B A3B Reasoning
Nemotron Nano 3 4B
Nemotron 3
Nemotron 3 Nano 30B A3BNano

Model IDs

huggingface-reasoning-nvidia-nemotron-3-ultra-550b-a55b-nvfp4
nvidia-nemotron-3-ultra-550b-a55b
nvidia/nemotron-3-ultra-550b-a55b
nvidia/nemotron-3-ultra-550b-a55b:free
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B
nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16