Nemotron 3 Ultra 550B A55B is NVIDIA's language model with a 1.0M context window and up to 66K output tokens, available from 3 providers, starting at $0.500 / 1M input and $2.50 / 1M output. A 550B-parameter mixture-of-experts Nemotron model with 55B active parameters, built for frontier-scale reasoning, tool use, and agentic tasks.
Specifications
Canonical IDnvidia-nemotron-3-ultra-550b-a55b
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Providers
Context Window1.0M tokens
Max Output66K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Parameters550B
Release Date · 1 day ago
Benchmarks
Intelligence Index
47.7
#31
Coding Index
37.6
#54
GPQA
0.9
#35
HLE
0.3
#34
IFBench
0.8
#2
Time to First Token
0.51s
#257
SciCode
0.4
#93
LCR
0.7
#38
TerminalBench Hard
0.4
#48
TAU2
0.8
#96
Output TPS

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities5/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
OpenRouter logo
OpenRouter
nvidia/nemotron-3-ultra-550b-a55b
$0.500$2.50$0.150
Vercel AI Gateway logo
Vercel AI Gateway
nvidia/nemotron-3-ultra-550b-a55b
$0.500$2.50$0.150
Hugging Face logo
Hugging Face
together_ai:nvidia/nemotron-3-ultra-550b-a55b
$0.600$3.60N/A

Cost Calculator

Preset:

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
Nemotron 4 15B
Nemotron 3.5 Content Safety128K
Nemotron Nano 3 30B A3B Omni Reasoning256K
Nemotron Super 3 120B256K$0.150$0.650
Nemotron Nano 3 30B262K$0.060$0.240
Nemotron Nano 3 30B A3B Reasoning
Nemotron Nano 3 30B A3B Omni
Nemotron Nano 3 4B
Nemotron 3
Nemotron 3 Nano 30B A3BNano

Model IDs