Nemotron 3 Ultra 550B A55B is NVIDIA's language model with a 1.0M context window and up to 66K output tokens, available from 3 providers, starting at $0.5 / 1M input and $2.20 / 1M output. A 550B-parameter mixture-of-experts Nemotron model with 55B active parameters, built for frontier-scale reasoning, tool use, and agentic tasks.
Capabilities
Input1/5
Text✓
Image·
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities5/13
Reasoning✓
Adaptive Reasoning·
Function Calling✓
Parallel Function Calling·
Structured Outputs✓
Native JSON Schema✓
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching✓
Assistant Prefill·
Pricing by Provider
US Dollar ($)
Per 1M tokens
| Provider | Standard | ||
|---|---|---|---|
| Input $ / 1M | Output $ / 1M | Cache Read $ / 1M | |
| $0.6 | $3.60 | N/A | |
| $0.5 | $2.20 | $0.1 | |
| $0.6 | $2.40 | $0.12 | |
Cost Calculator
US Dollar ($)
Preset:
Other Models
| Model | Tier | Released | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|---|
| Nemotron 4 15B | — | — | — | — | — |
| Nemotron 3.5 Content Safety | — | 128K | — | — | |
| Nemotron Nano 3 30B A3B Omni Reasoning | — | 256K | — | — | |
| Nemotron Super 3 120B | — | 256K | $0.150 | $0.650 | |
| Nemotron Nano 3 30B | — | 262K | $0.060 | $0.240 | |
| Nemotron Nano 3 30B A3B Omni | — | — | — | — | — |
| Nemotron Nano 3 30B A3B Reasoning | — | — | — | — | — |
| Nemotron Nano 3 4B | — | — | — | — | — |
| Nemotron 3 | — | — | — | — | — |
| Nemotron 3 Nano 30B A3B | Nano | — | — | — | — |