Nemotron 3 Ultra 550B A55B is NVIDIA's language model with a 1.0M context window and up to 66K output tokens, available from 3 providers, starting at $0.500 / 1M input and $2.50 / 1M output. A 550B-parameter mixture-of-experts Nemotron model with 55B active parameters, built for frontier-scale reasoning, tool use, and agentic tasks.
Capabilities
Input1/5
Text✓
Image·
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities5/13
Reasoning✓
Adaptive Reasoning·
Function Calling✓
Parallel Function Calling·
Structured Outputs✓
Native JSON Schema✓
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching✓
Assistant Prefill·
Pricing by Provider
| Provider | Standard | ||
|---|---|---|---|
| Input $ / 1M | Output $ / 1M | Cache Read $ / 1M | |
OpenRouter | $0.500 | $2.50 | $0.150 |
Vercel AI Gateway | $0.500 | $2.50 | $0.150 |
Hugging Face | $0.600 | $3.60 | N/A |
Cost Calculator
Preset:
Other models
| Model | Tier | Released | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|---|
| Nemotron 4 15B | — | — | — | — | — |
| Nemotron 3.5 Content Safety | — | 128K | — | — | |
| Nemotron Nano 3 30B A3B Omni Reasoning | — | 256K | — | — | |
| Nemotron Super 3 120B | — | 256K | $0.150 | $0.650 | |
| Nemotron Nano 3 30B | — | 262K | $0.060 | $0.240 | |
| Nemotron Nano 3 30B A3B Reasoning | — | — | — | — | — |
| Nemotron Nano 3 30B A3B Omni | — | — | — | — | — |
| Nemotron Nano 3 4B | — | — | — | — | — |
| Nemotron 3 | — | — | — | — | — |
| Nemotron 3 Nano 30B A3B | Nano | — | — | — | — |