Nemotron 3 Ultra is NVIDIA's language model. A frontier-scale Nemotron large language model from NVIDIA designed for strong agentic, reasoning, and conversational capabilities at extreme parameter counts.
Capabilities
Input1/5
Text✓
Image·
Audio·
Video·
PDF·
Output1/5
Text✓
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·
Other Models
| Model | Tier | Released | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|---|
| Nemotron Super 3 120B A12B | — | 1.0M | $0.090 | $0.450 | |
| Nemotron Nano 3 30B A3B | — | 262K | $0.050 | $0.200 | |
| Nemotron Nano 2 12B | — | 131K | $0.200 | $0.200 | |
| Nemotron Nano 2 9B | — | 131K | $0.040 | $0.160 |