Nemotron 3 Ultra is NVIDIA's language model. A frontier-scale Nemotron large language model from NVIDIA designed for strong agentic, reasoning, and conversational capabilities at extreme parameter counts.
Specifications
Canonical IDnvidia-nemotron-3-ultra
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Providers
Input ModalitiesText
Output ModalitiesText

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandard
Input
$ / 1M
Output
$ / 1M
Hugging Face logo
Hugging Face
fireworks_ai:accounts/fireworks/models/nemotron-3-ultra-nvfp4
$0.6$2.40

Cost Calculator

US Dollar ($)
Preset:

Other Models

ModelTierReleasedContextInput / 1MOutput / 1M
Nemotron Super 3 120B A12B1.0M$0.085$0.400
Nemotron Nano 3 30B A3B262K$0.050$0.200
Nemotron Nano 2 12B131K$0.200$0.200
Nemotron Nano 2 9B131K$0.040$0.160

Model IDs

accounts/fireworks/models/nemotron-3-ultra-bf16
accounts/fireworks/models/nemotron-3-ultra-nvfp4
nvidia-nemotron-3-ultra