Name: Nemotron Super 3 120B
Brand: NVIDIA

Nemotron Super 3 120B is NVIDIA's language model with a 256K context window and up to 33K output tokens, starting at $0.15 / 1M input and $0.65 / 1M output. A 120B-parameter third-generation Nemotron Super model from NVIDIA, engineered for highest compute efficiency and accuracy in multi-agent applications.

Specifications
Canonical ID	`nvidia-nemotron-super-3-120b`
Type	Language
Status	Active
Creator	NVIDIA
Providers	Amazon Bedrock
Context Window	256K tokens
Max Output	33K tokens
Input Modalities	Text
Output Modalities	Text
Reasoning Efforts	default
Parameters	120B
Release Date	2026-03-18 · 4 months ago

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities2/13

Reasoning✓

Adaptive Reasoning·

Function Calling✓

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Pricing by Provider

US Dollar ($)

Per 1M tokens

Provider	Standard		Batch		Flex		Priority
Provider	Input $ / 1M	Output $ / 1M	Input $ / 1M	Output $ / 1M	Input $ / 1M	Output $ / 1M	Input $ / 1M	Output $ / 1M
Amazon Bedrock `nvidia.nemotron-super-3-120b`	$0.15	$0.65	$0.075	$0.325	$0.075	$0.325	$0.263	$1.14

Cost Calculator

US Dollar ($)

Preset:

Input tokens

Output tokens

Number of calls

Cheapest Instances to Run It

Cloud GPU instances that can host Nemotron Super 3 120B, ranked by cheapest on-demand price. The model needs about 288 GB of GPU memory at FP16 precision (estimated from its parameter count), so treat the fit as guidance rather than a guarantee.

All clouds

FP16 (full precision)

US Dollar ($)

Instance	Cloud	GPU	VRAM	Price	Cheapest region
Standard_NC96ads_A100_v4	Azure	4× NVIDIA A100	320 GB	$14.69/hr	westus2
g7e.24xlarge	AWS	4× RTX PRO Server 6000	384 GB	$16.57/hr	us-east-1
p4d.24xlarge	AWS	8× A100	320 GB	$21.96/hr	us-west-2
7 more instances can run Nemotron Super 3 120B Unlock the full ranked list and FP8 / INT4 quantization with a CloudPrice subscription.

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Nemotron 4 15B	—	—	—	—	Available
Nemotron 3 Ultra 550B A55B	2026-06-04	1.0M	$0.600	$2.40	Available
Nemotron 3.5 Content Safety	2026-06-04	128K	—	—	Available
Nemotron Nano 3 30B A3B Omni Reasoning	2026-04-28	256K	—	—	Available
Nemotron Super 3 120B	2026-03-18	256K	$0.150	$0.650	Current
Nemotron Nano 3 30B	2025-12-23	262K	$0.060	$0.240	Available
Nemotron Nano 3 30B A3B Omni	—	—	—	—	Available
Nemotron Nano 3 30B A3B Reasoning	—	—	—	—	Available
Nemotron Nano 3 4B	—	—	—	—	Available
Nemotron 3	—	—	—	—	Available
Nemotron 3 Nano 30B A3B	—	—	—	—	Available

Model IDs

nvidia-nemotron-super-3-120b

nvidia.nemotron-super-3-120b

Nemotron Super 3 120B

CapabilitiesAPIGET/api/v1/models/nvidia-nemotron-super-3-120b

Pricing by ProviderAPIGET/api/v1/models/nvidia-nemotron-super-3-120b/pricing

Cost CalculatorAPIGET/api/v1/models/nvidia-nemotron-super-3-120b/pricing/calculate?input_tokens=1000000&output_tokens=500000

Cheapest Instances to Run ItAPIGET/api/v1/models/nvidia-nemotron-super-3-120b/instances

VersionsAPIGET/api/v1/models?family=nemotron

Model IDsAPIGET/api/v1/models/nvidia-nemotron-super-3-120b

Capabilities

Pricing by Provider

Cost Calculator

Cheapest Instances to Run It

Versions

Model IDs