Name: Nemotron 120B A12B
Brand: NVIDIA

Nemotron 120B A12B is NVIDIA's language model, starting at $0.300 / 1M input and $0.750 / 1M output. A 120B-parameter hybrid Mixture-of-Experts LLM from NVIDIA with 12B active parameters, designed for compute-efficient reasoning and agentic workloads.

Specifications
Canonical ID	`nvidia-nemotron-120b-a12b`
Type	Language
Status	Active
Creator	NVIDIA
Providers	other/baseten
Input Modalities	Text
Output Modalities	Text
Parameters	120B

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities0/13

Reasoning·

Adaptive Reasoning·

Function Calling·

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Pricing by Provider

Provider	Standard
Provider	Input $ / 1M	Output $ / 1M
Other/Baseten baseten/nvidia/Nemotron-120B-A12B	$0.300	$0.750

Cost Calculator

Preset:

Input tokens

Output tokens

Number of calls

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Nemotron 4 15B	—	—	—	—	Available
Nemotron Nano 3 30B A3B Omni Reasoning	2026-04-28	256K	—	—	Available
Nemotron Super 3 120B	2026-03-18	256K	$0.150	$0.650	Available
Nemotron Nano 3 30B	2025-12-23	262K	$0.060	$0.240	Available
Nemotron Nano 3 30B A3B Reasoning	—	—	—	—	Available
Nemotron Nano 3 30B A3B Omni	—	—	—	—	Available
Nemotron Nano 3 4B	—	—	—	—	Available
Nemotron 3	—	—	—	—	Available
Nemotron 3 Nano 30B A3B	—	—	—	—	Available
Nemotron 3 Super	—	—	—	—	Available
Nemotron 120B A12B	—	—	$0.300	$0.750	Current

Nemotron 120B A12B

Capabilities

Pricing by Provider

Cost Calculator

Versions

Model IDs