Nemotron 120B A12B is NVIDIA's language model, starting at $0.300 / 1M input and $0.750 / 1M output. A 120B-parameter hybrid Mixture-of-Experts LLM from NVIDIA with 12B active parameters, designed for compute-efficient reasoning and agentic workloads.
Specifications
Canonical IDnvidia-nemotron-120b-a12b
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Providers
Input ModalitiesText
Output ModalitiesText
Parameters120B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Other/Baseten
baseten/nvidia/Nemotron-120B-A12B
$0.300$0.750

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Nemotron 4 15BAvailable
Nemotron Nano 3 30B A3B Omni Reasoning256KAvailable
Nemotron Super 3 120B256K$0.150$0.650Available
Nemotron Nano 3 30B262K$0.060$0.240Available
Nemotron Nano 3 30B A3B ReasoningAvailable
Nemotron Nano 3 30B A3B OmniAvailable
Nemotron Nano 3 4BAvailable
Nemotron 3Available
Nemotron 3 Nano 30B A3BAvailable
Nemotron 3 SuperAvailable
Nemotron 120B A12B$0.300$0.750Current

Model IDs