NVIDIA logo

Nemotron 120B A12B


Nemotron 120B A12B is NVIDIA logoNVIDIA's language model, starting at $0.300 / 1M input and $0.750 / 1M output. A 120B-parameter hybrid Mixture-of-Experts LLM from NVIDIA with 12B active parameters, designed for compute-efficient reasoning and agentic workloads.
Spec
Canonical IDnvidia-nemotron-120b-a12b
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Providers
Input ModalitiesText
Output ModalitiesText
Parameters120B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Other/Baseten
baseten/nvidia/Nemotron-120B-A12B
$0.300$0.750

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Nemotron 4 15BAvailable
Nemotron Super 3 120B256K$0.150$0.650Available
Nemotron Nano 3 30B262K$0.060$0.240Available
Nemotron 3Available
Nemotron 3 Nano 30B A3BAvailable
Nemotron 3 SuperAvailable
Nemotron 3 Super 120BAvailable
Nemotron 3 Super 120B A12BAvailable
Nemotron Nano 3 30B A3B ReasoningAvailable
Nemotron Nano 3 4BAvailable
Nemotron 120B A12B$0.300$0.750Current

Model IDs