NVIDIA Nemotron Nano 9B V2

NVIDIA Nemotron Nano 9B V2 is a text model from DeepInfra with a context window of 131K tokens and max output of 131K tokens. Pricing starts at 0.04 per million input tokens and 0.16 per million output tokens (cheapest at DeepInfra).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keydeepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2
ProviderDeepInfra
Provider IDdeepinfra
ModeText
Canonical Namenemotron-nano-9b-2
Context Window131K tokens
Max Output131K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0000400.040
Output Tokens0.0001600.160

Benchmarks

Intelligence Index13.2#149
Coding Index7.5#146
Math Index62.3#42
MMLU-Pro0.7#90
GPQA0.6#117
HLE0.0#174
LiveCodeBench0.7#32
IFBench0.3#151
Time to First Token0.58s#140
SciCode0.2#155
AIME 20250.6#42
LCR0.2#98
TerminalBench Hard0.0#138
TAU20.2#111

Price Comparison by Provider

Compare prices for NVIDIA Nemotron Nano 9B V2 across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
AWS Bedrocknvidia.nemotron-nano-9b-v20.0600.230
Fireworks AIfireworks_ai/accounts/fireworks/models/nvidia-nemotron-nano-9b-v20.2000.200
DeepInfradeepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v20.0400.160

All Variants

All available versions, regions, and API endpoints for NVIDIA Nemotron Nano 9B V2.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
nvidia.nemotron-nano-9b-v2AWS BedrockText0.0600.230128K8Knono
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2DeepInfraText0.0400.160131K131Knoyes
fireworks_ai/accounts/fireworks/models/nvidia-nemotron-nano-9b-v2Fireworks AIText0.2000.200131K131Knono