NVIDIA Nemotron Nano 9 B V2 Pricing & Specs | AI Models

NVIDIA Nemotron Nano 9B V2 is a text model from DeepInfra with a context window of 131K tokens and max output of 131K tokens. Pricing starts at 0.04 per million input tokens and 0.16 per million output tokens (cheapest at DeepInfra).

Capabilities

✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output

Specifications

Model Key	`deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2`
Provider	DeepInfra
Provider ID	deepinfra
Mode	Text
Canonical Name	nemotron-nano-9b-2
Context Window	131K tokens
Max Output	131K tokens

Pricing

Type	Per 1K Tokens	Per 1M Tokens
Input Tokens	0.000040	0.040
Output Tokens	0.000160	0.160

Benchmarks

Intelligence Index	13.2#149
Coding Index	7.5#146
Math Index	62.3#42
MMLU-Pro	0.7#90
GPQA	0.6#117
HLE	0.0#174
LiveCodeBench	0.7#32
IFBench	0.3#151
Time to First Token	0.58s#140
SciCode	0.2#155
AIME 2025	0.6#42
LCR	0.2#98
TerminalBench Hard	0.0#138
TAU2	0.2#111

Price Comparison by Provider

Compare prices for NVIDIA Nemotron Nano 9B V2 across different providers. The same model may be available through multiple providers at different price points.

Provider	Model Key	Input Price, $	Output Price, $
AWS Bedrock	nvidia.nemotron-nano-9b-v2	0.060	0.230
Fireworks AI	fireworks_ai/accounts/fireworks/models/nvidia-nemotron-nano-9b-v2	0.200	0.200
DeepInfra	deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2	0.040	0.160

All Variants

All available versions, regions, and API endpoints for NVIDIA Nemotron Nano 9B V2.

Model Key	Provider	Mode	Input Price, $	Output Price, $	Context	Max Output	Vision	Functions
nvidia.nemotron-nano-9b-v2	AWS Bedrock	Text	0.060	0.230	128K	8K	no	no
deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2	DeepInfra	Text	0.040	0.160	131K	131K	no	yes
fireworks_ai/accounts/fireworks/models/nvidia-nemotron-nano-9b-v2	Fireworks AI	Text	0.200	0.200	131K	131K	no	no

← Back to All Models