NVIDIA logo

Nemotron Nano 2 12B VL


Nemotron Nano 2 12B VL is NVIDIA logoNVIDIA's language model with a 131K context window and up to 4K output tokens, available from 3 providers, starting at $0.100 / 1M input and $0.100 / 1M output. A 12B-parameter multimodal model from NVIDIA's Nemotron Nano 2 series using a hybrid Transformer-Mamba architecture for video and document understanding.
Spec
Canonical IDnvidia-nemotron-nano-2-12b-vl
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Providers
Context Window131K tokens
Max Output4K tokens
Input ModalitiesImageTextVideo
Output ModalitiesText
Reasoning Effortsdefault
Parameters12B
Release Date · 6 months ago
Intelligence Index
10.1
#269
Coding Index
5.9
#231
Math Index
26.7
#132
MMLU-Pro
0.6
#178
GPQA
0.4
#224
HLE
0.0
#224
LiveCodeBench
0.3
#135
IFBench
0.3
#241
Time to First Token
0.66s
#212
SciCode
0.2
#242
AIME 2025
0.3
#132
LCR
0.2
#165
TerminalBench Hard
0.0
#261
TAU2
0.2
#194
Output TPS

Capabilities

Input3/5
Text
Image
Audio·
Video
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Fireworks AI logo
Fireworks AI
accounts/fireworks/models/nemotron-nano-v2-12b-vl
$0.100$0.100
OpenRouter logo
OpenRouter
nvidia/nemotron-nano-12b-v2-vl
$0.200$0.600
Vercel AI Gateway logo
Vercel AI Gateway
nvidia/nemotron-nano-12b-v2-vl
$0.200$0.600

Cost Calculator

Preset:
Compares every provider & tier in USD

Model IDs