Llama 3.3 Nemotron Super 49B is NVIDIA's language model. A 49B-parameter compute-efficient LLM fine-tuned by NVIDIA on Llama 3.3, targeting multi-agent and agentic system workloads with the Nemotron Super architecture.
Specifications
Canonical IDnvidia-llama-3-3-nemotron-super-49b
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Input ModalitiesText
Output ModalitiesText
Parameters49B
Benchmarks
Intelligence Index
14.3
#312
Coding Index
7.6
#322
Math Index
7.7
#229
MMLU-Pro
0.7
#205
GPQA
0.5
#301
HLE
0.0
#429
LiveCodeBench
0.3
#220
AIME
0.2
#94
IFBench
0.4
#228
Time to First Token
0.00s
#158
SciCode
0.2
#316
MATH-500
0.8
#102
AIME 2025
0.1
#229
LCR
0.1
#287
TerminalBench Hard
0.0
#373
Output TPS
0.0
#429

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama Nemotron 1.5 Super 49B ReasoningAvailable
Llama Nemotron 1.5 Super 49BAvailable
Llama 3.3 Nemotron Super 49BCurrent
Llama 3.3 Nemotron Super 49B ReasoningAvailable

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
Llama 3.3 70B Instruct131K$0.100$0.200
Llama 3.2 3B Instruct131K$0.015$0.020
Llama 3.2 1B Instruct131K$0.027$0.080
Llama 3.1 405B Instruct131K$0.120$0.300
Llama 3.1 70B Instruct131K$0.100$0.100
Llama 3.1 8B Instruct200K$0.020$0.030
Llama 3.1 70B128K$0.600$0.600
Llama 3.1 8B131K$0.030$0.050
Llama 3 70B Instruct131K$0.120$0.300
Llama 3 8B Instruct32K$0.030$0.040

Model IDs