NVIDIA logo

Llama 3.3 Nemotron Super 49B V1.5


Llama 3.3 Nemotron Super 49B V1.5 is NVIDIA logoNVIDIA's language model with a 131K context window, available from 2 providers, starting at $0.100 / 1M input and $0.400 / 1M output. A 49B-parameter reasoning and chat LLM derived from Llama 3.3 70B, post-trained by NVIDIA for agentic workflows including RAG and tool use with a 128K context window.
Spec
Canonical IDnvidia-llama-3-3-nemotron-1-5-super-49b
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Providers
Context Window131K tokens
Input ModalitiesText
Output ModalitiesText
Reasoning Effortsdefault
Parameters49B
Release Date · 6 months ago

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
DeepInfra logo
DeepInfra
nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
$0.100$0.400
OpenRouter logo
OpenRouter
nvidia/llama-3.3-nemotron-super-49b-v1.5
$0.100$0.400

Cost Calculator

Preset:
Compares every provider & tier in USD

Other models

ModelTierReleasedContextInput / 1MOutput / 1M
Llama 3.3 70B Instruct131K$0.720$0.720
Llama 3.3 70B Instruct131K$0.100$0.300
Llama 3.3
Llama 3.3 70B Instruct Turbo131K$0.130$0.390
Llama 3.3 70B Versatile128K$0.590$0.790
Llama 3.3 8B Instruct128K
Llama 3.2 11B Vision Instruct128K$0.160$0.160
Llama 3.2 1B Instruct128K$0.027$0.080
Llama 3.2 3B Instruct131K$0.015$0.020
Llama 3.2 90B Vision Instruct128K$0.720$0.720

Model IDs