Llama 3.3 Nemotron Super 49B V1.5 is
NVIDIA's language model with a 131K context window, available from 2 providers, starting at $0.100 / 1M input and $0.400 / 1M output. A 49B-parameter reasoning and chat LLM derived from Llama 3.3 70B, post-trained by NVIDIA for agentic workflows including RAG and tool use with a 128K context window.
Capabilities
Input1/5
✓
·
·
·
·
Output1/5
✓
·
·
·
·
Capabilities2/13
✓
·
✓
·
·
·
·
·
·
·
·
·
·
Pricing by Provider
| Provider | Standard | |
|---|---|---|
| Input $ / 1M | Output $ / 1M | |
DeepInfra | $0.100 | $0.400 |
OpenRouter | $0.100 | $0.400 |
Cost Calculator
Preset:
Compares every provider & tier in USD
Other models
| Model | Tier | Released | Context | Input / 1M | Output / 1M |
|---|---|---|---|---|---|
| Llama 3.3 70B Instruct | — | 131K | $0.720 | $0.720 | |
| Llama 3.3 70B Instruct | — | 131K | $0.100 | $0.300 | |
| Llama 3.3 | — | — | — | — | — |
| Llama 3.3 70B Instruct Turbo | — | — | 131K | $0.130 | $0.390 |
| Llama 3.3 70B Versatile | — | — | 128K | $0.590 | $0.790 |
| Llama 3.3 8B Instruct | — | — | 128K | — | — |
| Llama 3.2 11B Vision Instruct | — | 128K | $0.160 | $0.160 | |
| Llama 3.2 1B Instruct | — | 128K | $0.027 | $0.080 | |
| Llama 3.2 3B Instruct | — | 131K | $0.015 | $0.020 | |
| Llama 3.2 90B Vision Instruct | — | 128K | $0.720 | $0.720 |