Llama 3.1 Nemotron Ultra 253B V1
Llama 3.1 Nemotron Ultra 253B V1 is a text model from Nebius with a context window of 128K tokens and max output of 128K tokens. Pricing starts at 0.60 per million input tokens and 1.80 per million output tokens.
Capabilities
✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Specifications
| Model Key | nebius/nvidia/Llama-3.1-Nemotron-Ultra-253B-v1 |
| Provider | Nebius |
| Provider ID | nebius |
| Mode | Text |
| Canonical Name | llama-nemotron-ultra-3.1-253b-1 |
| Context Window | 128K tokens |
| Max Output | 128K tokens |
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | 0.000600 | 0.600 |
| Output Tokens | 0.0018 | 1.80 |