NVIDIA logo

Llama3.1 Nemotron 70B Instruct


Llama3.1 Nemotron 70B Instruct is NVIDIA logoNVIDIA's language model with a 131K context window, starting at $0.120 / 1M input and $0.300 / 1M output. A 70B instruction-tuned LLM built on the fused Llama3.1 base and fine-tuned by NVIDIA's Nemotron process to improve helpfulness and alignment.
Spec
Canonical IDnvidia-llama3-1-nemotron-70b-instruct
TypeLanguage
StatusActive
CreatorNVIDIANVIDIA
Providers
Context Window131K tokens
Input ModalitiesText
Output ModalitiesText
Parameters70B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Lambda logo
Lambda
lambda_ai/llama3.1-nemotron-70b-instruct-fp8
$0.120$0.300

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.2 11B128K$0.160$0.160Available
Llama 3.2 1B Instruct128K$0.027$0.080Deprecated
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 90B128K$0.720$0.720Available
Llama 3.2 1B131K$0.100$0.100Available
Llama 3.2 3B131K$0.040$0.080Available
Llama 3.1 70B128K$0.600$0.600Available
Llama 3.1 70B Instruct131K$0.100$0.100Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3.1 8B Instruct200K$0.020$0.030Available
Llama3.1 Nemotron 70B Instruct131K$0.120$0.300Current

Model IDs