Meta Llama 3.1 405B Instruct
Meta Llama 3.1 405B Instruct is a text model from Nebius with a context window of 128K tokens and max output of 128K tokens. Pricing starts at 1.00 per million input tokens and 3.00 per million output tokens (cheapest at Fireworks AI).
Capabilities
✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Specifications
| Model Key | nebius/meta-llama/Meta-Llama-3.1-405B-Instruct |
| Provider | Nebius |
| Provider ID | nebius |
| Mode | Text |
| Canonical Name | llama-3.1-405b |
| Context Window | 128K tokens |
| Max Output | 128K tokens |
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | 0.0010 | 1.00 |
| Output Tokens | 0.0030 | 3.00 |
Price Comparison by Provider
Compare prices for Meta Llama 3.1 405B Instruct across different providers. The same model may be available through multiple providers at different price points.
Provider | Model Key | Input Price, $ | Output Price, $ |
|---|---|---|---|
| vertex_ai/meta/llama-3.1-405b-instruct-maas | 5.00 | 16.00 | |
| together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo | 3.50 | 3.50 | |
| snowflake/snowflake-llama-3.1-405b | N/A | N/A | |
| sambanova/Meta-Llama-3.1-405B-Instruct | 5.00 | 10.00 | |
| oci/meta.llama-3.1-405b-instruct | 10.68 | 10.68 | |
| Nebius | nebius/meta-llama/Meta-Llama-3.1-405B-Instruct | 1.00 | 3.00 |
| meta.llama3-1-405b-instruct-v1:0 | 5.32 | 16.00 | |
| lambda_ai/llama3.1-405b-instruct-fp8 | 0.800 | 0.800 | |
| hyperbolic/meta-llama/Meta-Llama-3.1-405B-Instruct | 0.120 | 0.300 | |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct-long | 0.100 | 0.100 | |
| databricks/databricks-meta-llama-3-1-405b-instruct | 5.00 | 15.00 | |
| azure_ai/Meta-Llama-3.1-405B-Instruct | 5.33 | 16.00 |
All Variants
All available versions, regions, and API endpoints for Meta Llama 3.1 405B Instruct.