Llama 3.1 8B Instruct
NscaleText
Llama 3.1 8B Instruct is a text model from Nscale. Pricing starts at $0.03 per million input tokens and $0.03 per million output tokens (cheapest at Google Vertex AI).
Specifications
| Model Key | nscale/meta-llama/Llama-3.1-8B-Instruct |
| Provider | Nscale |
| LiteLLM Provider | nscale |
| Mode | Text |
| Canonical Name | llama-3.1-8b |
| Context Window | N/A tokens |
| Max Output | N/A |
Capabilities
✗ Vision✗ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | $0.000030 | $0.030 |
| Output Tokens | $0.000030 | $0.030 |
Price Comparison by Provider
Compare prices for Llama 3.1 8B Instruct across different providers. The same model may be available through multiple providers at different price points.
Provider | Model Key | Input Price | Output Price |
|---|---|---|---|
| Databricks | databricks/databricks-meta-llama-3-1-8b-instruct | $0.150 | $0.450 |
| Llamagate | llamagate/llama-3.1-8b | $0.030 | $0.050 |
| Vercel Ai Gateway | vercel_ai_gateway/meta/llama-3.1-8b | $0.050 | $0.080 |
| Groq | groq/llama-3.1-8b-instant | $0.050 | $0.080 |
| Novita | novita/meta-llama/llama-3.1-8b-instruct | $0.020 | $0.050 |
| Nscale | nscale/meta-llama/Llama-3.1-8B-Instruct | $0.030 | $0.030 |
| Ovhcloud | ovhcloud/Llama-3.1-8B-Instruct | $0.100 | $0.100 |
| Perplexity | perplexity/llama-3.1-8b-instruct | $0.200 | $0.200 |
| Wandb | wandb/meta-llama/Llama-3.1-8B-Instruct | $0.022 | $0.022 |
| Google Vertex AI | vertex_ai/meta/llama-3.1-8b-instruct-maas | N/A | N/A |
| Fireworks AI | fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct | $0.100 | $0.100 |
| AWS Bedrock | meta.llama3-1-8b-instruct-v1:0 | $0.220 | $0.220 |
| Cerebras | cerebras/llama3.1-8b | $0.100 | $0.100 |
| Snowflake | snowflake/llama3.1-8b | N/A | N/A |
| Lambda Ai | lambda_ai/llama3.1-8b-instruct | $0.025 | $0.040 |
| Azure AI | azure_ai/Meta-Llama-3.1-8B-Instruct | $0.300 | $0.610 |
| Deepinfra | deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct | $0.030 | $0.050 |
| Friendliai | friendliai/meta-llama-3.1-8b-instruct | $0.100 | $0.100 |
| Hyperbolic | hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct | $0.120 | $0.300 |
| SambaNova | sambanova/Meta-Llama-3.1-8B-Instruct | $0.100 | $0.200 |
| Together AI | together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | $0.180 | $0.180 |
Similar Models
Models with similar capabilities and context window size.
Model | Provider | Mode | Input Price | Output Price | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| DeepSeek R1 Distill Llama 8B | Nscale | Text | $0.025 | $0.025 | N/A | N/A | no | no |
| DeepSeek R1 Distill Qwen 14B | Nscale | Text | $0.070 | $0.070 | N/A | N/A | no | no |
| GPT-5 nano | Replicate | Text | $0.050 | $0.400 | N/A | N/A | no | yes |
| Granite 3.3 8B Instruct | Replicate | Text | $0.030 | $0.250 | N/A | N/A | no | yes |
| Llama 3.3 70B Instruct Turbo Free | Together AI | Text | N/A | N/A | N/A | N/A | no | yes |
| Llama 4 Scout 17B 16E Instruct | Nscale | Text | $0.090 | $0.290 | N/A | N/A | no | no |
| Qwen2.5 Coder 32B Instruct | Nscale | Text | $0.060 | $0.200 | N/A | N/A | no | no |
| Qwen2.5 Coder 3B Instruct | Nscale | Text | $0.010 | $0.030 | N/A | N/A | no | no |
| Qwen2.5 Coder 7B Instruct | Nscale | Text | $0.010 | $0.030 | N/A | N/A | no | no |
| Titan Embed Text V2 | Vercel Ai Gateway | Text | $0.020 | N/A | N/A | N/A | no | no |