Llama 3.3 70B
CerebrasText
Llama 3.3 70B is a text model from Cerebras with a context window of 128K tokens and max output of 128K tokens. Pricing starts at $0.85 per million input tokens and $1.20 per million output tokens (cheapest at Together AI).
Specifications
| Model Key | cerebras/llama-3.3-70b |
| Provider | Cerebras |
| LiteLLM Provider | cerebras |
| Mode | Text |
| Canonical Name | llama-3.3-70b |
| Context Window | 128K tokens |
| Max Output | 128K tokens |
Capabilities
✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | $0.000850 | $0.850 |
| Output Tokens | $0.0012 | $1.20 |
Price Comparison by Provider
Compare prices for Llama 3.3 70B across different providers. The same model may be available through multiple providers at different price points.
Provider | Model Key | Input Price | Output Price |
|---|---|---|---|
| Databricks | databricks/databricks-meta-llama-3-3-70b-instruct | $0.500 | $1.50 |
| Watsonx | watsonx/meta-llama/llama-3-3-70b-instruct | $0.710 | $0.710 |
| Cerebras | cerebras/llama-3.3-70b | $0.850 | $1.20 |
| Vercel Ai Gateway | vercel_ai_gateway/meta/llama-3.3-70b | $0.720 | $0.720 |
| Azure AI | azure_ai/Llama-3.3-70B-Instruct | $0.710 | $0.710 |
| Deepinfra | deepinfra/meta-llama/Llama-3.3-70B-Instruct | $0.230 | $0.400 |
| Hyperbolic | hyperbolic/meta-llama/Llama-3.3-70B-Instruct | $0.120 | $0.300 |
| Meta Llama | meta_llama/Llama-3.3-70B-Instruct | N/A | N/A |
| Novita | novita/meta-llama/llama-3.3-70b-instruct | $0.135 | $0.400 |
| Nscale | nscale/meta-llama/Llama-3.3-70B-Instruct | $0.200 | $0.200 |
| Oci | oci/meta.llama-3.3-70b-instruct | $0.720 | $0.720 |
| Wandb | wandb/meta-llama/Llama-3.3-70B-Instruct | $0.071 | $0.071 |
| Together AI | together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free | N/A | N/A |
| Groq | groq/llama-3.3-70b-versatile | $0.590 | $0.790 |
| Fireworks AI | fireworks_ai/accounts/fireworks/models/llama-v3p3-70b-instruct | $0.900 | $0.900 |
| Gradient Ai | gradient_ai/llama3.3-70b-instruct | $0.650 | $0.650 |
| Lambda Ai | lambda_ai/llama3.3-70b-instruct-fp8 | $0.120 | $0.300 |
| Ovhcloud | ovhcloud/Meta-Llama-3_3-70B-Instruct | $0.670 | $0.670 |
| SambaNova | sambanova/Meta-Llama-3.3-70B-Instruct | $0.600 | $1.20 |
| Snowflake | snowflake/snowflake-llama-3.3-70b | N/A | N/A |
| AWS Bedrock | us.meta.llama3-3-70b-instruct-v1:0 | $0.720 | $0.720 |
Similar Models
Models with similar capabilities and context window size.
Model | Provider | Mode | Input Price | Output Price | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| Gemma 3 4B It GGUF | Lemonade | Text | N/A | N/A | 128K | 8K | no | yes |
| Gemma3 4B | Llamagate | Text | $0.030 | $0.080 | 128K | 8K | yes | yes |
| GigaChat 2 Lite | Gigachat | Text | N/A | N/A | 128K | 8K | no | yes |
| GigaChat 2 Max | Gigachat | Text | N/A | N/A | 128K | 8K | yes | yes |
| GigaChat 2 Pro | Gigachat | Text | N/A | N/A | 128K | 8K | yes | yes |
| Glm 4.5 Flash | Zai | Text | N/A | N/A | 128K | 32K | no | yes |
| Llama 3.1 70B Instruct Maas | Google Vertex AI | Text | N/A | N/A | 128K | 2K | yes | no |
| Llama 3.1 8B Instruct Maas | Google Vertex AI | Text | N/A | N/A | 128K | 2K | yes | no |
| Llama 3.2 90B Vision Instruct Maas | Google Vertex AI | Text | N/A | N/A | 128K | 2K | yes | no |
| Qwen3 4B Fp8 | Novita | Text | $0.030 | $0.030 | 128K | 20K | no | no |