Llama 3.3 8B Instruct
Meta LlamaText
Llama 3.3 8B Instruct is a text model from Meta Llama with a context window of 128K tokens and max output of 4K tokens.
Specifications
| Model Key | meta_llama/Llama-3.3-8B-Instruct |
| Provider | Meta Llama |
| LiteLLM Provider | meta_llama |
| Mode | Text |
| Canonical Name | llama-3.3-8b |
| Context Window | 128K tokens |
| Max Output | 4K tokens |
Capabilities
✗ Vision✓ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | N/A | N/A |
| Output Tokens | N/A | N/A |
Similar Models
Models with similar capabilities and context window size.
Model | Provider | Mode | Input Price | Output Price | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| Gemma 3 4B It GGUF | Lemonade | Text | N/A | N/A | 128K | 8K | no | yes |
| Gemma3 4B | Llamagate | Text | $0.030 | $0.080 | 128K | 8K | yes | yes |
| GigaChat 2 Lite | Gigachat | Text | N/A | N/A | 128K | 8K | no | yes |
| GigaChat 2 Max | Gigachat | Text | N/A | N/A | 128K | 8K | yes | yes |
| GigaChat 2 Pro | Gigachat | Text | N/A | N/A | 128K | 8K | yes | yes |
| Glm 4.5 Flash | Zai | Text | N/A | N/A | 128K | 32K | no | yes |
| Llama 3.1 70B Instruct Maas | Google Vertex AI | Text | N/A | N/A | 128K | 2K | yes | no |
| Llama 3.1 8B Instruct Maas | Google Vertex AI | Text | N/A | N/A | 128K | 2K | yes | no |
| Llama 3.2 90B Vision Instruct Maas | Google Vertex AI | Text | N/A | N/A | 128K | 2K | yes | no |
| Qwen3 4B Fp8 | Novita | Text | $0.030 | $0.030 | 128K | 20K | no | no |