Hermes 3 Llama 3.1 405B
DeepinfraText
Hermes 3 Llama 3.1 405B is a text model from Deepinfra with a context window of 131K tokens and max output of 131K tokens. Pricing starts at $1.00 per million input tokens and $1.00 per million output tokens.
Specifications
| Model Key | deepinfra/NousResearch/Hermes-3-Llama-3.1-405B |
| Provider | Deepinfra |
| LiteLLM Provider | deepinfra |
| Mode | Text |
| Canonical Name | hermes-3-llama-3.1-405b |
| Context Window | 131K tokens |
| Max Output | 131K tokens |
Capabilities
✗ Vision✗ Function Calling✗ Reasoning✗ JSON Schema✗ System Messages✗ Web Search✗ Prompt Caching✗ Audio Input✗ Audio Output
Pricing
| Type | Per 1K Tokens | Per 1M Tokens |
|---|---|---|
| Input Tokens | $0.0010 | $1.00 |
| Output Tokens | $0.0010 | $1.00 |
Similar Models
Models with similar capabilities and context window size.
Model | Provider | Mode | Input Price | Output Price | Context | Max Output | Vision | Functions |
|---|---|---|---|---|---|---|---|---|
| Gemma 3 27B It | Google Gemini | Text | N/A | N/A | 131K | 8K | yes | yes |
| GPT-oss-120b-mxfp-GGUF | Lemonade | Text | N/A | N/A | 131K | 33K | no | yes |
| GPT-oss-20b | OpenRouter | Text | $0.020 | $0.100 | 131K | 33K | no | yes |
| GPT-oss-20b-mxfp4-GGUF | Lemonade | Text | N/A | N/A | 131K | 33K | no | yes |
| GPT-oss:120b-cloud | Ollama | Text | N/A | N/A | 131K | 131K | no | yes |
| GPT-oss:20b-cloud | Ollama | Text | N/A | N/A | 131K | 131K | no | yes |
| Llama 3.2 3B Instruct | Deepinfra | Text | $0.020 | $0.020 | 131K | 131K | no | no |
| Llama3.2 11B Vision Instruct | Lambda Ai | Text | $0.015 | $0.025 | 131K | 131K | yes | yes |
| Llama3.2 3B Instruct | Lambda Ai | Text | $0.015 | $0.025 | 131K | 131K | no | yes |
| Mistral Nemo Instruct 2407 | Deepinfra | Text | $0.020 | $0.040 | 131K | 131K | no | no |