Hermes 3 Llama 3.1 405B

Hermes 3 Llama 3.1 405B is a text model from Nebius with a context window of 128K tokens and max output of 128K tokens. Pricing starts at 1.00 per million input tokens and 3.00 per million output tokens (cheapest at Lambda).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keynebius/NousResearch/Hermes-3-Llama-3.1-405B
ProviderNebius
Provider IDnebius
ModeText
Canonical Namehermes-3-llama-3.1-405b
Context Window128K tokens
Max Output128K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.00101.00
Output Tokens0.00303.00

Benchmarks

Intelligence Index17.6#106
Coding Index18.1#84
Math Index15.3#106
MMLU-Pro0.7#96
GPQA0.5#120
HLE0.0#159
LiveCodeBench0.5#52
IFBench0.3#110
Time to First Token0.73s#145
SciCode0.3#72
AIME 20250.2#106
LCR0.2#105
TerminalBench Hard0.1#77
TAU20.3#94

Price Comparison by Provider

Compare prices for Hermes 3 Llama 3.1 405B across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Nebiusnebius/NousResearch/Hermes-3-Llama-3.1-405B1.003.00
Lambdalambda_ai/hermes3-405b0.8000.800
DeepInfradeepinfra/NousResearch/Hermes-3-Llama-3.1-405B1.001.00

All Variants

All available versions, regions, and API endpoints for Hermes 3 Llama 3.1 405B.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
deepinfra/NousResearch/Hermes-3-Llama-3.1-405BDeepInfraText1.001.00131K131Knoyes
lambda_ai/hermes3-405bLambdaText0.8000.800131K131Knoyes
nebius/NousResearch/Hermes-3-Llama-3.1-405BNebiusText1.003.00128K128Knoyes