Meta Llama 3.1 8B Instruct

Meta Llama 3.1 8B Instruct is a text model from Nebius with a context window of 128K tokens and max output of 128K tokens. Pricing starts at 0.02 per million input tokens and 0.06 per million output tokens (cheapest at Vertex AI (Llama)).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keynebius/meta-llama/Meta-Llama-3.1-8B-Instruct
ProviderNebius
Provider IDnebius
ModeText
Canonical Namellama-3.1-8b
Context Window128K tokens
Max Output128K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0000200.020
Output Tokens0.0000600.060

Benchmarks

Intelligence Index11.8#174
Coding Index4.9#157
Math Index4.3#130
MMLU-Pro0.5#165
GPQA0.3#213
HLE0.1#99
LiveCodeBench0.1#170
AIME0.1#93
IFBench0.3#145
Time to First Token0.46s#116
SciCode0.1#185
MATH-5000.5#116
AIME 20250.0#130
LCR0.2#119
TerminalBench Hard0.0#138
TAU20.2#133

Price Comparison by Provider

Compare prices for Meta Llama 3.1 8B Instruct across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Weights & Biaseswandb/meta-llama/Llama-3.1-8B-Instruct0.0220.022
Vertex AI (Llama)vertex_ai/meta/llama-3.1-8b-instruct-maasN/AN/A
Vercel AI Gatewayvercel_ai_gateway/meta/llama-3.1-8b0.0500.080
Together AItogether_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo0.1800.180
Snowflakesnowflake/llama3.1-8bN/AN/A
SambaNovasambanova/Meta-Llama-3.1-8B-Instruct0.1000.200
Perplexityperplexity/llama-3.1-8b-instruct0.2000.200
OVHcloudovhcloud/Llama-3.1-8B-Instruct0.1000.100
Ollamaollama/llama3.1N/AN/A
Nscalenscale/meta-llama/Llama-3.1-8B-Instruct0.0300.030
Novita AInovita/meta-llama/llama-3.1-8b-instruct0.0200.050
Nebiusnebius/meta-llama/Meta-Llama-3.1-8B-Instruct0.0200.060
AWS Bedrockmeta.llama3-1-8b-instruct-v1:00.2200.220
LlamaGatellamagate/llama-3.1-8b0.0300.050
Lambdalambda_ai/llama3.1-8b-instruct0.0250.040
Hyperbolichyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct0.1200.300
Groqgroq/llama-3.1-8b-instant0.0500.080
FriendliAIfriendliai/meta-llama-3.1-8b-instruct0.1000.100
Fireworks AIfireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct0.1000.100
DeepInfradeepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo0.0200.030
Databricksdatabricks/databricks-meta-llama-3-1-8b-instruct0.1500.450
Cerebrascerebras/llama3.1-8b0.1000.100
Azure AIazure_ai/Meta-Llama-3.1-8B-Instruct0.3000.610

All Variants

All available versions, regions, and API endpoints for Meta Llama 3.1 8B Instruct.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
meta.llama3-1-8b-instruct-v1:0AWS BedrockText0.2200.220128K2Knoyes
us.meta.llama3-1-8b-instruct-v1:0AWS BedrockText0.2200.220128K2Knoyes
azure_ai/Meta-Llama-3.1-8B-InstructAzure AIText0.3000.610128K2Knono
cerebras/llama3.1-8bCerebrasText0.1000.100128K128Knoyes
databricks/databricks-meta-llama-3-1-8b-instructDatabricksText0.1500.450200K128Knono
deepinfra/meta-llama/Meta-Llama-3.1-8B-InstructDeepInfraText0.0300.050131K131Knoyes
deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-TurboDeepInfraText0.0200.030131K131Knoyes
fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instructFireworks AIText0.1000.10016K16Knono
friendliai/meta-llama-3.1-8b-instructFriendliAIText0.1000.1008K8Knoyes
groq/llama-3.1-8b-instantGroqText0.0500.080128K8Knoyes
hyperbolic/meta-llama/Meta-Llama-3.1-8B-InstructHyperbolicText0.1200.30033K33Knoyes
lambda_ai/llama3.1-8b-instructLambdaText0.0250.040131K131Knoyes
llamagate/llama-3.1-8bLlamaGateText0.0300.050131K8Knoyes
nebius/meta-llama/Meta-Llama-3.1-8B-InstructNebiusText0.0200.060128K128Knoyes
novita/meta-llama/llama-3.1-8b-instructNovita AIText0.0200.05016K16Knono
nscale/meta-llama/Llama-3.1-8B-InstructNscaleText0.0300.030N/AN/Anono
ollama/llama3.1OllamaTextN/AN/A8K8Knoyes
ovhcloud/Llama-3.1-8B-InstructOVHcloudText0.1000.100131K131Knoyes
perplexity/llama-3.1-8b-instructPerplexityText0.2000.200131K131Knono
sambanova/Meta-Llama-3.1-8B-InstructSambaNovaText0.1000.20016K16Knoyes
snowflake/llama3.1-8bSnowflakeTextN/AN/A128K8Knono
together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-TurboTogether AIText0.1800.180N/AN/Anoyes
vercel_ai_gateway/meta/llama-3.1-8bVercel AI GatewayText0.0500.080131K131Knoyes
vertex_ai/meta/llama-3.1-8b-instruct-maasVertex AI (Llama)TextN/AN/A128K2Kyesno
wandb/meta-llama/Llama-3.1-8B-InstructWeights & BiasesText0.0220.022128K128Knono