Meta Llama 3.1 70B Instruct

Meta Llama 3.1 70B Instruct is a text model from Nebius with a context window of 128K tokens and max output of 128K tokens. Pricing starts at 0.13 per million input tokens and 0.40 per million output tokens (cheapest at Vertex AI (Llama)).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keynebius/meta-llama/Meta-Llama-3.1-70B-Instruct
ProviderNebius
Provider IDnebius
ModeText
Canonical Namellama-3.1-70b
Context Window128K tokens
Max Output128K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0001300.130
Output Tokens0.0004000.400

Benchmarks

Intelligence Index12.5#161
Coding Index10.9#126
Math Index4.0#131
MMLU-Pro0.7#126
GPQA0.4#167
HLE0.0#126
LiveCodeBench0.2#140
AIME0.2#69
IFBench0.3#114
Time to First Token0.47s#117
SciCode0.3#122
MATH-5000.6#102
AIME 20250.0#131
LCR0.1#140
TerminalBench Hard0.0#121
TAU20.2#135

Price Comparison by Provider

Compare prices for Meta Llama 3.1 70B Instruct across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Vertex AI (Llama)vertex_ai/meta/llama-3.1-70b-instruct-maasN/AN/A
Vercel AI Gatewayvercel_ai_gateway/meta/llama-3.1-70b0.7200.720
Together AItogether_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo0.8800.880
Snowflakesnowflake/llama3.1-70bN/AN/A
Perplexityperplexity/llama-3.1-70b-instruct1.001.00
OVHcloudovhcloud/Meta-Llama-3_1-70B-Instruct0.6700.670
Nebiusnebius/meta-llama/Meta-Llama-3.1-70B-Instruct0.1300.400
AWS Bedrockmeta.llama3-1-70b-instruct-v1:00.9900.990
Lambdalambda_ai/llama3.1-70b-instruct-fp80.1200.300
Hyperbolichyperbolic/meta-llama/Meta-Llama-3.1-70B-Instruct0.1200.300
FriendliAIfriendliai/meta-llama-3.1-70b-instruct0.6000.600
Fireworks AIfireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instruct-1b0.1000.100
DeepInfradeepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo0.1000.280
Cerebrascerebras/llama3.1-70b0.6000.600
Azure AIazure_ai/Meta-Llama-3.1-70B-Instruct2.683.54

All Variants

All available versions, regions, and API endpoints for Meta Llama 3.1 70B Instruct.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
meta.llama3-1-70b-instruct-v1:0AWS BedrockText0.9900.990128K2Knoyes
us.meta.llama3-1-70b-instruct-v1:0AWS BedrockText0.9900.990128K2Knoyes
azure_ai/Meta-Llama-3.1-70B-InstructAzure AIText2.683.54128K2Knono
cerebras/llama3.1-70bCerebrasText0.6000.600128K128Knoyes
deepinfra/meta-llama/Meta-Llama-3.1-70B-InstructDeepInfraText0.4000.400131K131Knoyes
deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-TurboDeepInfraText0.1000.280131K131Knoyes
fireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instructFireworks AIText0.9000.900131K131Knono
fireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instruct-1bFireworks AIText0.1000.1004K4Knono
friendliai/meta-llama-3.1-70b-instructFriendliAIText0.6000.6008K8Knoyes
hyperbolic/meta-llama/Meta-Llama-3.1-70B-InstructHyperbolicText0.1200.30033K33Knoyes
lambda_ai/llama3.1-70b-instruct-fp8LambdaText0.1200.300131K131Knoyes
nebius/meta-llama/Meta-Llama-3.1-70B-InstructNebiusText0.1300.400128K128Knoyes
ovhcloud/Meta-Llama-3_1-70B-InstructOVHcloudText0.6700.670131K131Knono
perplexity/llama-3.1-70b-instructPerplexityText1.001.00131K131Knono
snowflake/llama3.1-70bSnowflakeTextN/AN/A128K8Knono
together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-TurboTogether AIText0.8800.880N/AN/Anoyes
vercel_ai_gateway/meta/llama-3.1-70bVercel AI GatewayText0.7200.720128K8Knono
vertex_ai/meta/llama-3.1-70b-instruct-maasVertex AI (Llama)TextN/AN/A128K2Kyesno