Llama 3.3 70B Instruct

Llama 3.3 70B Instruct is a text model from Nebius with a context window of 128K tokens and max output of 128K tokens. Pricing starts at 0.13 per million input tokens and 0.40 per million output tokens (cheapest at Together AI).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keynebius/meta-llama/Llama-3.3-70B-Instruct
ProviderNebius
Provider IDnebius
ModeText
Canonical Namellama-3.3-70b
Context Window128K tokens
Max Output128K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0001300.130
Output Tokens0.0004000.400

Benchmarks

Intelligence Index14.5#136
Coding Index10.7#130
Math Index7.7#120
MMLU-Pro0.7#102
GPQA0.5#136
HLE0.0#172
LiveCodeBench0.3#115
AIME0.3#48
IFBench0.5#47
Time to First Token0.53s#127
SciCode0.3#128
MATH-5000.8#74
AIME 20250.1#120
LCR0.1#121
TerminalBench Hard0.0#121
TAU20.3#94

Price Comparison by Provider

Compare prices for Llama 3.3 70B Instruct across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
IBM watsonxwatsonx/meta-llama/llama-3-3-70b-instruct0.7100.710
Weights & Biaseswandb/meta-llama/Llama-3.3-70B-Instruct0.0710.071
Vercel AI Gatewayvercel_ai_gateway/meta/llama-3.3-70b0.7200.720
AWS Bedrockus.meta.llama3-3-70b-instruct-v1:00.7200.720
Together AItogether_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-FreeN/AN/A
Snowflakesnowflake/snowflake-llama-3.3-70bN/AN/A
SambaNovasambanova/Meta-Llama-3.3-70B-Instruct0.6001.20
OVHcloudovhcloud/Meta-Llama-3_3-70B-Instruct0.6700.670
Oracle Cloud (OCI)oci/meta.llama-3.3-70b-instruct0.7200.720
Nscalenscale/meta-llama/Llama-3.3-70B-Instruct0.2000.200
Novita AInovita/meta-llama/llama-3.3-70b-instruct0.1350.400
Nebiusnebius/meta-llama/Llama-3.3-70B-Instruct0.1300.400
Meta Llamameta_llama/Llama-3.3-70B-InstructN/AN/A
Lambdalambda_ai/llama3.3-70b-instruct-fp80.1200.300
Hyperbolichyperbolic/meta-llama/Llama-3.3-70B-Instruct0.1200.300
Groqgroq/llama-3.3-70b-versatile0.5900.790
Gradient AIgradient_ai/llama3.3-70b-instruct0.6500.650
Fireworks AIfireworks_ai/accounts/fireworks/models/llama-v3p3-70b-instruct0.9000.900
DeepInfradeepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo0.1300.390
Databricksdatabricks/databricks-meta-llama-3-3-70b-instruct0.5001.50
Cerebrascerebras/llama-3.3-70b0.8501.20
Azure AIazure_ai/Llama-3.3-70B-Instruct0.7100.710

All Variants

All available versions, regions, and API endpoints for Llama 3.3 70B Instruct.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
meta.llama3-3-70b-instruct-v1:0AWS BedrockText0.7200.720128K4Knoyes
us.meta.llama3-3-70b-instruct-v1:0AWS BedrockText0.7200.720128K4Knoyes
azure_ai/Llama-3.3-70B-InstructAzure AIText0.7100.710128K2Knoyes
cerebras/llama-3.3-70bCerebrasText0.8501.20128K128Knoyes
databricks/databricks-meta-llama-3-3-70b-instructDatabricksText0.5001.50128K128Knono
deepinfra/meta-llama/Llama-3.3-70B-InstructDeepInfraText0.2300.400131K131Knoyes
deepinfra/meta-llama/Llama-3.3-70B-Instruct-TurboDeepInfraText0.1300.390131K131Knoyes
fireworks_ai/accounts/fireworks/models/llama-v3p3-70b-instructFireworks AIText0.9000.900131K131Knono
gradient_ai/llama3.3-70b-instructGradient AIText0.6500.6502KN/Anono
groq/llama-3.3-70b-versatileGroqText0.5900.790128K33Knoyes
hyperbolic/meta-llama/Llama-3.3-70B-InstructHyperbolicText0.1200.300131K131Knoyes
watsonx/meta-llama/llama-3-3-70b-instructIBM watsonxText0.7100.710128K128Knoyes
lambda_ai/deepseek-llama3.3-70bLambdaText0.2000.600131K131Knoyes
lambda_ai/llama3.3-70b-instruct-fp8LambdaText0.1200.300131K131Knoyes
meta_llama/Llama-3.3-70B-InstructMeta LlamaTextN/AN/A128K4Knoyes
nebius/meta-llama/Llama-3.3-70B-InstructNebiusText0.1300.400128K128Knoyes
novita/meta-llama/llama-3.3-70b-instructNovita AIText0.1350.400131K120Knoyes
nscale/meta-llama/Llama-3.3-70B-InstructNscaleText0.2000.200N/AN/Anono
oci/meta.llama-3.3-70b-instructOracle Cloud (OCI)Text0.7200.720128K4Knoyes
ovhcloud/Meta-Llama-3_3-70B-InstructOVHcloudText0.6700.670131K131Knoyes
sambanova/Meta-Llama-3.3-70B-InstructSambaNovaText0.6001.20131K131Knoyes
snowflake/llama3.3-70bSnowflakeTextN/AN/A128K8Knono
snowflake/snowflake-llama-3.3-70bSnowflakeTextN/AN/A8K8Knono
together_ai/meta-llama/Llama-3.3-70B-Instruct-TurboTogether AIText0.8800.880N/AN/Anoyes
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-FreeTogether AITextN/AN/AN/AN/Anoyes
vercel_ai_gateway/meta/llama-3.3-70bVercel AI GatewayText0.7200.720128K8Knoyes
wandb/meta-llama/Llama-3.3-70B-InstructWeights & BiasesText0.0710.071128K128Knono