Llama 3.3 70B is Meta's language model with a 128K context window and up to 8K output tokens, available from 6 providers, starting at $0.710 / 1M input and $0.710 / 1M output. Meta's 70B instruction-tuned LLM from the Llama 3.3 series, designed for high-performance conversational AI, content creation, and enterprise applications.
Specifications
Canonical IDmeta-llama-3-3-70b
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window128K tokens
Max Output8K tokens
Input ModalitiesText
Output ModalitiesText
Parameters70B
Release Date · 1 year ago

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities1/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Azure AI Foundry logo
Azure AI Foundry
meta:llama3370b
$0.710$0.710
Google Gemini logo
Google Gemini
llama3-3-70b-maas
$0.720$0.720
Google Vertex AI logo
Google Vertex AI
llama3-3-70b-maas
$0.720$0.720
Snowflake logo
Snowflake
snowflake-llama-3.3-70b
$0.720$0.720$0.360$0.360
Vercel AI Gateway logo
Vercel AI Gateway
meta/llama-3.3-70b
$0.720$0.720
Cerebras logo
Cerebras
cerebras/llama-3.3-70b
$0.850$1.20
View Azure AI Foundry

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B128K$0.710$0.710Current
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct131K$0.027$0.080Deprecated
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B Instruct131K$0.100$0.100Available
Llama 3.1 8B Instruct200K$0.020$0.030Available
Llama 3.1 70B128K$0.600$0.600Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3 70B Instruct131K$0.120$0.300Available
Llama 3 8B Instruct32K$0.030$0.040Available

Model IDs