Llama3.1 70B

CerebrasText

Llama3.1 70B is a text model from Cerebras with a context window of 128K tokens and max output of 128K tokens. Pricing starts at $0.60 per million input tokens and $0.60 per million output tokens (cheapest at Google Vertex AI).

Specifications

Model Keycerebras/llama3.1-70b
ProviderCerebras
LiteLLM Providercerebras
ModeText
Canonical Namellama-3.1-70b
Context Window128K tokens
Max Output128K tokens

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.000600$0.600
Output Tokens$0.000600$0.600

Price Comparison by Provider

Compare prices for Llama3.1 70B across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price
Output Price
Vercel Ai Gatewayvercel_ai_gateway/meta/llama-3.1-70b$0.720$0.720
Perplexityperplexity/llama-3.1-70b-instruct$1.00$1.00
Google Vertex AIvertex_ai/meta/llama-3.1-70b-instruct-maasN/AN/A
Fireworks AIfireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instruct$0.900$0.900
Cerebrascerebras/llama3.1-70b$0.600$0.600
Snowflakesnowflake/llama3.1-70bN/AN/A
Lambda Ailambda_ai/llama3.1-70b-instruct-fp8$0.120$0.300
Ovhcloudovhcloud/Meta-Llama-3_1-70B-Instruct$0.670$0.670
Azure AIazure_ai/Meta-Llama-3.1-70B-Instruct$2.68$3.54
Deepinfradeepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct$0.400$0.400
Friendliaifriendliai/meta-llama-3.1-70b-instruct$0.600$0.600
Hyperbolichyperbolic/meta-llama/Meta-Llama-3.1-70B-Instruct$0.120$0.300
Together AItogether_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo$0.880$0.880
AWS Bedrockus.meta.llama3-1-70b-instruct-v1:0$0.990$0.990

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
Gemma 3 4B It GGUFLemonadeTextN/AN/A128K8Knoyes
Gemma3 4BLlamagateText$0.030$0.080128K8Kyesyes
GigaChat 2 LiteGigachatTextN/AN/A128K8Knoyes
GigaChat 2 MaxGigachatTextN/AN/A128K8Kyesyes
GigaChat 2 ProGigachatTextN/AN/A128K8Kyesyes
Glm 4.5 FlashZaiTextN/AN/A128K32Knoyes
Llama 3.1 70B Instruct MaasGoogle Vertex AITextN/AN/A128K2Kyesno
Llama 3.1 8B Instruct MaasGoogle Vertex AITextN/AN/A128K2Kyesno
Llama 3.2 90B Vision Instruct MaasGoogle Vertex AITextN/AN/A128K2Kyesno
Qwen3 4B Fp8NovitaText$0.030$0.030128K20Knono