Llama3.1 70B Instruct Fp8

Lambda AiText

Llama3.1 70B Instruct Fp8 is a text model from Lambda Ai with a context window of 131K tokens and max output of 131K tokens. Pricing starts at $0.12 per million input tokens and $0.30 per million output tokens (cheapest at Google Vertex AI).

Specifications

Model Keylambda_ai/llama3.1-70b-instruct-fp8
ProviderLambda Ai
LiteLLM Providerlambda_ai
ModeText
Canonical Namellama-3.1-70b
Context Window131K tokens
Max Output131K tokens

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens$0.000120$0.120
Output Tokens$0.000300$0.300

Price Comparison by Provider

Compare prices for Llama3.1 70B Instruct Fp8 across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price
Output Price
Vercel Ai Gatewayvercel_ai_gateway/meta/llama-3.1-70b$0.720$0.720
Perplexityperplexity/llama-3.1-70b-instruct$1.00$1.00
Google Vertex AIvertex_ai/meta/llama-3.1-70b-instruct-maasN/AN/A
Fireworks AIfireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instruct$0.900$0.900
Cerebrascerebras/llama3.1-70b$0.600$0.600
Snowflakesnowflake/llama3.1-70bN/AN/A
Lambda Ailambda_ai/llama3.1-70b-instruct-fp8$0.120$0.300
Ovhcloudovhcloud/Meta-Llama-3_1-70B-Instruct$0.670$0.670
Azure AIazure_ai/Meta-Llama-3.1-70B-Instruct$2.68$3.54
Deepinfradeepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct$0.400$0.400
Friendliaifriendliai/meta-llama-3.1-70b-instruct$0.600$0.600
Hyperbolichyperbolic/meta-llama/Meta-Llama-3.1-70B-Instruct$0.120$0.300
Together AItogether_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo$0.880$0.880
AWS Bedrockus.meta.llama3-1-70b-instruct-v1:0$0.990$0.990

Similar Models

Models with similar capabilities and context window size.

Model
Provider
Mode
Input Price
Output Price
Context
Max Output
Vision
Functions
Gemma 3 27B ItGoogle GeminiTextN/AN/A131K8Kyesyes
GPT-oss-120b-mxfp-GGUFLemonadeTextN/AN/A131K33Knoyes
GPT-oss-20bOpenRouterText$0.020$0.100131K33Knoyes
GPT-oss-20b-mxfp4-GGUFLemonadeTextN/AN/A131K33Knoyes
GPT-oss:120b-cloudOllamaTextN/AN/A131K131Knoyes
GPT-oss:20b-cloudOllamaTextN/AN/A131K131Knoyes
Llama3.2 11B Vision InstructLambda AiText$0.015$0.025131K131Kyesyes
Llama3.2 3B InstructLambda AiText$0.015$0.025131K131Knoyes
Meta Llama 3.1 8B Instruct TurboDeepinfraText$0.020$0.030131K131Knono
Mistral Nemo Instruct 2407DeepinfraText$0.020$0.040131K131Knono