Llama 3.1 8B is Meta's language model with a 131K context window and up to 16K output tokens, available from 4 providers, starting at $0.03 / 1M input and $0.05 / 1M output. Meta's compact 8B pre-trained LLM in the Llama 3.1 series, offering efficient on-device or low-cost inference with tool-use support.
Specifications
Canonical IDmeta-llama-3-1-8b
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window131K tokens
Max Output16K tokens
Input ModalitiesText
Output ModalitiesText
Parameters8B
Release Date · 2 years ago

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities2/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

US Dollar ($)
Per 1M tokens
ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Cerebras logo
Cerebras
cerebras/llama3.1-8b
$0.1$0.1
LlamaGate
llamagate/llama-3.1-8b
$0.03$0.05
Snowflake logo
Snowflake
llama3.1-8b
$0.11$0.11$0.055$0.055
Vercel AI Gateway logo
Vercel AI Gateway
meta/llama-3.1-8b
$0.22$0.22

Cost Calculator

US Dollar ($)
Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct131K$0.027$0.080Deprecated
Llama 3.2 11B128K$0.160$0.160Available
Llama 3.1 8B131K$0.030$0.050Current
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B Instruct131K$0.120$0.300Available
Llama 3.1 8B Instruct200K$0.020$0.030Available
Llama 3.1 70B128K$0.360$0.360Available
Llama 3 70B Instruct131K$0.120$0.300Deprecated
Llama 3 8B Instruct32K$0.030$0.040Available

Model IDs

cerebras/llama3.1-8b
llamagate/llama-3.1-8b
meta-llama-3-1-8b
meta-textgeneration-llama-3-1-8b
meta-textgenerationneuron-llama-3-1-8b
meta/llama-3.1-8b
snowflake/llama3.1-8b
vercel_ai_gateway/meta/llama-3.1-8b