Llama 3 8B is Meta's language model with a 8K context window and up to 8K output tokens, available from 4 providers, starting at $0.050 / 1M input and $0.080 / 1M output. Meta's compact 8B pre-trained LLM from the Llama 3 generation, suitable for efficient on-device and low-cost cloud inference.
Specifications
Canonical IDmeta-llama-3-8b
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window8K tokens
Max Output8K tokens
Input ModalitiesText
Output ModalitiesText
Parameters8B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities0/13
Reasoning·
Adaptive Reasoning·
Function Calling·
Parallel Function Calling·
Structured Outputs·
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandardBatch
Input
$ / 1M
Output
$ / 1M
Input
$ / 1M
Output
$ / 1M
Replicate logo
Replicate
replicate/meta/llama-3-8b
$0.050$0.250
Vercel AI Gateway logo
Vercel AI Gateway
vercel_ai_gateway/meta/llama-3-8b
$0.050$0.080
Fireworks AI logo
Fireworks AI
fireworks_ai/accounts/fireworks/models/llama-v3-8b
$0.200$0.200
Snowflake logo
Snowflake
llama3-8b
$0.380$0.380$0.190$0.190

Cost Calculator

Preset:

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B Instruct131K$0.100$0.200Available
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 1B Instruct131K$0.027$0.080Deprecated
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B Instruct131K$0.100$0.100Available
Llama 3.1 8B Instruct200K$0.020$0.030Available
Llama 3.1 70B128K$0.600$0.600Available
Llama 3.1 8B131K$0.030$0.050Available
Llama 3 70B Instruct131K$0.120$0.300Available
Llama 3 8B Instruct32K$0.030$0.040Available
Llama 3 8B8K$0.050$0.080Current

Model IDs