Llama 3 8B

Llama 3 8B is a text model from Vercel AI Gateway with a context window of 8K tokens and max output of 8K tokens. Pricing starts at 0.05 per million input tokens and 0.08 per million output tokens (cheapest at Ollama).

Capabilities

Vision Function Calling Reasoning JSON Schema System Messages Web Search Prompt Caching Audio Input Audio Output

Specifications

Model Keyvercel_ai_gateway/meta/llama-3-8b
ProviderVercel AI Gateway
Provider IDvercel_ai_gateway
ModeText
Canonical Namellama-3-8b
Context Window8K tokens
Max Output8K tokens

Pricing

TypePer 1K TokensPer 1M Tokens
Input Tokens0.0000500.050
Output Tokens0.0000800.080

Benchmarks

Intelligence Index6.4#232
Coding Index4.0#161
MMLU-Pro0.4#182
GPQA0.3#206
HLE0.1#100
LiveCodeBench0.1#181
AIME0.0#127
IFBench0.2#159
Time to First Token0.37s#95
SciCode0.1#186
MATH-5000.5#118
LCR0.0#151
TerminalBench Hard0.0#148
TAU20.0#154

Price Comparison by Provider

Compare prices for Llama 3 8B across different providers. The same model may be available through multiple providers at different price points.

Provider
Model Key
Input Price, $
Output Price, $
Vertex AI (Llama)vertex_ai/meta/llama3-8b-instruct-maasN/AN/A
Vercel AI Gatewayvercel_ai_gateway/meta/llama-3-8b0.0500.080
Snowflakesnowflake/llama3-8bN/AN/A
Replicatereplicate/meta/llama-3-8b0.0500.250
Ollamaollama/llama3N/AN/A
Novita AInovita/meta-llama/llama-3-8b-instruct0.0400.040
Gradient AIgradient_ai/llama3-8b-instruct0.2000.200
Fireworks AIfireworks_ai/accounts/fireworks/models/llama-v3-8b0.2000.200
DeepInfradeepinfra/meta-llama/Meta-Llama-3-8B-Instruct0.0300.060
AWS Bedrockbedrock/us-east-1/meta.llama3-8b-instruct-v1:00.3000.600
Anyscaleanyscale/meta-llama/Meta-Llama-3-8B-Instruct0.1500.150

All Variants

All available versions, regions, and API endpoints for Llama 3 8B.

Model Key
Provider
Mode
Input Price, $
Output Price, $
Context
Max Output
Vision
Functions
anyscale/meta-llama/Meta-Llama-3-8B-InstructAnyscaleText0.1500.1508K8Knono
bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0AWS BedrockText0.3600.7208K8Knono
bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0AWS BedrockText0.3500.6908K8Knono
bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0AWS BedrockText0.3200.6508K8Knono
bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0AWS BedrockText0.3900.7808K8Knono
bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0AWS BedrockText0.5001.018K8Knono
bedrock/us-east-1/meta.llama3-8b-instruct-v1:0AWS BedrockText0.3000.6008K8Knono
bedrock/us-gov-east-1/meta.llama3-8b-instruct-v1:0AWS BedrockText0.3002.658K2Knono
bedrock/us-gov-west-1/meta.llama3-8b-instruct-v1:0AWS BedrockText0.3002.658K2Knono
bedrock/us-west-1/meta.llama3-8b-instruct-v1:0AWS BedrockText0.3000.6008K8Knono
meta.llama3-8b-instruct-v1:0AWS BedrockText0.3000.6008K8Knono
deepinfra/meta-llama/Meta-Llama-3-8B-InstructDeepInfraText0.0300.0608K8Knoyes
fireworks_ai/accounts/fireworks/models/llama-v3-8bFireworks AIText0.2000.2008K8Knono
fireworks_ai/accounts/fireworks/models/llama-v3-8b-instruct-hfFireworks AIText0.2000.2008K8Knono
gradient_ai/llama3-8b-instructGradient AIText0.2000.200512N/Anono
novita/meta-llama/llama-3-8b-instructNovita AIText0.0400.0408K8Knono
ollama/llama3OllamaTextN/AN/A8K8Knono
ollama/llama3:8bOllamaTextN/AN/A8K8Knono
replicate/meta/llama-3-8bReplicateText0.0500.2508K8Knono
replicate/meta/llama-3-8b-instructReplicateText0.0500.2508K8Knono
snowflake/llama3-8bSnowflakeTextN/AN/A8K8Knono
vercel_ai_gateway/meta/llama-3-8bVercel AI GatewayText0.0500.0808K8Knono
vertex_ai/meta/llama3-8b-instruct-maasVertex AI (Llama)TextN/AN/A32K32Knono