Meta logo

Llama 3.1 8B Instruct Turbo


Llama 3.1 8B Instruct Turbo is Meta logoMeta's language model with a 131K context window, available from 2 providers, starting at $0.020 / 1M input and $0.030 / 1M output. A throughput-optimized 8B Llama 3.1 Instruct model using FP8 quantization for significantly faster inference in high-volume production environments.
Spec
Canonical IDmeta-llama-3-1-8b-instruct-turbo
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window131K tokens
Input ModalitiesText
Output ModalitiesText
Parameters8B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities3/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
DeepInfra logo
DeepInfra
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
$0.020$0.030
Together AI logo
Together AI
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
$0.180$0.180

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.3 70B Instruct131K$0.720$0.720Available
Llama 3.3 70B Instruct131K$0.100$0.300Available
Llama 3.3Available
Llama 3.3 70B Instruct Turbo131K$0.130$0.390Available
Llama 3.3 70B Versatile128K$0.590$0.790Available
Llama 3.3 8B Instruct128KAvailable
Llama 3.2 11B Vision Instruct128K$0.160$0.160Available
Llama 3.2 1B Instruct128K$0.027$0.080Deprecated
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 90B Vision Instruct128K$0.720$0.720Available
Llama 3.1 8B Instruct Turbo131K$0.020$0.030Current

Model IDs