Meta logo

Llama 3.3 70B Instruct Turbo


Llama 3.3 70B Instruct Turbo is Meta logoMeta's language model with a 131K context window, available from 3 providers, starting at $0.130 / 1M input and $0.390 / 1M output. FP8-quantized turbo variant of Llama 3.3 70B Instruct, delivering significantly faster inference speeds with minimal accuracy trade-off.
Spec
Canonical IDmeta-llama-3-3-70b-instruct-turbo
TypeLanguage
StatusActive
CreatorMetaMeta
Providers
Context Window131K tokens
Input ModalitiesText
Output ModalitiesText
Parameters70B

Capabilities

Input1/5
Text
Image·
Audio·
Video·
PDF·
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities3/13
Reasoning·
Adaptive Reasoning·
Function Calling
Parallel Function Calling
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching·
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
DeepInfra logo
DeepInfra
deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo
$0.130$0.390
Hugging Face logo
Hugging Face
together_ai:meta-llama/Llama-3.3-70B-Instruct-Turbo
$0.880$0.880
Together AI logo
Together AI
together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo
$0.880$0.880

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
Llama 3.2 11B128K$0.160$0.160Available
Llama 3.2 11B Instruct128K$0.350$0.350Deprecated
Llama 3.2 1B Instruct128K$0.027$0.080Deprecated
Llama 3.2 3B Instruct131K$0.015$0.020Deprecated
Llama 3.2 90B128K$0.720$0.720Available
Llama 3.2 90B Instruct128K$2.00$2.00Deprecated
Llama 3.2 1B131K$0.100$0.100Available
Llama 3.2 3B131K$0.040$0.080Available
Llama 3.1 405B Instruct131K$0.120$0.300Deprecating
Llama 3.1 70B128K$0.600$0.600Available
Llama 3.3 70B Instruct Turbo131K$0.130$0.390Current

Model IDs