Name: Llama 3.3 70B Instruct FP8 Fast
Brand: Meta

Llama 3.3 70B Instruct FP8 Fast is Meta's language model with a 24K context window, starting at $0.293 / 1M input and $2.25 / 1M output. Llama 3.3 70B instruction-tuned model quantized to FP8 precision and further optimized for throughput-focused fast inference deployments.

Specifications
Canonical ID	`meta-llama-3-3-70b-instruct-fp8-fast`
Type	Language
Status	Active
Creator	Meta
Providers	Cloudflare Workers AI
Context Window	24K tokens
Input Modalities	Text
Output Modalities	Text

Capabilities

Input1/5

Text✓

Image·

Audio·

Video·

PDF·

Output1/5

Text✓

Image·

Audio·

Video·

Embedding·

Capabilities1/13

Reasoning·

Adaptive Reasoning·

Function Calling✓

Parallel Function Calling·

Structured Outputs·

Native JSON Schema·

Web Search·

URL Context·

Computer Use·

Code Execution·

File Search·

Prompt Caching·

Assistant Prefill·

Pricing by Provider

US Dollar ($)

Per 1M tokens

Provider	Standard
Provider	Input $ / 1M	Output $ / 1M
Cloudflare Workers AI `@cf/meta/llama-3.3-70b-instruct-fp8-fast`	$0.293	$2.25

Cost Calculator

US Dollar ($)

Preset:

Input tokens

Output tokens

Number of calls

Versions

Version	Released	Context	Input / 1M	Output / 1M	Status
Llama 3.3 70B Instruct	2024-12-06	131K	$0.120	$0.200	Deprecated
Llama 3.2 3B Instruct	2024-09-25	131K	$0.015	$0.020	Deprecated
Llama 3.2 1B Instruct	2024-09-25	128K	$0.027	$0.080	Deprecated
Llama 3.2 11B	2024-09-25	128K	$0.160	$0.160	Available
Llama 3.1 405B Instruct	2024-07-23	131K	$0.120	$0.300	Deprecated
Llama 3.1 8B Instruct	2024-07-23	200K	$0.020	$0.030	Deprecated
Llama 3.1 70B Instruct	2024-07-23	131K	$0.120	$0.300	Available
Llama 3.1 70B	2024-07-23	128K	$0.360	$0.360	Available
Llama 3.1 8B	2024-07-23	131K	$0.030	$0.050	Available
Llama 3 70B Instruct	2024-04-23	131K	$0.120	$0.300	Deprecated
Llama 3.3 70B Instruct FP8 Fast	—	24K	$0.293	$2.25	Current

Model IDs

@cf/meta/llama-3.3-70b-instruct-fp8-fast

cloudflare/@cf/meta/llama-3.3-70b-instruct-fp8-fast

meta-llama-3-3-70b-instruct-fp8-fast

Llama 3.3 70B Instruct FP8 Fast

CapabilitiesAPIGET/api/v1/models/meta-llama-3-3-70b-instruct-fp8-fast

Pricing by ProviderAPIGET/api/v1/models/meta-llama-3-3-70b-instruct-fp8-fast/pricing

Cost CalculatorAPIGET/api/v1/models/meta-llama-3-3-70b-instruct-fp8-fast/pricing/calculate?input_tokens=1000000&output_tokens=500000

VersionsAPIGET/api/v1/models?family=llama

Model IDsAPIGET/api/v1/models/meta-llama-3-3-70b-instruct-fp8-fast

Capabilities

Pricing by Provider

Cost Calculator

Versions

Model IDs