Groq AI models — pricing & benchmarks

Official siteDocsPricingAPI

The Groq LPU delivers inference with the speed and cost developers need. Inference platform · OpenAI-compatible API · Fast Inference · Low Latency · Lpu · Open Weight

Intelligence vs Price

Best value among Groq models on this chart: GPT OSS 120B · GPT OSS 20B · Llama 3.1 8B Instruct. Prices use each model's lowest available Groq price across regions. Hover any dot for full pricing, or click a creator in the legend to isolate.

Groq models

14 models in Global, 11 with pricing

All Model Types

All Creators

US Dollar ($)

Per 1M tokens

Model	Creator	Input Price, $	Output Price, $	Context	Max Output	Inference Providers	Intelligence	Coding
GPT OSS 120B	OpenAI	0.15	0.6	131K	131K	compare (24)	23.8#1	30.4#1
GPT OSS 20B	OpenAI	0.075	0.3	131K	131K	compare (18)	14.9#2	20.7#2
Llama 3.3 70B Instruct	Meta	0.59	0.79	131K	128K	compare (21)	9.4#3	11.9#3
Llama 3.1 8B Instruct	Meta	0.05	0.08	200K	128K	compare (21)	7.6#4	5.4#4
Gemma 7B IT	Google	0.05	0.08	8K	8K	compare (3)	N/A	N/A
GPT OSS 20B Safeguard	OpenAI	0.075	0.3	131K	66K	compare (5)	N/A	N/A
Kimi K2 Instruct	Moonshot AI (Kimi)	1.00	3.00	262K	33K	compare (9)	N/A	N/A
Llama 4 17B Maverick Instruct	Meta	0.2	0.6	1.0M	16K	compare (9)	N/A	N/A
Llama 4 17B Scout Instruct	Meta	0.11	0.34	10.0M	16K	compare (12)	N/A	N/A
LlamaGuard 4 12B	Meta	0.2	0.2	1.0M	16K	compare (4)	N/A	N/A
PlayAI TTS	PlayAI	N/A	N/A	10K	10K	compare (1)	N/A	N/A
Qwen3 32B	Alibaba	0.29	0.59	131K	41K	compare (15)	N/A	N/A
Whisper 3 Large	OpenAI	N/A	N/A	N/A	N/A	compare (1)	N/A	N/A
Whisper 3 Large Turbo	OpenAI	N/A	N/A	N/A	N/A	compare (3)	N/A	N/A

Groq

Intelligence vs Price

Groq modelsAPIGET/api/v1/providers/groq/models

Groq models