The Groq LPU delivers inference with the speed and cost developers need. Inference platform · OpenAI-compatible API · Fast Inference · Low Latency · Lpu · Open Weight

Intelligence vs Price

Best value among Groq models on this chart: GPT OSS 120B · GPT OSS 20B · Llama 3.1 8B Instruct. Hover any dot for full pricing, or click a creator in the legend to isolate.

Groq models

14 models, 11 with pricing
Input/1M
to
Output/1M
to
Model
Creator
Input Price, $
Output Price, $
Context
Max Output
Inference Providers
Intelligence
Coding
GPT OSS 120BOpenAI logoOpenAI0.0390.180131K131Kcompare (20)33.3#128.6#1
GPT OSS 20BOpenAI logoOpenAI0.0300.140131K131Kcompare (15)24.5#218.5#2
Llama 3.3 70B InstructMeta logoMeta0.1000.200131K120Kcompare (20)14.5#310.7#3
Llama 3.1 8B InstructMeta logoMeta0.0200.030200K128Kcompare (20)11.8#44.9#4
Gemma 7B ITGoogle logoGoogle0.0500.0808K8Kcompare (3)N/AN/A
GPT OSS 20B SafeguardOpenAI logoOpenAI0.0700.200131K66Kcompare (5)N/AN/A
Kimi K2 InstructMoonshot AI (Kimi) logoMoonshot AI (Kimi)0.5002.00262K33Kcompare (9)N/AN/A
Llama 4 17B Maverick InstructMeta logoMeta0.0500.1001.0M16Kcompare (9)N/AN/A
Llama 4 17B Scout InstructMeta logoMeta0.0500.10010.0M16Kcompare (11)N/AN/A
LlamaGuard 4 12BMeta logoMeta0.1800.180164K16Kcompare (4)N/AN/A
PlayAI TTSPlayAI logoPlayAIN/AN/A10K10Kcompare (1)N/AN/A
Qwen3 32BAlibaba logoAlibaba0.0500.100131K41Kcompare (15)N/AN/A
Whisper 3 LargeOpenAI logoOpenAIN/AN/AN/AN/Acompare (1)N/AN/A
Whisper 3 Large TurboOpenAI logoOpenAIN/AN/AN/AN/Acompare (2)N/AN/A