The Groq LPU delivers inference with the speed and cost developers need. Inference platform · OpenAI-compatible API · Fast Inference · Low Latency · Lpu · Open Weight

Intelligence vs Price

Best value among Groq models on this chart: GPT OSS 120B · GPT OSS 20B · Llama 3.1 8B Instruct. Hover any dot for full pricing, or click a creator in the legend to isolate.

Language Models
Intelligence
Blended Price, $
Log X

Groq models

14 models, 11 with pricing
All Model Types
All Creators
US Dollar ($)
Per 1M tokens
Input/1M
to
Output/1M
to
Model
Creator
Input Price, $
Output Price, $
Context
Max Output
Inference Providers
Intelligence
Coding
GPT OSS 120BOpenAI logoOpenAI0.0390.18131K131Kcompare (21)33.3#128.6#1
GPT OSS 20BOpenAI logoOpenAI0.0290.14131K131Kcompare (16)24.5#218.5#2
Llama 3.3 70B InstructMeta logoMeta0.10.2131K120Kcompare (20)14.5#310.7#3
Llama 3.1 8B InstructMeta logoMeta0.020.03200K128Kcompare (21)11.8#44.9#4
Gemma 7B ITGoogle logoGoogle0.050.088K8Kcompare (3)N/AN/A
GPT OSS 20B SafeguardOpenAI logoOpenAI0.070.2131K66Kcompare (5)N/AN/A
Kimi K2 InstructMoonshot AI (Kimi) logoMoonshot AI (Kimi)0.52.00262K33Kcompare (9)N/AN/A
Llama 4 17B Maverick InstructMeta logoMeta0.050.11.0M16Kcompare (9)N/AN/A
Llama 4 17B Scout InstructMeta logoMeta0.050.110.0M16Kcompare (12)N/AN/A
LlamaGuard 4 12BMeta logoMeta0.180.18164K16Kcompare (4)N/AN/A
PlayAI TTSPlayAI logoPlayAIN/AN/A10K10Kcompare (1)N/AN/A
Qwen3 32BAlibaba logoAlibaba0.050.1131K41Kcompare (15)N/AN/A
Whisper 3 LargeOpenAI logoOpenAIN/AN/AN/AN/Acompare (1)N/AN/A
Whisper 3 Large TurboOpenAI logoOpenAIN/AN/AN/AN/Acompare (3)N/AN/A