DeepInfra offers cost-effective, scalable, easy-to-deploy, and production-ready machine-learning models and infrastructures for deep-learning models. Inference platform · OpenAI-compatible API · Low Cost · Open Source · Serverless

Intelligence vs Price

Best value among DeepInfra models on this chart: GPT OSS 120B · GPT OSS 20B · Llama 3.1 8B Instruct (and 1 more on the dashed frontier). Hover any dot for full pricing, or click a creator in the legend to isolate.

Language Models
Intelligence
Blended Price, $
Log X

DeepInfra models

64 models, 64 with pricing
All Model Types
All Creators
US Dollar ($)
Per 1M tokens
Input/1M
to
Output/1M
to
Model
Creator
Input Price, $
Output Price, $
Context
Max Output
Inference Providers
Intelligence
Coding
GPT OSS 120BOpenAI logoOpenAI0.0390.18131K131Kcompare (21)33.3#128.6#4
Claude Sonnet 4Anthropic logoAnthropic3.0015.001.0M64Kcompare (10)33.0#330.6#3
Claude Opus 4Anthropic logoAnthropic5.0025.00410K32Kcompare (9)33.0#2N/A
Claude Sonnet 3.7Anthropic logoAnthropic3.0015.00200K128Kcompare (10)30.8#426.7#6
Gemini 2.5 ProGoogle logoGoogle1.2510.001.0M66Kcompare (7)29.5#532.0#1
DeepSeek V3.1 TerminusDeepSeek logoDeepSeek0.270.95164K66Kcompare (6)28.5#631.9#2
DeepSeek V3.1DeepSeek logoDeepSeek0.271.00164K66Kcompare (13)28.1#728.4#5
GLM-4.5Zhipu AI logoZhipu AI0.41.60131K98Kcompare (8)26.4#826.3#7
Qwen3 Coder 480B A35B InstructAlibaba logoAlibaba0.221.30262K66Kcompare (8)24.8#924.6#8
GPT OSS 20BOpenAI logoOpenAI0.0290.14131K131Kcompare (16)24.5#1018.5#10
DeepSeek V3 324DeepSeek logoDeepSeek0.20.4164K16Kcompare (13)22.3#1122.0#9
Qwen3 Next 80B A3B InstructAlibaba logoAlibaba0.090.9262K66Kcompare (10)20.1#1215.3#14
QwQ 32BAlibaba logoAlibaba0.150.2131K16Kcompare (9)19.7#13N/A
DeepSeek R1DeepSeek logoDeepSeek0.280.4164K66Kcompare (14)18.8#1415.9#13
Gemini 2.0 FlashGoogle logoGoogle0.10.41.0M8Kcompare (5)18.5#1513.6#17
Gemini 2.5 FlashGoogle logoGoogle0.150.61.0M66Kcompare (9)17.8#1617.8#11
DeepSeek R1 Distill Qwen 32BDeepSeek logoDeepSeek0.150.15131K33Kcompare (8)17.2#17N/A
Qwen3 235B A22B InstructAlibaba logoAlibaba0.090.58262K33Kcompare (10)17.0#1814.0#16
DeepSeek V3DeepSeek logoDeepSeek0.20.2164K82Kcompare (11)16.5#1916.4#12
DeepSeek R1 Distill Llama 70BDeepSeek logoDeepSeek0.20.375131K8Kcompare (11)16.0#2011.4#19
Qwen2.5 72B InstructAlibaba logoAlibaba0.120.3131K16Kcompare (8)15.6#2111.9#18
Qwen3 30B A3BAlibaba logoAlibaba0.0510.29131K20Kcompare (7)15.0#2214.2#15
Llama 3.3 70B InstructMeta logoMeta0.10.2131K120Kcompare (20)14.5#2310.7#23
Llama 3.1 Nemotron 70B InstructNVIDIA logoNVIDIA0.60.6131K16Kcompare (2)13.4#2410.8#22
Nemotron Nano 2 9BNVIDIA logoNVIDIA0.040.16131K8Kcompare (4)13.2#257.5#24
Llama 3.1 70B InstructMeta logoMeta0.120.3131K16Kcompare (13)12.5#2610.9#21
Llama 3.1 8B InstructMeta logoMeta0.020.03200K128Kcompare (21)11.8#274.9#25
Hermes 3 Llama 3.1 70BNous Research logoNous Research0.120.3131K16Kcompare (3)10.6#28N/A
Phi-4Microsoft logoMicrosoft0.0650.1416K16Kcompare (3)10.4#2911.2#20
Llama 3.2 3B InstructMeta logoMeta0.0150.02131K80Kcompare (10)9.7#30N/A
Mixtral 8x7B InstructMistral AI logoMistral AI0.070.1533K16Kcompare (9)7.7#31N/A
Llama 3 8B InstructMeta logoMeta0.030.0432K8Kcompare (9)6.4#324.0#26
DeepSeek R1 528DeepSeek logoDeepSeek0.20.25164K33Kcompare (12)N/AN/A
DeepSeek R1 528 TurboDeepSeek logoDeepSeek1.003.0033KN/Acompare (1)N/AN/A
DeepSeek R1 TurboDeepSeek logoDeepSeek0.72.5064K16Kcompare (3)N/AN/A
Gemma 3 12B InstructGoogle logoGoogle0.050.1131K16Kcompare (6)N/AN/A
Gemma 3 27B InstructGoogle logoGoogle0.060.16131K16Kcompare (6)N/AN/A
Gemma 3 4B InstructGoogle logoGoogle0.040.08131K16Kcompare (3)N/AN/A
Hermes 3 Llama 3.1 405BNous Research logoNous Research1.001.00131K16Kcompare (3)N/AN/A
Kimi K2 InstructMoonshot AI (Kimi) logoMoonshot AI (Kimi)0.52.00262K33Kcompare (9)N/AN/A
L3 Lunaris 1.8B TurboSao10K0.040.058K8Kcompare (1)N/AN/A
L3.1 70B Euryale V2.2Sao10K0.650.75131K8Kcompare (2)N/AN/A
L3.3 70B Euryale V2.3Sao10K0.650.75131KN/Acompare (1)N/AN/A
Llama 3.2 11B Vision InstructMeta logoMeta0.0150.025131K16Kcompare (8)N/AN/A
Llama 3.3 70B Instruct TurboMeta logoMeta0.130.39131KN/Acompare (3)N/AN/A
Llama 3.3 Nemotron 1.5 Super 49BNVIDIA logoNVIDIA0.10.4131K16Kcompare (2)N/AN/A
Llama 4 17B Maverick InstructMeta logoMeta0.050.11.0M16Kcompare (9)N/AN/A
Llama 4 17B Scout InstructMeta logoMeta0.050.110.0M16Kcompare (12)N/AN/A
LlamaGuard 3 8BMeta logoMeta0.020.03131K16Kcompare (5)N/AN/A
LlamaGuard 4 12BMeta logoMeta0.180.18164K16Kcompare (4)N/AN/A
Mistral Small 24B InstructMistral AI logoMistral AI0.050.0833K16Kcompare (3)N/AN/A
Mistral Small 3.2 24B InstructMistral AI logoMistral AI0.0750.2128K16Kcompare (3)N/AN/A
MythoMax L2 13BGryphe0.060.064K4Kcompare (4)N/AN/A
Nemo Instruct (24.07)Mistral AI logoMistral AI0.020.04131K512compare (5)N/AN/A
OLMoCR 7BAllen AI logoAllen AI0.271.5016K16Kcompare (1)N/AN/A
Qwen2.5 7B InstructAlibaba logoAlibaba0.040.07131K33Kcompare (5)N/AN/A
Qwen2.5 VL 32B InstructAlibaba logoAlibaba0.20.6131K8Kcompare (3)N/AN/A
Qwen3 14BAlibaba logoAlibaba0.060.2132K41Kcompare (7)N/AN/A
Qwen3 235B A22BAlibaba logoAlibaba0.090.1262K131Kcompare (9)N/AN/A
Qwen3 235B A22B ThinkingAlibaba logoAlibaba0.10.1262K33Kcompare (9)N/AN/A
Qwen3 32BAlibaba logoAlibaba0.050.1131K41Kcompare (15)N/AN/A
Qwen3 Coder 480B A35B Instruct TurboAlibaba logoAlibaba0.291.20262KN/Acompare (1)N/AN/A
Qwen3 Next 80B A3B ThinkingAlibaba logoAlibaba0.09750.78262K66Kcompare (10)N/AN/A
WizardLM 2 8x22BMicrosoft logoMicrosoft0.480.4866K8Kcompare (4)N/AN/A