DeepInfra offers cost-effective, scalable, easy-to-deploy, and production-ready machine-learning models and infrastructures for deep-learning models. Inference platform · OpenAI-compatible API · Low Cost · Open Source · Serverless

Intelligence vs Price

Best value among Deep Infra models on this chart: GPT OSS 120B · GPT OSS 20B · Llama 3.1 8B Instruct (and 1 more on the dashed frontier). Hover any dot for full pricing, or click a creator in the legend to isolate.

Deep Infra models

64 models, 64 with pricing
Input/1M
to
Output/1M
to
Model
Creator
Input Price, $
Output Price, $
Context
Max Output
Inference Providers
Intelligence
Coding
GPT OSS 120BOpenAI logoOpenAI0.0390.180131K131Kcompare (20)33.3#128.6#4
Claude Sonnet 4Anthropic logoAnthropic3.0015.001.0M64Kcompare (10)33.0#330.6#3
Claude Opus 4Anthropic logoAnthropic15.0075.00410K32Kcompare (8)33.0#2N/A
Claude Sonnet 3.7Anthropic logoAnthropic3.0015.00200K128Kcompare (9)30.8#426.7#6
Gemini 2.5 ProGoogle logoGoogle1.2510.001.0M66Kcompare (7)29.5#532.0#1
DeepSeek V3.1 TerminusDeepSeek logoDeepSeek0.2700.950164K66Kcompare (6)28.5#631.9#2
DeepSeek V3.1DeepSeek logoDeepSeek0.1350.500164K66Kcompare (14)28.1#728.4#5
GLM-4.5Zhipu AI logoZhipu AI0.4001.60131K98Kcompare (8)26.4#826.3#7
Qwen3 Coder 480B A35B InstructAlibaba logoAlibaba0.2201.30262K66Kcompare (8)24.8#924.6#8
GPT OSS 20BOpenAI logoOpenAI0.0300.140131K131Kcompare (15)24.5#1018.5#10
DeepSeek V3 324DeepSeek logoDeepSeek0.2000.400164K16Kcompare (13)22.3#1122.0#9
Qwen3 Next 80B A3B InstructAlibaba logoAlibaba0.0900.900262K66Kcompare (10)20.1#1215.3#14
QwQ 32BAlibaba logoAlibaba0.1500.200131K16Kcompare (8)19.7#13N/A
DeepSeek R1DeepSeek logoDeepSeek0.2800.400164K66Kcompare (14)18.8#1415.9#13
Gemini 2.0 FlashGoogle logoGoogle0.1000.4001.0M8Kcompare (5)18.5#1513.6#17
Gemini 2.5 FlashGoogle logoGoogle0.1500.6001.0M66Kcompare (9)17.8#1617.8#11
DeepSeek R1 Distill Qwen 32BDeepSeek logoDeepSeek0.1500.150131K33Kcompare (7)17.2#17N/A
Qwen3 235B A22B InstructAlibaba logoAlibaba0.0900.580262K33Kcompare (10)17.0#1814.0#16
DeepSeek V3DeepSeek logoDeepSeek0.2000.200400K128Kcompare (12)16.5#1916.4#12
DeepSeek R1 Distill Llama 70BDeepSeek logoDeepSeek0.2000.375131K16Kcompare (11)16.0#2011.4#19
Qwen2.5 72B InstructAlibaba logoAlibaba0.1200.300131K16Kcompare (8)15.6#2111.9#18
Qwen3 30B A3BAlibaba logoAlibaba0.0800.290131K20Kcompare (7)15.0#2214.2#15
Llama 3.3 70B InstructMeta logoMeta0.1000.200131K120Kcompare (20)14.5#2310.7#23
Llama 3.1 Nemotron 70B InstructNVIDIA logoNVIDIA0.6000.600131K16Kcompare (2)13.4#2410.8#22
Nemotron Nano 2 9BNVIDIA logoNVIDIA0.0400.160131K16Kcompare (5)13.2#257.5#24
Llama 3.1 70B InstructMeta logoMeta0.1000.100131K16Kcompare (13)12.5#2610.9#21
Llama 3.1 8B InstructMeta logoMeta0.0200.030200K128Kcompare (20)11.8#274.9#25
Hermes 3 Llama 3.1 70BNous Research logoNous Research0.1200.300131K16Kcompare (3)10.6#28N/A
Phi-4Microsoft logoMicrosoft0.0650.14016K16Kcompare (3)10.4#2911.2#20
Llama 3.2 3B InstructMeta logoMeta0.0150.020131K80Kcompare (9)9.7#30N/A
Mixtral 8x7B InstructMistral AI logoMistral AI0.0700.15033K16Kcompare (9)7.7#31N/A
Llama 3 8B InstructMeta logoMeta0.0300.04032K8Kcompare (9)6.4#324.0#26
DeepSeek R1 528DeepSeek logoDeepSeek0.2000.250164K33Kcompare (12)N/AN/A
DeepSeek R1 528 TurboDeepSeek logoDeepSeek1.003.0033KN/Acompare (1)N/AN/A
DeepSeek R1 TurboDeepSeek logoDeepSeek0.7002.5064K16Kcompare (3)N/AN/A
Gemma 3 12B InstructGoogle logoGoogle0.0400.100131K16Kcompare (5)N/AN/A
Gemma 3 27B InstructGoogle logoGoogle0.0600.160131K16Kcompare (6)N/AN/A
Gemma 3 4B InstructGoogle logoGoogle0.0400.080131K16Kcompare (3)N/AN/A
Hermes 3 Llama 3.1 405BNous Research logoNous Research1.001.00131K16Kcompare (3)N/AN/A
Kimi K2 InstructMoonshot AI (Kimi) logoMoonshot AI (Kimi)0.5002.00262K33Kcompare (9)N/AN/A
L3 Lunaris 1.8B TurboSao10K0.0400.0508K8Kcompare (1)N/AN/A
L3.1 70B Euryale V2.2Sao10K0.6500.750131K8Kcompare (2)N/AN/A
L3.3 70B Euryale V2.3Sao10K0.6500.750131KN/Acompare (1)N/AN/A
Llama 3.2 11B Vision InstructMeta logoMeta0.0150.025131K16Kcompare (7)N/AN/A
Llama 3.3 70B Instruct TurboMeta logoMeta0.1300.390131KN/Acompare (3)N/AN/A
Llama 3.3 Nemotron 1.5 Super 49BNVIDIA logoNVIDIA0.1000.400131K16Kcompare (2)N/AN/A
Llama 4 17B Maverick InstructMeta logoMeta0.0500.1001.0M16Kcompare (9)N/AN/A
Llama 4 17B Scout InstructMeta logoMeta0.0500.10010.0M16Kcompare (11)N/AN/A
LlamaGuard 3 8BMeta logoMeta0.0200.030131K16Kcompare (5)N/AN/A
LlamaGuard 4 12BMeta logoMeta0.1800.180164K16Kcompare (4)N/AN/A
Mistral Small 24B InstructMistral AI logoMistral AI0.0500.08033K16Kcompare (3)N/AN/A
Mistral Small 3.2 24B InstructMistral AI logoMistral AI0.0750.200128K16Kcompare (3)N/AN/A
MythoMax L2 13BGryphe0.0600.0604K4Kcompare (4)N/AN/A
Nemo Instruct (24.07)Mistral AI logoMistral AI0.0200.040131K512compare (5)N/AN/A
OLMoCR 7BAllen AI logoAllen AI0.2701.5016K16Kcompare (1)N/AN/A
Qwen2.5 7B InstructAlibaba logoAlibaba0.0400.070131K33Kcompare (5)N/AN/A
Qwen2.5 VL 32B InstructAlibaba logoAlibaba0.2000.600131K8Kcompare (3)N/AN/A
Qwen3 14BAlibaba logoAlibaba0.0600.200132K41Kcompare (7)N/AN/A
Qwen3 235B A22BAlibaba logoAlibaba0.0710.100262K131Kcompare (9)N/AN/A
Qwen3 235B A22B ThinkingAlibaba logoAlibaba0.1490.880262K33Kcompare (9)N/AN/A
Qwen3 32BAlibaba logoAlibaba0.0500.100131K41Kcompare (15)N/AN/A
Qwen3 Coder 480B A35B Instruct TurboAlibaba logoAlibaba0.2901.20262KN/Acompare (1)N/AN/A
Qwen3 Next 80B A3B ThinkingAlibaba logoAlibaba0.0980.780262K66Kcompare (10)N/AN/A
WizardLM 2 8x22BMicrosoft logoMicrosoft0.4800.48066K8Kcompare (4)N/AN/A