DeepInfra offers cost-effective, scalable, easy-to-deploy, and production-ready machine-learning models and infrastructures for deep-learning models. Inference platform · OpenAI-compatible API · Low Cost · Open Source · Serverless

Intelligence vs Price

Best value among DeepInfra models on this chart: Claude Sonnet 4 · GPT OSS 120B · GPT OSS 20B (and 2 more on the dashed frontier). Hover any dot for full pricing, or click a creator in the legend to isolate.

Language Models
Intelligence
Blended Price, $
Log X

DeepInfra models

64 models, 64 with pricing
All Model Types
All Creators
US Dollar ($)
Per 1M tokens
Input/1M
to
Output/1M
to
Model
Creator
Input Price, $
Output Price, $
Context
Max Output
Inference Providers
Intelligence
Coding
Claude Sonnet 4Anthropic logoAnthropic3.0015.001.0M64Kcompare (10)25.5#2N/A
Claude Opus 4Anthropic logoAnthropic5.0025.00410K32Kcompare (9)25.5#1N/A
GPT OSS 120BOpenAI logoOpenAI0.030.15131K131Kcompare (23)23.8#330.4#2
Claude Sonnet 3.7Anthropic logoAnthropic3.0015.00200K128Kcompare (10)23.5#4N/A
Gemini 2.5 ProGoogle logoGoogle1.2510.001.0M66Kcompare (7)22.3#533.3#1
DeepSeek V3.1 TerminusDeepSeek logoDeepSeek0.270.95164K66Kcompare (6)21.4#6N/A
DeepSeek V3.1DeepSeek logoDeepSeek0.271.00164K33Kcompare (14)21.0#7N/A
GLM-4.5Zhipu AI logoZhipu AI0.41.60131K98Kcompare (7)19.5#8N/A
DeepSeek R1DeepSeek logoDeepSeek0.280.4164K66Kcompare (14)18.5#924.6#3
Qwen3 Coder 480B A35B InstructAlibaba logoAlibaba0.221.30262K66Kcompare (8)18.0#10N/A
DeepSeek V3 324DeepSeek logoDeepSeek0.20.4164K16Kcompare (13)15.4#1121.2#5
GPT OSS 20BOpenAI logoOpenAI0.01450.07131K131Kcompare (18)14.9#1220.7#6
DeepSeek V3DeepSeek logoDeepSeek0.20.2164K82Kcompare (12)14.2#1323.0#4
Qwen3 Next 80B A3B InstructAlibaba logoAlibaba0.090.9262K66Kcompare (10)13.7#14N/A
QwQ 32BAlibaba logoAlibaba0.150.2131K16Kcompare (7)13.4#15N/A
Gemini 2.0 FlashGoogle logoGoogle0.10.41.0M8Kcompare (5)12.3#16N/A
Gemini 2.5 FlashGoogle logoGoogle0.150.61.0M66Kcompare (9)11.7#17N/A
DeepSeek R1 Distill Qwen 32BDeepSeek logoDeepSeek0.150.15131K32Kcompare (6)11.0#18N/A
Qwen3 235B A22B InstructAlibaba logoAlibaba0.090.58262K16Kcompare (11)10.9#19N/A
DeepSeek R1 Distill Llama 70BDeepSeek logoDeepSeek0.20.375131K8Kcompare (11)9.9#20N/A
Qwen2.5 72B InstructAlibaba logoAlibaba0.120.3131K16Kcompare (7)9.6#21N/A
Qwen3 30B A3BAlibaba logoAlibaba0.0510.2131K20Kcompare (8)9.1#22N/A
Llama 3.3 70B InstructMeta logoMeta0.10.2131K120Kcompare (21)8.6#2311.9#7
Llama 3.1 Nemotron 70B InstructNVIDIA logoNVIDIA0.60.6131K16Kcompare (2)7.6#25N/A
Llama 3.1 8B InstructMeta logoMeta0.020.03200K128Kcompare (21)7.6#245.4#8
Nemotron Nano 2 9BNVIDIA logoNVIDIA0.040.16131K8Kcompare (4)7.4#26N/A
Llama 3.1 70B InstructMeta logoMeta0.120.3131K16Kcompare (13)6.8#27N/A
Hermes 3 Llama 3.1 70BNous Research logoNous Research0.120.3131K16Kcompare (3)5.1#28N/A
Phi-4Microsoft logoMicrosoft0.070.1416K16Kcompare (3)4.9#29N/A
Llama 3.2 3B InstructMeta logoMeta0.0150.02131K80Kcompare (10)4.2#30N/A
Mixtral 8x7B InstructMistral AI logoMistral AI0.070.1533K16Kcompare (9)2.4#31N/A
Llama 3 8B InstructMeta logoMeta0.030.0432K8Kcompare (9)1.2#32N/A
DeepSeek R1 528DeepSeek logoDeepSeek0.20.25164K33Kcompare (13)N/AN/A
DeepSeek R1 528 TurboDeepSeek logoDeepSeek1.003.0033KN/Acompare (1)N/AN/A
DeepSeek R1 TurboDeepSeek logoDeepSeek0.72.5064K16Kcompare (3)N/AN/A
Gemma 3 12B InstructGoogle logoGoogle0.050.1131K16Kcompare (6)N/AN/A
Gemma 3 27B InstructGoogle logoGoogle0.060.16131K16Kcompare (7)N/AN/A
Gemma 3 4B InstructGoogle logoGoogle0.040.08131K16Kcompare (3)N/AN/A
Hermes 3 Llama 3.1 405BNous Research logoNous Research1.001.00131K16Kcompare (3)N/AN/A
Kimi K2 InstructMoonshot AI (Kimi) logoMoonshot AI (Kimi)0.52.00262K33Kcompare (9)N/AN/A
L3 Lunaris 1.8B TurboSao10K0.040.058K8Kcompare (1)N/AN/A
L3.1 70B Euryale V2.2Sao10K0.650.75131K8Kcompare (2)N/AN/A
L3.3 70B Euryale V2.3Sao10K0.650.75131KN/Acompare (1)N/AN/A
Llama 3.2 11B Vision InstructMeta logoMeta0.0150.025131K16Kcompare (8)N/AN/A
Llama 3.3 70B Instruct TurboMeta logoMeta0.130.39131KN/Acompare (3)N/AN/A
Llama 3.3 Nemotron 1.5 Super 49BNVIDIA logoNVIDIA0.10.4131K16Kcompare (2)N/AN/A
Llama 4 17B Maverick InstructMeta logoMeta0.050.11.0M16Kcompare (9)N/AN/A
Llama 4 17B Scout InstructMeta logoMeta0.050.110.0M16Kcompare (12)N/AN/A
LlamaGuard 3 8BMeta logoMeta0.020.03131K16Kcompare (5)N/AN/A
LlamaGuard 4 12BMeta logoMeta0.180.18164K16Kcompare (4)N/AN/A
Mistral Small 24B InstructMistral AI logoMistral AI0.050.0833K16Kcompare (3)N/AN/A
Mistral Small 3.2 24B InstructMistral AI logoMistral AI0.0750.2128K33Kcompare (4)N/AN/A
MythoMax L2 13BGryphe0.060.064K4Kcompare (4)N/AN/A
Nemo Instruct (24.07)Mistral AI logoMistral AI0.020.04131K512compare (5)N/AN/A
OLMoCR 7BAllen AI logoAllen AI0.271.5016K16Kcompare (1)N/AN/A
Qwen2.5 7B InstructAlibaba logoAlibaba0.040.07131K33Kcompare (4)N/AN/A
Qwen2.5 VL 32B InstructAlibaba logoAlibaba0.20.6128K8Kcompare (2)N/AN/A
Qwen3 14BAlibaba logoAlibaba0.060.2132K41Kcompare (7)N/AN/A
Qwen3 235B A22BAlibaba logoAlibaba0.090.1262K131Kcompare (9)N/AN/A
Qwen3 235B A22B ThinkingAlibaba logoAlibaba0.1490.88262K33Kcompare (9)N/AN/A
Qwen3 32BAlibaba logoAlibaba0.050.1131K41Kcompare (15)N/AN/A
Qwen3 Coder 480B A35B Instruct TurboAlibaba logoAlibaba0.291.20262KN/Acompare (1)N/AN/A
Qwen3 Next 80B A3B ThinkingAlibaba logoAlibaba0.09750.78262K66Kcompare (9)N/AN/A
WizardLM 2 8x22BMicrosoft logoMicrosoft0.480.4866K8Kcompare (4)N/AN/A