Compare AI model pricing and benchmarks across providers including OpenAI, Anthropic, Google, AWS Bedrock, Azure, Mistral, and more. Filter by model capabilities like vision, function calling, and reasoning support. Find the most cost-effective model for your use case. Currently tracking 1,870 models across 102 providers.

The data is based on LiteLLM, maintained by the open-source community, and benchmark data from Artificial Analysis. The latest update occurred on March 21, 2026 at 12:00 AM UTC

Input/1M
to
Output/1M
to
Model
Provider
Input Price, $
Output Price, $
Price Compare
Context
Max Output
Intelligence
Coding
Gemini 3.1 Pro PreviewGoogle Vertex AI2.0012.00compare1.0M66K57.2#155.5#2
Gemini 3.1 Pro PreviewOpenRouter2.0012.00compare1.0M66K57.2#155.5#2
GPT-5.4OpenAI2.5015.00compare1.1M128K57.2#157.3#1
Gemini 3.1 Pro PreviewGoogle Gemini2.0012.00compare1.0M66K57.2#155.5#2
GPT-5.4Azure OpenAI2.5015.00compare1.1M128K57.2#157.3#1
GPT-5.3-chatOpenAI1.7514.00compare128K16K54.0#353.1#3
GPT-5.2OpenRouter1.7514.00compare272K128K51.3#448.7#5
GPT-5.2-chatOpenAI1.7514.00compare128K16K51.3#448.7#5
GPT-5.2GMI Cloud1.7514.00compare410K32K51.3#448.7#5
GPT-5.2GitHub CopilotN/AN/Acompare128K64K51.3#448.7#5
GPT-5.2Azure OpenAI1.7514.00compare272K128K51.3#448.7#5
Glm 5Z AI (Zhipu)1.003.20compare200K128K49.8#544.2#11
Glm 5 MaasVertex AI (Z AI)1.003.20compare200K128K49.8#544.2#11
Glm 5OpenRouter0.8002.56compare203K128K49.8#544.2#11
GPT-5.2-codexOpenRouter1.7514.00compare272K128K49.0#643.0#13
Grok 4.20 Beta 0309 Non ReasoningxAI2.006.00compare2.0M2.0M48.5#742.2#15
Gemini 3 Pro PreviewGoogle Vertex AI2.0012.00compare1.0M66K48.4#846.5#8
Gemini 3 ProReplicate2.0012.00compareN/AN/A48.4#846.5#8
Gemini 3 Pro PreviewOpenRouter2.0012.00compare1.0M66K48.4#846.5#8
Gemini 3 Pro PreviewGMI Cloud2.0012.00compare1.0M66K48.4#846.5#8
Gemini 3 Pro PreviewGitHub CopilotN/AN/Acompare128K64K48.4#846.5#8
Gemini 3 Pro PreviewGoogle Gemini2.0012.00compare1.0M66K48.4#846.5#8
GPT-5.4 miniAzure OpenAI0.7504.50compare1.1M128K48.1#951.5#4
GPT-5.1-chatOpenAI1.2510.00compare128K16K47.7#1044.7#10
GPT-5.1GMI Cloud1.2510.00compare410K32K47.7#1044.7#10
GPT-5.1GitHub CopilotN/AN/Acompare128K64K47.7#1044.7#10
Databricks GPT 5 1Databricks1.2510.00compare272K128K47.7#1044.7#10
GPT-5.1Azure OpenAI1.3811.00compare272K128K47.7#1044.7#10
Kimi K2.5Together AI0.5002.80compare256K256K46.8#1139.5#18
Kimi K2.5OpenRouter0.6003.00compare262K262K46.8#1139.5#18
Moonshotai.kimi K2.5AWS Bedrock0.6003.00compare262K262K46.8#1139.5#18
Kimi K2.5Moonshot AI (Kimi)0.6003.00compare262K262K46.8#1139.5#18
Kimi K2p5Fireworks AI0.6003.00compare262K262K46.8#1139.5#18
Claude Opus 4.6Anthropic (Vertex AI)5.0025.00compare1.0M128K46.5#1247.6#6
Claude Opus 4.6Vercel AI Gateway5.0025.00compare200K64K46.5#1247.6#6
Claude Opus 4.6OpenRouter5.0025.00compare1.0M128K46.5#1247.6#6
Claude Opus 4.6GitHub CopilotN/AN/Acompare128K16K46.5#1247.6#6
Claude Opus 4.6Anthropic5.0025.00compare1.0M128K46.5#1247.6#6
Claude Opus 4.6Azure AI5.0025.00compare200K128K46.5#1247.6#6
Claude Opus 4.6AWS Bedrock5.0025.00compare1.0M128K46.5#1247.6#6
Qwen3.5 397B A17BTogether AI0.6003.60compare262KN/A45.0#1341.3#16
Qwen3.5 397B A17bOpenRouter0.6003.60compare262K66K45.0#1341.3#16
GPT-5Replicate1.2510.00compareN/AN/A44.6#1436.0#29
GPT-5-codexOpenRouter1.2510.00compare272K128K44.6#1438.9#20
GPT-5OpenRouter1.2510.00compare272K128K44.6#1436.0#29
GPT-5-chatOpenAI1.2510.00compare128K16K44.6#1436.0#29
GPT-5GMI Cloud1.2510.00compare410K32K44.6#1436.0#29
GPT-5GitHub CopilotN/AN/Acompare128K128K44.6#1436.0#29
Databricks GPT 5Databricks1.2510.00compare272K128K44.6#1436.0#29
GPT-5-chatAzure OpenAI1.2510.00compare128K16K44.6#1436.0#29
Claude Sonnet 4.6Anthropic (Vertex AI)3.0015.00compare200K64K44.4#1646.4#9
Claude Sonnet 4.6OpenRouter3.0015.00compare1.0M128K44.4#1646.4#9
Claude Sonnet 4.6Anthropic3.0015.00compare200K64K44.4#1646.4#9
GPT-5.4 nanoAzure OpenAI0.2001.25compare1.1M128K44.4#1643.9#12
Claude Sonnet 4.6Azure AI3.0015.00compare200K64K44.4#1646.4#9
Claude Sonnet 4.6AWS Bedrock3.0015.00compare200K64K44.4#1646.4#9
Claude Opus 4.5Anthropic (Vertex AI)5.0025.00compare200K64K43.1#1842.9#14
Claude Opus 4.5Vercel AI Gateway5.0025.00compare200K64K43.1#1842.9#14
Claude Opus 4.5OpenRouter5.0025.00compare200K32K43.1#1842.9#14
Claude Opus 4.5GMI Cloud5.0025.00compare410K32K43.1#1842.9#14
Claude Opus 4.5GitHub CopilotN/AN/Acompare128K16K43.1#1842.9#14
Databricks Claude Opus 4 5Databricks5.0025.00compare200K64K43.1#1842.9#14
Claude Opus 4.5Anthropic5.0025.00compare200K64K43.1#1842.9#14
Claude Opus 4.5Azure AI5.0025.00compare200K64K43.1#1842.9#14
Claude Opus 4.5AWS Bedrock5.0025.00compare200K64K43.1#1842.9#14
Glm 4.7Z AI (Zhipu)0.6002.20compare200K128K42.1#2036.3#28
Zai.glm 4.7AWS Bedrock0.6002.20compare200K128K42.1#2036.3#28
Glm 4.7 MaasVertex AI (Z AI)0.6002.20compare200K128K42.1#2036.3#28
GLM 4.7Together AI0.4502.00compare200K200K42.1#2036.3#28
Glm 4.7OpenRouter0.4001.50compare203K64K42.1#2036.3#28
Qwen3.5 27BOpenRouter0.3002.40compare262K66K42.1#2034.9#31
Glm 4.7Novita AI0.6002.20compare205K131K42.1#2036.3#28
GLM 4.7 FP8GMI Cloud0.4002.00compare203K16K42.1#2036.3#28
Glm 4 7 251222Volcengine (ByteDance)N/AN/Acompare205K131K42.1#2036.3#28
Glm 4p7Fireworks AI0.6002.20compare203K203K42.1#2036.3#28
Minimax M2.5OpenRouter0.3001.10compare197K66K41.9#2237.4#24
MiniMax M2.5MiniMax0.3001.20compare1.0M8K41.9#2237.4#24
DeepSeek ReasonerDeepSeek0.2800.420compare131K66K41.7#2336.7#25
Qwen3.5 122B A10bOpenRouter0.4002.00compare262K66K41.6#2434.7#32
Grok 4xAI3.0015.00compare256K256K41.5#2540.5#17
Grok 4Vercel AI Gateway3.0015.00compare256K256K41.5#2540.5#17
Grok 4Replicate7.2036.00compareN/AN/A41.5#2540.5#17
Grok 4OpenRouter3.0015.00compare256K256K41.5#2540.5#17
Xai.grok 4Oracle Cloud (OCI)3.0015.00compare128K128K41.5#2540.5#17
Grok 4Azure AI3.0015.00compare131K131K41.5#2540.5#17
GPT-5 miniReplicate0.2502.00compareN/AN/A41.2#2635.3#30
GPT-5 miniOpenRouter0.2502.00compare272K128K41.2#2635.3#30
GPT-5 miniOpenAI0.2502.00compare272K128K41.2#2635.3#30
GPT-5 miniGitHub CopilotN/AN/Acompare128K64K41.2#2635.3#30
Databricks GPT 5 MiniDatabricks0.2502.00compare272K128K41.2#2635.3#30
GPT-5 miniAzure OpenAI0.2502.00compare272K128K41.2#2635.3#30
Glm 5 CodeZ AI (Zhipu)1.205.00compare200K128K40.6#2839.0#19
Qwen3 235B A22B Thinking 2507Weights & Biases0.0100.010compare262K262K39.9#2930.5#38
Qwen3 235B A22B Thinking 2507Together AI0.6503.00compare256KN/A39.9#2930.5#38
Qwen3 235B A22b Thinking 2507OpenRouter0.1100.600compare262K262K39.9#2930.5#38
Qwen3 235B A22B Thinking 2507DeepInfra0.3002.90compare262K262K39.9#2930.5#38
o3Vercel AI Gateway2.008.00compare200K100K38.4#3138.4#21
o3OpenAI2.008.00compare200K100K38.4#3138.4#21
Openai O3Gradient AI2.008.00compare100KN/A38.4#3138.4#21
o3Azure OpenAI2.008.00compare200K100K38.4#3138.4#21
Claude Sonnet 4.5Anthropic (Vertex AI)3.0015.00compare200K64K37.1#3233.5#35
Claude Sonnet 4.5Vercel AI Gateway3.0015.00compare1.0M64K37.1#3233.5#35
Claude 4.5 SonnetReplicate3.0015.00compareN/AN/A37.1#3233.5#35
Qwen3.5 35B A3bOpenRouter0.2502.00compare262K66K37.1#3230.3#39
Claude Sonnet 4.5OpenRouter3.0015.00compare1.0M1.0M37.1#3233.5#35
Claude Sonnet 4.5GMI Cloud3.0015.00compare410K32K37.1#3233.5#35
Claude Sonnet 4.5GitHub CopilotN/AN/Acompare128K16K37.1#3233.5#35
Databricks Claude Sonnet 4 5Databricks3.0015.00compare200K64K37.1#3233.5#35
Claude Sonnet 4.5AWS Bedrock3.0015.00compare200K64K37.1#3233.5#35
Claude Sonnet 4.5Anthropic3.0015.00compare200K64K37.1#3233.5#35
Claude Sonnet 4.5Azure AI3.0015.00compare200K64K37.1#3233.5#35
Minimax M2 MaasVertex AI (MiniMax)0.3001.20compare197K197K36.1#3429.2#44
Minimax M2OpenRouter0.2551.02compare205K205K36.1#3429.2#44
Minimax M2Novita AI0.3001.20compare205K131K36.1#3429.2#44
MiniMax M2MiniMax0.3001.20compare200K8K36.1#3429.2#44
Minimax.minimax M2AWS Bedrock0.3001.20compare128K8K36.1#3429.2#44
Minimax M2Fireworks AI0.3001.20compare4K4K36.1#3429.2#44
Claude Opus 4.1Anthropic (Vertex AI)15.0075.00compare200K32K36.0#35N/A
Claude Opus 4.1Vercel AI Gateway15.0075.00compare200K32K36.0#35N/A
Claude Opus 4.1OpenRouter15.0075.00compare200K32K36.0#35N/A
Kat Coder ProNovita AI0.3001.20compare256K128K36.0#3518.3#82
Claude Opus 41GitHub CopilotN/AN/Acompare80K16K36.0#35N/A
Databricks Claude Opus 4 1Databricks15.0075.00compare200K32K36.0#35N/A
Claude Opus 4.1Anthropic15.0075.00compare200K32K36.0#35N/A
Claude Opus 4.1Azure AI15.0075.00compare200K32K36.0#35N/A
Claude Opus 4.1AWS Bedrock15.0075.00compare200K32K36.0#35N/A
Gemini 3 Flash PreviewGoogle Vertex AI0.5003.00compare1.0M66K35.0#3737.8#23
Gemini 3 Flash PreviewOpenRouter0.5003.00compare1.0M66K35.0#3737.8#23
Gemini 3 Flash PreviewGMI Cloud0.5003.00compare1.0M66K35.0#3737.8#23
Gemini 3 Flash PreviewGoogle Gemini0.5003.00compare1.0M66K35.0#3737.8#23
Gemini 3.1 Flash-Lite PreviewGoogle Gemini0.2501.50compare1.0M66K33.5#3830.1#42
Gemini 3.1 Flash-Lite PreviewGoogle Vertex AI0.2501.50compare1.0M66K33.5#3830.1#42
GPT-oss-120bIBM watsonx0.1500.600compare8K8K33.3#3928.6#45
GPT-oss-120bWeights & Biases0.0150.060compare131K131K33.3#3928.6#45
GPT-oss-120b-maasVertex AI (OpenAI)0.1500.600compare131K33K33.3#3928.6#45
GPT-oss-120bTogether AI0.1500.600compare128KN/A33.3#3928.6#45
GPT-oss-120bSambaNova3.004.50compare131K131K33.3#3928.6#45
GPT-oss-120bReplicate0.1800.720compareN/AN/A33.3#3928.6#45
GPT-oss-120bOVHcloud0.0800.400compare131K131K33.3#3928.6#45
GPT-oss-120bOpenRouter0.1800.800compare131K33K33.3#3928.6#45
GPT-oss:120b-cloudOllamaN/AN/Acompare131K131K33.3#3928.6#45
GPT-oss-120bNovita AI0.0500.250compare131K33K33.3#3928.6#45
GPT-oss-120b-mxfp-GGUFLemonade (AMD)N/AN/Acompare131K33K33.3#3928.6#45
GPT-oss-120bGroq0.1500.600compare131K33K33.3#3928.6#45
GPT-oss-120bFireworks AI0.1500.600compare131K131K33.3#3928.6#45
GPT-oss-120bDeepInfra0.0500.450compare131K131K33.3#3928.6#45
Databricks GPT OSS 120BDatabricks0.1500.600compare131K131K33.3#3928.6#45
GPT-oss-120bCerebras0.3500.750compare131K33K33.3#3928.6#45
GPT-oss-120bAWS Bedrock0.1500.600compare131K33K33.3#3928.6#45
GPT-oss-120bAzure AI0.1500.600compare131K131K33.3#3928.6#45
o4 miniVercel AI Gateway1.104.40compare200K100K33.1#4025.6#53
o4 miniReplicate1.004.00compareN/AN/A33.1#4025.6#53
o4 miniOpenAI1.104.40compare200K100K33.1#4025.6#53
o4 miniAzure OpenAI1.104.40compare200K100K33.1#4025.6#53
Claude Sonnet 4Anthropic (Vertex AI)3.0015.00compare1.0M64K33.0#4130.6#37
Claude Opus 4Anthropic (Vertex AI)15.0075.00compare200K32K33.0#41N/A
Claude 4 SonnetVercel AI Gateway3.0015.00compare200K64K33.0#4130.6#37
Claude 4 OpusVercel AI Gateway15.0075.00compare200K32K33.0#41N/A
Claude 4 SonnetReplicate3.0015.00compareN/AN/A33.0#4130.6#37
Claude Sonnet 4OpenRouter3.0015.00compare1.0M64K33.0#4130.6#37
Claude Opus 4OpenRouter15.0075.00compare200K32K33.0#41N/A
Claude 4 SonnetHeroku (Salesforce)N/AN/Acompare8KN/A33.0#4130.6#37
Claude Sonnet 4GMI Cloud3.0015.00compare410K32K33.0#4130.6#37
Claude Opus 4GMI Cloud15.0075.00compare410K32K33.0#41N/A
Claude Sonnet 4GitHub CopilotN/AN/Acompare128K16K33.0#4130.6#37
Claude 4 SonnetDeepInfra3.3016.50compare200K200K33.0#4130.6#37
Claude 4 OpusDeepInfra16.5082.50compare200K200K33.0#41N/A
Databricks Claude Sonnet 4Databricks3.0015.00compare200K64K33.0#4130.6#37
Databricks Claude Opus 4Databricks15.0075.00compare200K32K33.0#41N/A
Claude 4 SonnetAnthropic3.0015.00compare1.0M64K33.0#4130.6#37
Claude 4 OpusAnthropic15.0075.00compare200K32K33.0#41N/A
Claude Sonnet 4.20250514AWS Bedrock3.0015.00compare1.0M64K33.0#4130.6#37
Claude Opus 4.20250514AWS Bedrock15.0075.00compare200K32K33.0#41N/A
Grok 3 MinixAI0.3000.500compare131K131K32.1#4325.2#55
DeepSeek V3.2 MaasVertex AI (DeepSeek)0.5601.68compare164K33K32.1#4334.6#33
DeepSeek V3.2OpenRouter0.2800.400compare164K164K32.1#4334.6#33
DeepSeek V3.2Novita AI0.2690.400compare164K66K32.1#4334.6#33
DeepSeek V3.2GMI Cloud0.2800.400compare164K16K32.1#4334.6#33
DeepSeek V3p2Fireworks AI0.5601.68compare164K164K32.1#4334.6#33
DeepSeek V3.2DeepSeek0.2800.400compare164K164K32.1#4334.6#33
DeepSeek V3 2 251201Volcengine (ByteDance)N/AN/Acompare98K33K32.1#4334.6#33
V3.2AWS Bedrock0.7402.22compare164K164K32.1#4334.6#33
DeepSeek V3.2Azure AI0.5801.68compare164K164K32.1#4334.6#33
Qwen3 MaxNovita AI2.118.45compare262K66K31.4#4526.4#48
Qwen3 MaxDashScope (Alibaba)N/AN/Acompare258K66K31.4#4526.4#48
Claude Haiku 4.5Anthropic (Vertex AI)1.005.00compare200K8K31.1#4629.6#43
Claude Haiku 4.5Vercel AI Gateway1.005.00compare200K64K31.1#4629.6#43
Claude 4.5 HaikuReplicate1.005.00compareN/AN/A31.1#4629.6#43
Claude Haiku 4.5OpenRouter1.005.00compare200K200K31.1#4629.6#43
Claude Haiku 4.5GitHub CopilotN/AN/Acompare128K16K31.1#4629.6#43
Databricks Claude Haiku 4 5Databricks1.005.00compare200K64K31.1#4629.6#43
Claude Haiku 4.5Anthropic1.005.00compare200K64K31.1#4629.6#43
Claude Haiku 4.5Azure AI1.005.00compare200K64K31.1#4629.6#43
Claude Haiku 4.5AWS Bedrock1.005.00compare200K64K31.1#4629.6#43
Kimi K2 Instruct 0905Together AI1.003.00compare262KN/A30.9#4725.9#50
Kimi K2 0905Novita AI0.6002.50compare262K262K30.9#4725.9#50
Kimi K2 0905 PreviewMoonshot AI (Kimi)0.6002.50compare262K262K30.9#4725.9#50
Kimi K2 Instruct 0905Groq1.003.00compare262K16K30.9#4725.9#50
Kimi K2 Instruct 0905Fireworks AI0.6002.50compare262K33K30.9#4725.9#50
Kimi K2 Instruct 0905DeepInfra0.5002.00compare262K262K30.9#4725.9#50
Claude 3 7 SonnetAnthropic (Vertex AI)3.0015.00compare200K8K30.8#4826.7#47
o1Vercel AI Gateway15.0060.00compare200K100K30.8#4820.5#71
Claude 3 7 SonnetVercel AI Gateway3.0015.00compare200K64K30.8#4826.7#47
o1Replicate15.0060.00compareN/AN/A30.8#4820.5#71
Claude 3.7 SonnetReplicate3.0015.00compareN/AN/A30.8#4826.7#47
o1OpenRouter15.0060.00compare200K100K30.8#4820.5#71
Claude 3.7 SonnetOpenRouter3.0015.00compare200K128K30.8#4826.7#47
o1OpenAI15.0060.00compare200K100K30.8#4820.5#71
Claude 3 7 SonnetHeroku (Salesforce)N/AN/Acompare8KN/A30.8#4826.7#47
Anthropic Claude 3.7 SonnetGradient AI3.0015.00compare1KN/A30.8#4826.7#47
Eu.anthropic.claude 3 7 Sonnet 20250219 V1AWS Bedrock3.0015.00compare200K8K30.8#4826.7#47
Claude 3 7 SonnetDeepInfra3.3016.50compare200K200K30.8#4826.7#47
Databricks Claude 3 7 SonnetDatabricks3.0015.00compare200K128K30.8#4826.7#47
Claude 3 7 SonnetAnthropicN/AN/Acompare200K64K30.8#4826.7#47
o1Azure OpenAI15.0060.00compare200K100K30.8#4820.5#71
Mimo V2 FlashOpenRouter0.0900.290compare262K16K30.4#5025.8#52
Mimo V2 FlashNovita AI0.1000.300compare262K32K30.4#5025.8#52
Gemini 2.5 ProVercel AI Gateway2.5010.00compare1.0M66K30.3#5146.7#7
Gemini 2.5 ProOpenRouter1.2510.00compare1.0M8K30.3#5146.7#7
Gemini 2.5 ProGitHub CopilotN/AN/Acompare128K64K30.3#5146.7#7
Gemini proGoogle Gemini1.2510.00compare1.0M66K30.3#5146.7#7
Gemini 2.5 ProGoogle Vertex AI1.2510.00compare1.0M66K30.3#5146.7#7
Gemini 2.5 ProDeepInfra1.2510.00compare1.0M1.0M30.3#5146.7#7
Databricks Gemini 2 5 ProDatabricks1.2510.00compare1.0M66K30.3#5146.7#7
Glm 4.6Z AI (Zhipu)0.6002.20compare200K128K30.2#5230.2#40
Glm 4.6Vercel AI Gateway0.4501.80compare200K200K30.2#5230.2#40
GLM 4.6Together AI0.6002.20compare200K200K30.2#5230.2#40
Glm 4.6OpenRouter0.4001.75compare203K131K30.2#5230.2#40
Glm 4.6Novita AI0.5502.20compare205K131K30.2#5230.2#40
Glm 4p6Fireworks AI0.5502.19compare203K203K30.2#5230.2#40
Zai.glm 4.7 FlashAWS Bedrock0.0700.400compare200K128K30.1#5325.9#50
Glm 4.7 FlashOpenRouter0.0700.400compare200K32K30.1#5325.9#50
Qwen3 235B A22b Thinking 2507Novita AI0.3003.00compare131K33K29.5#5423.2#62
Qwen3 235B A22b Thinking 2507Fireworks AI0.2200.880compare262K262K29.5#5423.2#62
DeepSeek V3.2 SpecialeAzure AI0.5801.68compare164K164K29.4#5537.9#22
Grok Code Fast 1xAI0.2001.50compare256K256K28.7#5623.7#59
Grok Code Fast 1Azure AI0.2001.50compare131K131K28.7#5623.7#59
DeepSeek V3.1 TerminusNovita AI0.2701.00compare131K33K28.5#5731.9#36
DeepSeek V3p1 TerminusFireworks AI0.5601.68compare128K8K28.5#5731.9#36
DeepSeek V3.1 TerminusDeepInfra0.2701.00compare164K164K28.5#5731.9#36
Qwen3 Coder NextAWS Bedrock0.6001.44compare262K8K28.3#5822.9#63
DeepSeek V3.1Weights & Biases0.0550.165compare128K128K28.1#5928.4#46
DeepSeek V3.1 MaasVertex AI (DeepSeek)1.355.40compare164K33K28.1#5928.4#46
DeepSeek V3.1Together AI0.6001.70compare128KN/A28.1#5928.4#46
DeepSeek V3.1SambaNova3.004.50compare33K33K28.1#5928.4#46
DeepSeek V3.1Replicate0.6722.02compare164K164K28.1#5928.4#46
DeepSeek V3.1:671B CloudOllamaN/AN/Acompare164K164K28.1#5928.4#46
DeepSeek V3.1Novita AI0.2701.00compare131K33K28.1#5928.4#46
DeepSeek V3p1Fireworks AI0.5601.68compare128K8K28.1#5928.4#46
DeepSeek V3.1DeepInfra0.2701.00compare164K164K28.1#5928.4#46
DeepSeek R1Vercel AI Gateway0.5502.19compare128K8K27.1#6024.0#57
Us.deepseek.r1 V1AWS Bedrock1.355.40compare128K4K27.1#6024.0#57
DeepSeek R1Together AI3.007.00compare128K20K27.1#6024.0#57
DeepSeek R1SnowflakeN/AN/Acompare33K8K27.1#6024.0#57
DeepSeek R1SambaNova5.007.00compare33K33K27.1#6024.0#57
DeepSeek R1Replicate3.7510.00compare66K8K27.1#6024.0#57
DeepSeek R1OpenRouter0.5502.19compare65K8K27.1#6024.0#57
DeepSeek R1Nebius0.8002.40compare128K128K27.1#6024.0#57
Magistral Medium 1 2 2509Mistral AI2.005.00compare40K40K27.1#6021.7#69
DeepSeek R1Hyperbolic0.4000.400compare33K33K27.1#6024.0#57
DeepSeek R1Fireworks AI3.008.00compare128K20K27.1#6024.0#57
DeepSeek R1DeepSeek0.5502.19compare66K8K27.1#6024.0#57
DeepSeek R1DeepInfra0.7002.40compare164K164K27.1#6024.0#57
DeepSeek R1Azure AI1.355.40compare128K8K27.1#6024.0#57
GPT-5 nanoReplicate0.0500.400compareN/AN/A26.8#6220.3#73
GPT-5 nanoOpenRouter0.0500.400compare272K128K26.8#6220.3#73
GPT-5 nanoOpenAI0.0500.400compare272K128K26.8#6220.3#73
Databricks GPT 5 NanoDatabricks0.0500.400compare272K128K26.8#6220.3#73
GPT-4.1 nanoAzure OpenAI0.1000.400compare1.0M33K26.8#6220.3#73
Qwen3 Next 80B A3b Thinking MaasVertex AI (Qwen)0.1501.20compare262K262K26.7#6319.5#75
Qwen3 Next 80B A3B ThinkingTogether AI0.1501.50compare262KN/A26.7#6319.5#75
Qwen3 Next 80B A3b ThinkingNovita AI0.1501.50compare131K33K26.7#6319.5#75
Qwen3 Next 80B A3b ThinkingFireworks AI0.9000.900compare4K4K26.7#6319.5#75
Qwen3 Next 80B A3B ThinkingDeepInfra0.1401.40compare262K262K26.7#6319.5#75
Qwen3 Next 80B A3b ThinkingDashScope (Alibaba)0.1501.20compare262K66K26.7#6319.5#75
Glm 4.5Z AI (Zhipu)0.6002.20compare128K32K26.4#6426.3#49
GLM 4.5Weights & Biases0.0550.200compare131K131K26.4#6426.3#49
Glm 4.5Vercel AI Gateway0.6002.20compare131K131K26.4#6426.3#49
Glm 4.5Novita AI0.6002.20compare131K98K26.4#6426.3#49
Glm 4p5Fireworks AI0.5502.19compare128K96K26.4#6426.3#49
GLM 4.5DeepInfra0.4001.60compare131K131K26.4#6426.3#49
Kimi K2 InstructWeights & Biases0.6002.50compare128K128K26.3#6522.1#65
GPT-4.1Vercel AI Gateway2.008.00compare1.0M33K26.3#6521.8#68
Kimi K2Vercel AI Gateway0.5502.20compare131K16K26.3#6522.1#65
Kimi K2 InstructTogether AI1.003.00compareN/AN/A26.3#6522.1#65
GPT-4.1Replicate2.008.00compareN/AN/A26.3#6521.8#68
GPT-4.1OpenRouter2.008.00compare1.0M33K26.3#6521.8#68
Kimi K2 InstructNovita AI0.5702.30compare131K131K26.3#6522.1#65
Kimi K2 InstructHyperbolic2.002.00compare131K131K26.3#6522.1#65
GPT-4.1GitHub CopilotN/AN/Acompare128K16K26.3#6521.8#68
Ft:gpt 4 0613OpenAI30.0060.00compare8K4K26.3#6521.8#68
Kimi K2 InstructFireworks AI0.6002.50compare131K16K26.3#6522.1#65
Kimi K2 InstructDeepInfra0.5002.00compare131K131K26.3#6522.1#65
GPT-4.1Azure OpenAI2.208.80compare1.0M33K26.3#6521.8#68
o3 miniVercel AI Gateway1.104.40compare200K100K25.9#6717.9#85
o3 miniOpenRouter1.104.40compare128K66K25.9#6717.9#85
o3 miniOpenAI1.104.40compare200K100K25.9#6717.9#85
Openai O3 MiniGradient AI1.104.40compare100KN/A25.9#6717.9#85
o3 miniAzure OpenAI1.104.40compare200K100K25.9#6717.9#85
Grok 3xAI3.0015.00compare131K131K25.2#6919.8#74
Grok 3Vercel AI Gateway3.0015.00compare131K131K25.2#6919.8#74
Xai.grok 3Oracle Cloud (OCI)3.0015.00compare131K131K25.2#6919.8#74
Grok 3Azure AI3.0015.00compare131K131K25.2#6919.8#74
Qwen3 235B A22B Instruct 2507Weights & Biases0.0100.010compare262K262K25.0#7022.1#65
Qwen3 235B A22b Instruct 2507 MaasVertex AI (Qwen)0.2501.00compare262K16K25.0#7022.1#65
Qwen3 235B A22B Instruct 2507 TputTogether AI0.2006.00compare262KN/A25.0#7022.1#65
Qwen3 235B A22b 2507 V1AWS Bedrock0.2200.880compare262K131K25.0#7022.1#65
Qwen3 235B A22b 2507OpenRouter0.0710.100compare262K262K25.0#7022.1#65
Qwen3 235B A22b Instruct 2507Novita AI0.0900.580compare131K16K25.0#7022.1#65
Qwen3 235B A22b Instruct 2507Fireworks AI0.2200.880compare262K262K25.0#7022.1#65
Qwen3 235B A22B Instruct 2507DeepInfra0.0900.600compare262K262K25.0#7022.1#65
Qwen3 Coder 480B A35B InstructWeights & Biases0.1000.150compare262K262K24.8#7124.6#56
Qwen3 Coder 480B A35b Instruct MaasVertex AI (Qwen)1.004.00compare262K33K24.8#7124.6#56
Qwen3 Coder 480B A35B Instruct FP8Together AI2.002.00compare256KN/A24.8#7124.6#56
Qwen3 Coder 480B A35b V1AWS Bedrock0.2201.80compare262K66K24.8#7124.6#56
Qwen3 Coder:480B CloudOllamaN/AN/Acompare262K262K24.8#7124.6#56
Qwen3 Coder 480B A35b InstructNovita AI0.3001.30compare262K66K24.8#7124.6#56
Qwen3 Coder 480B A35b InstructFireworks AI0.4501.80compare262K262K24.8#7124.6#56
Qwen3 Coder 480B A35B InstructDeepInfra0.4001.60compare262K262K24.8#7124.6#56
GPT-oss-20bWeights & Biases0.00500.020compare131K131K24.5#7218.5#80
GPT-oss-20b-maasVertex AI (OpenAI)0.0750.300compare131K33K24.5#7218.5#80
GPT-oss-20bTogether AI0.0500.200compare128KN/A24.5#7218.5#80
GPT-oss-20bReplicate0.0900.360compareN/AN/A24.5#7218.5#80
GPT-oss-20bOVHcloud0.0400.150compare131K131K24.5#7218.5#80
GPT-oss-20bOpenRouter0.0200.100compare131K33K24.5#7218.5#80
GPT-oss:20b-cloudOllamaN/AN/Acompare131K131K24.5#7218.5#80
GPT-oss-20bNovita AI0.0400.150compare131K33K24.5#7218.5#80
GPT-oss-20b-mxfp4-GGUFLemonade (AMD)N/AN/Acompare131K33K24.5#7218.5#80
GPT-oss-20bGroq0.0750.300compare131K33K24.5#7218.5#80
GPT-oss-20bFireworks AI0.0500.200compare131K131K24.5#7218.5#80
GPT-oss-20bDeepInfra0.0400.150compare131K131K24.5#7218.5#80
Databricks GPT OSS 20BDatabricks0.0700.300compare131K131K24.5#7218.5#80
GPT-oss-20bAWS Bedrock0.0750.300compare131K33K24.5#7218.5#80
Kimi K2 Thinking MaasVertex AI (Moonshot)0.6002.50compare256K256K24.1#7315.5#96
Kimi K2 ThinkingNovita AI0.6002.50compare262K262K24.1#7315.5#96
Kimi K2 ThinkingMoonshot AI (Kimi)0.6002.50compare262K262K24.1#7315.5#96
Kimi K2 Thinking 251104Volcengine (ByteDance)N/AN/Acompare229K33K24.1#7315.5#96
Kimi K2 ThinkingGMI Cloud0.8001.20compare262K16K24.1#7315.5#96
Kimi K2 ThinkingFireworks AI0.6002.50compare262K262K24.1#7315.5#96
Moonshotai.kimi K2 ThinkingAWS Bedrock0.7303.03compare262K262K24.1#7315.5#96
o1-previewOpenAIN/AN/Acompare128K33K23.7#7434.0#34
o1-previewAzure OpenAI15.0060.00compare128K33K23.7#7434.0#34
Grok 4 1 Fast Non ReasoningxAI0.2000.500compare2.0M2.0M23.6#7519.5#75
Grok 4 1 Fast Non ReasoningAzure AI0.2000.500compare131K131K23.6#7519.5#75
Glm 4.5 AirZ AI (Zhipu)0.2001.10compare128K32K23.2#7623.8#58
Glm 4.5 AirVercel AI Gateway0.2001.10compare128K96K23.2#7623.8#58
GLM 4.5 Air FP8Together AI0.2001.10compare128KN/A23.2#7623.8#58
Glm 4.5 AirNovita AI0.1300.850compare131K98K23.2#7623.8#58
Grok 4 Fast Non ReasoningxAI0.2000.500compare2.0M2.0M23.1#7719.0#79
Nova 2 Pro Preview 20251202 V1AWS Bedrock2.1917.50compare1.0M64K23.1#7720.5#71
GPT-4.1 miniVercel AI Gateway0.4001.60compare1.0M33K22.9#7918.5#80
GPT-4.1 miniReplicate0.4001.60compareN/AN/A22.9#7918.5#80
GPT-4.1 miniOpenRouter0.4001.60compare1.0M33K22.9#7918.5#80
GPT-4.1 miniOpenAI0.4001.60compare1.0M33K22.9#7918.5#80
GPT-4.1 miniAzure OpenAI0.4401.76compare1.0M33K22.9#7918.5#80
Mistral Large 3Mistral AI0.5001.50compare262K262K22.8#8022.7#64
Mistral Large 3 675B InstructAWS Bedrock0.5001.50compare128K8K22.8#8022.7#64
Mistral Large 3 Fp8Fireworks AI1.201.20compare256K256K22.8#8022.7#64
Mistral Large 3Azure AI0.5001.50compare256K8K22.8#8022.7#64
Qwen3 30B A3b Thinking 2507Fireworks AI0.9000.900compare262K262K22.4#8114.7#99
DeepSeek V3 0324Weights & Biases0.1140.275compare161K161K22.3#8222.0#67
DeepSeek V3 0324Novita AI0.2701.12compare164K164K22.3#8222.0#67
DeepSeek V3 0324Lambda0.2000.600compare131K131K22.3#8222.0#67
DeepSeek V3 0324Hyperbolic0.4000.400compare33K33K22.3#8222.0#67
DeepSeek V3 0324GMI Cloud0.2800.880compare164K16K22.3#8222.0#67
DeepSeek V3 0324Fireworks AI0.9000.900compare164K164K22.3#8222.0#67
DeepSeek V3 0324DeepInfra0.2500.880compare164K164K22.3#8222.0#67
DevstralMistral AI0.4002.00compare256K256K22.0#8323.7#59
Mistral Medium 3 1 2508Mistral AI0.4002.00compare131K131K21.3#8418.3#82
Minimax M1 80KNovita AI0.5502.20compare1.0M40K20.9#8514.1#104
Minimax M1 80KFireworks AI0.1000.100compare4K4K20.9#8514.1#104
Qwen3 Vl 235B A22bAWS Bedrock0.5302.66compare128K8K20.8#8616.5#89
Qwen3 Vl 235B A22b InstructNovita AI0.3001.50compare131K33K20.8#8616.5#89
Qwen3 Vl 235B A22b InstructFireworks AI0.2200.880compare262K262K20.8#8616.5#89
Qwen3 Vl 235B A22b InstructDashScope (Alibaba)0.4001.60compare131K33K20.8#8616.5#89
Gemini 2.5 FlashVercel AI Gateway0.3002.50compare1.0M66K20.6#8717.8#86
Gemini 2.5 FlashReplicate2.502.50compareN/AN/A20.6#8717.8#86
Gemini 2.5 FlashOpenRouter0.3002.50compare1.0M8K20.6#8717.8#86
Gemini 2.5 Flash-native-audioGoogle Gemini0.3002.50compare1.0M8K20.6#8717.8#86
Gemini 2.5 FlashGoogle Vertex AI0.3002.50compare1.0M66K20.6#8717.8#86
Gemini 2.5 FlashDeepInfra0.3002.50compare1.0M1.0M20.6#8717.8#86
Databricks Gemini 2 5 FlashDatabricks0.3002.50compare1.0M66K20.6#8717.8#86
o1 miniReplicate1.104.40compareN/AN/A20.4#88N/A
o1 miniOpenAIN/AN/Acompare128K66K20.4#88N/A
o1 miniAzure OpenAI1.214.84compare128K66K20.4#88N/A
Qwen3 Next 80B A3b Instruct MaasVertex AI (Qwen)0.1501.20compare262K262K20.1#8915.3#97
Qwen3 Next 80B A3B InstructTogether AI0.1501.50compare262KN/A20.1#8915.3#97
Qwen3 Next 80B A3bAWS Bedrock0.1501.20compare128K8K20.1#8915.3#97
Qwen3 Next 80B A3b InstructNovita AI0.1501.50compare131K33K20.1#8915.3#97
Qwen3 Next 80B A3b InstructFireworks AI0.9000.900compare4K4K20.1#8915.3#97
Qwen3 Next 80B A3B InstructDeepInfra0.1401.40compare262K262K20.1#8915.3#97
Qwen3 Next 80B A3b InstructDashScope (Alibaba)0.1501.20compare262K66K20.1#8915.3#97
Qwen3 Coder 30B A3b V1AWS Bedrock0.1500.600compare262K131K20.0#9019.4#78
Qwen3 Coder 30B A3b InstructNovita AI0.0700.270compare160K33K20.0#9019.4#78
Qwen3 Coder 30B A3B Instruct GGUFLemonade (AMD)N/AN/Acompare262K33K20.0#9019.4#78
GPT-4.5 PreviewOpenAIN/AN/Acompare128K16K20.0#90N/A
Qwen3 Coder 30B A3b InstructFireworks AI0.1500.600compare262K262K20.0#9019.4#78
QwQ 32BSambaNova0.5001.00compare16K16K19.7#92N/A
QwQ 32BNscale0.1800.200compareN/AN/A19.7#92N/A
QwQ 32BNebius0.1500.450compare33K33K19.7#92N/A
QwQ 32BHyperbolic0.2000.200compare131K131K19.7#92N/A
Qwen Qwq 32B PreviewFireworks AI0.9000.900compare33K33K19.7#92N/A
QwQ 32BDeepInfra0.1500.400compare131K131K19.7#92N/A
Devstral Small 2507Mistral AI0.1000.300compare128K128K19.5#9320.7#70
Us.amazon.nova Premier V1AWS Bedrock2.5012.50compare1.0M10K19.0#9413.8#108
Nova Premier V1Amazon Nova2.5012.50compare1.0M10K19.0#9413.8#108
Mistral Medium 3Vertex AI (Mistral)0.4002.00compare128K8K18.8#9513.6#110
Magistral MediumVercel AI Gateway2.005.00compare128K64K18.8#9516.0#91
Magistral MediumMistral AI2.005.00compare40K40K18.8#9516.0#91
Claude 3 5 HaikuAnthropic (Vertex AI)1.005.00compare200K8K18.7#9710.7#130
Claude 3.5 HaikuVercel AI Gateway0.8004.00compare200K8K18.7#9710.7#130
Claude 3.5 HaikuReplicate1.005.00compareN/AN/A18.7#9710.7#130
Devstral MediumMistral AI0.4002.00compare256K256K18.7#9715.9#92
Claude 3 5 HaikuHeroku (Salesforce)N/AN/Acompare4KN/A18.7#9710.7#130
Anthropic Claude 3.5 HaikuGradient AI0.8004.00compare1KN/A18.7#9710.7#130
Eu.anthropic.claude 3 5 Haiku 20241022 V1AWS Bedrock0.2501.25compare200K8K18.7#9710.7#130
Claude 3 5 HaikuAnthropicN/AN/Acompare200K8K18.7#9710.7#130
Gemini 2.0 FlashVercel AI Gateway0.1500.600compare1.0M8K18.5#9913.6#110
Gemini 2.0 Flash-001OpenRouter0.1000.400compare1.0M8K18.5#9913.6#110
Gemini 2.0 Flash-expGoogle GeminiN/AN/Acompare1.0M8K18.5#9913.6#110
Gemini 2.0 Flash-expGoogle Vertex AIN/AN/Acompare1.0M8K18.5#9913.6#110
Gemini 2.0 Flash-001DeepInfra0.1000.400compare1.0M1.0M18.5#9913.6#110
Llama 4 MaverickVercel AI Gateway0.2000.600compare131K8K18.4#10015.6#94
Magistral Small 1 2 2509Mistral AI0.5001.50compare40K40K18.2#10114.8#98
Magistral Small 2509AWS Bedrock0.5001.50compare128K8K18.2#10114.8#98
Gemini 2.0 Pro-exp-02-05Google GeminiN/AN/Acompare2.1M8K18.1#10225.5#54
Gemini 2.0 Pro-exp-02-05Google Vertex AIN/AN/Acompare2.1M8K18.1#10225.5#54
Claude 3 OpusAnthropic (Vertex AI)15.0075.00compare200K4K18.0#16119.5#75
Claude 3 OpusVercel AI Gateway15.0075.00compare200K4K18.0#16119.5#75
Devstral Small 2505Mistral AI0.1000.300compare128K128K18.0#10312.2#117
Anthropic Claude 3 OpusGradient AI15.0075.00compare1KN/A18.0#16119.5#75
Devstral Small 2505Fireworks AI0.9000.900compare131K131K18.0#10312.2#117
Claude 3 Opus 20240229 V1AWS Bedrock15.0075.00compare200K4K18.0#16119.5#75
Nova 2 Lite V1AWS Bedrock0.3002.50compare1.0M64K18.0#10312.5#116
Sonar ReasoningVercel AI Gateway1.005.00compare127K8K17.9#105N/A
Sonar ReasoningPerplexity1.005.00compare128KN/A17.9#105N/A
Hermes 3 Llama 3.1 405BNebius1.003.00compare128K128K17.6#10618.1#84
Hermes3 405BLambda0.8000.800compare131K131K17.6#10618.1#84
Hermes 3 Llama 3.1 405BDeepInfra1.001.00compare131K131K17.6#10618.1#84
Llama 3.1 405B Instruct MaasVertex AI (Llama)5.0016.00compare128K2K17.4#10714.5#100
Meta Llama 3.1 405B Instruct TurboTogether AI3.503.50compareN/AN/A17.4#10714.5#100
Llama3.1 405BSnowflakeN/AN/Acompare128K8K17.4#10714.5#100
Meta Llama 3.1 405B InstructSambaNova5.0010.00compare16K16K17.4#10714.5#100
Llama 3.1 405B InstructOracle Cloud (OCI)10.6810.68compare128K4K17.4#10714.5#100
Meta Llama 3.1 405B InstructNebius1.003.00compare128K128K17.4#10714.5#100
Llama3 1 405B Instruct V1AWS Bedrock5.3216.00compare128K4K17.4#10714.5#100
Llama3.1 405B Instruct Fp8Lambda0.8000.800compare131K131K17.4#10714.5#100
Meta Llama 3.1 405B InstructHyperbolic0.1200.300compare33K33K17.4#10714.5#100
Llama V3p1 405B InstructFireworks AI3.003.00compare128K16K17.4#10714.5#100
Databricks Meta Llama 3 1 405B InstructDatabricks5.0015.00compare128K128K17.4#10714.5#100
Meta Llama 3.1 405B InstructAzure AI5.3316.00compare128K2K17.4#10714.5#100
GPT-4oVercel AI Gateway2.5010.00compare128K16K17.3#10816.7#88
GPT-4oReplicate2.5010.00compareN/AN/A17.3#10816.7#88
GPT-4oOpenRouter2.5010.00compare128K4K17.3#10816.7#88
Openai GPT 4oGradient AIN/AN/Acompare16KN/A17.3#10816.7#88
GPT-4oGMI Cloud2.5010.00compare131K16K17.3#10816.7#88
GPT-4-o PreviewGitHub CopilotN/AN/Acompare64K4K17.3#10816.7#88
Chatgpt 4oOpenAI5.0015.00compare128K4K17.3#10816.7#88
GPT-4oAzure OpenAI2.5010.00compare128K16K17.3#10816.7#88
DeepSeek R1 Distill Qwen 32BNscale0.1500.150compareN/AN/A17.2#109N/A
DeepSeek R1 Distill Qwen 32BNovita AI0.3000.300compare64K32K17.2#109N/A
Qwen3 Vl 32B InstructFireworks AI0.9000.900compare4K4K17.2#10915.6#94
DeepSeek R1 Distill Qwen 32BFireworks AI0.9000.900compare131K131K17.2#109N/A
DeepSeek R1 Distill Qwen 32BDeepInfra0.2700.270compare131K131K17.2#109N/A
Qwen3 Vl 32B InstructDashScope (Alibaba)0.1600.640compare131K33K17.2#10915.6#94
Glm 4.6vNovita AI0.3000.900compare131K33K17.1#11111.1#123
Qwen 3 235BVercel AI Gateway0.2000.600compare41K16K17.0#11214.0#105
Qwen3 235B A22B Fp8 TputTogether AI0.2000.600compare40KN/A17.0#11214.0#105
Qwen3 235B A22b Instruct 2507Replicate0.2641.06compareN/AN/A17.0#11214.0#105
Qwen3 235B A22b Fp8Novita AI0.2000.800compare41K20K17.0#11214.0#105
Qwen3 235B A22BNebius0.2000.600compare262K262K17.0#11214.0#105
Qwen3 235B A22BHyperbolic2.002.00compare131K131K17.0#11214.0#105
Qwen3 VL 235B A22B Instruct FP8GMI Cloud0.3001.40compare262K16K17.0#11214.0#105
Qwen3 235B A22bFireworks AI0.2200.880compare131K131K17.0#11214.0#105
Qwen3 235B A22BDeepInfra0.1800.540compare41K41K17.0#11214.0#105
Magistral SmallVercel AI Gateway0.5001.50compare128K64K16.8#11311.1#123
Magistral SmallMistral AI0.5001.50compare40K40K16.8#11311.1#123
Qwen3 Vl 8B InstructNovita AI0.0800.500compare131K33K16.7#1149.8#137
DeepSeek V3Vercel AI Gateway0.9000.900compare128K8K16.5#11516.4#90
DeepSeek V3Together AI1.251.25compare66K8K16.5#11516.4#90
DeepSeek V3 0324SambaNova3.004.50compare33K33K16.5#11516.4#90
DeepSeek V3Replicate1.451.45compare66K8K16.5#11516.4#90
DeepSeek V3 TurboNovita AI0.4001.30compare64K16K16.5#11516.4#90
DeepSeek V3Nebius0.5001.50compare128K128K16.5#11516.4#90
DeepSeek V3Hyperbolic0.2000.200compare33K33K16.5#11516.4#90
DeepSeek V3Fireworks AI0.9000.900compare128K8K16.5#11516.4#90
DeepSeek V3DeepSeek0.2701.10compare66K8K16.5#11516.4#90
V3 V1AWS Bedrock0.5801.68compare164K82K16.5#11516.4#90
DeepSeek V3DeepInfra0.3800.890compare164K164K16.5#11516.4#90
DeepSeek V3Azure AI1.144.56compare128K8K16.5#11516.4#90
DeepSeek R1 0528 Distill Qwen3 8BFireworks AI0.2000.200compare131K131K16.4#1167.8#141
Qwen MaxDashScope (Alibaba)1.606.40compare31K8K16.3#117N/A
Qwen3 Vl 30B A3b InstructNovita AI0.2000.700compare131K33K16.1#11814.3#102
Qwen3 Vl 30B A3b InstructFireworks AI0.1500.600compare262K262K16.1#11814.3#102
DeepSeek R1 Distill Llama 70BVercel AI Gateway0.7500.990compare131K131K16.0#11911.4#120
DeepSeek R1 Distill Llama 70BSambaNova0.7001.40compare131K131K16.0#11911.4#120
DeepSeek R1 Distill Llama 70BOVHcloud0.6700.670compare131K131K16.0#11911.4#120
Ministral 14B 2512OpenRouter0.2000.200compare262K262K16.0#11910.9#126
DeepSeek R1 Distill Llama 70BNscale0.3750.375compareN/AN/A16.0#11911.4#120
DeepSeek R1 Distill Llama 70BNovita AI0.8000.800compare8K8K16.0#11911.4#120
DeepSeek R1 Distill Llama 70BNebius0.2500.750compare128K128K16.0#11911.4#120
Ministral 3 14B InstructAWS Bedrock0.2000.200compare128K8K16.0#11910.9#126
DeepSeek R1 Distill Llama 70BGradient AI0.9900.990compare8KN/A16.0#11911.4#120
Gemini 1.5 ProGoogle GeminiN/AN/Acompare1.0M8K16.0#11923.6#61
Gemini 1.5 ProGoogle Vertex AIN/AN/Acompare2.1M8K16.0#11923.6#61
DeepSeek R1 Distill Llama 70BFireworks AI0.9000.900compare131K131K16.0#11911.4#120
DeepSeek R1 Distill Llama 70BDeepInfra0.2000.600compare131K131K16.0#11911.4#120
Claude 3 5 SonnetAnthropic (Vertex AI)3.0015.00compare200K8K15.9#12230.2#40
Claude 3 5 SonnetVercel AI Gateway3.0015.00compare200K8K15.9#12230.2#40
Claude 3 5 SonnetSnowflakeN/AN/Acompare18K8K15.9#12230.2#40
Claude 3.5 SonnetReplicate3.7518.75compareN/AN/A15.9#12230.2#40
Claude 3.5 SonnetOpenRouter3.0015.00compare200K8K15.9#12230.2#40
Claude 3 5 SonnetHeroku (Salesforce)N/AN/Acompare8KN/A15.9#12230.2#40
Anthropic Claude 3.5 SonnetGradient AI3.0015.00compare1KN/A15.9#12230.2#40
Claude 3 5 SonnetAnthropicN/AN/Acompare200K8K15.9#12230.2#40
Claude 3 5 Sonnet 20240620 V1AWS Bedrock3.0015.00compare1.0M4K15.9#12230.2#40
DeepSeek R1 Distill Qwen 14BNscale0.0700.070compareN/AN/A15.8#123N/A
DeepSeek R1 Distill Qwen 14BNovita AI0.1500.150compare33K16K15.8#123N/A
DeepSeek R1 Distill Qwen 14BFireworks AI0.2000.200compare131K131K15.8#123N/A
Qwen2.5 72B Instruct TurboTogether AIN/AN/AcompareN/AN/A15.6#12411.9#119
Qwen 2.5 72B InstructNovita AI0.3800.400compare32K8K15.6#12411.9#119
Qwen2.5 72B InstructNebius0.1300.400compare128K128K15.6#12411.9#119
Qwen2.5 72B InstructHyperbolic0.1200.300compare131K131K15.6#12411.9#119
Qwen2p5 72BFireworks AI0.9000.900compare131K131K15.6#12411.9#119
Qwen2.5 72B InstructDeepInfra0.1200.390compare33K33K15.6#12411.9#119
SonarVercel AI Gateway1.001.00compare127K8K15.5#125N/A
SonarPerplexity1.001.00compare128KN/A15.5#125N/A
Sonar Reasoning ProVercel AI Gateway2.008.00compare127K8K15.2#126N/A
Devstral SmallVercel AI Gateway0.0700.280compare128K128K15.2#12612.1#118
Sonar Reasoning ProPerplexity2.008.00compare128KN/A15.2#126N/A
Devstral SmallMistral AI0.1000.300compare256K256K15.2#12612.1#118
Mistral Large2SnowflakeN/AN/Acompare128K8K15.1#12813.8#108
Mistral Small 3.2 24B Instruct 2506OVHcloud0.0900.280compare128K128K15.1#12813.3#112
Mistral Small 3.2 24B InstructOpenRouter0.1000.300compare32KN/A15.1#12813.3#112
Mistral Small 3 2 2506Mistral AI0.0600.180compare131K131K15.1#12813.3#112
Mistral Small 3.2 24B Instruct 2506DeepInfra0.0750.200compare128K128K15.1#12813.3#112
ERNIE 4.5 300B A47b PaddleNovita AI0.2801.10compare123K12K15.0#13014.5#100
Llama 3.1 Nemotron Ultra 253B V1Nebius0.6001.80compare128K128K15.0#13013.1#114
Qwen3 30B A3b Instruct 2507Fireworks AI0.5000.500compare262K262K15.0#13014.2#103
ERNIE 4p5 300B A47b PtFireworks AI0.1000.100compare4K4K15.0#13014.5#100
Ministral 8B 2512OpenRouter0.1500.150compare262K262K14.8#13310.0#135
Ministral 3 8B InstructAWS Bedrock0.1500.150compare128K8K14.8#13310.0#135
Gemini 2.0 Flash-LiteVercel AI Gateway0.0750.300compare1.0M8K14.7#134N/A
Gemini 2.0 Flash-LiteGoogle Gemini0.0750.300compare1.0M8K14.7#134N/A
Gemini 2.0 Flash-LiteGoogle Vertex AI0.0750.300compare1.0M8K14.7#134N/A
Llama 3.3 Nemotron Super 49B V1.5DeepInfra0.1000.400compare131K131K14.6#13510.5#133
Mistral Small 3 1 24B Instruct 2503IBM watsonx0.1000.300compare32K32K14.5#13613.9#107
Llama 3 3 70B InstructIBM watsonx0.7100.710compare128K128K14.5#13610.7#130
Llama 3.3 70B InstructWeights & Biases0.0710.071compare128K128K14.5#13610.7#130
Llama 3.3 70BVercel AI Gateway0.7200.720compare128K8K14.5#13610.7#130
Qwen 3 32BVercel AI Gateway0.1000.300compare41K16K14.5#136N/A
Llama 3.3 70B Instruct TurboTogether AI0.8800.880compareN/AN/A14.5#13610.7#130
Llama3.3 70BSnowflakeN/AN/Acompare128K8K14.5#13610.7#130
Qwen3 32BSambaNova0.4000.800compare8K8K14.5#136N/A
Meta Llama 3.3 70B InstructSambaNova0.6001.20compare131K131K14.5#13610.7#130
Qwen3 32B V1AWS Bedrock0.1500.600compare131K16K14.5#136N/A
Qwen3 32BOVHcloud0.0800.230compare32K32K14.5#136N/A
Meta Llama 3 3 70B InstructOVHcloud0.6700.670compare131K131K14.5#13610.7#130
Mistral Small 3.1 24B InstructOpenRouter0.1000.300compare32KN/A14.5#13613.9#107
Llama 3.3 70B InstructOracle Cloud (OCI)0.7200.720compare128K4K14.5#13610.7#130
Llama 3.3 70B InstructNscale0.2000.200compareN/AN/A14.5#13610.7#130
Qwen3 32B Fp8Novita AI0.1000.450compare41K20K14.5#136N/A
Llama 3.3 70B InstructNovita AI0.1350.400compare131K120K14.5#13610.7#130
Qwen3 32BNebius0.1000.300compare33K33K14.5#136N/A
Llama 3.3 70B InstructNebius0.1300.400compare128K128K14.5#13610.7#130
Llama3 3 70B Instruct V1AWS Bedrock0.7200.720compare128K4K14.5#13610.7#130
Llama 3.3 70B InstructMeta LlamaN/AN/Acompare128K4K14.5#13610.7#130
Qwen3 32B Fp8Lambda0.0500.100compare131K131K14.5#136N/A
DeepSeek Llama3.3 70BLambda0.2000.600compare131K131K14.5#13610.7#130
Llama 3.3 70B InstructHyperbolic0.1200.300compare131K131K14.5#13610.7#130
Qwen3 32BGroq0.2900.590compare131K131K14.5#136N/A
Llama 3.3 70B VersatileGroq0.5900.790compare128K33K14.5#13610.7#130
Llama3.3 70B InstructGradient AI0.6500.650compare2KN/A14.5#13610.7#130
Alibaba Qwen3 32BGradient AIN/AN/Acompare2KN/A14.5#136N/A
Qwen3 32BFireworks AI0.9000.900compare131K131K14.5#136N/A
Llama V3p3 70B InstructFireworks AI0.9000.900compare131K131K14.5#13610.7#130
Qwen3 32BDeepInfra0.1000.280compare41K41K14.5#136N/A
Llama 3.3 70B InstructDeepInfra0.2300.400compare131K131K14.5#13610.7#130
Databricks Meta Llama 3 3 70B InstructDatabricks0.5001.50compare128K128K14.5#13610.7#130
Qwen 3 32BCerebras0.4000.800compare128K128K14.5#136N/A
Llama 3.3 70BCerebras0.8501.20compare128K128K14.5#13610.7#130
Llama 3.3 70B InstructAzure AI0.7100.710compare128K2K14.5#13610.7#130
Llama 3.3 Nemotron Super 49B V1Nebius0.1000.400compare131K131K14.3#1397.6#144
Qwen3 Vl 8BLlamaGate0.1500.550compare33K8K14.3#1397.3#149
Qwen3 Vl 8B InstructFireworks AI0.2000.200compare4K4K14.3#1397.3#149
Pixtral Large 2411Mistral AI2.006.00compare128K128K14.0#141N/A
Grok 2 1212xAI2.0010.00compare131K131K13.9#142N/A
Gemini 1.5 FlashGoogle GeminiN/AN/Acompare1.0M8K13.8#143N/A
Gemini 1.5 FlashGoogle Vertex AIN/AN/Acompare1.0M8K13.8#143N/A
Llama 4 ScoutVercel AI Gateway0.1000.300compare131K8K13.5#1446.7#152
Command AVercel AI Gateway2.5010.00compare256K8K13.5#1449.9#136
Nova ProVercel AI Gateway0.8003.20compare300K8K13.5#14411.0#125
Nova Pro V1AWS Bedrock0.8003.20compare300K10K13.5#14411.0#125
Nova Pro V1Amazon Nova0.8003.20compare300K10K13.5#14411.0#125
Llama3.1 Nemotron 70B Instruct Fp8Lambda0.1200.300compare131K131K13.4#14710.8#128
Llama V3p1 Nemotron 70B InstructFireworks AI0.9000.900compare131K131K13.4#14710.8#128
Llama 3.1 Nemotron 70B InstructDeepInfra0.6000.600compare131K131K13.4#14710.8#128
Grok BetaxAI5.0015.00compare131K131K13.3#148N/A
Nvidia.nemotron Nano 9B V2AWS Bedrock0.0600.230compare128K8K13.2#1497.5#146
Nvidia.nemotron Nano 3 30BAWS Bedrock0.0600.240compare262K8K13.2#14915.8#93
Qwen2.5 32B InstructNebius0.0600.200compare128K128K13.2#149N/A
Qwen2p5 32BFireworks AI0.9000.900compare131K131K13.2#149N/A
Nvidia Nemotron Nano 9B V2Fireworks AI0.2000.200compare131K131K13.2#1497.5#146
NVIDIA Nemotron Nano 9B V2DeepInfra0.0400.160compare131K131K13.2#1497.5#146
Mistral LargeVertex AI (Mistral)2.006.00compare128K8K13.0#152N/A
GPT-4.1 nanoVercel AI Gateway0.1000.400compare1.0M33K13.0#15211.2#121
GPT-4.1 nanoReplicate0.1000.400compareN/AN/A13.0#15211.2#121
GPT-4.1 nanoOpenRouter0.1000.400compare1.0M33K13.0#15211.2#121
Mistral Large Instruct 2407OllamaN/AN/Acompare66K8K13.0#152N/A
Mistral Large 2407Mistral AI3.009.00compare128K128K13.0#152N/A
Mistral Large 2407 V1AWS Bedrock3.009.00compare128K8K13.0#152N/A
GPT-4.1 nanoOpenAI0.1000.400compare1.0M33K13.0#15211.2#121
GPT-4.1 nanoAzure OpenAI0.1100.440compare1.0M33K13.0#15211.2#121
Mistral Large 2407Azure AI2.006.00compare128K4K13.0#152N/A
Qwen2.5 Coder 32B InstructOVHcloud0.8700.870compare32K32K12.9#154N/A
Qwen 2.5 Coder 32B InstructOpenRouter0.1800.180compare34K34K12.9#154N/A
Qwen2.5 Coder 32B InstructNscale0.0600.200compareN/AN/A12.9#154N/A
Qwen3 4B Instruct 2507 GGUFLemonade (AMD)N/AN/Acompare262K33K12.9#1549.1#139
Qwen25 Coder 32B InstructLambda0.0500.100compare131K131K12.9#154N/A
Qwen2.5 Coder 32B InstructHyperbolic0.1200.300compare33K33K12.9#154N/A
Qwen3 4B Instruct 2507Fireworks AI0.2000.200compare262K262K12.9#1549.1#139
Qwen2p5 Coder 32BFireworks AI0.9000.900compare33K33K12.9#154N/A
GPT-4 TurboVercel AI Gateway10.0030.00compare128K4K12.8#15613.1#114
GPT-4OpenRouter30.0060.00compare8KN/A12.8#15613.1#114
GPT-4-32kOpenAIN/AN/Acompare33K4K12.8#15613.1#114
Glm 4.5vZ AI (Zhipu)0.6001.80compare128K32K12.7#15710.8#128
Nova LiteVercel AI Gateway0.0600.240compare300K8K12.7#1575.1#156
Glm 4.5vNovita AI0.6001.80compare66K16K12.7#15710.8#128
Gemini flash-liteGoogle Gemini0.1000.400compare1.0M66K12.7#1577.4#148
Gemini 2.5 Flash-LiteGoogle Vertex AI0.1000.400compare1.0M66K12.7#1577.4#148
Glm 4p5vFireworks AI1.201.20compare131K131K12.7#15710.8#128
Nova Lite V1AWS Bedrock0.0600.240compare300K10K12.7#1575.1#156
Nova Lite V1Amazon Nova0.0600.240compare300K10K12.7#1575.1#156
GPT-4o miniVercel AI Gateway0.1500.600compare128K16K12.6#160N/A
GPT-4o miniReplicate0.1500.600compareN/AN/A12.6#160N/A
Openai GPT 4o MiniGradient AIN/AN/Acompare16KN/A12.6#160N/A
GPT-4o miniOpenAI0.1500.600compare128K16K12.6#160N/A
GPT-4o miniGMI Cloud0.1500.600compare131K16K12.6#160N/A
GPT-4o miniGitHub CopilotN/AN/Acompare64K4K12.6#160N/A
GPT-4o miniAzure OpenAI0.1500.600compare128K16K12.6#160N/A
Llama 3.1 70B Instruct MaasVertex AI (Llama)N/AN/Acompare128K2K12.5#16110.9#126
Llama 3.1 70BVercel AI Gateway0.7200.720compare128K8K12.5#16110.9#126
Qwen 3 30BVercel AI Gateway0.1000.300compare41K16K12.5#16113.3#112
Meta Llama 3.1 70B Instruct TurboTogether AI0.8800.880compareN/AN/A12.5#16110.9#126
Llama3.1 70BSnowflakeN/AN/Acompare128K8K12.5#16110.9#126
Llama 3.1 70B InstructPerplexity1.001.00compare131K131K12.5#16110.9#126
Meta Llama 3 1 70B InstructOVHcloud0.6700.670compare131K131K12.5#16110.9#126
Qwen3 4B Fp8Novita AI0.0300.030compare128K20K12.5#161N/A
Qwen3 30B A3b Fp8Novita AI0.0900.450compare41K20K12.5#16113.3#112
Qwen3 4BNebius0.0800.240compare33K33K12.5#161N/A
Qwen3 30B A3BNebius0.1000.300compare33K33K12.5#16113.3#112
Meta Llama 3.1 70B InstructNebius0.1300.400compare128K128K12.5#16110.9#126
Llama3 1 70B Instruct V1AWS Bedrock0.9900.990compare128K2K12.5#16110.9#126
Llama3.1 70B Instruct Fp8Lambda0.1200.300compare131K131K12.5#16110.9#126
Meta Llama 3.1 70B InstructHyperbolic0.1200.300compare33K33K12.5#16110.9#126
Meta Llama 3.1 70B InstructFriendliAI0.6000.600compare8K8K12.5#16110.9#126
Qwen3 4BFireworks AI0.2000.200compare41K41K12.5#161N/A
Qwen3 30B A3bFireworks AI0.1500.600compare131K131K12.5#16113.3#112
Llama V3p1 70B InstructFireworks AI0.9000.900compare131K131K12.5#16110.9#126
DeepSeek V2p5Fireworks AI1.201.20compare33K33K12.5#161N/A
Qwen3 30B A3BDeepInfra0.0800.290compare41K41K12.5#16113.3#112
Meta Llama 3.1 70B InstructDeepInfra0.4000.400compare131K131K12.5#16110.9#126
Qwen3 30B A3bDashScope (Alibaba)N/AN/Acompare129K16K12.5#16113.3#112
Claude 3 OpusAnthropicN/AN/Acompare200K4K12.5#16119.5#75
Llama3.1 70BCerebras0.6000.600compare128K128K12.5#16110.9#126
Meta Llama 3.1 70B InstructAzure AI2.683.54compare128K2K12.5#16110.9#126
Claude 3 HaikuAnthropic (Vertex AI)0.2501.25compare200K4K12.3#1666.7#152
Claude 3 HaikuVercel AI Gateway0.2501.25compare200K4K12.3#1666.7#152
Claude 3 HaikuOpenRouter0.2501.25compare200KN/A12.3#1666.7#152
Gemini 2.0 Flash-thinking-expGoogle GeminiN/AN/Acompare1.0M66K12.3#166N/A
Gemini 2.0 Flash-thinking-expGoogle Vertex AIN/AN/Acompare1.0M8K12.3#166N/A
Claude 3 HaikuAnthropic0.2501.25compare200K4K12.3#1666.7#152
Claude 3 Haiku 20240307 V1AWS Bedrock0.2501.25compare200K4K12.3#1666.7#152
Mistral Saba 24BVercel AI Gateway0.7900.790compare33K33K12.1#168N/A
Olmo 3 32B ThinkPublic AIN/AN/Acompare33K4K12.1#16810.5#133
DeepSeek R1 Distill Llama 8BNscale0.0250.025compareN/AN/A12.1#168N/A
DeepSeek R1 Distill Llama 8BFireworks AI0.2000.200compare131K131K12.1#168N/A
Reka FlashSnowflakeN/AN/Acompare100K8K12.0#171N/A
Qwen TurboDashScope (Alibaba)0.0500.200compare1.0M16K12.0#171N/A
Llama 3 2 90B Vision InstructIBM watsonx2.002.00compare128K128K11.9#173N/A
Llama 3.2 90B Vision Instruct MaasVertex AI (Llama)N/AN/Acompare128K2K11.9#173N/A
Llama 3.2 90BVercel AI Gateway0.7200.720compare128K8K11.9#173N/A
Llama 3.2 90B Vision InstructOracle Cloud (OCI)2.002.00compare128K4K11.9#173N/A
Llama3 2 90B Instruct V1AWS Bedrock2.002.00compare128K4K11.9#173N/A
Llama V3p2 90B Vision InstructFireworks AI0.9000.900compare16K16K11.9#173N/A
Llama 3.2 90B Vision InstructAzure AI2.042.04compare128K2K11.9#173N/A
Llama 3.1 8B InstructWeights & Biases0.0220.022compare128K128K11.8#1744.9#157
Llama 3.1 8B Instruct MaasVertex AI (Llama)N/AN/Acompare128K2K11.8#1744.9#157
Llama 3.1 8BVercel AI Gateway0.0500.080compare131K131K11.8#1744.9#157
Meta Llama 3.1 8B Instruct TurboTogether AI0.1800.180compareN/AN/A11.8#1744.9#157
Llama3.1 8BSnowflakeN/AN/Acompare128K8K11.8#1744.9#157
Meta Llama 3.1 8B InstructSambaNova0.1000.200compare16K16K11.8#1744.9#157
Llama 3.1 8B InstructPerplexity0.2000.200compare131K131K11.8#1744.9#157
Llama 3.1 8B InstructOVHcloud0.1000.100compare131K131K11.8#1744.9#157
Llama3.1OllamaN/AN/Acompare8K8K11.8#1744.9#157
Llama 3.1 8B InstructNscale0.0300.030compareN/AN/A11.8#1744.9#157
Llama 3.1 8B InstructNovita AI0.0200.050compare16K16K11.8#1744.9#157
Meta Llama 3.1 8B InstructNebius0.0200.060compare128K128K11.8#1744.9#157
Llama3 1 8B Instruct V1AWS Bedrock0.2200.220compare128K2K11.8#1744.9#157
Llama 3.1 8BLlamaGate0.0300.050compare131K8K11.8#1744.9#157
Llama3.1 8B InstructLambda0.0250.040compare131K131K11.8#1744.9#157
Meta Llama 3.1 8B InstructHyperbolic0.1200.300compare33K33K11.8#1744.9#157
Llama 3.1 8B InstantGroq0.0500.080compare128K8K11.8#1744.9#157
Meta Llama 3.1 8B InstructFriendliAI0.1000.100compare8K8K11.8#1744.9#157
Llama V3p1 8B InstructFireworks AI0.1000.100compare16K16K11.8#1744.9#157
Meta Llama 3.1 8B InstructDeepInfra0.0300.050compare131K131K11.8#1744.9#157
Databricks Meta Llama 3 1 8B InstructDatabricks0.1500.450compare200K128K11.8#1744.9#157
Llama3.1 8BCerebras0.1000.100compare128K128K11.8#1744.9#157
Meta Llama 3.1 8B InstructAzure AI0.3000.610compare128K2K11.8#1744.9#157
Ministral 3B 2512OpenRouter0.1000.100compare131K131K11.2#1754.8#158
Ministral 3 3B InstructAWS Bedrock0.1000.100compare128K8K11.2#1754.8#158
Jamba Large 1.7AI21 Labs2.008.00compare256K256K10.9#1767.8#141
Granite 4 H SmallIBM watsonx0.0600.250compare20K20K10.8#1778.5#140
Jamba 1.5 LargeVertex AI (AI21)2.008.00compare256K256K10.7#178N/A
Jamba 1.5 LargeSnowflakeN/AN/Acompare256K8K10.7#178N/A
Jamba 1.5 LargeAI21 Labs2.008.00compare256K256K10.7#178N/A
Jamba 1 5 Large V1AWS Bedrock2.008.00compare256K256K10.7#178N/A
DeepSeek Coder V2 BaseOllamaN/AN/Acompare8K8K10.6#179N/A
Qwen3 8B Fp8Novita AI0.0350.138compare128K20K10.6#1797.1#150
Qwen3 8BLlamaGate0.0400.140compare33K8K10.6#1797.1#150
Hermes3 70BLambda0.1200.300compare131K131K10.6#179N/A
Jamba Large 1.6AI21 Labs2.008.00compare256K256K10.6#179N/A
Hermes 3 Llama 3.1 70BHyperbolic0.1200.300compare33K33K10.6#179N/A
Qwen3 8BFireworks AI0.2000.200compare41K41K10.6#1797.1#150
DeepSeek Coder V2 InstructFireworks AI1.201.20compare66K66K10.6#179N/A
Hermes 3 Llama 3.1 70BDeepInfra0.3000.300compare131K131K10.6#179N/A
Phi 4DeepInfra0.0700.140compare16K16K10.4#18311.2#121
Phi 4Azure AI0.1250.500compare16K16K10.4#18311.2#121
Claude 3 SonnetAnthropic (Vertex AI)3.0015.00compare200K4K10.3#184N/A
Nova MicroVercel AI Gateway0.0350.140compare128K8K10.3#1844.1#160
Gemma 3 27B ItNovita AI0.1190.200compare98K16K10.3#1849.6#138
Gemma 3 27B ItNebius0.0600.200compare128K128K10.3#1849.6#138
Gemma 3 27B ItAWS Bedrock0.2300.380compare128K8K10.3#1849.6#138
Gemma 3 27B ItGoogle GeminiN/AN/Acompare131K8K10.3#1849.6#138
Gemma 3 27B ItFireworks AI0.9000.900compare131K131K10.3#1849.6#138
Gemma 3 27B ItDeepInfra0.0900.160compare131K131K10.3#1849.6#138
Claude 3 Sonnet 20240229 V1AWS Bedrock3.0015.00compare200K4K10.3#184N/A
Nova Micro V1AWS Bedrock0.0350.140compare128K10K10.3#1844.1#160
Nova Micro V1Amazon Nova0.0350.140compare128K10K10.3#1844.1#160
Mistral SmallVercel AI Gateway0.1000.300compare32K4K10.2#187N/A
Mistral SmallMistral AI0.0600.180compare131K131K10.2#187N/A
Mistral SmallAzure AI1.003.00compare32K8K10.2#187N/A
Nvidia.nemotron Nano 12B V2AWS Bedrock0.2000.600compare128K8K10.1#1885.9#155
Gemini 1.0 UltraGoogle Vertex AIN/AN/Acompare8K2K10.1#18817.6#87
Phi 3 Mini 128K InstructFireworks AI0.1000.100compare131K131K10.1#1883.0#166
Nemotron Nano V2 12B VlFireworks AI0.1000.100compare4K4K10.1#1885.9#155
Phi 3 Mini 128K InstructAzure AI0.1300.520compare128K4K10.1#1883.0#166
Qwen2.5 Coder 7B InstructNscale0.0100.030compareN/AN/A10.0#191N/A
Qwen2.5 Coder 7BNebius0.0100.030compare33K33K10.0#191N/A
Qwen2.5 Coder 7BLlamaGate0.0600.120compare33K8K10.0#191N/A
Qwen2p5 Coder 7BFireworks AI0.2000.200compare33K33K10.0#191N/A
Phi 4 Multimodal InstructAzure AI0.0800.320compare131K4K10.0#191N/A
Mistral LargeIBM watsonx3.0010.00compare131K16K9.9#193N/A
Mistral Large@latestVertex AI (Mistral)2.006.00compare128K8K9.9#193N/A
Mistral LargeVercel AI Gateway2.006.00compare32K4K9.9#193N/A
Mistral LargeSnowflakeN/AN/Acompare32K8K9.9#193N/A
Mistral LargeOpenRouter8.0024.00compare32KN/A9.9#193N/A
Mistral LargeMistral AI0.5001.50compare262K262K9.9#193N/A
Mistral LargeAzure OpenAI8.0024.00compare32KN/A9.9#193N/A
Mistral LargeAzure AI2.006.00compare128K4K9.9#193N/A
Mixtral 8x22B InstructVercel AI Gateway1.201.20compare66K2K9.8#194N/A
Mixtral 8x22B InstructOpenRouter0.6500.650compare66KN/A9.8#194N/A
Open Mixtral 8x22BMistral AI2.006.00compare65K8K9.8#194N/A
Mixtral 8x22BFireworks AI1.201.20compare66K66K9.8#194N/A
Llama 3 2 3B InstructIBM watsonx0.1500.150compare128K128K9.7#195N/A
Llama 3.2 3BVercel AI Gateway0.1500.150compare128K8K9.7#195N/A
Llama 3.2 3B Instruct TurboTogether AIN/AN/AcompareN/AN/A9.7#195N/A
Llama3.2 3BSnowflakeN/AN/Acompare128K8K9.7#195N/A
Meta Llama 3.2 3B InstructSambaNova0.0800.160compare4K4K9.7#195N/A
Meta Textgeneration Llama 2 7BAWS SageMakerN/AN/Acompare4K4K9.7#195N/A
Llama 2 7BReplicate0.0500.250compare4K4K9.7#195N/A
Llama2:7BOllamaN/AN/Acompare4K4K9.7#195N/A
Llama 3.2 3B InstructNovita AI0.0300.050compare33K32K9.7#195N/A
Llama3 2 3B Instruct V1AWS Bedrock0.1500.150compare128K4K9.7#195N/A
Llama 3.2 3BLlamaGate0.0400.080compare131K8K9.7#195N/A
Llama3.2 3B InstructLambda0.0150.025compare131K131K9.7#195N/A
Llama 3.2 3B InstructHyperbolic0.1200.300compare33K33K9.7#195N/A
Llama V3p2 3BFireworks AI0.1000.100compare131K131K9.7#195N/A
Llama V2 7BFireworks AI0.2000.200compare4K4K9.7#195N/A
Llama 3.2 3B InstructDeepInfra0.0200.020compare131K131K9.7#195N/A
Llama 2 7B Chat Fp16Cloudflare Workers AI1.921.92compare3K3K9.7#195N/A
Llama 2 7B Chat HfAnyscale0.1500.150compare4K4K9.7#195N/A
Olmo 3 7B ThinkPublic AIN/AN/Acompare33K4K9.4#1977.6#144
Claude V2AWS Bedrock8.0024.00compare100K8K9.3#19814.0#105
DeepSeek R1 Distill Qwen 1.5BNscale0.0900.090compareN/AN/A9.1#199N/A
DeepSeek R1 Distill Qwen 1p5bFireworks AI0.1000.100compare131K131K9.1#199N/A
GPT-3.5 TurboVercel AI Gateway0.5001.50compare16K4K9.0#20010.7#130
GPT-3.5 TurboOpenRouter1.502.00compare4KN/A9.0#20010.7#130
Mistral MediumMistral AI0.4002.00compare131K131K9.0#200N/A
Mistral Small 2402 V1AWS Bedrock1.003.00compare32K8K9.0#200N/A
GPT-3.5 TurboGitHub CopilotN/AN/Acompare16K4K9.0#20010.7#130
Ft:gpt 3.5 TurboOpenAI3.006.00compare16K4K9.0#20010.7#130
GPT-3.5-turbo-instruct-0914Microsoft Azure1.502.00compare4KN/A9.0#20010.7#130
GPT-3.5 TurboAzure OpenAI0.5001.50compare4K4K9.0#20010.7#130
Llama3 70B Instruct MaasVertex AI (Llama)N/AN/Acompare32K32K8.9#2036.8#151
Llama 3 70BVercel AI Gateway0.5900.790compare8K8K8.9#2036.8#151
Llama3 70BSnowflakeN/AN/Acompare8K8K8.9#2036.8#151
Llama 3 70BReplicate0.6502.75compare8K8K8.9#2036.8#151
Llama 3 70B InstructOpenRouter0.5900.790compare8KN/A8.9#2036.8#151
Llama3:70BOllamaN/AN/Acompare8K8K8.9#2036.8#151
Llama 3 70B InstructNovita AI0.5100.740compare8K8K8.9#2036.8#151
Meta Llama 3 70B InstructHyperbolic0.1200.300compare131K131K8.9#2036.8#151
Llama V3 70B InstructFireworks AI0.9000.900compare8K8K8.9#2036.8#151
Databricks Meta Llama 3 70B InstructDatabricks1.003.00compare128K128K8.9#2036.8#151
Llama3 70B Instruct V1AWS Bedrock2.653.50compare8K8K8.9#2036.8#151
Meta Llama 3 70B InstructAzure AI1.100.370compare8K2K8.9#2036.8#151
Meta Llama 3 70B InstructAnyscale1.001.00compare8K8K8.9#2036.8#151
Snowflake ArcticSnowflakeN/AN/Acompare4K8K8.8#204N/A
Gemma 3 12B ItNovita AI0.0500.100compare131K8K8.8#2046.3#154
Lfm 40BLambda0.1000.200compare131K131K8.8#204N/A
Gemma 3 12B ItAWS Bedrock0.0900.290compare128K8K8.8#2046.3#154
Qwen2 72B InstructFireworks AI0.9000.900compare33K33K8.8#204N/A
Gemma 3 12B ItDeepInfra0.0500.100compare131K131K8.8#2046.3#154
Databricks Gemma 3 12BDatabricks0.1500.500compare128K32K8.8#2046.3#154
Llama 3 2 11B Vision InstructIBM watsonx0.3500.350compare128K128K8.7#2084.3#159
Llama 3.2 11BVercel AI Gateway0.1600.160compare128K8K8.7#2084.3#159
Llama3 2 11B Instruct V1AWS Bedrock0.3500.350compare128K4K8.7#2084.3#159
Llama3.2 11B Vision InstructLambda0.0150.025compare131K131K8.7#2084.3#159
Llama V3p2 11B Vision InstructFireworks AI0.2000.200compare16K16K8.7#2084.3#159
Llama 3.2 11B Vision InstructDeepInfra0.0490.049compare131K131K8.7#2084.3#159
Llama 3.2 11B Vision InstructAzure AI0.3700.370compare128K2K8.7#2084.3#159
DeepSeek Coder V2 Lite BaseOllamaN/AN/Acompare8K8K8.5#209N/A
Gemini proGoogle GeminiN/AN/Acompare33K8K8.5#209N/A
Gemini 1.0 ProGoogle Vertex AIN/AN/Acompare33K8K8.5#209N/A
DeepSeek Coder V2 Lite BaseFireworks AI0.5000.500compare164K164K8.5#209N/A
Phi 4 Mini InstructWeights & Biases0.00800.035compare128K128K8.4#2113.6#162
Llama2 70B ChatSnowflakeN/AN/Acompare4K8K8.4#211N/A
Sarvam MSarvam AIN/AN/Acompare8K32K8.4#2117.5#146
Meta Textgeneration Llama 2 70BAWS SageMakerN/AN/Acompare4K4K8.4#211N/A
Meta Textgeneration Llama 2 13BAWS SageMakerN/AN/Acompare4K4K8.4#211N/A
Llama 2 70BReplicate0.6502.75compare4K4K8.4#211N/A
Llama 2 13BReplicate0.1000.500compare4K4K8.4#211N/A
Llama 2 70B ChatPerplexity0.7002.80compare4K4K8.4#211N/A
Llama2:70BOllamaN/AN/Acompare4K4K8.4#211N/A
Llama2:13BOllamaN/AN/Acompare4K4K8.4#211N/A
Llama2 70B Chat V1AWS Bedrock1.952.56compare4K4K8.4#211N/A
Llama2 13B Chat V1AWS Bedrock0.7501.00compare4K4K8.4#211N/A
Llama V2 70BFireworks AI0.1000.100compare4K4K8.4#211N/A
Llama V2 13BFireworks AI0.2000.200compare4K4K8.4#211N/A
Databricks Llama 2 70B ChatDatabricks0.5001.50compare4K4K8.4#211N/A
Phi 4 Mini InstructAzure AI0.0750.300compare131K4K8.4#2113.6#162
Llama 2 70B Chat HfAnyscale1.001.00compare4K4K8.4#211N/A
Llama 2 13B Chat HfAnyscale0.2500.250compare4K4K8.4#211N/A
Command R+Vercel AI Gateway2.5010.00compare128K4K8.3#215N/A
Command PlusOracle Cloud (OCI)1.561.56compare128K4K8.3#215N/A
Openchat 3p5 0106 7BFireworks AI0.2000.200compare8K8K8.3#215N/A
Dbrx InstructFireworks AI1.201.20compare33K33K8.3#215N/A
Command R+Cohere2.5010.00compare128K4K8.3#215N/A
Command R Plus V1AWS Bedrock3.0015.00compare128K4K8.3#215N/A
Command R+Azure OpenAI3.0015.00compare128K4K8.3#215N/A
Olmo 3 7B InstructPublic AIN/AN/Acompare33K4K8.2#2183.4#163
Jamba Mini 1.7AI21 Labs0.2000.400compare256K256K8.1#2193.1#165
Jamba 1.5 MiniVertex AI (AI21)0.2000.400compare256K256K8.0#220N/A
Jamba 1.5 MiniSnowflakeN/AN/Acompare256K8K8.0#220N/A
Jamba 1.5 MiniAI21 Labs0.2000.400compare256K256K8.0#220N/A
Qwen3 1p7b Fp8 DraftFireworks AI0.1000.100compare262K262K8.0#2201.4#169
Jamba 1 5 Mini V1AWS Bedrock0.2000.400compare256K256K8.0#220N/A
Jamba Mini 1.6AI21 Labs0.2000.400compare256K256K7.9#222N/A
Mixtral 8x7BSnowflakeN/AN/Acompare32K8K7.7#223N/A
Mixtral 8x7B InstructPerplexity0.0700.280compare4K4K7.7#223N/A
Open Mixtral 8x7BMistral AI0.7000.700compare32K8K7.7#223N/A
Mixtral 8x7BFireworks AI0.5000.500compare33K33K7.7#223N/A
Hermes3 8BLambda0.0250.040compare131K131K7.6#224N/A
Command RVercel AI Gateway0.1500.600compare128K4K7.4#225N/A
Qwen 3 14BVercel AI Gateway0.0800.240compare41K16K7.4#225N/A
Mistral 7BSnowflakeN/AN/Acompare32K8K7.4#225N/A
Mistral 7B InstructPerplexity0.0700.280compare4K4K7.4#225N/A
Mistral 7B InstructOpenRouter0.1300.130compare8KN/A7.4#225N/A
MistralOllamaN/AN/Acompare8K8K7.4#225N/A
Qwen3 14BNebius0.0800.240compare33K33K7.4#225N/A
Open Mistral 7BMistral AI0.2500.250compare32K8K7.4#225N/A
Qwen3 14BFireworks AI0.2000.200compare41K41K7.4#225N/A
Mistral 7BFireworks AI0.2000.200compare33K33K7.4#225N/A
Qwen3 14BDeepInfra0.0600.240compare41K41K7.4#225N/A
Command RCohere0.1500.600compare128K4K7.4#225N/A
Command R V1AWS Bedrock0.5001.50compare128K4K7.4#225N/A
Claude Instant V1AWS Bedrock0.8002.40compare100K8K7.4#2257.8#141
Granite 3 3 8B InstructIBM watsonx0.2000.200compare8K8K7.0#2293.4#163
Granite 3.3 8B InstructReplicate0.0300.250compareN/AN/A7.0#2293.4#163
Qwen3 1p7bFireworks AI0.1000.100compare131K131K6.8#2302.3#168
Llama3 8B Instruct MaasVertex AI (Llama)N/AN/Acompare32K32K6.4#2314.0#161
Llama 3 8BVercel AI Gateway0.0500.080compare8K8K6.4#2314.0#161
Llama3 8BSnowflakeN/AN/Acompare8K8K6.4#2314.0#161
Llama 3 8BReplicate0.0500.250compare8K8K6.4#2314.0#161
Llama3OllamaN/AN/Acompare8K8K6.4#2314.0#161
Llama 3 8B InstructNovita AI0.0400.040compare8K8K6.4#2314.0#161
Llama3 8B InstructGradient AI0.2000.200compare512N/A6.4#2314.0#161
Llama V3 8BFireworks AI0.2000.200compare8K8K6.4#2314.0#161
Meta Llama 3 8B InstructDeepInfra0.0300.060compare8K8K6.4#2314.0#161
Llama3 8B Instruct V1AWS Bedrock0.3000.600compare8K8K6.4#2314.0#161
Meta Llama 3 8B InstructAnyscale0.1500.150compare8K8K6.4#2314.0#161
Llama 3 2 1B InstructIBM watsonx0.1000.100compare128K128K6.3#2320.6#171
Llama 3.2 1BVercel AI Gateway0.1000.100compare128K8K6.3#2320.6#171
Llama3.2 1BSnowflakeN/AN/Acompare128K8K6.3#2320.6#171
Meta Llama 3.2 1B InstructSambaNova0.0400.080compare16K16K6.3#2320.6#171
Llama3 2 1B Instruct V1AWS Bedrock0.1000.100compare128K4K6.3#2320.6#171
Gemma3 4BLlamaGate0.0300.080compare128K8K6.3#2322.9#167
Gemma 3 4B It GGUFLemonade (AMD)N/AN/Acompare128K8K6.3#2322.9#167
Gemma 3 4B ItAWS Bedrock0.0400.080compare128K8K6.3#2322.9#167
Llama V3p2 1BFireworks AI0.1000.100compare131K131K6.3#2320.6#171
Gemma 3 4B ItDeepInfra0.0400.080compare131K131K6.3#2322.9#167
Qwen3 0p6bFireworks AI0.1000.100compare41K41K5.7#2341.4#169
Glm 4.5 XZ AI (Zhipu)2.208.90compare128K32KN/AN/A
Glm 4.5 FlashZ AI (Zhipu)N/AN/Acompare128K32KN/AN/A
Glm 4.5 AirxZ AI (Zhipu)1.104.50compare128K32KN/AN/A
Glm 4 32B 0414 128KZ AI (Zhipu)0.1000.100compare128K32KN/AN/A
Grok Vision BetaxAI5.0015.00compare8K8KN/AN/A
Grok Code Fast 1 0825xAI0.2001.50compare256K256KN/AN/A
Grok Code FastxAI0.2001.50compare256K256KN/AN/A
Grok 4 0709xAI3.0015.00compare256K256KN/AN/A
Grok 3 Mini FastxAI0.6004.00compare131K131KN/AN/A
Grok 3 FastxAI5.0025.00compare131K131KN/AN/A
Grok 2 VisionxAI2.0010.00compare33K33KN/AN/A
Grok 2xAI2.0010.00compare131K131KN/AN/A
Allam 1 13B InstructIBM watsonx1.801.80compare8K8KN/AN/A
Pixtral 12B 2409IBM watsonx0.3500.350compare128K128KN/AN/A
Mistral Small 2503IBM watsonx0.1000.300compare32K32KN/AN/A
Mistral Medium 2505IBM watsonx3.0010.00compare128K128KN/AN/A
Llama Guard 3 11B VisionIBM watsonx0.3500.350compare128K128KN/AN/A
Llama 4 Maverick 17BIBM watsonx0.3501.40compare128K128KN/AN/A
Granite Vision 3 2 2BIBM watsonx0.1000.100compare8K8KN/AN/A
Granite Ttm 512 96 R2IBM watsonx0.3800.380compare512512N/AN/A
Granite Ttm 1536 96 R2IBM watsonx0.3800.380compare512512N/AN/A
Granite Ttm 1024 96 R2IBM watsonx0.3800.380compare512512N/AN/A
Granite Guardian 3 3 8BIBM watsonx0.2000.200compare8K8KN/AN/A
Granite Guardian 3 2 2BIBM watsonx0.1000.100compare8K8KN/AN/A
Granite 13B Chat V2IBM watsonx0.6000.600compare8K8KN/AN/A
Flan T5 Xl 3BIBM watsonx0.6000.600compare8K8KN/AN/A
JAIS 13B ChatIBM watsonx500.000.0020compare8K8KN/AN/A
Mt0 Xxl 13BIBM watsonx500.000.0020compare8K8KN/AN/A
Llama 4 Scout 17B 16E InstructWeights & Biases0.0170.066compare64K64KN/AN/A
DeepSeek R1 0528Weights & Biases0.1350.540compare161K161KN/AN/A
Doubao Seed 2 0 Pro 260215Volcengine (ByteDance)N/AN/Acompare256K128KN/AN/A
Doubao Seed 2 0 Mini 260215Volcengine (ByteDance)N/AN/Acompare256K128KN/AN/A
Doubao Seed 2 0 Lite 260215Volcengine (ByteDance)N/AN/Acompare256K128KN/AN/A
Doubao Seed 2 0 Code Preview 260215Volcengine (ByteDance)N/AN/Acompare256K128KN/AN/A
Mistral Small 2503Vertex AI (Mistral)1.003.00compare128K128KN/AN/A
Mistral Nemo@latestVertex AI (Mistral)0.1500.150compare128K128KN/AN/A
Mistral NemoVertex AI (Mistral)3.003.00compare128K128KN/AN/A
Mistral Large 2411Vertex AI (Mistral)2.006.00compare128K8KN/AN/A
Llama3 405B Instruct MaasVertex AI (Llama)N/AN/Acompare32K32KN/AN/A
Llama 4 Scout 17B 16e Instruct MaasVertex AI (Llama)0.2500.700compare10.0M10.0MN/AN/A
Llama 4 Scout 17B 128e Instruct MaasVertex AI (Llama)0.2500.700compare10.0M10.0MN/AN/A
Llama 4 Maverick 17B 16e Instruct MaasVertex AI (Llama)0.3501.15compare1.0M1.0MN/AN/A
Llama 4 Maverick 17B 128e Instruct MaasVertex AI (Llama)0.3501.15compare1.0M1.0MN/AN/A
Jamba 1.5Vertex AI (AI21)0.2000.400compare256K256KN/AN/A
DeepSeek R1 0528 MaasVertex AI (DeepSeek)1.355.40compare65K8KN/AN/A
Codestral @latestVertex AI (Mistral)0.2000.600compare128K128KN/AN/A
CodestralVertex AI (Mistral)0.2000.600compare128K128KN/AN/A
Codestral 2501Vertex AI (Mistral)0.2000.600compare128K128KN/AN/A
Codestral 2Vertex AI (Mistral)0.3000.900compare128K128KN/AN/A
Grok 3 Mini FastVercel AI Gateway0.6004.00compare131K131KN/AN/A
Grok 3 MiniVercel AI Gateway0.3000.500compare131K131KN/AN/A
Grok 3 FastVercel AI Gateway5.0025.00compare131K131KN/AN/A
Grok 2 VisionVercel AI Gateway2.0010.00compare33K33KN/AN/A
Grok 2Vercel AI Gateway2.0010.00compare131K4KN/AN/A
V0 1.5 MdVercel AI Gateway3.0015.00compare128K33KN/AN/A
V0 1.0 MdVercel AI Gateway3.0015.00compare128K32KN/AN/A
Morph V3 LargeVercel AI Gateway0.9001.90compare33K16KN/AN/A
Morph V3 FastVercel AI Gateway0.8001.20compare33K16KN/AN/A
Pixtral LargeVercel AI Gateway2.006.00compare128K4KN/AN/A
Pixtral 12BVercel AI Gateway0.1500.150compare128K4KN/AN/A
Mistral EmbedVercel AI Gateway0.100N/AcompareN/AN/AN/AN/A
Ministral 8BVercel AI Gateway0.1000.100compare128K4KN/AN/A
Ministral 3BVercel AI Gateway0.0400.040compare128K4KN/AN/A
Codestral EmbedVercel AI Gateway0.150N/AcompareN/AN/AN/AN/A
CodestralVercel AI Gateway0.3000.900compare256K4KN/AN/A
Mercury Coder SmallVercel AI Gateway0.2501.00compare32K16KN/AN/A
Gemma 2 9BVercel AI Gateway0.2000.200compare8K8KN/AN/A
Embed V4.0Vercel AI Gateway0.120N/AcompareN/AN/AN/AN/A
Titan Embed Text V2Vercel AI Gateway0.020N/AcompareN/AN/AN/AN/A
Qwen3 CoderVercel AI Gateway0.4001.60compare262K67KN/AN/A
V0 1.5 Mdv0 (Vercel)3.0015.00compare128K128KN/AN/A
V0 1.5 Lgv0 (Vercel)15.0075.00compare512K512KN/AN/A
V0 1.0 Mdv0 (Vercel)3.0015.00compare128K128KN/AN/A
Us.writer.palmyra X5 V1AWS Bedrock0.6006.00compare1.0M8KN/AN/A
Us.writer.palmyra X4 V1AWS Bedrock2.5010.00compare128K8KN/AN/A
Together Ai Up To 4BTogether AI0.1000.100compareN/AN/AN/AN/A
Together Ai 81.1B 110BTogether AI1.801.80compareN/AN/AN/AN/A
Together Ai 8.1B 21BTogether AI0.3000.300compare1KN/AN/AN/A
Together Ai 41.1B 80BTogether AI0.9000.900compareN/AN/AN/AN/A
Together Ai 4.1B 8BTogether AI0.2000.200compareN/AN/AN/AN/A
Together Ai 21.1B 41BTogether AI0.8000.800compareN/AN/AN/AN/A
CodeLlama 34B InstructTogether AIN/AN/AcompareN/AN/AN/AN/A
Qwen2.5 7B Instruct TurboTogether AIN/AN/AcompareN/AN/AN/AN/A
Mixtral 8x7B Instruct V0.1Together AI0.6000.600compareN/AN/AN/AN/A
Mistral Small 24B Instruct 2501Together AIN/AN/AcompareN/AN/AN/AN/A
Mistral 7B Instruct V0.1Together AIN/AN/AcompareN/AN/AN/AN/A
Llama 4 Scout 17B 16E InstructTogether AI0.1800.590compareN/AN/AN/AN/A
Llama 4 Maverick 17B 128E Instruct FP8Together AI0.2700.850compareN/AN/AN/AN/A
DeepSeek R1 0528 TputTogether AI0.5502.19compare128KN/AN/AN/A
Text UnicornGoogle Vertex AI10.0028.00compare8K1KN/AN/A
Text UnicornGoogle Vertex AI10.0028.00compare8K1KN/AN/A
Text Bison32kGoogle Vertex AIN/AN/Acompare8K1KN/AN/A
Text Bison32kGoogle Vertex AIN/AN/Acompare8K1KN/AN/A
Text BisonGoogle Vertex AIN/AN/Acompare8K1KN/AN/A
Text BisonGoogle Vertex AIN/AN/Acompare8K1KN/AN/A
Text BisonGoogle Vertex AIN/AN/Acompare8K2KN/AN/A
Reka CoreSnowflakeN/AN/Acompare32K8KN/AN/A
Jamba InstructSnowflakeN/AN/Acompare256K8KN/AN/A
Gemma 7BSnowflakeN/AN/Acompare8K8KN/AN/A
Qwen2 Audio 7B InstructSambaNova0.500100.00compare4K4KN/AN/A
Meta Llama Guard 3 8BSambaNova0.3000.300compare16K16KN/AN/A
Llama 4 Scout 17B 16E InstructSambaNova0.4000.700compare8K8KN/AN/A
Llama 4 Maverick 17B 128E InstructSambaNova0.6301.80compare131K131KN/AN/A
Mixtral 8x7B Instruct V0.1Replicate0.3001.00compare4K4KN/AN/A
Mistral 7B V0.1Replicate0.0500.250compare4K4KN/AN/A
Mistral 7B Instruct V0.2Replicate0.0500.250compare4K4KN/AN/A
Apertus 8B InstructPublic AIN/AN/Acompare8K4KN/AN/A
Apertus 70B InstructPublic AIN/AN/Acompare8K4KN/AN/A
Salamandra 7B Instruct Tools 16KPublic AIN/AN/Acompare16K4KN/AN/A
ALIA 40B Instruct Q8 0Public AIN/AN/Acompare8K4KN/AN/A
Qwen SEA LION V4 32B ITPublic AIN/AN/Acompare33K4KN/AN/A
Gemma SEA LION V4 27B ITPublic AIN/AN/Acompare8K4KN/AN/A
Sonar Small ChatPerplexity0.0700.280compare16K16KN/AN/A
Sonar Medium ChatPerplexity0.6001.80compare16K16KN/AN/A
Sonar Deep ResearchPerplexity2.008.00compare128KN/AN/AN/A
Pplx 7B ChatPerplexity0.0700.280compare8K8KN/AN/A
Pplx 70B ChatPerplexity0.7002.80compare4K4KN/AN/A
Llama 3.1 Sonar Small 128K ChatPerplexityN/AN/Acompare131K131KN/AN/A
Llama 3.1 Sonar Large 128K ChatPerplexityN/AN/Acompare131K131KN/AN/A
Llama 3.1 Sonar Huge 128K OnlinePerplexityN/AN/Acompare127K127KN/AN/A
Codellama 70B InstructPerplexity0.7002.80compare16K16KN/AN/A
Codellama 34B InstructPerplexity0.3501.40compare16K16KN/AN/A
Text Bison 001Google PaLM0.1250.125compare8K1KN/AN/A
Text BisonGoogle PaLM0.1250.125compare8K1KN/AN/A
Chat Bison 001Google PaLM0.1250.125compare8K4KN/AN/A
Chat BisonGoogle PaLM0.1250.125compare8K4KN/AN/A
Qwen2.5 VL 72B InstructOVHcloud0.9100.910compare32K32KN/AN/A
Mixtral 8x7B Instruct V0.1OVHcloud0.6300.630compare32K32KN/AN/A
Mistral Nemo Instruct 2407OVHcloud0.1300.130compare118K118KN/AN/A
Mistral 7B Instruct V0.3OVHcloud0.1000.100compare127K127KN/AN/A
Mamba Codestral 7B V0.1OVHcloud0.1900.190compare256K256KN/AN/A
Llava V1.6 Mistral 7B HfOVHcloud0.2900.290compare32K32KN/AN/A
Remm Slerp L2 13BOpenRouter1.881.88compare6KN/AN/AN/A
RouterOpenRouter0.8503.40compare131K131KN/AN/A
Qwen3.5 Plus 02 15OpenRouter0.4002.40compare1.0M66KN/AN/A
Qwen3.5 Flash 02 23OpenRouter0.1000.400compare1.0M66KN/AN/A
Qwen3 Coder PlusOpenRouter1.005.00compare998K66KN/AN/A
Qwen3 CoderOpenRouter0.2200.950compare262K262KN/AN/A
Qwen Vl PlusOpenRouter0.2100.630compare8K2KN/AN/A
FreeOpenRouterN/AN/Acompare200KN/AN/AN/A
BodybuilderOpenRouterN/AN/Acompare128KN/AN/AN/A
AutoOpenRouterN/AN/Acompare2.0MN/AN/AN/A
GPT-5.2-proOpenRouter21.00168.00compare272K128KN/AN/A
GPT-5.1-codex-maxOpenRouter1.2510.00compare400K128KN/AN/A
Mistral Large 2512OpenRouter0.5001.50compare262K262KN/AN/A
Devstral 2512OpenRouter0.1500.600compare262K66KN/AN/A
Minimax M2.1OpenRouter0.2701.20compare204K64KN/AN/A
WeaverOpenRouter5.635.63compare8KN/AN/AN/A
Mythomax L2 13BOpenRouter1.881.88compare8KN/AN/AN/A
DeepSeek R1 0528OpenRouter0.5002.15compare65K8KN/AN/A
DeepSeek Chat V3.1OpenRouter0.2000.800compare164K164KN/AN/A
DeepSeek Chat V3 0324OpenRouter0.1400.280compare66K8KN/AN/A
DeepSeek ChatOpenRouter0.1400.280compare66K8KN/AN/A
Ui Tars 1.5 7BOpenRouter0.1000.200compare131K2KN/AN/A
ContainerOpenAIN/AN/AcompareN/AN/AN/AN/A
GPT-oss-safeguard-20bAWS Bedrock0.0700.200compare128K8KN/AN/A
GPT-oss-safeguard-120bAWS Bedrock0.1500.600compare128K8KN/AN/A
VicunaOllamaN/AN/Acompare2K2KN/AN/A
Orca MiniOllamaN/AN/Acompare4K4KN/AN/A
Mixtral 8x7B Instruct V0.1OllamaN/AN/Acompare33K33KN/AN/A
Mixtral 8x22B Instruct V0.1OllamaN/AN/Acompare66K66KN/AN/A
Mistral 7B Instruct V0.2OllamaN/AN/Acompare33K33KN/AN/A
Mistral 7B Instruct V0.1OllamaN/AN/Acompare8K8KN/AN/A
Llama2OllamaN/AN/Acompare4K4KN/AN/A
Internlm2 5 20B ChatOllamaN/AN/Acompare33K8KN/AN/A
CodellamaOllamaN/AN/Acompare4K4KN/AN/A
CodegemmaOllamaN/AN/Acompare8K8KN/AN/A
Codegeex4OllamaN/AN/Acompare33K8KN/AN/A
Xai.grok 3 Mini FastOracle Cloud (OCI)0.6004.00compare131K131KN/AN/A
Xai.grok 3 MiniOracle Cloud (OCI)0.3000.500compare131K131KN/AN/A
Xai.grok 3 FastOracle Cloud (OCI)5.0025.00compare131K131KN/AN/A
Llama 4 Scout 17B 16e InstructOracle Cloud (OCI)0.7200.720compare192K4KN/AN/A
Llama 4 Maverick 17B 128e Instruct Fp8Oracle Cloud (OCI)0.7200.720compare512K4KN/AN/A
CommandOracle Cloud (OCI)1.561.56compare128K4KN/AN/A
Command AOracle Cloud (OCI)1.561.56compare256K4KN/AN/A
Qwen2.5 Coder 3B InstructNscale0.0100.030compareN/AN/AN/AN/A
Mixtral 8x22B Instruct V0.1Nscale0.6000.600compareN/AN/AN/AN/A
Llama 4 Scout 17B 16E InstructNscale0.0900.290compareN/AN/AN/AN/A
DeepSeek R1 Distill Qwen 7BNscale0.2000.200compareN/AN/AN/AN/A
Autoglm Phone 9B MultilingualNovita AI0.0350.138compare66K66KN/AN/A
R1v4 LiteNovita AI0.2000.600compare262K66KN/AN/A
L31 70B Euryale V2.2Novita AI1.481.48compare8K8KN/AN/A
L3 8B Stheno V3.2Novita AI0.0500.050compare8K32KN/AN/A
L3 8B LunarisNovita AI0.0500.050compare8K8KN/AN/A
L3 70B Euryale V2.1Novita AI1.481.48compare8K8KN/AN/A
Qwen2.5 Vl 72B InstructNovita AI0.8000.800compare33K33KN/AN/A
Qwen2.5 7B InstructNovita AI0.0700.070compare32K32KN/AN/A
Qwen Mt PlusNovita AI0.2500.750compare16K8KN/AN/A
Paddleocr VlNovita AI0.0200.020compare16K16KN/AN/A
Hermes 2 Pro Llama 3 8BNovita AI0.1400.140compare8K8KN/AN/A
Mistral NemoNovita AI0.0400.170compare60K16KN/AN/A
Minimax M2.1Novita AI0.3001.20compare205K131KN/AN/A
Wizardlm 2 8x22BNovita AI0.6200.620compare66K8KN/AN/A
Llama 4 Scout 17B 16e InstructNovita AI0.1800.590compare131K131KN/AN/A
Llama 4 Maverick 17B 128e Instruct Fp8Novita AI0.2700.850compare1.0M8KN/AN/A
Mythomax L2 13BNovita AI0.0900.090compare4K3KN/AN/A
DeepSeek R1 TurboNovita AI0.7002.50compare64K16KN/AN/A
DeepSeek R1 0528Novita AI0.7002.50compare164K33KN/AN/A
DeepSeek Prover V2 671BNovita AI0.7002.50compare160K160KN/AN/A
DeepSeek OCRNovita AI0.0300.030compare8K8KN/AN/A
ERNIE 4.5 Vl 424B A47bNovita AI0.4201.25compare123K16KN/AN/A
ERNIE 4.5 Vl 28B A3bNovita AI0.1400.560compare30K8KN/AN/A
ERNIE 4.5 21B A3bNovita AI0.0700.280compare120K8KN/AN/A
Baichuan M2 32BNovita AI0.0700.070compare131K131KN/AN/A
Qwen2.5 VL 72B InstructNebius0.1300.400compare131K131KN/AN/A
Qwen2 VL 7B InstructNebius0.0200.060compare131K131KN/AN/A
Qwen2 VL 72B InstructNebius0.1300.400compare131K131KN/AN/A
Mistral Nemo Instruct 2407Nebius0.0400.120compare128K128KN/AN/A
Llama Guard 3 8BNebius0.0200.060compare128K128KN/AN/A
DeepSeek V3 0324Nebius0.5001.50compare128K128KN/AN/A
DeepSeek R1 0528Nebius0.8002.40compare164K164KN/AN/A
Morph V3 LargeMorph0.9001.90compare16K16KN/AN/A
Morph V3 FastMorph0.8001.20compare16K16KN/AN/A
Moonshot V1 AutoMoonshot AI (Kimi)2.005.00compare131K131KN/AN/A
Moonshot V1 8K Vision PreviewMoonshot AI (Kimi)0.2002.00compare8K8KN/AN/A
Moonshot V1 8K 0430Moonshot AI (Kimi)0.2002.00compare8K8KN/AN/A
Moonshot V1 8KMoonshot AI (Kimi)0.2002.00compare8K8KN/AN/A
Moonshot V1 32K Vision PreviewMoonshot AI (Kimi)1.003.00compare33K33KN/AN/A
Moonshot V1 32K 0430Moonshot AI (Kimi)1.003.00compare33K33KN/AN/A
Moonshot V1 32KMoonshot AI (Kimi)1.003.00compare33K33KN/AN/A
Moonshot V1 128K Vision PreviewMoonshot AI (Kimi)2.005.00compare131K131KN/AN/A
Moonshot V1 128K 0430Moonshot AI (Kimi)2.005.00compare131K131KN/AN/A
Moonshot V1 128KMoonshot AI (Kimi)2.005.00compare131K131KN/AN/A
Kimi Thinking PreviewMoonshot AI (Kimi)0.6002.50compare131K131KN/AN/A
Kimi Latest 8KMoonshot AI (Kimi)0.2002.00compare8K8KN/AN/A
Kimi Latest 32KMoonshot AI (Kimi)1.003.00compare33K33KN/AN/A
Kimi Latest 128KMoonshot AI (Kimi)2.005.00compare131K131KN/AN/A
KimiMoonshot AI (Kimi)2.005.00compare131K131KN/AN/A
Kimi K2 Turbo PreviewMoonshot AI (Kimi)1.158.00compare262K262KN/AN/A
Kimi K2 Thinking TurboMoonshot AI (Kimi)1.158.00compare262K262KN/AN/A
Kimi K2 0711 PreviewMoonshot AI (Kimi)0.6002.50compare131K131KN/AN/A
Pixtral LargeMistral AI2.006.00compare128K128KN/AN/A
Pixtral 12B 2409Mistral AI0.1500.150compare128K128KN/AN/A
Open Mistral Nemo 2407Mistral AI0.3000.300compare128K128KN/AN/A
Open Mistral NemoMistral AI0.3000.300compare128K128KN/AN/A
Mistral TinyMistral AI0.2500.250compare32K8KN/AN/A
Mistral Medium 2505Mistral AI0.4002.00compare131K8KN/AN/A
Mistral Medium 2312Mistral AI2.708.10compare32K8KN/AN/A
Mistral Large 2512Mistral AI0.5001.50compare262K262KN/AN/A
Mistral Large 2411Mistral AI2.006.00compare128K128KN/AN/A
Mistral Large 2402Mistral AI4.0012.00compare32K8KN/AN/A
Ministral 3 8B 2512Mistral AI0.1500.150compare262K262KN/AN/A
Ministral 3 3B 2512Mistral AI0.1000.100compare131K131KN/AN/A
Ministral 3 14B 2512Mistral AI0.2000.200compare262K262KN/AN/A
Magistral Small 2506Mistral AI0.5001.50compare40K40KN/AN/A
Magistral Medium 2506Mistral AI2.005.00compare40K40KN/AN/A
Labs Devstral Small 2512Mistral AI0.1000.300compare256K256KN/AN/A
Devstral Medium 2507Mistral AI0.4002.00compare128K128KN/AN/A
Devstral 2512Mistral AI0.4002.00compare256K256KN/AN/A
Codestral MambaMistral AI0.2500.250compare256K256KN/AN/A
CodestralMistral AI1.003.00compare32K8KN/AN/A
Codestral 2508Mistral AI0.3000.900compare256K256KN/AN/A
Codestral 2405Mistral AI1.003.00compare32K8KN/AN/A
Voxtral Small 24B 2507AWS Bedrock0.1000.300compare128K8KN/AN/A
Voxtral Mini 3B 2507AWS Bedrock0.0400.040compare128K8KN/AN/A
Devstral 2 123BAWS Bedrock0.4002.00compare256K8KN/AN/A
MiniMax M2.5 LightningMiniMax0.3002.40compare1.0M8KN/AN/A
MiniMax M2.1 LightningMiniMax0.3002.40compare1.0M8KN/AN/A
MiniMax M2.1MiniMax0.3001.20compare1.0M8KN/AN/A
Llama4 Scout 17B Instruct V1AWS Bedrock0.1700.660compare128K4KN/AN/A
Llama4 Maverick 17B Instruct V1AWS Bedrock0.2400.970compare128K4KN/AN/A
Llama 4 Scout 17B 16E Instruct FP8Meta LlamaN/AN/Acompare10.0M4KN/AN/A
Llama 4 Maverick 17B 128E Instruct FP8Meta LlamaN/AN/Acompare1.0M4KN/AN/A
Llama 3.3 8B InstructMeta LlamaN/AN/Acompare128K4KN/AN/A
Medlm MediumGoogle Vertex AIN/AN/Acompare33K8KN/AN/A
Medlm LargeGoogle Vertex AIN/AN/Acompare8K1KN/AN/A
Luminous Supreme ControlAleph Alpha218.75240.63compare2KN/AN/AN/A
Luminous SupremeAleph Alpha175.00192.50compare2KN/AN/AN/A
Luminous Extended ControlAleph Alpha56.2561.88compare2KN/AN/AN/A
Luminous ExtendedAleph Alpha45.0049.50compare2KN/AN/AN/A
Luminous Base ControlAleph Alpha37.5041.25compare2KN/AN/AN/A
Luminous BaseAleph Alpha30.0033.00compare2KN/AN/AN/A
Openthinker 7BLlamaGate0.0800.150compare33K8KN/AN/A
Mistral 7B V0.3LlamaGate0.1000.150compare33K8KN/AN/A
Llava 7BLlamaGate0.1000.200compare4K2KN/AN/A
Dolphin3 8BLlamaGate0.0800.150compare128K8KN/AN/A
DeepSeek R1 8BLlamaGate0.1000.200compare66K16KN/AN/A
DeepSeek R1 7B QwenLlamaGate0.0800.150compare131K16KN/AN/A
DeepSeek Coder 6.7BLlamaGate0.0600.120compare16K4KN/AN/A
Codellama 7BLlamaGate0.0600.120compare16K4KN/AN/A
Llama 4 Scout 17B 16e InstructLambda0.0500.100compare16K8KN/AN/A
Llama 4 Maverick 17B 128e Instruct Fp8Lambda0.0500.100compare131K8KN/AN/A
Lfm 7BLambda0.0250.040compare131K131KN/AN/A
DeepSeek R1 671BLambda0.8000.800compare131K131KN/AN/A
DeepSeek R1 0528Lambda0.2000.600compare131K131KN/AN/A
Jamba 1.5AI21 Labs0.2000.400compare256K256KN/AN/A
J2 UltraAI21 Labs15.0015.00compare8K8KN/AN/A
J2 MidAI21 Labs10.0010.00compare8K8KN/AN/A
J2 LightAI21 Labs3.003.00compare8K8KN/AN/A
DeepSeek R1 0528Hyperbolic0.2500.250compare131K131KN/AN/A
GPT-oss-safeguard-20bGroq0.0750.300compare131K66KN/AN/A
Llama Guard 4 12BGroq0.2000.200compare8K8KN/AN/A
Llama 4 Scout 17B 16e InstructGroq0.1100.340compare131K8KN/AN/A
Llama 4 Maverick 17B 128e InstructGroq0.2000.600compare131K8KN/AN/A
Gemma 7B ItGroq0.0500.080compare8K8KN/AN/A
Mistral Nemo Instruct 2407Gradient AI0.3000.300compare512N/AN/AN/A
GPT-realtime miniOpenAI0.6002.40compare128K4KN/AN/A
GPT-realtime-1.5OpenAI4.0016.00compare32K4KN/AN/A
GPT-realtimeOpenAI4.0016.00compare32K4KN/AN/A
GPT-audio miniOpenAI0.6002.40compare128K16KN/AN/A
GPT-audio-1.5OpenAI2.5010.00compare128K16KN/AN/A
GPT-audioOpenAI2.5010.00compare128K16KN/AN/A
GPT-5-search-apiOpenAI1.2510.00compare272K128KN/AN/A
GPT-4o-realtime PreviewOpenAI5.0020.00compare128K4KN/AN/A
GPT-4o-mini-search PreviewOpenAI0.1500.600compare128K16KN/AN/A
GPT-4o-mini-realtime PreviewOpenAI0.6002.40compare128K4KN/AN/A
GPT-4o-mini-audio PreviewOpenAI0.1500.600compare128K16KN/AN/A
GPT-4o-audio PreviewOpenAI2.5010.00compare128K16KN/AN/A
GPT-4-32k-0613OpenAIN/AN/Acompare33K4KN/AN/A
GPT-4-32k-0314OpenAIN/AN/Acompare33K4KN/AN/A
GPT-4-1106 PreviewOpenAI10.0030.00compare128K4KN/AN/A
GPT-4OpenAI30.0060.00compare8K4KN/AN/A
MiniMax M2.1GMI Cloud0.3001.20compare197K16KN/AN/A
GPT-4GitHub CopilotN/AN/Acompare33K4KN/AN/A
GigaChat 2 ProGigaChat (Sber)N/AN/Acompare128K8KN/AN/A
GigaChat 2 MaxGigaChat (Sber)N/AN/Acompare128K8KN/AN/A
GigaChat 2 LiteGigaChat (Sber)N/AN/Acompare128K8KN/AN/A
Learnlm 1.5 Pro ExperimentalGoogle GeminiN/AN/Acompare33K8KN/AN/A
Gemini robotics-er-1.5 PreviewGoogle Gemini0.3002.50compare1.0M66KN/AN/A
Gemini gemma-2-9b-itGoogle Gemini0.3501.05compare8K8KN/AN/A
Gemini gemma-2-27b-itGoogle Gemini0.3501.05compare8K8KN/AN/A
Gemini Experimental 1114Google GeminiN/AN/Acompare1.0M8KN/AN/A
Gemini robotics-er-1.5 PreviewGoogle Vertex AI0.3002.50compare1.0M66KN/AN/A
Gemini Experimental 1206Google Gemini0.3002.50compare1.0M66KN/AN/A
Ft:davinci 002OpenAI12.0012.00compare16K4KN/AN/A
Ft:babbage 002OpenAI1.601.60compare16K4KN/AN/A
Zephyr 7B BetaFireworks AI0.2000.200compare33K33KN/AN/A
Yi LargeFireworks AI3.003.00compare33K33KN/AN/A
Yi 6BFireworks AI0.2000.200compare4K4KN/AN/A
Yi 34B 200K CapybaraFireworks AI0.9000.900compare200K200KN/AN/A
Yi 34BFireworks AI0.9000.900compare4K4KN/AN/A
Toppy M 7BFireworks AI0.2000.200compare33K33KN/AN/A
Starcoder2 7BFireworks AI0.2000.200compare16K16KN/AN/A
Starcoder2 3BFireworks AI0.1000.100compare16K16KN/AN/A
Starcoder2 15BFireworks AI0.2000.200compare16K16KN/AN/A
Starcoder 7BFireworks AI0.2000.200compare8K8KN/AN/A
Starcoder 16BFireworks AI0.2000.200compare8K8KN/AN/A
Stablecode 3BFireworks AI0.1000.100compare4K4KN/AN/A
Snorkel Mistral 7B Pairrm DpoFireworks AI0.2000.200compare33K33KN/AN/A
Rolm OCRFireworks AI0.2000.200compare128K128KN/AN/A
Qwen3 1p7b Fp8 Draft 40960Fireworks AI0.1000.100compare41K41KN/AN/A
Qwen3 1p7b Fp8 Draft 131072Fireworks AI0.1000.100compare131K131KN/AN/A
Qwen2p5 Vl 7B InstructFireworks AI0.2000.200compare128K128KN/AN/A
Qwen2p5 Vl 72B InstructFireworks AI0.9000.900compare128K128KN/AN/A
Qwen2p5 Vl 3B InstructFireworks AI0.2000.200compare128K128KN/AN/A
Qwen2p5 Vl 32B InstructFireworks AI0.9000.900compare128K128KN/AN/A
Qwen2p5 Math 72B InstructFireworks AI0.9000.900compare4K4KN/AN/A
Qwen2p5 Coder 3BFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2p5 Coder 1p5bFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2p5 Coder 14BFireworks AI0.2000.200compare33K33KN/AN/A
Qwen2p5 Coder 0p5bFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2p5 1p5b InstructFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2p5 0p5b InstructFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2 Vl 7B InstructFireworks AI0.2000.200compare33K33KN/AN/A
Qwen2 Vl 72B InstructFireworks AI0.9000.900compare33K33KN/AN/A
Qwen2 Vl 2B InstructFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2 7B InstructFireworks AI0.2000.200compare33K33KN/AN/A
Qwen1p5 72B ChatFireworks AI0.9000.900compare33K33KN/AN/A
Qwen V2p5 7BFireworks AI0.2000.200compare131K131KN/AN/A
Qwen V2p5 14B InstructFireworks AI0.2000.200compare33K33KN/AN/A
Pythia 12BFireworks AI0.2000.200compare2K2KN/AN/A
Phind Code Llama 34B V2Fireworks AI0.9000.900compare16K16KN/AN/A
Phind Code Llama 34B Python V1Fireworks AI0.9000.900compare16K16KN/AN/A
Phi 3 Vision 128K InstructFireworks AI0.2000.200compare32K32KN/AN/A
Phi 2 3BFireworks AI0.1000.100compare2K2KN/AN/A
Openorca 7BFireworks AI0.2000.200compare33K33KN/AN/A
Openhermes 2p5 Mistral 7BFireworks AI0.2000.200compare33K33KN/AN/A
Openhermes 2 Mistral 7BFireworks AI0.2000.200compare33K33KN/AN/A
Nous Hermes Llama2 7BFireworks AI0.2000.200compare4K4KN/AN/A
Nous Hermes Llama2 70BFireworks AI0.9000.900compare4K4KN/AN/A
Nous Hermes Llama2 13BFireworks AI0.2000.200compare4K4KN/AN/A
Nous Hermes 2 Yi 34BFireworks AI0.9000.900compare4K4KN/AN/A
Nous Hermes 2 Mixtral 8x7B DpoFireworks AI0.5000.500compare33K33KN/AN/A
Nous Capybara 7B V1p9Fireworks AI0.2000.200compare33K33KN/AN/A
Mythomax L2 13BFireworks AI0.2000.200compare4K4KN/AN/A
Mixtral 8x22B Instruct HfFireworks AI1.201.20compare66K66KN/AN/A
Mistral Small 24B Instruct 2501Fireworks AI0.9000.900compare33K33KN/AN/A
Mistral Nemo Base 2407Fireworks AI0.2000.200compare128K128KN/AN/A
Mistral 7B Instruct V0p2Fireworks AI0.2000.200compare33K33KN/AN/A
Ministral 3 8B Instruct 2512Fireworks AI0.2000.200compare256K256KN/AN/A
Ministral 3 3B Instruct 2512Fireworks AI0.1000.100compare256K256KN/AN/A
Ministral 3 14B Instruct 2512Fireworks AI0.2000.200compare256K256KN/AN/A
Minimax M2p1Fireworks AI0.3001.20compare205K205KN/AN/A
Llava Yi 34BFireworks AI0.9000.900compare4K4KN/AN/A
Llamaguard 7BFireworks AI0.2000.200compare4K4KN/AN/A
Llama4 Scout Instruct BasicFireworks AI0.1500.600compare131K131KN/AN/A
Llama4 Maverick Instruct BasicFireworks AI0.2200.880compare131K131KN/AN/A
Llama Guard 3 8BFireworks AI0.2000.200compare131K131KN/AN/A
Llama Guard 3 1BFireworks AI0.1000.100compare131K131KN/AN/A
Llama Guard 2 8BFireworks AI0.2000.200compare8K8KN/AN/A
Kat Dev 72B ExpFireworks AI0.9000.900compare131K131KN/AN/A
Kat Dev 32BFireworks AI0.9000.900compare131K131KN/AN/A
Kat CoderFireworks AI0.9000.900compare262K262KN/AN/A
Internvl3 8BFireworks AI0.2000.200compare16K16KN/AN/A
Internvl3 78BFireworks AI0.9000.900compare16K16KN/AN/A
Internvl3 38BFireworks AI0.9000.900compare16K16KN/AN/A
Hermes 2 Pro Mistral 7BFireworks AI0.2000.200compare33K33KN/AN/A
GPT-oss-safeguard-20bFireworks AI0.5000.500compare131K131KN/AN/A
GPT-oss-safeguard-120bFireworks AI1.201.20compare131K131KN/AN/A
Glm 4p5 AirFireworks AI0.2200.880compare128K96KN/AN/A
Gemma2 9B ItFireworks AI0.2000.200compare8K8KN/AN/A
Gemma 7BFireworks AI0.2000.200compare8K8KN/AN/A
Gemma 2B ItFireworks AI0.1000.100compare8K8KN/AN/A
Flux 1 SchnellFireworks AI0.1000.100compare4K4KN/AN/A
Flux 1 Dev Controlnet UnionFireworks AI0.00100.0010compare4K4KN/AN/A
Flux 1 DevFireworks AI0.1000.100compare4K4KN/AN/A
Firesearch OCR V6Fireworks AI0.2000.200compare8K8KN/AN/A
Firellava 13BFireworks AI0.2000.200compare4K4KN/AN/A
Firefunction V2Fireworks AI0.9000.900compare8K8KN/AN/A
Firefunction V1Fireworks AI0.5000.500compare33K33KN/AN/A
Fare 20BFireworks AI0.9000.900compare131K131KN/AN/A
ERNIE 4p5 21B A3b PtFireworks AI0.1000.100compare4K4KN/AN/A
Dolphin 2p6 Mixtral 8x7BFireworks AI0.5000.500compare33K33KN/AN/A
Dolphin 2 9 2 Qwen2 72BFireworks AI0.9000.900compare131K131KN/AN/A
Dobby Unhinged Llama 3 3 70B NewFireworks AI0.9000.900compare131K131KN/AN/A
Dobby Mini Unhinged Plus Llama 3 1 8BFireworks AI0.2000.200compare131K131KN/AN/A
DeepSeek V2 Lite ChatFireworks AI0.5000.500compare164K164KN/AN/A
DeepSeek R1 Distill Qwen 7BFireworks AI0.2000.200compare131K131KN/AN/A
DeepSeek R1 BasicFireworks AI0.5502.19compare128K20KN/AN/A
DeepSeek R1 0528Fireworks AI3.008.00compare160K160KN/AN/A
DeepSeek Prover V2Fireworks AI1.201.20compare164K164KN/AN/A
DeepSeek Coder 7B Base V1p5Fireworks AI0.2000.200compare4K4KN/AN/A
DeepSeek Coder 7B BaseFireworks AI0.2000.200compare4K4KN/AN/A
DeepSeek Coder 33B InstructFireworks AI0.9000.900compare16K16KN/AN/A
DeepSeek Coder 1B BaseFireworks AI0.1000.100compare16K16KN/AN/A
Cogito V1 Preview Qwen 32BFireworks AI0.9000.900compare131K131KN/AN/A
Cogito V1 Preview Qwen 14BFireworks AI0.2000.200compare131K131KN/AN/A
Cogito V1 Preview Llama 8BFireworks AI0.2000.200compare131K131KN/AN/A
Cogito V1 Preview Llama 70BFireworks AI0.9000.900compare131K131KN/AN/A
Cogito V1 Preview Llama 3BFireworks AI0.1000.100compare131K131KN/AN/A
Cogito 671B V2 P1Fireworks AI1.201.20compare164K164KN/AN/A
Codegemma 7BFireworks AI0.2000.200compare8K8KN/AN/A
Codegemma 2BFireworks AI0.1000.100compare8K8KN/AN/A
Code Qwen 1p5 7BFireworks AI0.2000.200compare66K66KN/AN/A
Code Llama 7BFireworks AI0.2000.200compare16K16KN/AN/A
Code Llama 70BFireworks AI0.9000.900compare4K4KN/AN/A
Code Llama 34BFireworks AI0.9000.900compare16K16KN/AN/A
Code Llama 13BFireworks AI0.2000.200compare16K16KN/AN/A
Chronos Hermes 13B V2Fireworks AI0.2000.200compare4K4KN/AN/A
Qwerky QwQ 32BFeatherless AIN/AN/Acompare33K4KN/AN/A
Qwerky 72BFeatherless AIN/AN/Acompare33K4KN/AN/A
Eu.twelvelabs.pegasus 1 2 V1AWS BedrockN/A7.50compareN/AN/AN/AN/A
Eu.mistral.pixtral Large 2502 V1AWS Bedrock2.006.00compare128K4KN/AN/A
DolphinNLP Cloud0.5000.500compare16K16KN/AN/A
DeepSeek CoderDeepSeek0.1400.280compare128K4KN/AN/A
DeepSeek ChatDeepSeek0.2800.420compare131K8KN/AN/A
L3.3 70B Euryale V2.3DeepInfra0.6500.750compare131K131KN/AN/A
L3.1 70B Euryale V2.2DeepInfra0.6500.750compare131K131KN/AN/A
L3 8B Lunaris V1 TurboDeepInfra0.0400.050compare8K8KN/AN/A
Qwen3 Coder 480B A35B Instruct TurboDeepInfra0.2901.20compare262K262KN/AN/A
Qwen2.5 VL 32B InstructDeepInfra0.2000.600compare128K128KN/AN/A
Qwen2.5 7B InstructDeepInfra0.0400.100compare33K33KN/AN/A
Mixtral 8x7B Instruct V0.1DeepInfra0.4000.400compare33K33KN/AN/A
Mistral Small 24B Instruct 2501DeepInfra0.0500.080compare33K33KN/AN/A
Mistral Nemo Instruct 2407DeepInfra0.0200.040compare131K131KN/AN/A
WizardLM 2 8x22BDeepInfra0.4800.480compare66K66KN/AN/A
Llama Guard 4 12BDeepInfra0.1800.180compare164K164KN/AN/A
Llama Guard 3 8BDeepInfra0.0550.055compare131K131KN/AN/A
Llama 4 Scout 17B 16E InstructDeepInfra0.0800.300compare328K328KN/AN/A
Llama 4 Maverick 17B 128E Instruct FP8DeepInfra0.1500.600compare1.0M1.0MN/AN/A
MythoMax L2 13BDeepInfra0.0800.090compare4K4KN/AN/A
DeepSeek R1 TurboDeepInfra1.003.00compare41K41KN/AN/A
DeepSeek R1 0528 TurboDeepInfra1.003.00compare33K33KN/AN/A
DeepSeek R1 0528DeepInfra0.5002.15compare164K164KN/AN/A
OlmOCR 7B 0725 FP8DeepInfra0.2701.50compare16K16KN/AN/A
Databricks Mpt 7B InstructDatabricks0.500N/Acompare8K8KN/AN/A
Databricks Mpt 30B InstructDatabricks1.001.00compare8K8KN/AN/A
Databricks Mixtral 8x7B InstructDatabricks0.5001.00compare4K4KN/AN/A
Databricks Llama 4 MaverickDatabricks0.5001.50compare128K128KN/AN/A
Databricks Claude Sonnet 4 1Databricks3.0015.00compare200K64KN/AN/A
Qwq PlusDashScope (Alibaba)0.8002.40compare98K8KN/AN/A
Qwen3 Vl PlusDashScope (Alibaba)N/AN/Acompare260K33KN/AN/A
Qwen3 Coder PlusDashScope (Alibaba)N/AN/Acompare998K66KN/AN/A
Qwen3 Coder PlusDashScope (Alibaba)N/AN/Acompare998K66KN/AN/A
Qwen3 Coder FlashDashScope (Alibaba)N/AN/Acompare998K66KN/AN/A
Qwen3 Coder FlashDashScope (Alibaba)N/AN/Acompare998K66KN/AN/A
Qwen TurboDashScope (Alibaba)0.0500.200compare1.0M16KN/AN/A
Qwen TurboDashScope (Alibaba)0.0500.200compare1.0M8KN/AN/A
Qwen PlusDashScope (Alibaba)N/AN/Acompare998K33KN/AN/A
Qwen PlusDashScope (Alibaba)N/AN/Acompare998K33KN/AN/A
Qwen PlusDashScope (Alibaba)N/AN/Acompare998K33KN/AN/A
Qwen PlusDashScope (Alibaba)0.4001.20compare129K16KN/AN/A
Qwen PlusDashScope (Alibaba)0.4001.20compare129K16KN/AN/A
Qwen PlusDashScope (Alibaba)0.4001.20compare129K8KN/AN/A
Qwen FlashDashScope (Alibaba)N/AN/Acompare998K33KN/AN/A
Qwen FlashDashScope (Alibaba)N/AN/Acompare998K33KN/AN/A
Qwen CoderDashScope (Alibaba)0.3001.50compare1.0M16KN/AN/A
Command R7bCohere0.1500.037compare128K4KN/AN/A
Command NightlyCohere1.002.00compare4K4KN/AN/A
Command LightCohere0.3000.600compare4K4KN/AN/A
Command ACohere2.5010.00compare256K8KN/AN/A
CommandCohere1.002.00compare4K4KN/AN/A
CodestralMistral CodestralN/AN/Acompare32K8KN/AN/A
Codestral 2405Mistral CodestralN/AN/Acompare32K8KN/AN/A
Codechat BisonVertex AI (Code Chat)N/AN/Acompare6K1KN/AN/A
Codechat BisonVertex AI (Code Chat)N/AN/Acompare6K1KN/AN/A
Codechat Bison 32KVertex AI (Code Chat)N/AN/Acompare32K8KN/AN/A
Codechat Bison 32KVertex AI (Code Chat)N/AN/Acompare32K8KN/AN/A
Codechat BisonVertex AI (Code Chat)N/AN/Acompare6K1KN/AN/A
Code GeckoVertex AI (Code Text)N/AN/Acompare2K64N/AN/A
Code GeckoVertex AI (Code Text)N/AN/Acompare2K64N/AN/A
Code GeckoVertex AI (Code Text)N/AN/Acompare2K64N/AN/A
Code Bison32kVertex AI (Code Text)N/AN/Acompare6K1KN/AN/A
Code BisonVertex AI (Code Text)N/AN/Acompare6K1KN/AN/A
Code BisonVertex AI (Code Text)N/AN/Acompare6K1KN/AN/A
Code Bison 32KVertex AI (Code Text)N/AN/Acompare6K1KN/AN/A
Code BisonVertex AI (Code Text)N/AN/Acompare6K1KN/AN/A
Codellama 7B Instruct AwqCloudflare Workers AI1.921.92compare4K4KN/AN/A
Mistral 7B Instruct V0.1Cloudflare Workers AI1.921.92compare8K8KN/AN/A
ChatdolphinNLP Cloud0.5000.500compare16K16KN/AN/A
Chat BisonVertex AI (Chat)N/AN/Acompare8K4KN/AN/A
Chat BisonVertex AI (Chat)N/AN/Acompare8K4KN/AN/A
Chat Bison 32KVertex AI (Chat)N/AN/Acompare32K8KN/AN/A
Chat Bison 32KVertex AI (Chat)N/AN/Acompare32K8KN/AN/A
Chat BisonVertex AI (Chat)N/AN/Acompare8K4KN/AN/A
Zai Glm 4.7Cerebras2.252.75compare128K128KN/AN/A
Zai Glm 4.6Cerebras2.252.75compare128K128KN/AN/A
Mixtral 8x7B Instruct V0AWS Bedrock0.4500.700compare32K8KN/AN/A
Mistral Large 2402 V1AWS Bedrock8.0024.00compare32K8KN/AN/A
Mistral 7B Instruct V0AWS Bedrock0.1500.200compare32K8KN/AN/A
Qwen3 Coder NextAWS Bedrock0.6001.44compare262K8KN/AN/A
Moonshotai.kimi K2.5AWS Bedrock0.7203.60compare262K262KN/AN/A
Minimax.minimax M2.1AWS Bedrock0.3601.44compare196K8KN/AN/A
Command Text V14AWS BedrockN/AN/Acompare4K4KN/AN/A
Command Light Text V14AWS BedrockN/AN/Acompare4K4KN/AN/A
Babbage 002OpenAI0.4000.400compare16K4KN/AN/A
Mistral Large 2402Azure OpenAI8.0024.00compare32KN/AN/AN/A
GPT-realtime miniAzure OpenAI0.6002.40compare32K4KN/AN/A
GPT-realtimeAzure OpenAI4.0016.00compare32K4KN/AN/A
GPT-realtime-1.5Azure OpenAI4.0016.00compare32K4KN/AN/A
GPT-audio miniAzure OpenAI0.6002.40compare128K16KN/AN/A
GPT-audioAzure OpenAI2.5010.00compare128K16KN/AN/A
GPT-audio-1.5Azure OpenAI2.5010.00compare128K16KN/AN/A
GPT-5.3-chatAzure OpenAI1.7514.00compare128K16KN/AN/A
GPT-4o-realtime PreviewAzure OpenAI5.0020.00compare128K4KN/AN/A
GPT-4o-mini-realtime PreviewAzure OpenAI0.6002.40compare128K4KN/AN/A
GPT-4.1 miniAzure OpenAI0.4001.60compare1.0M33KN/AN/A
ContainerAzure OpenAIN/AN/AcompareN/AN/AN/AN/A
Computer Use PreviewAzure OpenAI3.0012.00compare8K1KN/AN/A
Phi 4 ReasoningAzure AI0.1250.500compare33K4KN/AN/A
Phi 4 Mini ReasoningAzure AI0.0800.320compare131K4KN/AN/A
Phi 3.5 Vision InstructAzure AI0.1300.520compare128K4KN/AN/A
Phi 3.5 MoE InstructAzure AI0.1600.640compare128K4KN/AN/A
Phi 3.5 Mini InstructAzure AI0.1300.520compare128K4KN/AN/A
Phi 3 Small 128K InstructAzure AI0.1500.600compare128K4KN/AN/A
Phi 3 Medium 128K InstructAzure AI0.1700.680compare128K4KN/AN/A
Model RouterAzure AI0.140N/AcompareN/AN/AN/AN/A
Mistral Small 2503Azure AI0.1000.300compare128K128KN/AN/A
Mistral NemoAzure AI0.1500.150compare131K4KN/AN/A
Mistral Medium 2505Azure AI0.4002.00compare131K8KN/AN/A
Ministral 3BAzure AI0.0400.040compare128K4KN/AN/A
MAI DS R1Azure AI1.355.40compare128K8KN/AN/A
Llama 4 Scout 17B 16E InstructAzure AI0.2000.780compare10.0M16KN/AN/A
Llama 4 Maverick 17B 128E Instruct FP8Azure AI1.410.350compare1.0M16KN/AN/A
Kimi K2.5Azure AI0.6003.00compare262K262KN/AN/A
Jamba InstructAzure AI0.5000.700compare70K4KN/AN/A
JAIS 30B ChatAzure AI0.00320.0097compare8K8KN/AN/A
Grok 4 Fast Non ReasoningAzure AI0.2000.500compare131K131KN/AN/A
Grok 3 MiniAzure AI0.2501.27compare131K131KN/AN/A
Mixtral 8x7B Instruct V0.1Anyscale0.1500.150compare16K16KN/AN/A
Mixtral 8x22B Instruct V0.1Anyscale0.9000.900compare66K66KN/AN/A
Mistral 7B Instruct V0.1Anyscale0.1500.150compare16K16KN/AN/A
Zephyr 7B BetaAnyscale0.1500.150compare16K16KN/AN/A
Gemma 7B ItAnyscale0.1500.150compare8K8KN/AN/A
CodeLlama 70B Instruct HfAnyscale1.001.00compare4K4KN/AN/A
CodeLlama 34B Instruct HfAnyscale1.001.00compare4K4KN/AN/A
Claude V1AWS Bedrock8.0024.00compare100K8KN/AN/A
Titan Text Premier V1AWS Bedrock0.5001.50compare42K32KN/AN/A
Titan Text Lite V1AWS Bedrock0.3000.400compare42K4KN/AN/A
Titan Text Express V1AWS Bedrock1.301.70compare42K8KN/AN/A
Jamba Instruct V1AWS Bedrock0.5000.700compare70K4KN/AN/A
J2 Ultra V1AWS Bedrock18.8018.80compare8K8KN/AN/A
J2 Mid V1AWS Bedrock12.5012.50compare8K8KN/AN/A