Compare AI model pricing and benchmarks across providers including OpenAI, Anthropic, Google, AWS Bedrock, Azure, Mistral, and more. Filter by model capabilities like vision, function calling, and reasoning support. Find the most cost-effective model for your use case. Currently tracking 1,771 models across 99 providers.

The data is based on LiteLLM, maintained by the open-source community, and benchmark data from Artificial Analysis. The latest update occurred on February 27, 2026 at 12:00 AM UTC

Input/1M
to
Output/1M
to
Model
Provider
Input Price, $
Output Price, $
Price Compare
Context
Max Output
Intelligence
Coding
Gemini 3 Pro PreviewGoogle Vertex AI2.0012.00compare1.0M66K49.846.5
Gemini 3 ProReplicate2.0012.00compareN/AN/A49.846.5
Gemini 3 Pro PreviewOpenRouter2.0012.00compare1.0M66K49.846.5
Gemini 3 Pro PreviewGMI Cloud2.0012.00compare1.0M66K49.846.5
Gemini 3 Pro PreviewGitHub CopilotN/AN/Acompare128K64K49.846.5
Gemini 3 Pro PreviewGoogle Gemini2.0012.00compare1.0M66K49.846.5
GPT-5.2OpenRouter1.7514.00compare272K128K49.448.7
GPT-5.2-chatOpenAI1.7514.00compare128K16K49.448.7
GPT-5.2GMI Cloud1.7514.00compare410K32K49.448.7
GPT-5.2GitHub CopilotN/AN/Acompare128K64K49.448.7
GPT-5.2Azure OpenAI1.7514.00compare272K128K49.448.7
GPT-5.1-chatOpenAI1.2510.00compare128K16K47.744.7
GPT-5.1GMI Cloud1.2510.00compare410K32K47.744.7
GPT-5.1GitHub CopilotN/AN/Acompare128K64K47.744.7
Databricks GPT 5 1Databricks1.2510.00compare272K128K47.744.7
GPT-5.1Azure OpenAI1.3811.00compare272K128K47.744.7
Kimi K2.5Together AI0.5002.80compare256K256K46.839.5
Kimi K2.5OpenRouter0.6003.00compare262K262K46.839.5
Moonshotai.kimi K2.5AWS Bedrock0.6003.00compare262K262K46.839.5
Kimi K2.5Moonshot AI (Kimi)0.6003.00compare262K262K46.839.5
Kimi K2p5Fireworks AI0.6003.00compare262K262K46.839.5
GPT-5Replicate1.2510.00compareN/AN/A44.636.0
GPT-5-codexOpenRouter1.2510.00compare272K128K44.638.9
GPT-5OpenRouter1.2510.00compare272K128K44.636.0
GPT-5-chatOpenAI1.2510.00compare128K16K44.636.0
GPT-5GMI Cloud1.2510.00compare410K32K44.636.0
GPT-5GitHub CopilotN/AN/Acompare128K128K44.636.0
Databricks GPT 5Databricks1.2510.00compare272K128K44.636.0
GPT-5-chatAzure OpenAI1.2510.00compare128K16K44.636.0
Glm 4.7Z AI (Zhipu)0.6002.20compare200K128K42.136.3
Zai.glm 4.7AWS Bedrock0.6002.20compare200K128K42.136.3
Glm 4.7 MaasVertex AI (Z AI)0.6002.20compare200K128K42.136.3
GLM 4.7Together AI0.4502.00compare200K200K42.136.3
Glm 4.7OpenRouter0.4001.50compare203K64K42.136.3
Glm 4.7Novita AI0.6002.20compare205K131K42.136.3
GLM 4.7 FP8GMI Cloud0.4002.00compare203K16K42.136.3
Glm 4 7 251222Volcengine (ByteDance)N/AN/Acompare205K131K42.136.3
Glm 4p7Fireworks AI0.6002.20compare203K203K42.136.3
Minimax M2.5OpenRouter0.3001.10compare197K66K41.937.4
MiniMax M2.5MiniMax0.3001.20compare1.0M8K41.937.4
DeepSeek ReasonerDeepSeek0.2800.420compare131K66K41.736.7
GPT-5 miniReplicate0.2502.00compareN/AN/A41.235.3
GPT-5 miniOpenRouter0.2502.00compare272K128K41.235.3
GPT-5 miniOpenAI0.2502.00compare272K128K41.235.3
GPT-5 miniGitHub CopilotN/AN/Acompare128K64K41.235.3
Databricks GPT 5 MiniDatabricks0.2502.00compare272K128K41.235.3
GPT-5 miniAzure OpenAI0.2502.00compare272K128K41.235.3
Grok 4xAI3.0015.00compare256K256K40.740.5
Grok 4Vercel AI Gateway3.0015.00compare256K256K40.740.5
Grok 4Replicate7.2036.00compareN/AN/A40.740.5
Grok 4OpenRouter3.0015.00compare256K256K40.740.5
Xai.grok 4Oracle Cloud (OCI)3.0015.00compare128K128K40.740.5
Grok 4Azure AI3.0015.00compare131K131K40.740.5
o3Vercel AI Gateway2.008.00compare200K100K38.438.4
o3OpenAI2.008.00compare200K100K38.438.4
Openai O3Gradient AI2.008.00compare100KN/A38.438.4
o3Azure OpenAI2.008.00compare200K100K38.438.4
Claude Sonnet 4.5Anthropic (Vertex AI)3.0015.00compare200K64K37.133.5
Claude Sonnet 4.5Vercel AI Gateway3.0015.00compare1.0M64K37.133.5
Claude 4.5 SonnetReplicate3.0015.00compareN/AN/A37.133.5
Claude Sonnet 4.5OpenRouter3.0015.00compare1.0M1.0M37.133.5
Claude Sonnet 4.5GMI Cloud3.0015.00compare410K32K37.133.5
Claude Sonnet 4.5GitHub CopilotN/AN/Acompare128K16K37.133.5
Databricks Claude Sonnet 4 5Databricks3.0015.00compare200K64K37.133.5
Claude Sonnet 4.5AWS Bedrock3.0015.00compare200K64K37.133.5
Claude Sonnet 4.5Anthropic3.0015.00compare200K64K37.133.5
Claude Sonnet 4.5Azure AI3.0015.00compare200K64K37.133.5
Minimax M2 MaasVertex AI (MiniMax)0.3001.20compare197K197K36.129.2
Minimax M2OpenRouter0.2551.02compare205K205K36.129.2
Minimax M2Novita AI0.3001.20compare205K131K36.129.2
MiniMax M2MiniMax0.3001.20compare200K8K36.129.2
Minimax.minimax M2AWS Bedrock0.3001.20compare128K8K36.129.2
Minimax M2Fireworks AI0.3001.20compare4K4K36.129.2
Kat Coder ProNovita AI0.3001.20compare256K128K36.018.3
Claude Opus 4.5Anthropic (Vertex AI)5.0025.00compare200K64K35.342.9
Claude Opus 4.5Vercel AI Gateway5.0025.00compare200K64K35.342.9
Claude Opus 4.5OpenRouter5.0025.00compare200K32K35.342.9
Claude Opus 4.5GMI Cloud5.0025.00compare410K32K35.342.9
Claude Opus 4.5GitHub CopilotN/AN/Acompare128K16K35.342.9
Databricks Claude Opus 4 5Databricks5.0025.00compare200K64K35.342.9
Claude Opus 4.5Anthropic5.0025.00compare200K64K35.342.9
Claude Opus 4.5Azure AI5.0025.00compare200K64K35.342.9
Claude Opus 4.5AWS Bedrock5.0025.00compare200K64K35.342.9
Gemini 3 Flash PreviewGoogle Vertex AI0.5003.00compare1.0M66K35.037.8
Gemini 3 Flash PreviewOpenRouter0.5003.00compare1.0M66K35.037.8
Gemini 3 Flash PreviewGMI Cloud0.5003.00compare1.0M66K35.037.8
Gemini 3 Flash PreviewGoogle Gemini0.5003.00compare1.0M66K35.037.8
DeepSeek V3.2 SpecialeAzure AI0.5801.68compare164K164K34.137.9
GPT-oss-120bIBM watsonx0.1500.600compare8K8K33.328.6
GPT-oss-120bWeights & Biases0.0150.060compare131K131K33.328.6
GPT-oss-120b-maasVertex AI (OpenAI)0.1500.600compare131K33K33.328.6
GPT-oss-120bTogether AI0.1500.600compare128KN/A33.328.6
GPT-oss-120bSambaNova3.004.50compare131K131K33.328.6
GPT-oss-120bReplicate0.1800.720compareN/AN/A33.328.6
GPT-oss-120bOVHcloud0.0800.400compare131K131K33.328.6
GPT-oss-120bOpenRouter0.1800.800compare131K33K33.328.6
GPT-oss-120b-1AWS Bedrock0.1500.600compare128K128K33.328.6
GPT-oss:120b-cloudOllamaN/AN/Acompare131K131K33.328.6
GPT-oss-120bNovita AI0.0500.250compare131K33K33.328.6
GPT-oss-120b-mxfp-GGUFLemonade (AMD)N/AN/Acompare131K33K33.328.6
GPT-oss-120bGroq0.1500.600compare131K33K33.328.6
GPT-oss-120bFireworks AI0.1500.600compare131K131K33.328.6
GPT-oss-120bDeepInfra0.0500.450compare131K131K33.328.6
Databricks GPT OSS 120BDatabricks0.1500.600compare131K131K33.328.6
GPT-oss-120bCerebras0.3500.750compare131K33K33.328.6
GPT-oss-120bAzure AI0.1500.600compare131K131K33.328.6
o4 miniVercel AI Gateway1.104.40compare200K100K33.125.6
o4 miniReplicate1.004.00compareN/AN/A33.125.6
o4 miniOpenAI1.104.40compare200K100K33.125.6
o4 miniAzure OpenAI1.104.40compare200K100K33.125.6
Claude Sonnet 4Anthropic (Vertex AI)3.0015.00compare1.0M64K33.030.6
Claude 4 SonnetVercel AI Gateway3.0015.00compare200K64K33.030.6
Claude 4 SonnetReplicate3.0015.00compareN/AN/A33.030.6
Claude Sonnet 4OpenRouter3.0015.00compare1.0M64K33.030.6
Claude 4 SonnetHeroku (Salesforce)N/AN/Acompare8KN/A33.030.6
Claude Sonnet 4GMI Cloud3.0015.00compare410K32K33.030.6
Claude Sonnet 4GitHub CopilotN/AN/Acompare128K16K33.030.6
Claude 4 SonnetDeepInfra3.3016.50compare200K200K33.030.6
Databricks Claude Sonnet 4Databricks3.0015.00compare200K64K33.030.6
Claude 4 SonnetAnthropic3.0015.00compare1.0M64K33.030.6
Claude Sonnet 4.20250514AWS Bedrock3.0015.00compare1.0M64K33.030.6
Grok 3 MinixAI0.3000.500compare131K131K32.125.2
DeepSeek V3.2 MaasVertex AI (DeepSeek)0.5601.68compare164K33K32.134.6
DeepSeek V3.2OpenRouter0.2800.400compare164K164K32.134.6
DeepSeek V3.2Novita AI0.2690.400compare164K66K32.134.6
DeepSeek V3.2GMI Cloud0.2800.400compare164K16K32.134.6
DeepSeek V3p2Fireworks AI0.5601.68compare164K164K32.134.6
DeepSeek V3.2DeepSeek0.2800.400compare164K164K32.134.6
DeepSeek V3 2 251201Volcengine (ByteDance)N/AN/Acompare98K33K32.134.6
V3.2AWS Bedrock0.7402.22compare164K164K32.134.6
DeepSeek V3.2Azure AI0.5801.68compare164K164K32.134.6
Qwen3 MaxNovita AI2.118.45compare262K66K31.426.4
Qwen3 MaxDashScope (Alibaba)N/AN/Acompare258K66K31.426.4
Kimi K2 Instruct 0905Together AI1.003.00compare262KN/A30.925.9
Kimi K2 0905Novita AI0.6002.50compare262K262K30.925.9
Kimi K2 0905 PreviewMoonshot AI (Kimi)0.6002.50compare262K262K30.925.9
Kimi K2 Instruct 0905Groq1.003.00compare262K16K30.925.9
Kimi K2 Instruct 0905Fireworks AI0.6002.50compare262K33K30.925.9
Kimi K2 Instruct 0905DeepInfra0.5002.00compare262K262K30.925.9
Claude 3 7 SonnetAnthropic (Vertex AI)3.0015.00compare200K8K30.826.7
o1Vercel AI Gateway15.0060.00compare200K100K30.820.5
Claude 3 7 SonnetVercel AI Gateway3.0015.00compare200K64K30.826.7
o1Replicate15.0060.00compareN/AN/A30.820.5
Claude 3.7 SonnetReplicate3.0015.00compareN/AN/A30.826.7
o1OpenRouter15.0060.00compare200K100K30.820.5
Claude 3.7 SonnetOpenRouter3.0015.00compare200K128K30.826.7
o1OpenAI15.0060.00compare200K100K30.820.5
Claude 3 7 SonnetHeroku (Salesforce)N/AN/Acompare8KN/A30.826.7
Anthropic Claude 3.7 SonnetGradient AI3.0015.00compare1KN/A30.826.7
Eu.anthropic.claude 3 7 Sonnet 20250219 V1AWS Bedrock3.0015.00compare200K8K30.826.7
Claude 3 7 SonnetDeepInfra3.3016.50compare200K200K30.826.7
Databricks Claude 3 7 SonnetDatabricks3.0015.00compare200K128K30.826.7
Claude 3 7 SonnetAnthropic3.0015.00compare200K64K30.826.7
o1Azure OpenAI15.0060.00compare200K100K30.820.5
Mimo V2 FlashOpenRouter0.0900.290compare262K16K30.425.8
Mimo V2 FlashNovita AI0.1000.300compare262K32K30.425.8
Gemini 2.5 ProVercel AI Gateway2.5010.00compare1.0M66K30.346.7
Gemini 2.5 ProOpenRouter1.2510.00compare1.0M8K30.346.7
Gemini 2.5 ProGitHub CopilotN/AN/Acompare128K64K30.346.7
Gemini proGoogle Gemini1.2510.00compare1.0M66K30.346.7
Gemini 2.5 ProGoogle Vertex AI1.2510.00compare1.0M66K30.346.7
Gemini 2.5 ProDeepInfra1.2510.00compare1.0M1.0M30.346.7
Databricks Gemini 2 5 ProDatabricks1.2510.00compare1.0M66K30.346.7
Glm 4.6Z AI (Zhipu)0.6002.20compare200K128K30.230.2
Glm 4.6Vercel AI Gateway0.4501.80compare200K200K30.230.2
GLM 4.6Together AI0.6002.20compare200K200K30.230.2
Glm 4.6OpenRouter0.4001.75compare203K131K30.230.2
Glm 4.6Novita AI0.5502.20compare205K131K30.230.2
Glm 4p6Fireworks AI0.5502.19compare203K203K30.230.2
Glm 4.7 FlashOpenRouter0.0700.400compare200K32K30.125.9
Qwen3 235B A22b Thinking 2507Novita AI0.3003.00compare131K33K29.523.2
Qwen3 235B A22b Thinking 2507Fireworks AI0.2200.880compare262K262K29.523.2
Grok Code Fast 1xAI0.2001.50compare256K256K28.723.7
Grok Code Fast 1Azure AI0.2001.50compare131K131K28.723.7
DeepSeek V3.1 TerminusNovita AI0.2701.00compare131K33K28.531.9
DeepSeek V3p1 TerminusFireworks AI0.5601.68compare128K8K28.531.9
DeepSeek V3.1 TerminusDeepInfra0.2701.00compare164K164K28.531.9
Qwen3 Coder NextAWS Bedrock0.6001.44compare262K8K28.322.9
DeepSeek V3.1Weights & Biases0.0550.165compare128K128K28.128.4
DeepSeek V3.1 MaasVertex AI (DeepSeek)1.355.40compare164K33K28.128.4
DeepSeek V3.1Together AI0.6001.70compare128KN/A28.128.4
DeepSeek V3.1SambaNova3.004.50compare33K33K28.128.4
DeepSeek V3.1Replicate0.6722.02compare164K164K28.128.4
DeepSeek V3.1:671B CloudOllamaN/AN/Acompare164K164K28.128.4
DeepSeek V3.1Novita AI0.2701.00compare131K33K28.128.4
DeepSeek V3p1Fireworks AI0.5601.68compare128K8K28.128.4
DeepSeek V3.1DeepInfra0.2701.00compare164K164K28.128.4
DeepSeek R1Vercel AI Gateway0.5502.19compare128K8K27.124.0
Us.deepseek.r1 V1AWS Bedrock1.355.40compare128K4K27.124.0
DeepSeek R1Together AI3.007.00compare128K20K27.124.0
DeepSeek R1SnowflakeN/AN/Acompare33K8K27.124.0
DeepSeek R1SambaNova5.007.00compare33K33K27.124.0
DeepSeek R1Replicate3.7510.00compare66K8K27.124.0
DeepSeek R1OpenRouter0.5502.19compare65K8K27.124.0
Magistral Medium 2509Mistral AI2.005.00compare40K40K27.121.7
DeepSeek R1Hyperbolic0.4000.400compare33K33K27.124.0
DeepSeek R1Fireworks AI3.008.00compare128K20K27.124.0
DeepSeek R1DeepSeek0.5502.19compare66K8K27.124.0
DeepSeek R1DeepInfra0.7002.40compare164K164K27.124.0
DeepSeek R1Azure AI1.355.40compare128K8K27.124.0
GPT-5 nanoReplicate0.0500.400compareN/AN/A26.820.3
GPT-5 nanoOpenRouter0.0500.400compare272K128K26.820.3
GPT-5 nanoOpenAI0.0500.400compare272K128K26.820.3
Databricks GPT 5 NanoDatabricks0.0500.400compare272K128K26.820.3
GPT-4.1 nanoAzure OpenAI0.1000.400compare1.0M33K26.820.3
Qwen3 Next 80B A3b Thinking MaasVertex AI (Qwen)0.1501.20compare262K262K26.719.5
Qwen3 Next 80B A3B ThinkingTogether AI0.1501.50compare262KN/A26.719.5
Qwen3 Next 80B A3b ThinkingNovita AI0.1501.50compare131K33K26.719.5
Qwen3 Next 80B A3b ThinkingFireworks AI0.9000.900compare4K4K26.719.5
Qwen3 Next 80B A3B ThinkingDeepInfra0.1401.40compare262K262K26.719.5
Glm 4.5Z AI (Zhipu)0.6002.20compare128K32K26.426.3
GLM 4.5Weights & Biases0.0550.200compare131K131K26.426.3
Glm 4.5Vercel AI Gateway0.6002.20compare131K131K26.426.3
Glm 4.5Novita AI0.6002.20compare131K98K26.426.3
Glm 4p5Fireworks AI0.5502.19compare128K96K26.426.3
GLM 4.5DeepInfra0.4001.60compare131K131K26.426.3
Glm 4.5 AirZ AI (Zhipu)0.2001.10compare128K32K26.323.8
Kimi K2 InstructWeights & Biases0.6002.50compare128K128K26.322.1
Glm 4.5 AirVercel AI Gateway0.2001.10compare128K96K26.323.8
GPT-4.1Vercel AI Gateway2.008.00compare1.0M33K26.321.8
Kimi K2Vercel AI Gateway0.5502.20compare131K16K26.322.1
GLM 4.5 Air FP8Together AI0.2001.10compare128KN/A26.323.8
Kimi K2 InstructTogether AI1.003.00compareN/AN/A26.322.1
GPT-4.1Replicate2.008.00compareN/AN/A26.321.8
GPT-4.1OpenRouter2.008.00compare1.0M33K26.321.8
Glm 4.5 AirNovita AI0.1300.850compare131K98K26.323.8
Kimi K2 InstructNovita AI0.5702.30compare131K131K26.322.1
Kimi K2 InstructHyperbolic2.002.00compare131K131K26.322.1
GPT-4.1GitHub CopilotN/AN/Acompare128K16K26.321.8
Ft:gpt 4 0613OpenAI30.0060.00compare8K4K26.321.8
Kimi K2 InstructFireworks AI0.6002.50compare131K16K26.322.1
Kimi K2 InstructDeepInfra0.5002.00compare131K131K26.322.1
GPT-4.1Azure OpenAI2.208.80compare1.0M33K26.321.8
o3 miniVercel AI Gateway1.104.40compare200K100K25.917.9
o3 miniOpenRouter1.104.40compare128K66K25.917.9
o3 miniOpenAI1.104.40compare200K100K25.917.9
Openai O3 MiniGradient AI1.104.40compare100KN/A25.917.9
o3 miniAzure OpenAI1.104.40compare200K100K25.917.9
Grok 3xAI3.0015.00compare131K131K25.219.8
Grok 3Vercel AI Gateway3.0015.00compare131K131K25.219.8
Xai.grok 3Oracle Cloud (OCI)3.0015.00compare131K131K25.219.8
Grok 3Azure AI3.0015.00compare131K131K25.219.8
Qwen3 235B A22B Instruct 2507Weights & Biases0.0100.010compare262K262K25.022.1
Qwen3 235B A22b Instruct 2507 MaasVertex AI (Qwen)0.2501.00compare262K16K25.022.1
Qwen3 235B A22B Instruct 2507 TputTogether AI0.2006.00compare262KN/A25.022.1
Qwen3 235B A22b 2507 V1AWS Bedrock0.2200.880compare262K131K25.022.1
Qwen3 235B A22b 2507OpenRouter0.0710.100compare262K262K25.022.1
Qwen3 235B A22b Instruct 2507Novita AI0.0900.580compare131K16K25.022.1
Qwen3 235B A22b Instruct 2507Fireworks AI0.2200.880compare262K262K25.022.1
Qwen3 235B A22B Instruct 2507DeepInfra0.0900.600compare262K262K25.022.1
Qwen3 Coder 480B A35B InstructWeights & Biases0.1000.150compare262K262K24.824.6
Qwen3 Coder 480B A35b Instruct MaasVertex AI (Qwen)1.004.00compare262K33K24.824.6
Qwen3 Coder 480B A35B Instruct FP8Together AI2.002.00compare256KN/A24.824.6
Qwen3 Coder 480B A35b V1AWS Bedrock0.2201.80compare262K66K24.824.6
Qwen3 Coder:480B CloudOllamaN/AN/Acompare262K262K24.824.6
Qwen3 Coder 480B A35b InstructNovita AI0.3001.30compare262K66K24.824.6
Qwen3 Coder 480B A35b InstructFireworks AI0.4501.80compare262K262K24.824.6
Qwen3 Coder 480B A35B InstructDeepInfra0.4001.60compare262K262K24.824.6
GPT-oss-20bWeights & Biases0.00500.020compare131K131K24.518.5
GPT-oss-20b-maasVertex AI (OpenAI)0.0750.300compare131K33K24.518.5
GPT-oss-20bTogether AI0.0500.200compare128KN/A24.518.5
GPT-oss-20bReplicate0.0900.360compareN/AN/A24.518.5
GPT-oss-20bOVHcloud0.0400.150compare131K131K24.518.5
GPT-oss-20bOpenRouter0.0200.100compare131K33K24.518.5
GPT-oss-20b-1AWS Bedrock0.0700.300compare128K128K24.518.5
GPT-oss:20b-cloudOllamaN/AN/Acompare131K131K24.518.5
GPT-oss-20bNovita AI0.0400.150compare131K33K24.518.5
GPT-oss-20b-mxfp4-GGUFLemonade (AMD)N/AN/Acompare131K33K24.518.5
GPT-oss-20bGroq0.0750.300compare131K33K24.518.5
GPT-oss-20bFireworks AI0.0500.200compare131K131K24.518.5
GPT-oss-20bDeepInfra0.0400.150compare131K131K24.518.5
Databricks GPT OSS 20BDatabricks0.0700.300compare131K131K24.518.5
Kimi K2 Thinking MaasVertex AI (Moonshot)0.6002.50compare256K256K24.115.5
Kimi K2 ThinkingNovita AI0.6002.50compare262K262K24.115.5
Kimi K2 ThinkingMoonshot AI (Kimi)0.6002.50compare262K262K24.115.5
Kimi K2 Thinking 251104Volcengine (ByteDance)N/AN/Acompare229K33K24.115.5
Kimi K2 ThinkingGMI Cloud0.8001.20compare262K16K24.115.5
Kimi K2 ThinkingFireworks AI0.6002.50compare262K262K24.115.5
Moonshotai.kimi K2 ThinkingAWS Bedrock0.7303.03compare262K262K24.115.5
Qwen3 Next 80B A3b Instruct MaasVertex AI (Qwen)0.1501.20compare262K262K23.715.3
Qwen3 Next 80B A3B InstructTogether AI0.1501.50compare262KN/A23.715.3
Qwen3 Next 80B A3bAWS Bedrock0.1501.20compare128K8K23.715.3
o1-previewOpenAI15.0060.00compare128K33K23.734.0
Qwen3 Next 80B A3b InstructNovita AI0.1501.50compare131K33K23.715.3
Qwen3 Next 80B A3b InstructFireworks AI0.9000.900compare4K4K23.715.3
Qwen3 Next 80B A3B InstructDeepInfra0.1401.40compare262K262K23.715.3
o1-previewAzure OpenAI15.0060.00compare128K33K23.734.0
Claude Opus 4.1Anthropic (Vertex AI)15.0075.00compare200K32K23.6N/A
Claude Opus 4.1Vercel AI Gateway15.0075.00compare200K32K23.6N/A
Claude Opus 4.1OpenRouter15.0075.00compare200K32K23.6N/A
Claude Opus 41GitHub CopilotN/AN/Acompare80K16K23.6N/A
Databricks Claude Opus 4 1Databricks15.0075.00compare200K32K23.6N/A
Claude Opus 4.1Anthropic15.0075.00compare200K32K23.6N/A
Claude Opus 4.1Azure AI15.0075.00compare200K32K23.6N/A
Claude Opus 4.1AWS Bedrock15.0075.00compare200K32K23.6N/A
Qwen3 Vl 235B A22bAWS Bedrock0.5302.66compare128K8K23.316.5
Qwen3 Vl 235B A22b InstructNovita AI0.3001.50compare131K33K23.316.5
Qwen3 Vl 235B A22b InstructFireworks AI0.2200.880compare262K262K23.316.5
Grok 4 Fast Non ReasoningxAI0.2000.500compare2.0M2.0M23.119.0
Mistral Large 3Mistral AI0.5001.50compare256K8K22.822.7
Mistral Large 3 675B InstructAWS Bedrock0.5001.50compare128K8K22.822.7
Mistral Large 3 Fp8Fireworks AI1.201.20compare256K256K22.822.7
Mistral Large 3Azure AI0.5001.50compare256K8K22.822.7
Magistral Small 2509AWS Bedrock0.5001.50compare128K8K22.514.8
DeepSeek V3 0324Weights & Biases0.1140.275compare161K161K22.322.0
DeepSeek V3 0324Novita AI0.2701.12compare164K164K22.322.0
DeepSeek V3 0324Lambda0.2000.600compare131K131K22.322.0
DeepSeek V3 0324Hyperbolic0.4000.400compare33K33K22.322.0
DeepSeek V3 0324GMI Cloud0.2800.880compare164K16K22.322.0
DeepSeek V3 0324Fireworks AI0.9000.900compare164K164K22.322.0
DeepSeek V3 0324DeepInfra0.2500.880compare164K164K22.322.0
Claude Opus 4Anthropic (Vertex AI)15.0075.00compare200K32K22.2N/A
GPT-4.1 miniVercel AI Gateway0.4001.60compare1.0M33K22.218.5
Claude 4 OpusVercel AI Gateway15.0075.00compare200K32K22.2N/A
GPT-4.1 miniReplicate0.4001.60compareN/AN/A22.218.5
GPT-4.1 miniOpenRouter0.4001.60compare1.0M33K22.218.5
Claude Opus 4OpenRouter15.0075.00compare200K32K22.2N/A
GPT-4.1 miniOpenAI0.4001.60compare1.0M33K22.218.5
Claude Opus 4GMI Cloud15.0075.00compare410K32K22.2N/A
Claude 4 OpusDeepInfra16.5082.50compare200K200K22.2N/A
Databricks Claude Opus 4Databricks15.0075.00compare200K32K22.2N/A
Claude 4 OpusAnthropic15.0075.00compare200K32K22.2N/A
GPT-4.1 miniAzure OpenAI0.4401.76compare1.0M33K22.218.5
Claude Opus 4.20250514AWS Bedrock15.0075.00compare200K32K22.2N/A
Nova 2 Pro Preview 20251202 V1AWS Bedrock2.1917.50compare1.0M64K21.920.5
Claude Haiku 4.5Anthropic (Vertex AI)1.005.00compare200K8K21.829.6
Claude Haiku 4.5Vercel AI Gateway1.005.00compare200K64K21.829.6
Claude 4.5 HaikuReplicate1.005.00compareN/AN/A21.829.6
Claude Haiku 4.5OpenRouter1.005.00compare200K200K21.829.6
Claude Haiku 4.5GitHub CopilotN/AN/Acompare128K16K21.829.6
Databricks Claude Haiku 4 5Databricks1.005.00compare200K64K21.829.6
Claude Haiku 4.5Anthropic1.005.00compare200K64K21.829.6
Claude Haiku 4.5Azure AI1.005.00compare200K64K21.829.6
Claude Haiku 4.5AWS Bedrock1.005.00compare200K64K21.829.6
Qwen3 Vl 32B InstructFireworks AI0.9000.900compare4K4K21.415.6
Gemini 2.5 FlashVercel AI Gateway0.3002.50compare1.0M66K21.117.8
Gemini 2.5 FlashReplicate2.502.50compareN/AN/A21.117.8
Gemini 2.5 FlashOpenRouter0.3002.50compare1.0M8K21.117.8
Gemini 2.5 Flash-native-audioGoogle Gemini0.3002.50compare1.0M8K21.117.8
Gemini 2.5 FlashGoogle Vertex AI0.3002.50compare1.0M66K21.117.8
Gemini 2.5 FlashDeepInfra0.3002.50compare1.0M1.0M21.117.8
Databricks Gemini 2 5 FlashDatabricks0.3002.50compare1.0M66K21.117.8
Minimax M1 80KNovita AI0.5502.20compare1.0M40K20.914.1
Minimax M1 80KFireworks AI0.1000.100compare4K4K20.914.1
o1 miniReplicate1.104.40compareN/AN/A20.4N/A
o1 miniOpenAI1.104.40compare128K66K20.4N/A
o1 miniAzure OpenAI1.214.84compare128K66K20.4N/A
Qwen3 Vl 30B A3b InstructNovita AI0.2000.700compare131K33K20.014.3
GPT-4.5 PreviewOpenAI75.00150.00compare128K16K20.0N/A
Qwen3 Vl 30B A3b InstructFireworks AI0.1500.600compare262K262K20.014.3
Grok 4 1 Fast Non ReasoningxAI0.2000.500compare2.0M2.0M19.919.5
QwQ 32BSambaNova0.5001.00compare16K16K19.7N/A
QwQ 32BNscale0.1800.200compareN/AN/A19.7N/A
QwQ 32BHyperbolic0.2000.200compare131K131K19.7N/A
Qwen Qwq 32B PreviewFireworks AI0.9000.900compare33K33K19.7N/A
QwQ 32BDeepInfra0.1500.400compare131K131K19.7N/A
Qwen3 30B A3b Instruct 2507Fireworks AI0.5000.500compare262K262K19.314.2
Qwen3 30B A3b Thinking 2507Fireworks AI0.9000.900compare262K262K19.111.0
DevstralMistral AI0.4002.00compare256K256K19.023.7
Olmo 3 32B ThinkPublic AIN/AN/Acompare33K4K18.910.5
Nvidia.nemotron Nano 9B V2AWS Bedrock0.0600.230compare128K8K18.87.5
Nvidia Nemotron Nano 9B V2Fireworks AI0.2000.200compare131K131K18.87.5
NVIDIA Nemotron Nano 9B V2DeepInfra0.0400.160compare131K131K18.87.5
Nova 2 Lite V1AWS Bedrock0.3002.50compare1.0M64K18.612.5
Llama 4 MaverickVercel AI Gateway0.2000.600compare131K8K18.415.6
Sonar ReasoningVercel AI Gateway1.005.00compare127K8K17.9N/A
Sonar ReasoningPerplexity1.005.00compare128KN/A17.9N/A
Mistral Medium 3Vertex AI (Mistral)0.4002.00compare128K8K17.613.6
Gemini 2.0 FlashVercel AI Gateway0.1500.600compare1.0M8K17.613.6
Gemini 2.0 Flash-001OpenRouter0.1000.400compare1.0M8K17.613.6
Gemini 2.0 Flash-expGoogle GeminiN/AN/Acompare1.0M8K17.613.6
Gemini 2.0 Flash-expGoogle Vertex AI0.1500.600compare1.0M8K17.613.6
Gemini 2.0 Flash-001DeepInfra0.1000.400compare1.0M1.0M17.613.6
Qwen3 Coder 30B A3b V1AWS Bedrock0.1500.600compare262K131K17.519.4
Qwen3 Coder 30B A3b InstructNovita AI0.0700.270compare160K33K17.519.4
Qwen3 Coder 30B A3B Instruct GGUFLemonade (AMD)N/AN/Acompare262K33K17.519.4
Qwen3 Coder 30B A3b InstructFireworks AI0.1500.600compare262K262K17.519.4
Magistral MediumVercel AI Gateway2.005.00compare128K64K17.416.0
Magistral MediumMistral AI2.005.00compare40K40K17.416.0
ERNIE 4.5 300B A47b PaddleNovita AI0.2801.10compare123K12K17.314.5
ERNIE 4p5 300B A47b PtFireworks AI0.1000.100compare4K4K17.314.5
DeepSeek R1 Distill Qwen 32BNscale0.1500.150compareN/AN/A17.2N/A
DeepSeek R1 Distill Qwen 32BNovita AI0.3000.300compare64K32K17.2N/A
DeepSeek R1 Distill Qwen 32BFireworks AI0.9000.900compare131K131K17.2N/A
DeepSeek R1 Distill Qwen 32BDeepInfra0.2700.270compare131K131K17.2N/A
DeepSeek V3Vercel AI Gateway0.9000.900compare128K8K17.116.4
DeepSeek V3Together AI1.251.25compare66K8K17.116.4
DeepSeek V3 0324SambaNova3.004.50compare33K33K17.116.4
DeepSeek V3Replicate1.451.45compare66K8K17.116.4
DeepSeek V3 TurboNovita AI0.4001.30compare64K16K17.116.4
Hermes3 405BLambda0.8000.800compare131K131K17.118.1
DeepSeek V3Hyperbolic0.2000.200compare33K33K17.116.4
DeepSeek V3Fireworks AI0.9000.900compare128K8K17.116.4
DeepSeek V3DeepSeek0.2701.10compare66K8K17.116.4
V3 V1AWS Bedrock0.5801.68compare164K82K17.116.4
Hermes 3 Llama 3.1 405BDeepInfra1.001.00compare131K131K17.118.1
DeepSeek V3DeepInfra0.3800.890compare164K164K17.116.4
DeepSeek V3Azure AI1.144.56compare128K8K17.116.4
Us.amazon.nova Premier V1AWS Bedrock2.5012.50compare1.0M10K17.013.8
Nova Premier V1Amazon Nova2.5012.50compare1.0M10K17.013.8
Magistral SmallVercel AI Gateway0.5001.50compare128K64K16.811.1
Olmo 3 7B ThinkPublic AIN/AN/Acompare33K4K16.87.6
Magistral SmallMistral AI0.5001.50compare40K40K16.811.1
Qwen3 Vl 8B InstructNovita AI0.0800.500compare131K33K16.79.8
DeepSeek R1 0528 Distill Qwen3 8BFireworks AI0.2000.200compare131K131K16.47.8
Qwen MaxDashScope (Alibaba)1.606.40compare31K8K16.3N/A
Ministral 14B 2512OpenRouter0.2000.200compare262K262K16.210.9
Ministral 3 14B InstructAWS Bedrock0.2000.200compare128K8K16.210.9
Glm 4.6vNovita AI0.3000.900compare131K33K16.111.1
Qwen3 4B Instruct 2507 GGUFLemonade (AMD)N/AN/Acompare262K33K16.19.1
Qwen3 4B Instruct 2507Fireworks AI0.2000.200compare262K262K16.19.1
DeepSeek R1 Distill Llama 70BVercel AI Gateway0.7500.990compare131K131K16.011.4
Qwen 3 235BVercel AI Gateway0.2000.600compare41K16K16.014.0
Qwen3 235B A22B Fp8 TputTogether AI0.2000.600compare40KN/A16.014.0
DeepSeek R1 Distill Llama 70BSambaNova0.7001.40compare131K131K16.011.4
Qwen3 235B A22b Instruct 2507Replicate0.2641.06compareN/AN/A16.014.0
DeepSeek R1 Distill Llama 70BOVHcloud0.6700.670compare131K131K16.011.4
DeepSeek R1 Distill Llama 70BNscale0.3750.375compareN/AN/A16.011.4
Qwen3 235B A22b Fp8Novita AI0.2000.800compare41K20K16.014.0
DeepSeek R1 Distill Llama 70BNovita AI0.8000.800compare8K8K16.011.4
Qwen3 235B A22BHyperbolic2.002.00compare131K131K16.014.0
DeepSeek R1 Distill Llama 70BGradient AI0.9900.990compare8KN/A16.011.4
Qwen3 VL 235B A22B Instruct FP8GMI Cloud0.3001.40compare262K16K16.014.0
Gemini 1.5 ProGoogle Gemini3.501.05compare1.0M8K16.023.6
Gemini flash-liteGoogle Gemini0.1000.400compare1.0M66K16.07.4
Gemini 2.5 Flash-LiteGoogle Vertex AI0.1000.400compare1.0M66K16.07.4
Gemini 1.5 ProGoogle Vertex AI1.255.00compare2.1M8K16.023.6
Qwen3 235B A22bFireworks AI0.2200.880compare131K131K16.014.0
DeepSeek R1 Distill Llama 70BFireworks AI0.9000.900compare131K131K16.011.4
Qwen3 235B A22BDeepInfra0.1800.540compare41K41K16.014.0
DeepSeek R1 Distill Llama 70BDeepInfra0.2000.600compare131K131K16.011.4
Claude 3 5 SonnetAnthropic (Vertex AI)3.0015.00compare200K8K15.930.2
Claude 3 5 SonnetVercel AI Gateway3.0015.00compare200K8K15.930.2
Claude 3 5 SonnetSnowflakeN/AN/Acompare18K8K15.930.2
Claude 3.5 SonnetReplicate3.7518.75compareN/AN/A15.930.2
Claude 3.5 SonnetOpenRouter3.0015.00compare200K8K15.930.2
Claude 3 5 SonnetHeroku (Salesforce)N/AN/Acompare8KN/A15.930.2
Anthropic Claude 3.5 SonnetGradient AI3.0015.00compare1KN/A15.930.2
Claude 3 5 SonnetAnthropic3.0015.00compare200K8K15.930.2
Claude 3 5 Sonnet 20240620 V1AWS Bedrock3.0015.00compare1.0M4K15.930.2
DeepSeek R1 Distill Qwen 14BNscale0.0700.070compareN/AN/A15.8N/A
DeepSeek R1 Distill Qwen 14BNovita AI0.1500.150compare33K16K15.8N/A
DeepSeek R1 Distill Qwen 14BFireworks AI0.2000.200compare131K131K15.8N/A
Qwen2.5 72B Instruct TurboTogether AIN/AN/AcompareN/AN/A15.611.9
Qwen 2.5 72B InstructNovita AI0.3800.400compare32K8K15.611.9
Qwen2.5 72B InstructHyperbolic0.1200.300compare131K131K15.611.9
Qwen2p5 72BFireworks AI0.9000.900compare131K131K15.611.9
Qwen2.5 72B InstructDeepInfra0.1200.390compare33K33K15.611.9
SonarVercel AI Gateway1.001.00compare127K8K15.5N/A
SonarPerplexity1.001.00compare128KN/A15.5N/A
Ministral 8B 2512OpenRouter0.1500.150compare262K262K15.310.0
Ministral 3 8B InstructAWS Bedrock0.1500.150compare128K8K15.310.0
Llama 3.1 405B Instruct MaasVertex AI (Llama)5.0016.00compare128K2K15.214.5
Sonar Reasoning ProVercel AI Gateway2.008.00compare127K8K15.2N/A
Meta Llama 3.1 405B Instruct TurboTogether AI3.503.50compareN/AN/A15.214.5
Llama3.1 405BSnowflakeN/AN/Acompare128K8K15.214.5
Meta Llama 3.1 405B InstructSambaNova5.0010.00compare16K16K15.214.5
Sonar Reasoning ProPerplexity2.008.00compare128KN/A15.2N/A
Llama 3.1 405B InstructOracle Cloud (OCI)10.6810.68compare128K4K15.214.5
Llama3 1 405B Instruct V1AWS Bedrock5.3216.00compare128K4K15.214.5
Llama3.1 405B Instruct Fp8Lambda0.8000.800compare131K131K15.214.5
Meta Llama 3.1 405B InstructHyperbolic0.1200.300compare33K33K15.214.5
Llama V3p1 405B InstructFireworks AI3.003.00compare128K16K15.214.5
Databricks Meta Llama 3 1 405B InstructDatabricks5.0015.00compare128K128K15.214.5
Meta Llama 3.1 405B InstructAzure AI5.3316.00compare128K2K15.214.5
Mistral Small 3.2 24B Instruct 2506OVHcloud0.0900.280compare128K128K15.113.3
Mistral Small 3.2 24B InstructOpenRouter0.1000.300compare32KN/A15.113.3
Devstral MediumMistral AI0.4002.00compare256K256K15.115.9
Mistral Small 3.2 24B Instruct 2506DeepInfra0.0750.200compare128K128K15.113.3
GPT-4.1 nanoVercel AI Gateway0.1000.400compare1.0M33K14.911.2
GPT-4.1 nanoReplicate0.1000.400compareN/AN/A14.911.2
GPT-4.1 nanoOpenRouter0.1000.400compare1.0M33K14.911.2
GPT-4.1 nanoOpenAI0.1000.400compare1.0M33K14.911.2
GPT-4.1 nanoAzure OpenAI0.1100.440compare1.0M33K14.911.2
GPT-4oVercel AI Gateway2.5010.00compare128K16K14.816.7
Devstral SmallVercel AI Gateway0.0700.280compare128K128K14.812.1
GPT-4oReplicate2.5010.00compareN/AN/A14.816.7
GPT-4oOpenRouter2.5010.00compare128K4K14.816.7
Devstral SmallMistral AI0.1000.300compare256K256K14.812.1
Qwen3 Vl 8BLlamaGate0.1500.550compare33K8K14.87.3
Openai GPT 4oGradient AIN/AN/Acompare16KN/A14.816.7
GPT-4oGMI Cloud2.5010.00compare131K16K14.816.7
GPT-4-o PreviewGitHub CopilotN/AN/Acompare64K4K14.816.7
Qwen3 Vl 8B InstructFireworks AI0.2000.200compare4K4K14.87.3
Chatgpt 4oOpenAI5.0015.00compare128K4K14.816.7
GPT-4oAzure OpenAI2.5010.00compare128K16K14.816.7
Gemini 2.0 Flash-LiteVercel AI Gateway0.0750.300compare1.0M8K14.7N/A
Command AVercel AI Gateway2.5010.00compare256K8K14.79.9
Mistral Large2SnowflakeN/AN/Acompare128K8K14.713.8
Gemini 2.0 Flash-LiteGoogle Gemini0.0750.300compare1.0M8K14.7N/A
Gemini 2.0 Flash-LiteGoogle Vertex AI0.0750.300compare1.0M8K14.7N/A
Qwen 3 30BVercel AI Gateway0.1000.300compare41K16K14.613.3
Qwen3 30B A3b Fp8Novita AI0.0900.450compare41K20K14.613.3
Qwen3 30B A3bFireworks AI0.1500.600compare131K131K14.613.3
Qwen3 30B A3BDeepInfra0.0800.290compare41K41K14.613.3
Llama 3.3 Nemotron Super 49B V1.5DeepInfra0.1000.400compare131K131K14.610.5
Qwen3 30B A3bDashScope (Alibaba)N/AN/Acompare129K16K14.613.3
Llama 3 3 70B InstructIBM watsonx0.7100.710compare128K128K14.510.7
Llama 3.3 70B InstructWeights & Biases0.0710.071compare128K128K14.510.7
Llama 3.3 70BVercel AI Gateway0.7200.720compare128K8K14.510.7
Qwen 3 32BVercel AI Gateway0.1000.300compare41K16K14.5N/A
Llama 3.3 70B Instruct TurboTogether AI0.8800.880compareN/AN/A14.510.7
Llama3.3 70BSnowflakeN/AN/Acompare128K8K14.510.7
Qwen3 32BSambaNova0.4000.800compare8K8K14.5N/A
Meta Llama 3.3 70B InstructSambaNova0.6001.20compare131K131K14.510.7
Qwen3 32B V1AWS Bedrock0.1500.600compare131K16K14.5N/A
Qwen3 32BOVHcloud0.0800.230compare32K32K14.5N/A
Meta Llama 3 3 70B InstructOVHcloud0.6700.670compare131K131K14.510.7
Llama 3.3 70B InstructOracle Cloud (OCI)0.7200.720compare128K4K14.510.7
Llama 3.3 70B InstructNscale0.2000.200compareN/AN/A14.510.7
Qwen3 32B Fp8Novita AI0.1000.450compare41K20K14.5N/A
Llama 3.3 70B InstructNovita AI0.1350.400compare131K120K14.510.7
Llama3 3 70B Instruct V1AWS Bedrock0.7200.720compare128K4K14.510.7
Llama 3.3 70B InstructMeta LlamaN/AN/Acompare128K4K14.510.7
Qwen3 32B Fp8Lambda0.0500.100compare131K131K14.5N/A
DeepSeek Llama3.3 70BLambda0.2000.600compare131K131K14.510.7
Llama 3.3 70B InstructHyperbolic0.1200.300compare131K131K14.510.7
Qwen3 32BGroq0.2900.590compare131K131K14.5N/A
Llama 3.3 70B VersatileGroq0.5900.790compare128K33K14.510.7
Llama3.3 70B InstructGradient AI0.6500.650compare2KN/A14.510.7
Alibaba Qwen3 32BGradient AIN/AN/Acompare2KN/A14.5N/A
Qwen3 32BFireworks AI0.9000.900compare131K131K14.5N/A
Llama V3p3 70B InstructFireworks AI0.9000.900compare131K131K14.510.7
Qwen3 32BDeepInfra0.1000.280compare41K41K14.5N/A
Llama 3.3 70B InstructDeepInfra0.2300.400compare131K131K14.510.7
Databricks Meta Llama 3 3 70B InstructDatabricks0.5001.50compare128K128K14.510.7
Qwen 3 32BCerebras0.4000.800compare128K128K14.5N/A
Llama 3.3 70BCerebras0.8501.20compare128K128K14.510.7
Llama 3.3 70B InstructAzure AI0.7100.710compare128K2K14.510.7
Glm 4.5vZ AI (Zhipu)0.6001.80compare128K32K14.410.8
Glm 4.5vNovita AI0.6001.80compare66K16K14.410.8
Glm 4p5vFireworks AI1.201.20compare131K131K14.410.8
Nvidia.nemotron Nano 3 30BAWS Bedrock0.0600.240compare262K8K14.215.8
Nvidia.nemotron Nano 12B V2AWS Bedrock0.2000.600compare128K8K14.25.9
Nemotron Nano V2 12B VlFireworks AI0.1000.100compare4K4K14.25.9
Mistral Small 3 1 24B Instruct 2503IBM watsonx0.1000.300compare32K32K14.013.9
Nova ProVercel AI Gateway0.8003.20compare300K8K14.011.0
Mistral Small 3.1 24B InstructOpenRouter0.1000.300compare32KN/A14.013.9
Pixtral Large 2411Mistral AI2.006.00compare128K128K14.0N/A
Nova Pro V1AWS Bedrock0.8003.20compare300K10K14.011.0
Nova Pro V1Amazon Nova0.8003.20compare300K10K14.011.0
Grok 2 1212xAI2.0010.00compare131K131K13.9N/A
Gemini 1.5 FlashGoogle Gemini0.0750.300compare1.0M8K13.8N/A
Gemini 1.5 FlashGoogle Vertex AI0.0750.300compare1.0M8K13.8N/A
Llama 4 ScoutVercel AI Gateway0.1000.300compare131K8K13.56.7
Llama3.1 Nemotron 70B Instruct Fp8Lambda0.1200.300compare131K131K13.510.8
Llama V3p1 Nemotron 70B InstructFireworks AI0.9000.900compare131K131K13.510.8
Llama 3.1 Nemotron 70B InstructDeepInfra0.6000.600compare131K131K13.510.8
Grok BetaxAI5.0015.00compare131K131K13.3N/A
Granite 4 H SmallIBM watsonx0.0600.250compare20K20K13.28.5
Qwen3 8B Fp8Novita AI0.0350.138compare128K20K13.27.1
Qwen3 8BLlamaGate0.0400.140compare33K8K13.27.1
Qwen3 8BFireworks AI0.2000.200compare41K41K13.27.1
Qwen2p5 32BFireworks AI0.9000.900compare131K131K13.2N/A
Phi 4DeepInfra0.0700.140compare16K16K13.211.2
Phi 4Azure AI0.1250.500compare16K16K13.211.2
Llama 3.1 70B Instruct MaasVertex AI (Llama)N/AN/Acompare128K2K13.110.9
Llama 3.1 70BVercel AI Gateway0.7200.720compare128K8K13.110.9
Meta Llama 3.1 70B Instruct TurboTogether AI0.8800.880compareN/AN/A13.110.9
Llama3.1 70BSnowflakeN/AN/Acompare128K8K13.110.9
Llama 3.1 70B InstructPerplexity1.001.00compare131K131K13.110.9
Meta Llama 3 1 70B InstructOVHcloud0.6700.670compare131K131K13.110.9
Llama3 1 70B Instruct V1AWS Bedrock0.9900.990compare128K2K13.110.9
Llama3.1 70B Instruct Fp8Lambda0.1200.300compare131K131K13.110.9
Meta Llama 3.1 70B InstructHyperbolic0.1200.300compare33K33K13.110.9
Meta Llama 3.1 70B InstructFriendliAI0.6000.600compare8K8K13.110.9
Qwen3 1p7b Fp8 DraftFireworks AI0.1000.100compare262K262K13.11.4
Llama V3p1 70B InstructFireworks AI0.9000.900compare131K131K13.110.9
Meta Llama 3.1 70B InstructDeepInfra0.4000.400compare131K131K13.110.9
Llama3.1 70BCerebras0.6000.600compare128K128K13.110.9
Meta Llama 3.1 70B InstructAzure AI2.683.54compare128K2K13.110.9
Mistral LargeVertex AI (Mistral)2.006.00compare128K8K13.0N/A
Olmo 3 7B InstructPublic AIN/AN/Acompare33K4K13.03.4
Mistral Large Instruct 2407OllamaN/AN/Acompare66K8K13.0N/A
Mistral Large 2407Mistral AI3.009.00compare128K128K13.0N/A
Mistral Large 2407 V1AWS Bedrock3.009.00compare128K8K13.0N/A
Mistral Large 2407Azure AI2.006.00compare128K4K13.0N/A
Qwen2.5 Coder 32B InstructOVHcloud0.8700.870compare32K32K12.9N/A
Qwen 2.5 Coder 32B InstructOpenRouter0.1800.180compare34K34K12.9N/A
Ministral 3B 2512OpenRouter0.1000.100compare131K131K12.94.8
Qwen2.5 Coder 32B InstructNscale0.0600.200compareN/AN/A12.9N/A
Ministral 3 3B InstructAWS Bedrock0.1000.100compare128K8K12.94.8
Qwen25 Coder 32B InstructLambda0.0500.100compare131K131K12.9N/A
Qwen2.5 Coder 32B InstructHyperbolic0.1200.300compare33K33K12.9N/A
Qwen2p5 Coder 32BFireworks AI0.9000.900compare33K33K12.9N/A
GPT-4 TurboVercel AI Gateway10.0030.00compare128K4K12.813.1
Nova LiteVercel AI Gateway0.0600.240compare300K8K12.85.1
GPT-4OpenRouter30.0060.00compare8KN/A12.813.1
GPT-4-32kOpenAI60.00120.00compare33K4K12.813.1
Nova Lite V1AWS Bedrock0.0600.240compare300K10K12.85.1
Nova Lite V1Amazon Nova0.0600.240compare300K10K12.85.1
GPT-4o miniVercel AI Gateway0.1500.600compare128K16K12.6N/A
GPT-4o miniReplicate0.1500.600compareN/AN/A12.6N/A
Openai GPT 4o MiniGradient AIN/AN/Acompare16KN/A12.6N/A
GPT-4o miniOpenAI0.1500.600compare128K16K12.6N/A
GPT-4o miniGMI Cloud0.1500.600compare131K16K12.6N/A
GPT-4o miniGitHub CopilotN/AN/Acompare64K4K12.6N/A
GPT-4o miniAzure OpenAI0.1500.600compare128K16K12.6N/A
Claude 3 OpusAnthropic (Vertex AI)15.0075.00compare200K4K12.519.5
Claude 3 OpusVercel AI Gateway15.0075.00compare200K4K12.519.5
Qwen3 4B Fp8Novita AI0.0300.030compare128K20K12.5N/A
Jamba Large 1.7AI21 Labs2.008.00compare256K256K12.57.8
Anthropic Claude 3 OpusGradient AI15.0075.00compare1KN/A12.519.5
Qwen3 4BFireworks AI0.2000.200compare41K41K12.5N/A
DeepSeek V2p5Fireworks AI1.201.20compare33K33K12.5N/A
Claude 3 OpusAnthropic15.0075.00compare200K4K12.519.5
Claude 3 Opus 20240229 V1AWS Bedrock15.0075.00compare200K4K12.519.5
Gemma 3 12B ItNovita AI0.0500.100compare131K8K12.46.3
Gemma 3 12B ItAWS Bedrock0.0900.290compare128K8K12.46.3
Gemma 3 12B ItDeepInfra0.0500.100compare131K131K12.46.3
Databricks Gemma 3 12BDatabricks0.1500.500compare128K32K12.46.3
Claude 3 5 HaikuAnthropic (Vertex AI)1.005.00compare200K8K12.310.7
Claude 3.5 HaikuVercel AI Gateway0.8004.00compare200K8K12.310.7
Claude 3.5 HaikuReplicate1.005.00compareN/AN/A12.310.7
Claude 3 5 HaikuHeroku (Salesforce)N/AN/Acompare4KN/A12.310.7
Anthropic Claude 3.5 HaikuGradient AI0.8004.00compare1KN/A12.310.7
Gemini 2.0 Flash-thinking-expGoogle GeminiN/AN/Acompare1.0M66K12.3N/A
Gemini 2.0 Flash-thinking-expGoogle Vertex AIN/AN/Acompare1.0M8K12.3N/A
Eu.anthropic.claude 3 5 Haiku 20241022 V1AWS Bedrock0.2501.25compare200K8K12.310.7
Claude 3 5 HaikuAnthropic1.005.00compare200K8K12.310.7
Mistral Saba 24BVercel AI Gateway0.7900.790compare33K33K12.1N/A
DeepSeek R1 Distill Llama 8BNscale0.0250.025compareN/AN/A12.1N/A
Devstral Small 2505Mistral AI0.1000.300compare128K128K12.112.2
Devstral Small 2505Fireworks AI0.9000.900compare131K131K12.112.2
DeepSeek R1 Distill Llama 8BFireworks AI0.2000.200compare131K131K12.1N/A
Reka FlashSnowflakeN/AN/Acompare100K8K12.0N/A
Qwen TurboDashScope (Alibaba)0.0500.200compare1.0M16K12.0N/A
Llama 3 2 90B Vision InstructIBM watsonx2.002.00compare128K128K11.9N/A
Llama 3.2 90B Vision Instruct MaasVertex AI (Llama)N/AN/Acompare128K2K11.9N/A
Llama 3.2 90BVercel AI Gateway0.7200.720compare128K8K11.9N/A
Llama 3.2 90B Vision InstructOracle Cloud (OCI)2.002.00compare128K4K11.9N/A
Llama3 2 90B Instruct V1AWS Bedrock2.002.00compare128K4K11.9N/A
Llama V3p2 90B Vision InstructFireworks AI0.9000.900compare16K16K11.9N/A
Llama 3.2 90B Vision InstructAzure AI2.042.04compare128K2K11.9N/A
Nova MicroVercel AI Gateway0.0350.140compare128K8K11.64.1
Nova Micro V1AWS Bedrock0.0350.140compare128K10K11.64.1
Nova Micro V1Amazon Nova0.0350.140compare128K10K11.64.1
Llama 3.1 8B InstructWeights & Biases0.0220.022compare128K128K11.34.9
Llama 3.1 8B Instruct MaasVertex AI (Llama)N/AN/Acompare128K2K11.34.9
Llama 3.1 8BVercel AI Gateway0.0500.080compare131K131K11.34.9
Meta Llama 3.1 8B Instruct TurboTogether AI0.1800.180compareN/AN/A11.34.9
Llama3.1 8BSnowflakeN/AN/Acompare128K8K11.34.9
Meta Llama 3.1 8B InstructSambaNova0.1000.200compare16K16K11.34.9
Llama 3.1 8B InstructPerplexity0.2000.200compare131K131K11.34.9
Llama 3.1 8B InstructOVHcloud0.1000.100compare131K131K11.34.9
Llama3.1OllamaN/AN/Acompare8K8K11.34.9
Llama 3.1 8B InstructNscale0.0300.030compareN/AN/A11.34.9
Llama 3.1 8B InstructNovita AI0.0200.050compare16K16K11.34.9
Llama3 1 8B Instruct V1AWS Bedrock0.2200.220compare128K2K11.34.9
Llama 3.1 8BLlamaGate0.0300.050compare131K8K11.34.9
Llama3.1 8B InstructLambda0.0250.040compare131K131K11.34.9
Meta Llama 3.1 8B InstructHyperbolic0.1200.300compare33K33K11.34.9
Llama 3.1 8B InstantGroq0.0500.080compare128K8K11.34.9
Meta Llama 3.1 8B InstructFriendliAI0.1000.100compare8K8K11.34.9
Llama V3p1 8B InstructFireworks AI0.1000.100compare16K16K11.34.9
Meta Llama 3.1 8B InstructDeepInfra0.0300.050compare131K131K11.34.9
Databricks Meta Llama 3 1 8B InstructDatabricks0.1500.450compare200K128K11.34.9
Llama3.1 8BCerebras0.1000.100compare128K128K11.34.9
Meta Llama 3.1 8B InstructAzure AI0.3000.610compare128K2K11.34.9
Llama 3 2 11B Vision InstructIBM watsonx0.3500.350compare128K128K10.94.3
Phi 4 Mini InstructWeights & Biases0.00800.035compare128K128K10.93.6
Llama 3.2 11BVercel AI Gateway0.1600.160compare128K8K10.94.3
Llama3 2 11B Instruct V1AWS Bedrock0.3500.350compare128K4K10.94.3
Llama3.2 11B Vision InstructLambda0.0150.025compare131K131K10.94.3
Llama V3p2 11B Vision InstructFireworks AI0.2000.200compare16K16K10.94.3
Llama 3.2 11B Vision InstructDeepInfra0.0490.049compare131K131K10.94.3
Phi 4 Mini InstructAzure AI0.0750.300compare131K4K10.93.6
Llama 3.2 11B Vision InstructAzure AI0.3700.370compare128K2K10.94.3
Granite 3 3 8B InstructIBM watsonx0.2000.200compare8K8K10.83.4
Granite 3.3 8B InstructReplicate0.0300.250compareN/AN/A10.83.4
Jamba 1.5 LargeVertex AI (AI21)2.008.00compare256K256K10.7N/A
Jamba 1.5 LargeSnowflakeN/AN/Acompare256K8K10.7N/A
Gemma3 4BLlamaGate0.0300.080compare128K8K10.72.9
Gemma 3 4B It GGUFLemonade (AMD)N/AN/Acompare128K8K10.72.9
Jamba Mini 1.7AI21 Labs0.2000.400compare256K256K10.73.1
Jamba 1.5 LargeAI21 Labs2.008.00compare256K256K10.7N/A
Gemma 3 4B ItAWS Bedrock0.0400.080compare128K8K10.72.9
Gemma 3 4B ItDeepInfra0.0400.080compare131K131K10.72.9
Jamba 1 5 Large V1AWS Bedrock2.008.00compare256K256K10.7N/A
DeepSeek Coder V2 BaseOllamaN/AN/Acompare8K8K10.6N/A
Hermes3 70BLambda0.1200.300compare131K131K10.6N/A
Jamba Large 1.6AI21 Labs2.008.00compare256K256K10.6N/A
Hermes 3 Llama 3.1 70BHyperbolic0.1200.300compare33K33K10.6N/A
Qwen3 1p7bFireworks AI0.1000.100compare131K131K10.62.3
DeepSeek Coder V2 InstructFireworks AI1.201.20compare66K66K10.6N/A
Hermes 3 Llama 3.1 70BDeepInfra0.3000.300compare131K131K10.6N/A
Claude 3 SonnetAnthropic (Vertex AI)3.0015.00compare200K4K10.3N/A
Gemma 3 27B ItNovita AI0.1190.200compare98K16K10.39.6
Gemma 3 27B ItAWS Bedrock0.2300.380compare128K8K10.39.6
Gemma 3 27B ItGoogle GeminiN/AN/Acompare131K8K10.39.6
Gemma 3 27B ItFireworks AI0.9000.900compare131K131K10.39.6
Gemma 3 27B ItDeepInfra0.0900.160compare131K131K10.39.6
Claude 3 Sonnet 20240229 V1AWS Bedrock3.0015.00compare200K4K10.3N/A
Llama3 70B Instruct MaasVertex AI (Llama)N/AN/Acompare32K32K10.26.8
Mistral SmallVercel AI Gateway0.1000.300compare32K4K10.2N/A
Llama 3 70BVercel AI Gateway0.5900.790compare8K8K10.26.8
Llama3 70BSnowflakeN/AN/Acompare8K8K10.26.8
Llama 3 70BReplicate0.6502.75compare8K8K10.26.8
Llama 3 70B InstructOpenRouter0.5900.790compare8KN/A10.26.8
Llama3:70BOllamaN/AN/Acompare8K8K10.26.8
Llama 3 70B InstructNovita AI0.5100.740compare8K8K10.26.8
Mistral SmallMistral AI0.1000.300compare32K8K10.2N/A
Meta Llama 3 70B InstructHyperbolic0.1200.300compare131K131K10.26.8
Llama V3 70B InstructFireworks AI0.9000.900compare8K8K10.26.8
Databricks Meta Llama 3 70B InstructDatabricks1.003.00compare128K128K10.26.8
Llama3 70B Instruct V1AWS Bedrock2.653.50compare8K8K10.26.8
Mistral SmallAzure AI1.003.00compare32K8K10.2N/A
Meta Llama 3 70B InstructAzure AI1.100.370compare8K2K10.26.8
Meta Llama 3 70B InstructAnyscale1.001.00compare8K8K10.26.8
Gemini 1.0 UltraGoogle Vertex AI0.5001.50compare8K2K10.117.6
Phi 3 Mini 128K InstructFireworks AI0.1000.100compare131K131K10.13.0
Phi 3 Mini 128K InstructAzure AI0.1300.520compare128K4K10.13.0
Qwen2.5 Coder 7B InstructNscale0.0100.030compareN/AN/A10.0N/A
Qwen2.5 Coder 7BLlamaGate0.0600.120compare33K8K10.0N/A
Qwen2p5 Coder 7BFireworks AI0.2000.200compare33K33K10.0N/A
Phi 4 Multimodal InstructAzure AI0.0800.320compare131K4K10.0N/A
Mistral LargeIBM watsonx3.0010.00compare131K16K9.9N/A
Mistral Large@latestVertex AI (Mistral)2.006.00compare128K8K9.9N/A
Mistral LargeVercel AI Gateway2.006.00compare32K4K9.9N/A
Mistral LargeSnowflakeN/AN/Acompare32K8K9.9N/A
Mistral LargeOpenRouter8.0024.00compare32KN/A9.9N/A
Mistral LargeMistral AI2.006.00compare128K128K9.9N/A
Mistral LargeAzure OpenAI8.0024.00compare32KN/A9.9N/A
Mistral LargeAzure AI2.006.00compare128K4K9.9N/A
Mixtral 8x22B InstructVercel AI Gateway1.201.20compare66K2K9.8N/A
Mixtral 8x22B InstructOpenRouter0.6500.650compare66KN/A9.8N/A
Open Mixtral 8x22BMistral AI2.006.00compare65K8K9.8N/A
Mixtral 8x22BFireworks AI1.201.20compare66K66K9.8N/A
Llama 3 2 3B InstructIBM watsonx0.1500.150compare128K128K9.7N/A
Llama 3.2 3BVercel AI Gateway0.1500.150compare128K8K9.7N/A
Llama 3.2 3B Instruct TurboTogether AIN/AN/AcompareN/AN/A9.7N/A
Llama3.2 3BSnowflakeN/AN/Acompare128K8K9.7N/A
Meta Llama 3.2 3B InstructSambaNova0.0800.160compare4K4K9.7N/A
Meta Textgeneration Llama 2 7BAWS SageMakerN/AN/Acompare4K4K9.7N/A
Llama 2 7BReplicate0.0500.250compare4K4K9.7N/A
Llama2:7BOllamaN/AN/Acompare4K4K9.7N/A
Llama 3.2 3B InstructNovita AI0.0300.050compare33K32K9.7N/A
Llama3 2 3B Instruct V1AWS Bedrock0.1500.150compare128K4K9.7N/A
Llama 3.2 3BLlamaGate0.0400.080compare131K8K9.7N/A
Llama3.2 3B InstructLambda0.0150.025compare131K131K9.7N/A
Llama 3.2 3B InstructHyperbolic0.1200.300compare33K33K9.7N/A
Qwen3 0p6bFireworks AI0.1000.100compare41K41K9.71.4
Llama V3p2 3BFireworks AI0.1000.100compare131K131K9.7N/A
Llama V2 7BFireworks AI0.2000.200compare4K4K9.7N/A
Llama 3.2 3B InstructDeepInfra0.0200.020compare131K131K9.7N/A
Llama 2 7B Chat Fp16Cloudflare Workers AI1.921.92compare3K3K9.7N/A
Llama 2 7B Chat HfAnyscale0.1500.150compare4K4K9.7N/A
Claude 3 HaikuAnthropic (Vertex AI)0.2501.25compare200K4K9.36.7
Claude 3 HaikuVercel AI Gateway0.2501.25compare200K4K9.36.7
Claude 3 HaikuOpenRouter0.2501.25compare200KN/A9.36.7
Claude 3 HaikuAnthropic0.2501.25compare200K4K9.36.7
Claude V2AWS Bedrock8.0024.00compare100K8K9.314.0
Claude 3 Haiku 20240307 V1AWS Bedrock0.2501.25compare200K4K9.36.7
Llama 3 2 1B InstructIBM watsonx0.1000.100compare128K128K9.10.6
Llama 3.2 1BVercel AI Gateway0.1000.100compare128K8K9.10.6
Llama3.2 1BSnowflakeN/AN/Acompare128K8K9.10.6
Meta Llama 3.2 1B InstructSambaNova0.0400.080compare16K16K9.10.6
DeepSeek R1 Distill Qwen 1.5BNscale0.0900.090compareN/AN/A9.1N/A
Llama3 2 1B Instruct V1AWS Bedrock0.1000.100compare128K4K9.10.6
Llama V3p2 1BFireworks AI0.1000.100compare131K131K9.10.6
DeepSeek R1 Distill Qwen 1p5bFireworks AI0.1000.100compare131K131K9.1N/A
GPT-3.5 TurboVercel AI Gateway0.5001.50compare16K4K9.010.7
GPT-3.5 TurboOpenRouter1.502.00compare4KN/A9.010.7
Mistral MediumMistral AI0.4002.00compare131K8K9.0N/A
Mistral Small 2402 V1AWS Bedrock1.003.00compare32K8K9.0N/A
GPT-3.5 TurboGitHub CopilotN/AN/Acompare16K4K9.010.7
Ft:gpt 3.5 TurboOpenAI3.006.00compare16K4K9.010.7
GPT-3.5-turbo-instruct-0914Microsoft Azure1.502.00compare4KN/A9.010.7
GPT-3.5 TurboAzure OpenAI0.5001.50compare4K4K9.010.7
Snowflake ArcticSnowflakeN/AN/Acompare4K8K8.8N/A
Lfm 40BLambda0.1000.200compare131K131K8.8N/A
Qwen2 72B InstructFireworks AI0.9000.900compare33K33K8.8N/A
Llama3 8B Instruct MaasVertex AI (Llama)N/AN/Acompare32K32K8.74.0
Llama 3 8BVercel AI Gateway0.0500.080compare8K8K8.74.0
Llama3 8BSnowflakeN/AN/Acompare8K8K8.74.0
Llama 3 8BReplicate0.0500.250compare8K8K8.74.0
Llama3OllamaN/AN/Acompare8K8K8.74.0
Llama 3 8B InstructNovita AI0.0400.040compare8K8K8.74.0
Llama3 8B InstructGradient AI0.2000.200compare512N/A8.74.0
Llama V3 8BFireworks AI0.2000.200compare8K8K8.74.0
Meta Llama 3 8B InstructDeepInfra0.0300.060compare8K8K8.74.0
Llama3 8B Instruct V1AWS Bedrock0.3000.600compare8K8K8.74.0
Meta Llama 3 8B InstructAnyscale0.1500.150compare8K8K8.74.0
DeepSeek Coder V2 Lite BaseOllamaN/AN/Acompare8K8K8.5N/A
Gemini proGoogle Gemini0.3501.05compare33K8K8.5N/A
Gemini 1.0 ProGoogle Vertex AI0.5001.50compare33K8K8.5N/A
DeepSeek Coder V2 Lite BaseFireworks AI0.5000.500compare164K164K8.5N/A
Llama2 70B ChatSnowflakeN/AN/Acompare4K8K8.4N/A
Meta Textgeneration Llama 2 70BAWS SageMakerN/AN/Acompare4K4K8.4N/A
Meta Textgeneration Llama 2 13BAWS SageMakerN/AN/Acompare4K4K8.4N/A
Llama 2 70BReplicate0.6502.75compare4K4K8.4N/A
Llama 2 13BReplicate0.1000.500compare4K4K8.4N/A
Llama 2 70B ChatPerplexity0.7002.80compare4K4K8.4N/A
Llama2:70BOllamaN/AN/Acompare4K4K8.4N/A
Llama2:13BOllamaN/AN/Acompare4K4K8.4N/A
Llama2 70B Chat V1AWS Bedrock1.952.56compare4K4K8.4N/A
Llama2 13B Chat V1AWS Bedrock0.7501.00compare4K4K8.4N/A
Llama V2 70BFireworks AI0.1000.100compare4K4K8.4N/A
Llama V2 13BFireworks AI0.2000.200compare4K4K8.4N/A
Databricks Llama 2 70B ChatDatabricks0.5001.50compare4K4K8.4N/A
Llama 2 70B Chat HfAnyscale1.001.00compare4K4K8.4N/A
Llama 2 13B Chat HfAnyscale0.2500.250compare4K4K8.4N/A
Command R+Vercel AI Gateway2.5010.00compare128K4K8.3N/A
Command PlusOracle Cloud (OCI)1.561.56compare128K4K8.3N/A
Openchat 3p5 0106 7BFireworks AI0.2000.200compare8K8K8.3N/A
Dbrx InstructFireworks AI1.201.20compare33K33K8.3N/A
Command R+Cohere2.5010.00compare128K4K8.3N/A
Command R Plus V1AWS Bedrock3.0015.00compare128K4K8.3N/A
Command R+Azure OpenAI3.0015.00compare128K4K8.3N/A
Jamba 1.5 MiniVertex AI (AI21)0.2000.400compare256K256K8.0N/A
Jamba 1.5 MiniSnowflakeN/AN/Acompare256K8K8.0N/A
Jamba 1.5 MiniAI21 Labs0.2000.400compare256K256K8.0N/A
Jamba 1 5 Mini V1AWS Bedrock0.2000.400compare256K256K8.0N/A
Jamba Mini 1.6AI21 Labs0.2000.400compare256K256K7.9N/A
Mixtral 8x7BSnowflakeN/AN/Acompare32K8K7.7N/A
Mixtral 8x7B InstructPerplexity0.0700.280compare4K4K7.7N/A
Open Mixtral 8x7BMistral AI0.7000.700compare32K8K7.7N/A
Mixtral 8x7BFireworks AI0.5000.500compare33K33K7.7N/A
Command RVercel AI Gateway0.1500.600compare128K4K7.4N/A
Qwen 3 14BVercel AI Gateway0.0800.240compare41K16K7.4N/A
Mistral 7BSnowflakeN/AN/Acompare32K8K7.4N/A
Mistral 7B InstructPerplexity0.0700.280compare4K4K7.4N/A
Mistral 7B InstructOpenRouter0.1300.130compare8KN/A7.4N/A
MistralOllamaN/AN/Acompare8K8K7.4N/A
Open Mistral 7BMistral AI0.2500.250compare32K8K7.4N/A
Qwen3 14BFireworks AI0.2000.200compare41K41K7.4N/A
Mistral 7BFireworks AI0.2000.200compare33K33K7.4N/A
Qwen3 14BDeepInfra0.0600.240compare41K41K7.4N/A
Command RCohere0.1500.600compare128K4K7.4N/A
Command R V1AWS Bedrock0.5001.50compare128K4K7.4N/A
Claude Instant V1AWS Bedrock0.8002.40compare100K8K7.47.8
Glm 4.5 XZ AI (Zhipu)2.208.90compare128K32KN/AN/A
Glm 4.5 FlashZ AI (Zhipu)N/AN/Acompare128K32KN/AN/A
Glm 4.5 AirxZ AI (Zhipu)1.104.50compare128K32KN/AN/A
Glm 4 32B 0414 128KZ AI (Zhipu)0.1000.100compare128K32KN/AN/A
Grok Vision BetaxAI5.0015.00compare8K8KN/AN/A
Grok Code Fast 1 0825xAI0.2001.50compare256K256KN/AN/A
Grok Code FastxAI0.2001.50compare256K256KN/AN/A
Grok 4 0709xAI3.0015.00compare256K256KN/AN/A
Grok 3 Mini FastxAI0.6004.00compare131K131KN/AN/A
Grok 3 FastxAI5.0025.00compare131K131KN/AN/A
Grok 2 VisionxAI2.0010.00compare33K33KN/AN/A
Grok 2xAI2.0010.00compare131K131KN/AN/A
Allam 1 13B InstructIBM watsonx1.801.80compare8K8KN/AN/A
Pixtral 12B 2409IBM watsonx0.3500.350compare128K128KN/AN/A
Mistral Small 2503IBM watsonx0.1000.300compare32K32KN/AN/A
Mistral Medium 2505IBM watsonx3.0010.00compare128K128KN/AN/A
Llama Guard 3 11B VisionIBM watsonx0.3500.350compare128K128KN/AN/A
Llama 4 Maverick 17BIBM watsonx0.3501.40compare128K128KN/AN/A
Granite Vision 3 2 2BIBM watsonx0.1000.100compare8K8KN/AN/A
Granite Ttm 512 96 R2IBM watsonx0.3800.380compare512512N/AN/A
Granite Ttm 1536 96 R2IBM watsonx0.3800.380compare512512N/AN/A
Granite Ttm 1024 96 R2IBM watsonx0.3800.380compare512512N/AN/A
Granite Guardian 3 3 8BIBM watsonx0.2000.200compare8K8KN/AN/A
Granite Guardian 3 2 2BIBM watsonx0.1000.100compare8K8KN/AN/A
Granite 13B Chat V2IBM watsonx0.6000.600compare8K8KN/AN/A
Flan T5 Xl 3BIBM watsonx0.6000.600compare8K8KN/AN/A
JAIS 13B ChatIBM watsonx500.000.0020compare8K8KN/AN/A
Mt0 Xxl 13BIBM watsonx500.000.0020compare8K8KN/AN/A
Qwen3 235B A22B Thinking 2507Weights & Biases0.0100.010compare262K262KN/AN/A
Llama 4 Scout 17B 16E InstructWeights & Biases0.0170.066compare64K64KN/AN/A
DeepSeek R1 0528Weights & Biases0.1350.540compare161K161KN/AN/A
Glm 5 MaasVertex AI (Z AI)1.003.20compare200K128KN/A44.2
Mistral Small 2503Vertex AI (Mistral)1.003.00compare128K128KN/AN/A
Mistral Nemo@latestVertex AI (Mistral)0.1500.150compare128K128KN/AN/A
Mistral NemoVertex AI (Mistral)3.003.00compare128K128KN/AN/A
Mistral Large 2411Vertex AI (Mistral)2.006.00compare128K8KN/AN/A
Llama3 405B Instruct MaasVertex AI (Llama)N/AN/Acompare32K32KN/AN/A
Llama 4 Scout 17B 16e Instruct MaasVertex AI (Llama)0.2500.700compare10.0M10.0MN/AN/A
Llama 4 Scout 17B 128e Instruct MaasVertex AI (Llama)0.2500.700compare10.0M10.0MN/AN/A
Llama 4 Maverick 17B 16e Instruct MaasVertex AI (Llama)0.3501.15compare1.0M1.0MN/AN/A
Llama 4 Maverick 17B 128e Instruct MaasVertex AI (Llama)0.3501.15compare1.0M1.0MN/AN/A
Jamba 1.5Vertex AI (AI21)0.2000.400compare256K256KN/AN/A
Gemini 3.1 Pro PreviewGoogle Vertex AI2.0012.00compare1.0M66KN/A55.5
DeepSeek R1 0528 MaasVertex AI (DeepSeek)1.355.40compare65K8KN/AN/A
Codestral @latestVertex AI (Mistral)0.2000.600compare128K128KN/AN/A
CodestralVertex AI (Mistral)0.2000.600compare128K128KN/AN/A
Codestral 2501Vertex AI (Mistral)0.2000.600compare128K128KN/AN/A
Codestral 2Vertex AI (Mistral)0.3000.900compare128K128KN/AN/A
Claude Sonnet 4.6Anthropic (Vertex AI)3.0015.00compare200K64KN/A46.4
Claude Opus 4.6Anthropic (Vertex AI)5.0025.00compare1.0M128KN/A47.6
Grok 3 Mini FastVercel AI Gateway0.6004.00compare131K131KN/AN/A
Grok 3 MiniVercel AI Gateway0.3000.500compare131K131KN/AN/A
Grok 3 FastVercel AI Gateway5.0025.00compare131K131KN/AN/A
Grok 2 VisionVercel AI Gateway2.0010.00compare33K33KN/AN/A
Grok 2Vercel AI Gateway2.0010.00compare131K4KN/AN/A
V0 1.5 MdVercel AI Gateway3.0015.00compare128K33KN/AN/A
V0 1.0 MdVercel AI Gateway3.0015.00compare128K32KN/AN/A
Morph V3 LargeVercel AI Gateway0.9001.90compare33K16KN/AN/A
Morph V3 FastVercel AI Gateway0.8001.20compare33K16KN/AN/A
Pixtral LargeVercel AI Gateway2.006.00compare128K4KN/AN/A
Pixtral 12BVercel AI Gateway0.1500.150compare128K4KN/AN/A
Mistral EmbedVercel AI Gateway0.100N/AcompareN/AN/AN/AN/A
Ministral 8BVercel AI Gateway0.1000.100compare128K4KN/AN/A
Ministral 3BVercel AI Gateway0.0400.040compare128K4KN/AN/A
Codestral EmbedVercel AI Gateway0.150N/AcompareN/AN/AN/AN/A
CodestralVercel AI Gateway0.3000.900compare256K4KN/AN/A
Mercury Coder SmallVercel AI Gateway0.2501.00compare32K16KN/AN/A
Gemma 2 9BVercel AI Gateway0.2000.200compare8K8KN/AN/A
Embed V4.0Vercel AI Gateway0.120N/AcompareN/AN/AN/AN/A
Claude Opus 4.6Vercel AI Gateway5.0025.00compare200K64KN/A47.6
Titan Embed Text V2Vercel AI Gateway0.020N/AcompareN/AN/AN/AN/A
Qwen3 CoderVercel AI Gateway0.4001.60compare262K67KN/AN/A
V0 1.5 Mdv0 (Vercel)3.0015.00compare128K128KN/AN/A
V0 1.5 Lgv0 (Vercel)15.0075.00compare512K512KN/AN/A
V0 1.0 Mdv0 (Vercel)3.0015.00compare128K128KN/AN/A
Us.writer.palmyra X5 V1AWS Bedrock0.6006.00compare1.0M8KN/AN/A
Us.writer.palmyra X4 V1AWS Bedrock2.5010.00compare128K8KN/AN/A
Together Ai Up To 4BTogether AI0.1000.100compareN/AN/AN/AN/A
Together Ai 81.1B 110BTogether AI1.801.80compareN/AN/AN/AN/A
Together Ai 8.1B 21BTogether AI0.3000.300compare1KN/AN/AN/A
Together Ai 41.1B 80BTogether AI0.9000.900compareN/AN/AN/AN/A
Together Ai 4.1B 8BTogether AI0.2000.200compareN/AN/AN/AN/A
Together Ai 21.1B 41BTogether AI0.8000.800compareN/AN/AN/AN/A
CodeLlama 34B InstructTogether AIN/AN/AcompareN/AN/AN/AN/A
Qwen3 235B A22B Thinking 2507Together AI0.6503.00compare256KN/AN/AN/A
Qwen2.5 7B Instruct TurboTogether AIN/AN/AcompareN/AN/AN/AN/A
Mixtral 8x7B Instruct V0.1Together AI0.6000.600compareN/AN/AN/AN/A
Mistral Small 24B Instruct 2501Together AIN/AN/AcompareN/AN/AN/AN/A
Mistral 7B Instruct V0.1Together AIN/AN/AcompareN/AN/AN/AN/A
Llama 4 Scout 17B 16E InstructTogether AI0.1800.590compareN/AN/AN/AN/A
Llama 4 Maverick 17B 128E Instruct FP8Together AI0.2700.850compareN/AN/AN/AN/A
DeepSeek R1 0528 TputTogether AI0.5502.19compare128KN/AN/AN/A
Text UnicornGoogle Vertex AI10.0028.00compare8K1KN/AN/A
Text UnicornGoogle Vertex AI10.0028.00compare8K1KN/AN/A
Text Bison32kGoogle Vertex AI0.1250.125compare8K1KN/AN/A
Text Bison32kGoogle Vertex AI0.1250.125compare8K1KN/AN/A
Text BisonGoogle Vertex AIN/AN/Acompare8K1KN/AN/A
Text BisonGoogle Vertex AIN/AN/Acompare8K1KN/AN/A
Text BisonGoogle Vertex AIN/AN/Acompare8K2KN/AN/A
Reka CoreSnowflakeN/AN/Acompare32K8KN/AN/A
Jamba InstructSnowflakeN/AN/Acompare256K8KN/AN/A
Gemma 7BSnowflakeN/AN/Acompare8K8KN/AN/A
Sarvam MSarvam AIN/AN/Acompare8K32KN/AN/A
Qwen2 Audio 7B InstructSambaNova0.500100.00compare4K4KN/AN/A
Meta Llama Guard 3 8BSambaNova0.3000.300compare16K16KN/AN/A
Llama 4 Scout 17B 16E InstructSambaNova0.4000.700compare8K8KN/AN/A
Llama 4 Maverick 17B 128E InstructSambaNova0.6301.80compare131K131KN/AN/A
Mixtral 8x7B Instruct V0.1Replicate0.3001.00compare4K4KN/AN/A
Mistral 7B V0.1Replicate0.0500.250compare4K4KN/AN/A
Mistral 7B Instruct V0.2Replicate0.0500.250compare4K4KN/AN/A
Apertus 8B InstructPublic AIN/AN/Acompare8K4KN/AN/A
Apertus 70B InstructPublic AIN/AN/Acompare8K4KN/AN/A
Salamandra 7B Instruct Tools 16KPublic AIN/AN/Acompare16K4KN/AN/A
ALIA 40B Instruct Q8 0Public AIN/AN/Acompare8K4KN/AN/A
Qwen SEA LION V4 32B ITPublic AIN/AN/Acompare33K4KN/AN/A
Gemma SEA LION V4 27B ITPublic AIN/AN/Acompare8K4KN/AN/A
Sonar Small ChatPerplexity0.0700.280compare16K16KN/AN/A
Sonar Medium ChatPerplexity0.6001.80compare16K16KN/AN/A
Sonar Deep ResearchPerplexity2.008.00compare128KN/AN/AN/A
Pplx 7B ChatPerplexity0.0700.280compare8K8KN/AN/A
Pplx 70B ChatPerplexity0.7002.80compare4K4KN/AN/A
Llama 3.1 Sonar Small 128K ChatPerplexity0.2000.200compare131K131KN/AN/A
Llama 3.1 Sonar Large 128K ChatPerplexity1.001.00compare131K131KN/AN/A
Llama 3.1 Sonar Huge 128K OnlinePerplexity5.005.00compare127K127KN/AN/A
Codellama 70B InstructPerplexity0.7002.80compare16K16KN/AN/A
Codellama 34B InstructPerplexity0.3501.40compare16K16KN/AN/A
Text Bison 001Google PaLM0.1250.125compare8K1KN/AN/A
Text BisonGoogle PaLM0.1250.125compare8K1KN/AN/A
Chat Bison 001Google PaLM0.1250.125compare8K4KN/AN/A
Chat BisonGoogle PaLM0.1250.125compare8K4KN/AN/A
Qwen2.5 VL 72B InstructOVHcloud0.9100.910compare32K32KN/AN/A
Mixtral 8x7B Instruct V0.1OVHcloud0.6300.630compare32K32KN/AN/A
Mistral Nemo Instruct 2407OVHcloud0.1300.130compare118K118KN/AN/A
Mistral 7B Instruct V0.3OVHcloud0.1000.100compare127K127KN/AN/A
Mamba Codestral 7B V0.1OVHcloud0.1900.190compare256K256KN/AN/A
Llava V1.6 Mistral 7B HfOVHcloud0.2900.290compare32K32KN/AN/A
Remm Slerp L2 13BOpenRouter1.881.88compare6KN/AN/AN/A
RouterOpenRouter0.8503.40compare131K131KN/AN/A
Qwen3 CoderOpenRouter0.2200.950compare262K262KN/AN/A
Qwen3 235B A22b Thinking 2507OpenRouter0.1100.600compare262K262KN/AN/A
Qwen Vl PlusOpenRouter0.2100.630compare8K2KN/AN/A
GPT-5.2-proOpenRouter21.00168.00compare272K128KN/AN/A
GPT-5.2-codexOpenRouter1.7514.00compare272K128KN/A43.0
Mistral Large 2512OpenRouter0.5001.50compare262K262KN/AN/A
Devstral 2512OpenRouter0.1500.600compare262K66KN/AN/A
Minimax M2.1OpenRouter0.2701.20compare204K64KN/AN/A
WeaverOpenRouter5.635.63compare8KN/AN/AN/A
Mythomax L2 13BOpenRouter1.881.88compare8KN/AN/AN/A
DeepSeek R1 0528OpenRouter0.5002.15compare65K8KN/AN/A
DeepSeek Chat V3.1OpenRouter0.2000.800compare164K164KN/AN/A
DeepSeek Chat V3 0324OpenRouter0.1400.280compare66K8KN/AN/A
DeepSeek ChatOpenRouter0.1400.280compare66K8KN/AN/A
Ui Tars 1.5 7BOpenRouter0.1000.200compare131K2KN/AN/A
ContainerOpenAIN/AN/AcompareN/AN/AN/AN/A
GPT-oss-safeguard-20bAWS Bedrock0.0700.200compare128K8KN/AN/A
GPT-oss-safeguard-120bAWS Bedrock0.1500.600compare128K8KN/AN/A
VicunaOllamaN/AN/Acompare2K2KN/AN/A
Orca MiniOllamaN/AN/Acompare4K4KN/AN/A
Mixtral 8x7B Instruct V0.1OllamaN/AN/Acompare33K33KN/AN/A
Mixtral 8x22B Instruct V0.1OllamaN/AN/Acompare66K66KN/AN/A
Mistral 7B Instruct V0.2OllamaN/AN/Acompare33K33KN/AN/A
Mistral 7B Instruct V0.1OllamaN/AN/Acompare8K8KN/AN/A
Llama2OllamaN/AN/Acompare4K4KN/AN/A
Internlm2 5 20B ChatOllamaN/AN/Acompare33K8KN/AN/A
CodellamaOllamaN/AN/Acompare4K4KN/AN/A
CodegemmaOllamaN/AN/Acompare8K8KN/AN/A
Codegeex4OllamaN/AN/Acompare33K8KN/AN/A
Xai.grok 3 Mini FastOracle Cloud (OCI)0.6004.00compare131K131KN/AN/A
Xai.grok 3 MiniOracle Cloud (OCI)0.3000.500compare131K131KN/AN/A
Xai.grok 3 FastOracle Cloud (OCI)5.0025.00compare131K131KN/AN/A
Llama 4 Scout 17B 16e InstructOracle Cloud (OCI)0.7200.720compare192K4KN/AN/A
Llama 4 Maverick 17B 128e Instruct Fp8Oracle Cloud (OCI)0.7200.720compare512K4KN/AN/A
CommandOracle Cloud (OCI)1.561.56compare128K4KN/AN/A
Command AOracle Cloud (OCI)1.561.56compare256K4KN/AN/A
Qwen2.5 Coder 3B InstructNscale0.0100.030compareN/AN/AN/AN/A
Mixtral 8x22B Instruct V0.1Nscale0.6000.600compareN/AN/AN/AN/A
Llama 4 Scout 17B 16E InstructNscale0.0900.290compareN/AN/AN/AN/A
DeepSeek R1 Distill Qwen 7BNscale0.2000.200compareN/AN/AN/AN/A
Autoglm Phone 9B MultilingualNovita AI0.0350.138compare66K66KN/AN/A
R1v4 LiteNovita AI0.2000.600compare262K66KN/AN/A
L31 70B Euryale V2.2Novita AI1.481.48compare8K8KN/AN/A
L3 8B Stheno V3.2Novita AI0.0500.050compare8K32KN/AN/A
L3 8B LunarisNovita AI0.0500.050compare8K8KN/AN/A
L3 70B Euryale V2.1Novita AI1.481.48compare8K8KN/AN/A
Qwen2.5 Vl 72B InstructNovita AI0.8000.800compare33K33KN/AN/A
Qwen2.5 7B InstructNovita AI0.0700.070compare32K32KN/AN/A
Qwen Mt PlusNovita AI0.2500.750compare16K8KN/AN/A
Paddleocr VlNovita AI0.0200.020compare16K16KN/AN/A
Hermes 2 Pro Llama 3 8BNovita AI0.1400.140compare8K8KN/AN/A
Mistral NemoNovita AI0.0400.170compare60K16KN/AN/A
Minimax M2.1Novita AI0.3001.20compare205K131KN/AN/A
Wizardlm 2 8x22BNovita AI0.6200.620compare66K8KN/AN/A
Llama 4 Scout 17B 16e InstructNovita AI0.1800.590compare131K131KN/AN/A
Llama 4 Maverick 17B 128e Instruct Fp8Novita AI0.2700.850compare1.0M8KN/AN/A
Mythomax L2 13BNovita AI0.0900.090compare4K3KN/AN/A
DeepSeek R1 TurboNovita AI0.7002.50compare64K16KN/AN/A
DeepSeek R1 0528Novita AI0.7002.50compare164K33KN/AN/A
DeepSeek Prover V2 671BNovita AI0.7002.50compare160K160KN/AN/A
DeepSeek OCRNovita AI0.0300.030compare8K8KN/AN/A
ERNIE 4.5 Vl 424B A47bNovita AI0.4201.25compare123K16KN/AN/A
ERNIE 4.5 Vl 28B A3bNovita AI0.1400.560compare30K8KN/AN/A
ERNIE 4.5 21B A3bNovita AI0.0700.280compare120K8KN/AN/A
Baichuan M2 32BNovita AI0.0700.070compare131K131KN/AN/A
Morph V3 LargeMorph0.9001.90compare16K16KN/AN/A
Morph V3 FastMorph0.8001.20compare16K16KN/AN/A
Moonshot V1 AutoMoonshot AI (Kimi)2.005.00compare131K131KN/AN/A
Moonshot V1 8K Vision PreviewMoonshot AI (Kimi)0.2002.00compare8K8KN/AN/A
Moonshot V1 8K 0430Moonshot AI (Kimi)0.2002.00compare8K8KN/AN/A
Moonshot V1 8KMoonshot AI (Kimi)0.2002.00compare8K8KN/AN/A
Moonshot V1 32K Vision PreviewMoonshot AI (Kimi)1.003.00compare33K33KN/AN/A
Moonshot V1 32K 0430Moonshot AI (Kimi)1.003.00compare33K33KN/AN/A
Moonshot V1 32KMoonshot AI (Kimi)1.003.00compare33K33KN/AN/A
Moonshot V1 128K Vision PreviewMoonshot AI (Kimi)2.005.00compare131K131KN/AN/A
Moonshot V1 128K 0430Moonshot AI (Kimi)2.005.00compare131K131KN/AN/A
Moonshot V1 128KMoonshot AI (Kimi)2.005.00compare131K131KN/AN/A
Kimi Thinking PreviewMoonshot AI (Kimi)0.6002.50compare131K131KN/AN/A
Kimi Latest 8KMoonshot AI (Kimi)0.2002.00compare8K8KN/AN/A
Kimi Latest 32KMoonshot AI (Kimi)1.003.00compare33K33KN/AN/A
Kimi Latest 128KMoonshot AI (Kimi)2.005.00compare131K131KN/AN/A
KimiMoonshot AI (Kimi)2.005.00compare131K131KN/AN/A
Kimi K2 Turbo PreviewMoonshot AI (Kimi)1.158.00compare262K262KN/AN/A
Kimi K2 Thinking TurboMoonshot AI (Kimi)1.158.00compare262K262KN/AN/A
Kimi K2 0711 PreviewMoonshot AI (Kimi)0.6002.50compare131K131KN/AN/A
Pixtral LargeMistral AI2.006.00compare128K128KN/AN/A
Pixtral 12B 2409Mistral AI0.1500.150compare128K128KN/AN/A
Open Mistral Nemo 2407Mistral AI0.3000.300compare128K128KN/AN/A
Open Mistral NemoMistral AI0.3000.300compare128K128KN/AN/A
Mistral TinyMistral AI0.2500.250compare32K8KN/AN/A
Mistral Medium 2505Mistral AI0.4002.00compare131K8KN/AN/A
Mistral Medium 2312Mistral AI2.708.10compare32K8KN/AN/A
Mistral Large 2411Mistral AI2.006.00compare128K128KN/AN/A
Mistral Large 2402Mistral AI4.0012.00compare32K8KN/AN/A
Magistral Small 2506Mistral AI0.5001.50compare40K40KN/AN/A
Magistral Medium 2506Mistral AI2.005.00compare40K40KN/AN/A
Labs Devstral Small 2512Mistral AI0.1000.300compare256K256KN/AN/A
Devstral Small 2507Mistral AI0.1000.300compare128K128KN/AN/A
Devstral Medium 2507Mistral AI0.4002.00compare128K128KN/AN/A
Devstral 2512Mistral AI0.4002.00compare256K256KN/AN/A
Codestral MambaMistral AI0.2500.250compare256K256KN/AN/A
CodestralMistral AI1.003.00compare32K8KN/AN/A
Codestral 2508Mistral AI0.3000.900compare256K256KN/AN/A
Codestral 2405Mistral AI1.003.00compare32K8KN/AN/A
Voxtral Small 24B 2507AWS Bedrock0.1000.300compare128K8KN/AN/A
Voxtral Mini 3B 2507AWS Bedrock0.0400.040compare128K8KN/AN/A
MiniMax M2.5 LightningMiniMax0.3002.40compare1.0M8KN/AN/A
MiniMax M2.1 LightningMiniMax0.3002.40compare1.0M8KN/AN/A
MiniMax M2.1MiniMax0.3001.20compare1.0M8KN/AN/A
Llama4 Scout 17B Instruct V1AWS Bedrock0.1700.660compare128K4KN/AN/A
Llama4 Maverick 17B Instruct V1AWS Bedrock0.2400.970compare128K4KN/AN/A
Llama 4 Scout 17B 16E Instruct FP8Meta LlamaN/AN/Acompare10.0M4KN/AN/A
Llama 4 Maverick 17B 128E Instruct FP8Meta LlamaN/AN/Acompare1.0M4KN/AN/A
Llama 3.3 8B InstructMeta LlamaN/AN/Acompare128K4KN/AN/A
Medlm MediumGoogle Vertex AIN/AN/Acompare33K8KN/AN/A
Medlm LargeGoogle Vertex AIN/AN/Acompare8K1KN/AN/A
Luminous Supreme ControlAleph Alpha218.75240.63compare2KN/AN/AN/A
Luminous SupremeAleph Alpha175.00192.50compare2KN/AN/AN/A
Luminous Extended ControlAleph Alpha56.2561.88compare2KN/AN/AN/A
Luminous ExtendedAleph Alpha45.0049.50compare2KN/AN/AN/A
Luminous Base ControlAleph Alpha37.5041.25compare2KN/AN/AN/A
Luminous BaseAleph Alpha30.0033.00compare2KN/AN/AN/A
Openthinker 7BLlamaGate0.0800.150compare33K8KN/AN/A
Mistral 7B V0.3LlamaGate0.1000.150compare33K8KN/AN/A
Llava 7BLlamaGate0.1000.200compare4K2KN/AN/A
Dolphin3 8BLlamaGate0.0800.150compare128K8KN/AN/A
DeepSeek R1 8BLlamaGate0.1000.200compare66K16KN/AN/A
DeepSeek R1 7B QwenLlamaGate0.0800.150compare131K16KN/AN/A
DeepSeek Coder 6.7BLlamaGate0.0600.120compare16K4KN/AN/A
Codellama 7BLlamaGate0.0600.120compare16K4KN/AN/A
Llama 4 Scout 17B 16e InstructLambda0.0500.100compare16K8KN/AN/A
Llama 4 Maverick 17B 128e Instruct Fp8Lambda0.0500.100compare131K8KN/AN/A
Lfm 7BLambda0.0250.040compare131K131KN/AN/A
Hermes3 8BLambda0.0250.040compare131K131KN/AN/A
DeepSeek R1 671BLambda0.8000.800compare131K131KN/AN/A
DeepSeek R1 0528Lambda0.2000.600compare131K131KN/AN/A
Jamba 1.5AI21 Labs0.2000.400compare256K256KN/AN/A
J2 UltraAI21 Labs15.0015.00compare8K8KN/AN/A
J2 MidAI21 Labs10.0010.00compare8K8KN/AN/A
J2 LightAI21 Labs3.003.00compare8K8KN/AN/A
DeepSeek R1 0528Hyperbolic0.2500.250compare131K131KN/AN/A
GPT-oss-safeguard-20bGroq0.0750.300compare131K66KN/AN/A
Llama Guard 4 12BGroq0.2000.200compare8K8KN/AN/A
Llama 4 Scout 17B 16e InstructGroq0.1100.340compare131K8KN/AN/A
Llama 4 Maverick 17B 128e InstructGroq0.2000.600compare131K8KN/AN/A
Gemma 7B ItGroq0.0500.080compare8K8KN/AN/A
Mistral Nemo Instruct 2407Gradient AI0.3000.300compare512N/AN/AN/A
GPT-realtime miniOpenAI0.6002.40compare128K4KN/AN/A
GPT-realtimeOpenAI4.0016.00compare32K4KN/AN/A
GPT-audio miniOpenAI0.6002.40compare128K16KN/AN/A
GPT-audioOpenAI2.5010.00compare128K16KN/AN/A
GPT-5-search-apiOpenAI1.2510.00compare272K128KN/AN/A
GPT-4o-realtime PreviewOpenAI5.0020.00compare128K4KN/AN/A
GPT-4o-mini-search PreviewOpenAI0.1500.600compare128K16KN/AN/A
GPT-4o-mini-realtime PreviewOpenAI0.6002.40compare128K4KN/AN/A
GPT-4o-mini-audio PreviewOpenAI0.1500.600compare128K16KN/AN/A
GPT-4o-audio PreviewOpenAI2.5010.00compare128K16KN/AN/A
GPT-4-32k-0613OpenAI60.00120.00compare33K4KN/AN/A
GPT-4-32k-0314OpenAI60.00120.00compare33K4KN/AN/A
GPT-4-1106 PreviewOpenAI10.0030.00compare128K4KN/AN/A
GPT-4OpenAI30.0060.00compare8K4KN/AN/A
MiniMax M2.1GMI Cloud0.3001.20compare197K16KN/AN/A
GPT-4GitHub CopilotN/AN/Acompare33K4KN/AN/A
Claude Opus 4.6GitHub CopilotN/AN/Acompare128K16KN/A47.6
GigaChat 2 ProGigaChat (Sber)N/AN/Acompare128K8KN/AN/A
GigaChat 2 MaxGigaChat (Sber)N/AN/Acompare128K8KN/AN/A
GigaChat 2 LiteGigaChat (Sber)N/AN/Acompare128K8KN/AN/A
Learnlm 1.5 Pro ExperimentalGoogle GeminiN/AN/Acompare33K8KN/AN/A
Gemini robotics-er-1.5 PreviewGoogle Gemini0.3002.50compare1.0M66KN/AN/A
Gemini gemma-2-9b-itGoogle Gemini0.3501.05compare8K8KN/AN/A
Gemini gemma-2-27b-itGoogle Gemini0.3501.05compare8K8KN/AN/A
Gemini Experimental 1114Google GeminiN/AN/Acompare1.0M8KN/AN/A
Gemini 3.1 Pro PreviewGoogle Gemini2.0012.00compare1.0M66KN/A55.5
Gemini 2.0 Pro-exp-02-05Google GeminiN/AN/Acompare2.1M8KN/AN/A
Gemini robotics-er-1.5 PreviewGoogle Vertex AI0.3002.50compare1.0M66KN/AN/A
Gemini Experimental 1206Google Gemini0.3002.50compare1.0M66KN/AN/A
Gemini 2.0 Pro-exp-02-05Google Vertex AI1.2510.00compare2.1M8KN/AN/A
Ft:davinci 002OpenAI12.0012.00compare16K4KN/AN/A
Ft:babbage 002OpenAI1.601.60compare16K4KN/AN/A
Zephyr 7B BetaFireworks AI0.2000.200compare33K33KN/AN/A
Yi LargeFireworks AI3.003.00compare33K33KN/AN/A
Yi 6BFireworks AI0.2000.200compare4K4KN/AN/A
Yi 34B 200K CapybaraFireworks AI0.9000.900compare200K200KN/AN/A
Yi 34BFireworks AI0.9000.900compare4K4KN/AN/A
Toppy M 7BFireworks AI0.2000.200compare33K33KN/AN/A
Starcoder2 7BFireworks AI0.2000.200compare16K16KN/AN/A
Starcoder2 3BFireworks AI0.1000.100compare16K16KN/AN/A
Starcoder2 15BFireworks AI0.2000.200compare16K16KN/AN/A
Starcoder 7BFireworks AI0.2000.200compare8K8KN/AN/A
Starcoder 16BFireworks AI0.2000.200compare8K8KN/AN/A
Stablecode 3BFireworks AI0.1000.100compare4K4KN/AN/A
Snorkel Mistral 7B Pairrm DpoFireworks AI0.2000.200compare33K33KN/AN/A
Rolm OCRFireworks AI0.2000.200compare128K128KN/AN/A
Qwen3 1p7b Fp8 Draft 40960Fireworks AI0.1000.100compare41K41KN/AN/A
Qwen3 1p7b Fp8 Draft 131072Fireworks AI0.1000.100compare131K131KN/AN/A
Qwen2p5 Vl 7B InstructFireworks AI0.2000.200compare128K128KN/AN/A
Qwen2p5 Vl 72B InstructFireworks AI0.9000.900compare128K128KN/AN/A
Qwen2p5 Vl 3B InstructFireworks AI0.2000.200compare128K128KN/AN/A
Qwen2p5 Vl 32B InstructFireworks AI0.9000.900compare128K128KN/AN/A
Qwen2p5 Math 72B InstructFireworks AI0.9000.900compare4K4KN/AN/A
Qwen2p5 Coder 3BFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2p5 Coder 1p5bFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2p5 Coder 14BFireworks AI0.2000.200compare33K33KN/AN/A
Qwen2p5 Coder 0p5bFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2p5 1p5b InstructFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2p5 0p5b InstructFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2 Vl 7B InstructFireworks AI0.2000.200compare33K33KN/AN/A
Qwen2 Vl 72B InstructFireworks AI0.9000.900compare33K33KN/AN/A
Qwen2 Vl 2B InstructFireworks AI0.1000.100compare33K33KN/AN/A
Qwen2 7B InstructFireworks AI0.2000.200compare33K33KN/AN/A
Qwen1p5 72B ChatFireworks AI0.9000.900compare33K33KN/AN/A
Qwen V2p5 7BFireworks AI0.2000.200compare131K131KN/AN/A
Qwen V2p5 14B InstructFireworks AI0.2000.200compare33K33KN/AN/A
Pythia 12BFireworks AI0.2000.200compare2K2KN/AN/A
Phind Code Llama 34B V2Fireworks AI0.9000.900compare16K16KN/AN/A
Phind Code Llama 34B Python V1Fireworks AI0.9000.900compare16K16KN/AN/A
Phi 3 Vision 128K InstructFireworks AI0.2000.200compare32K32KN/AN/A
Phi 2 3BFireworks AI0.1000.100compare2K2KN/AN/A
Openorca 7BFireworks AI0.2000.200compare33K33KN/AN/A
Openhermes 2p5 Mistral 7BFireworks AI0.2000.200compare33K33KN/AN/A
Openhermes 2 Mistral 7BFireworks AI0.2000.200compare33K33KN/AN/A
Nous Hermes Llama2 7BFireworks AI0.2000.200compare4K4KN/AN/A
Nous Hermes Llama2 70BFireworks AI0.9000.900compare4K4KN/AN/A
Nous Hermes Llama2 13BFireworks AI0.2000.200compare4K4KN/AN/A
Nous Hermes 2 Yi 34BFireworks AI0.9000.900compare4K4KN/AN/A
Nous Hermes 2 Mixtral 8x7B DpoFireworks AI0.5000.500compare33K33KN/AN/A
Nous Capybara 7B V1p9Fireworks AI0.2000.200compare33K33KN/AN/A
Mythomax L2 13BFireworks AI0.2000.200compare4K4KN/AN/A
Mixtral 8x22B Instruct HfFireworks AI1.201.20compare66K66KN/AN/A
Mistral Small 24B Instruct 2501Fireworks AI0.9000.900compare33K33KN/AN/A
Mistral Nemo Base 2407Fireworks AI0.2000.200compare128K128KN/AN/A
Mistral 7B Instruct V0p2Fireworks AI0.2000.200compare33K33KN/AN/A
Ministral 3 8B Instruct 2512Fireworks AI0.2000.200compare256K256KN/AN/A
Ministral 3 3B Instruct 2512Fireworks AI0.1000.100compare256K256KN/AN/A
Ministral 3 14B Instruct 2512Fireworks AI0.2000.200compare256K256KN/AN/A
Minimax M2p1Fireworks AI0.3001.20compare205K205KN/AN/A
Llava Yi 34BFireworks AI0.9000.900compare4K4KN/AN/A
Llamaguard 7BFireworks AI0.2000.200compare4K4KN/AN/A
Llama4 Scout Instruct BasicFireworks AI0.1500.600compare131K131KN/AN/A
Llama4 Maverick Instruct BasicFireworks AI0.2200.880compare131K131KN/AN/A
Llama Guard 3 8BFireworks AI0.2000.200compare131K131KN/AN/A
Llama Guard 3 1BFireworks AI0.1000.100compare131K131KN/AN/A
Llama Guard 2 8BFireworks AI0.2000.200compare8K8KN/AN/A
Kat Dev 72B ExpFireworks AI0.9000.900compare131K131KN/AN/A
Kat Dev 32BFireworks AI0.9000.900compare131K131KN/AN/A
Kat CoderFireworks AI0.9000.900compare262K262KN/AN/A
Internvl3 8BFireworks AI0.2000.200compare16K16KN/AN/A
Internvl3 78BFireworks AI0.9000.900compare16K16KN/AN/A
Internvl3 38BFireworks AI0.9000.900compare16K16KN/AN/A
Hermes 2 Pro Mistral 7BFireworks AI0.2000.200compare33K33KN/AN/A
GPT-oss-safeguard-20bFireworks AI0.5000.500compare131K131KN/AN/A
GPT-oss-safeguard-120bFireworks AI1.201.20compare131K131KN/AN/A
Glm 4p5 AirFireworks AI0.2200.880compare128K96KN/AN/A
Gemma2 9B ItFireworks AI0.2000.200compare8K8KN/AN/A
Gemma 7BFireworks AI0.2000.200compare8K8KN/AN/A
Gemma 2B ItFireworks AI0.1000.100compare8K8KN/AN/A
Flux 1 SchnellFireworks AI0.1000.100compare4K4KN/AN/A
Flux 1 Dev Controlnet UnionFireworks AI0.00100.0010compare4K4KN/AN/A
Flux 1 DevFireworks AI0.1000.100compare4K4KN/AN/A
Firesearch OCR V6Fireworks AI0.2000.200compare8K8KN/AN/A
Firellava 13BFireworks AI0.2000.200compare4K4KN/AN/A
Firefunction V2Fireworks AI0.9000.900compare8K8KN/AN/A
Firefunction V1Fireworks AI0.5000.500compare33K33KN/AN/A
Fare 20BFireworks AI0.9000.900compare131K131KN/AN/A
ERNIE 4p5 21B A3b PtFireworks AI0.1000.100compare4K4KN/AN/A
Dolphin 2p6 Mixtral 8x7BFireworks AI0.5000.500compare33K33KN/AN/A
Dolphin 2 9 2 Qwen2 72BFireworks AI0.9000.900compare131K131KN/AN/A
Dobby Unhinged Llama 3 3 70B NewFireworks AI0.9000.900compare131K131KN/AN/A
Dobby Mini Unhinged Plus Llama 3 1 8BFireworks AI0.2000.200compare131K131KN/AN/A
DeepSeek V2 Lite ChatFireworks AI0.5000.500compare164K164KN/AN/A
DeepSeek R1 Distill Qwen 7BFireworks AI0.2000.200compare131K131KN/AN/A
DeepSeek R1 BasicFireworks AI0.5502.19compare128K20KN/AN/A
DeepSeek R1 0528Fireworks AI3.008.00compare160K160KN/AN/A
DeepSeek Prover V2Fireworks AI1.201.20compare164K164KN/AN/A
DeepSeek Coder 7B Base V1p5Fireworks AI0.2000.200compare4K4KN/AN/A
DeepSeek Coder 7B BaseFireworks AI0.2000.200compare4K4KN/AN/A
DeepSeek Coder 33B InstructFireworks AI0.9000.900compare16K16KN/AN/A
DeepSeek Coder 1B BaseFireworks AI0.1000.100compare16K16KN/AN/A
Cogito V1 Preview Qwen 32BFireworks AI0.9000.900compare131K131KN/AN/A
Cogito V1 Preview Qwen 14BFireworks AI0.2000.200compare131K131KN/AN/A
Cogito V1 Preview Llama 8BFireworks AI0.2000.200compare131K131KN/AN/A
Cogito V1 Preview Llama 70BFireworks AI0.9000.900compare131K131KN/AN/A
Cogito V1 Preview Llama 3BFireworks AI0.1000.100compare131K131KN/AN/A
Cogito 671B V2 P1Fireworks AI1.201.20compare164K164KN/AN/A
Codegemma 7BFireworks AI0.2000.200compare8K8KN/AN/A
Codegemma 2BFireworks AI0.1000.100compare8K8KN/AN/A
Code Qwen 1p5 7BFireworks AI0.2000.200compare66K66KN/AN/A
Code Llama 7BFireworks AI0.2000.200compare16K16KN/AN/A
Code Llama 70BFireworks AI0.9000.900compare4K4KN/AN/A
Code Llama 34BFireworks AI0.9000.900compare16K16KN/AN/A
Code Llama 13BFireworks AI0.2000.200compare16K16KN/AN/A
Chronos Hermes 13B V2Fireworks AI0.2000.200compare4K4KN/AN/A
Qwerky QwQ 32BFeatherless AIN/AN/Acompare33K4KN/AN/A
Qwerky 72BFeatherless AIN/AN/Acompare33K4KN/AN/A
Eu.twelvelabs.pegasus 1 2 V1AWS BedrockN/A7.50compareN/AN/AN/AN/A
Eu.mistral.pixtral Large 2502 V1AWS Bedrock2.006.00compare128K4KN/AN/A
DolphinNLP Cloud0.5000.500compare16K16KN/AN/A
DeepSeek CoderDeepSeek0.1400.280compare128K4KN/AN/A
DeepSeek ChatDeepSeek0.2800.420compare131K8KN/AN/A
L3.3 70B Euryale V2.3DeepInfra0.6500.750compare131K131KN/AN/A
L3.1 70B Euryale V2.2DeepInfra0.6500.750compare131K131KN/AN/A
L3 8B Lunaris V1 TurboDeepInfra0.0400.050compare8K8KN/AN/A
Qwen3 Coder 480B A35B Instruct TurboDeepInfra0.2901.20compare262K262KN/AN/A
Qwen3 235B A22B Thinking 2507DeepInfra0.3002.90compare262K262KN/AN/A
Qwen2.5 VL 32B InstructDeepInfra0.2000.600compare128K128KN/AN/A
Qwen2.5 7B InstructDeepInfra0.0400.100compare33K33KN/AN/A
Mixtral 8x7B Instruct V0.1DeepInfra0.4000.400compare33K33KN/AN/A
Mistral Small 24B Instruct 2501DeepInfra0.0500.080compare33K33KN/AN/A
Mistral Nemo Instruct 2407DeepInfra0.0200.040compare131K131KN/AN/A
WizardLM 2 8x22BDeepInfra0.4800.480compare66K66KN/AN/A
Llama Guard 4 12BDeepInfra0.1800.180compare164K164KN/AN/A
Llama Guard 3 8BDeepInfra0.0550.055compare131K131KN/AN/A
Llama 4 Scout 17B 16E InstructDeepInfra0.0800.300compare328K328KN/AN/A
Llama 4 Maverick 17B 128E Instruct FP8DeepInfra0.1500.600compare1.0M1.0MN/AN/A
MythoMax L2 13BDeepInfra0.0800.090compare4K4KN/AN/A
DeepSeek R1 TurboDeepInfra1.003.00compare41K41KN/AN/A
DeepSeek R1 0528 TurboDeepInfra1.003.00compare33K33KN/AN/A
DeepSeek R1 0528DeepInfra0.5002.15compare164K164KN/AN/A
OlmOCR 7B 0725 FP8DeepInfra0.2701.50compare16K16KN/AN/A
Databricks Mpt 7B InstructDatabricks0.500N/Acompare8K8KN/AN/A
Databricks Mpt 30B InstructDatabricks1.001.00compare8K8KN/AN/A
Databricks Mixtral 8x7B InstructDatabricks0.5001.00compare4K4KN/AN/A
Databricks Llama 4 MaverickDatabricks0.5001.50compare128K128KN/AN/A
Databricks Claude Sonnet 4 1Databricks3.0015.00compare200K64KN/AN/A
Qwq PlusDashScope (Alibaba)0.8002.40compare98K8KN/AN/A
Qwen3 Coder PlusDashScope (Alibaba)N/AN/Acompare998K66KN/AN/A
Qwen3 Coder PlusDashScope (Alibaba)N/AN/Acompare998K66KN/AN/A
Qwen3 Coder FlashDashScope (Alibaba)N/AN/Acompare998K66KN/AN/A
Qwen3 Coder FlashDashScope (Alibaba)N/AN/Acompare998K66KN/AN/A
Qwen TurboDashScope (Alibaba)0.0500.200compare1.0M16KN/AN/A
Qwen TurboDashScope (Alibaba)0.0500.200compare1.0M8KN/AN/A
Qwen PlusDashScope (Alibaba)N/AN/Acompare998K33KN/AN/A
Qwen PlusDashScope (Alibaba)N/AN/Acompare998K33KN/AN/A
Qwen PlusDashScope (Alibaba)N/AN/Acompare998K33KN/AN/A
Qwen PlusDashScope (Alibaba)0.4001.20compare129K16KN/AN/A
Qwen PlusDashScope (Alibaba)0.4001.20compare129K16KN/AN/A
Qwen PlusDashScope (Alibaba)0.4001.20compare129K8KN/AN/A
Qwen FlashDashScope (Alibaba)N/AN/Acompare998K33KN/AN/A
Qwen FlashDashScope (Alibaba)N/AN/Acompare998K33KN/AN/A
Qwen CoderDashScope (Alibaba)0.3001.50compare1.0M16KN/AN/A
Command R7bCohere0.1500.037compare128K4KN/AN/A
Command NightlyCohere1.002.00compare4K4KN/AN/A
Command LightCohere0.3000.600compare4K4KN/AN/A
Command ACohere2.5010.00compare256K8KN/AN/A
CommandCohere1.002.00compare4K4KN/AN/A
CodestralMistral CodestralN/AN/Acompare32K8KN/AN/A
Codestral 2405Mistral CodestralN/AN/Acompare32K8KN/AN/A
Codechat BisonVertex AI (Code Chat)0.1250.125compare6K1KN/AN/A
Codechat BisonVertex AI (Code Chat)0.1250.125compare6K1KN/AN/A
Codechat Bison 32KVertex AI (Code Chat)0.1250.125compare32K8KN/AN/A
Codechat Bison 32KVertex AI (Code Chat)0.1250.125compare32K8KN/AN/A
Codechat BisonVertex AI (Code Chat)0.1250.125compare6K1KN/AN/A
Code GeckoVertex AI (Code Text)0.1250.125compare2K64N/AN/A
Code GeckoVertex AI (Code Text)0.1250.125compare2K64N/AN/A
Code GeckoVertex AI (Code Text)0.1250.125compare2K64N/AN/A
Code Bison32kVertex AI (Code Text)0.1250.125compare6K1KN/AN/A
Code BisonVertex AI (Code Text)0.1250.125compare6K1KN/AN/A
Code BisonVertex AI (Code Text)0.1250.125compare6K1KN/AN/A
Code Bison 32KVertex AI (Code Text)0.1250.125compare6K1KN/AN/A
Code BisonVertex AI (Code Text)0.1250.125compare6K1KN/AN/A
Codellama 7B Instruct AwqCloudflare Workers AI1.921.92compare4K4KN/AN/A
Mistral 7B Instruct V0.1Cloudflare Workers AI1.921.92compare8K8KN/AN/A
Claude Sonnet 4.6Anthropic3.0015.00compare200K64KN/A46.4
Claude Opus 4.6Anthropic5.0025.00compare1.0M128KN/A47.6
ChatdolphinNLP Cloud0.5000.500compare16K16KN/AN/A
Chat BisonVertex AI (Chat)0.1250.125compare8K4KN/AN/A
Chat BisonVertex AI (Chat)0.1250.125compare8K4KN/AN/A
Chat Bison 32KVertex AI (Chat)0.1250.125compare32K8KN/AN/A
Chat Bison 32KVertex AI (Chat)0.1250.125compare32K8KN/AN/A
Chat BisonVertex AI (Chat)0.1250.125compare8K4KN/AN/A
Zai Glm 4.7Cerebras2.252.75compare128K128KN/AN/A
Zai Glm 4.6Cerebras2.252.75compare128K128KN/AN/A
Mixtral 8x7B Instruct V0AWS Bedrock0.4500.700compare32K8KN/AN/A
Mistral Large 2402 V1AWS Bedrock8.0024.00compare32K8KN/AN/A
Mistral 7B Instruct V0AWS Bedrock0.1500.200compare32K8KN/AN/A
Qwen3 Coder NextAWS Bedrock0.6001.44compare262K8KN/AN/A
Moonshotai.kimi K2.5AWS Bedrock0.7203.60compare262K262KN/AN/A
Minimax.minimax M2.1AWS Bedrock0.3601.44compare196K8KN/AN/A
Command Text V14AWS BedrockN/AN/Acompare4K4KN/AN/A
Command Light Text V14AWS BedrockN/AN/Acompare4K4KN/AN/A
Babbage 002OpenAI0.4000.400compare16K4KN/AN/A
Mistral Large 2402Azure OpenAI8.0024.00compare32KN/AN/AN/A
GPT-realtime miniAzure OpenAI0.6002.40compare32K4KN/AN/A
GPT-realtimeAzure OpenAI4.0016.00compare32K4KN/AN/A
GPT-realtime-1.5Azure OpenAI4.0016.00compare32K4KN/AN/A
GPT-audio miniAzure OpenAI0.6002.40compare128K16KN/AN/A
GPT-audioAzure OpenAI2.5010.00compare128K16KN/AN/A
GPT-audio-1.5Azure OpenAI2.5010.00compare128K16KN/AN/A
GPT-4o-realtime PreviewAzure OpenAI5.0020.00compare128K4KN/AN/A
GPT-4o-mini-realtime PreviewAzure OpenAI0.6002.40compare128K4KN/AN/A
GPT-4.1 miniAzure OpenAI0.4001.60compare1.0M33KN/AN/A
ContainerAzure OpenAIN/AN/AcompareN/AN/AN/AN/A
Computer Use PreviewAzure OpenAI3.0012.00compare8K1KN/AN/A
Phi 4 ReasoningAzure AI0.1250.500compare33K4KN/AN/A
Phi 4 Mini ReasoningAzure AI0.0800.320compare131K4KN/AN/A
Phi 3.5 Vision InstructAzure AI0.1300.520compare128K4KN/AN/A
Phi 3.5 MoE InstructAzure AI0.1600.640compare128K4KN/AN/A
Phi 3.5 Mini InstructAzure AI0.1300.520compare128K4KN/AN/A
Phi 3 Small 128K InstructAzure AI0.1500.600compare128K4KN/AN/A
Phi 3 Medium 128K InstructAzure AI0.1700.680compare128K4KN/AN/A
Model RouterAzure AI0.140N/AcompareN/AN/AN/AN/A
Mistral Small 2503Azure AI0.1000.300compare128K128KN/AN/A
Mistral NemoAzure AI0.1500.150compare131K4KN/AN/A
Mistral Medium 2505Azure AI0.4002.00compare131K8KN/AN/A
Ministral 3BAzure AI0.0400.040compare128K4KN/AN/A
MAI DS R1Azure AI1.355.40compare128K8KN/AN/A
Llama 4 Scout 17B 16E InstructAzure AI0.2000.780compare10.0M16KN/AN/A
Llama 4 Maverick 17B 128E Instruct FP8Azure AI1.410.350compare1.0M16KN/AN/A
Kimi K2.5Azure AI0.6003.00compare262K262KN/AN/A
Jamba InstructAzure AI0.5000.700compare70K4KN/AN/A
JAIS 30B ChatAzure AI0.00320.0097compare8K8KN/AN/A
Grok 4 Fast Non ReasoningAzure AI0.2000.500compare131K131KN/AN/A
Grok 3 MiniAzure AI0.2501.27compare131K131KN/AN/A
Claude Sonnet 4.6Azure AI3.0015.00compare200K64KN/A46.4
Claude Opus 4.6Azure AI5.0025.00compare200K128KN/A47.6
Mixtral 8x7B Instruct V0.1Anyscale0.1500.150compare16K16KN/AN/A
Mixtral 8x22B Instruct V0.1Anyscale0.9000.900compare66K66KN/AN/A
Mistral 7B Instruct V0.1Anyscale0.1500.150compare16K16KN/AN/A
Zephyr 7B BetaAnyscale0.1500.150compare16K16KN/AN/A
Gemma 7B ItAnyscale0.1500.150compare8K8KN/AN/A
CodeLlama 70B Instruct HfAnyscale1.001.00compare4K4KN/AN/A
CodeLlama 34B Instruct HfAnyscale1.001.00compare4K4KN/AN/A
Claude V1AWS Bedrock8.0024.00compare100K8KN/AN/A
Claude Sonnet 4.6AWS Bedrock3.0015.00compare200K64KN/A46.4
Claude Opus 4.6AWS Bedrock5.0025.00compare1.0M128KN/A47.6
Titan Text Premier V1AWS Bedrock0.5001.50compare42K32KN/AN/A
Titan Text Lite V1AWS Bedrock0.3000.400compare42K4KN/AN/A
Titan Text Express V1AWS Bedrock1.301.70compare42K8KN/AN/A
Jamba Instruct V1AWS Bedrock0.5000.700compare70K4KN/AN/A
J2 Ultra V1AWS Bedrock18.8018.80compare8K8KN/AN/A
J2 Mid V1AWS Bedrock12.5012.50compare8K8KN/AN/A