AI Models Comparison
Compare AI model pricing and benchmarks across providers including OpenAI, Anthropic, Google, AWS Bedrock, Azure, Mistral, and more. Filter by model capabilities like vision, function calling, and reasoning support. Find the most cost-effective model for your use case. Currently tracking 1,872 models across 102 providers.
The data is based on LiteLLM, maintained by the open-source community, and benchmark data from Artificial Analysis. The latest update occurred on March 25, 2026 at 12:00 AM UTC
Model | Provider | Input Price, $ | Output Price, $ | Price Compare | Context | Max Output | Intelligence | Coding | |
|---|---|---|---|---|---|---|---|---|---|
| Qwen3 235B A22B Thinking 2507 | 0.010 | 0.010 | compare | 262K | 262K | 39.9#29 | 30.5#38 | ||
| GPT-oss-120b | 0.015 | 0.060 | compare | 131K | 131K | 33.3#39 | 28.6#45 | ||
| DeepSeek V3.1 | 0.055 | 0.165 | compare | 128K | 128K | 28.1#59 | 28.4#46 | ||
| GLM 4.5 | 0.055 | 0.200 | compare | 131K | 131K | 26.4#64 | 26.3#49 | ||
| Kimi K2 Instruct | 0.600 | 2.50 | compare | 128K | 128K | 26.3#65 | 22.1#65 | ||
| Qwen3 235B A22B Instruct 2507 | 0.010 | 0.010 | compare | 262K | 262K | 25.0#70 | 22.1#65 | ||
| Qwen3 Coder 480B A35B Instruct | 0.100 | 0.150 | compare | 262K | 262K | 24.8#71 | 24.6#56 | ||
| GPT-oss-20b | 0.0050 | 0.020 | compare | 131K | 131K | 24.5#72 | 18.5#80 | ||
| DeepSeek V3 0324 | 0.114 | 0.275 | compare | 161K | 161K | 22.3#82 | 22.0#67 | ||
| Llama 3.3 70B Instruct | 0.071 | 0.071 | compare | 128K | 128K | 14.5#136 | 10.7#130 | ||
| Llama 3.1 8B Instruct | 0.022 | 0.022 | compare | 128K | 128K | 11.8#174 | 4.9#157 | ||
| Phi 4 Mini Instruct | 0.0080 | 0.035 | compare | 128K | 128K | 8.4#211 | 3.6#162 | ||
| Llama 4 Scout 17B 16E Instruct | 0.017 | 0.066 | compare | 64K | 64K | N/A | N/A | ||
| DeepSeek R1 0528 | 0.135 | 0.540 | compare | 161K | 161K | N/A | N/A |