Question 1

What is the cheapest AI model for chat?

Accepted Answer

The cheapest AI chat models are typically smaller models like GPT-3.5-turbo, Claude Instant, or open-source models on platforms like Together AI and Groq. Prices can be as low as $0.10-0.25 per million tokens. However, cheaper models may have reduced capabilities compared to flagship models like GPT-4 or Claude 3 Opus. Use CloudPrice to compare current prices across all providers.

Question 2

How much does GPT-4 cost?

Accepted Answer

GPT-4 pricing varies by version. GPT-4 Turbo costs approximately $10 per million input tokens and $30 per million output tokens. GPT-4o is more affordable at around $5/$15 per million tokens. OpenAI also offers GPT-4o-mini at significantly lower prices. Pricing may change, so check CloudPrice for current rates.

Question 3

What is the difference between input and output tokens?

Accepted Answer

Input tokens are the text you send to the AI model (your prompts, context, and conversation history). Output tokens are what the model generates in response. Most AI providers charge differently for input and output tokens, with output tokens typically being 2-4x more expensive because they require more computation to generate.

Question 4

Which AI model has the largest context window?

Accepted Answer

Google's Gemini 1.5 Pro currently offers one of the largest context windows at 1 million tokens. Anthropic's Claude 3 models support 200K tokens, while OpenAI's GPT-4 Turbo supports 128K tokens. Larger context windows allow processing more text in a single request but may cost more.

Question 5

What does 'supports vision' mean for AI models?

Accepted Answer

AI models with vision support can process and understand images in addition to text. You can send images as part of your prompt and the model can describe, analyze, or answer questions about them. Examples include GPT-4 Vision, Claude 3, and Gemini Pro Vision. Vision capabilities typically cost extra per image processed.

Model	Provider	Input Price, $	Output Price, $	Price Compare	Context	Max Output	Intelligence	Coding
Qwen3 235B A22B Thinking 2507	Weights & Biases	0.010	0.010	compare	262K	262K	39.9#29	30.5#38
GPT-oss-120b	Weights & Biases	0.015	0.060	compare	131K	131K	33.3#39	28.6#45
DeepSeek V3.1	Weights & Biases	0.055	0.165	compare	128K	128K	28.1#59	28.4#46
GLM 4.5	Weights & Biases	0.055	0.200	compare	131K	131K	26.4#64	26.3#49
Kimi K2 Instruct	Weights & Biases	0.600	2.50	compare	128K	128K	26.3#65	22.1#65
Qwen3 235B A22B Instruct 2507	Weights & Biases	0.010	0.010	compare	262K	262K	25.0#70	22.1#65
Qwen3 Coder 480B A35B Instruct	Weights & Biases	0.100	0.150	compare	262K	262K	24.8#71	24.6#56
GPT-oss-20b	Weights & Biases	0.0050	0.020	compare	131K	131K	24.5#72	18.5#80
DeepSeek V3 0324	Weights & Biases	0.114	0.275	compare	161K	161K	22.3#82	22.0#67
Llama 3.3 70B Instruct	Weights & Biases	0.071	0.071	compare	128K	128K	14.5#136	10.7#130
Llama 3.1 8B Instruct	Weights & Biases	0.022	0.022	compare	128K	128K	11.8#174	4.9#157
Phi 4 Mini Instruct	Weights & Biases	0.0080	0.035	compare	128K	128K	8.4#211	3.6#162
Llama 4 Scout 17B 16E Instruct	Weights & Biases	0.017	0.066	compare	64K	64K	N/A	N/A
DeepSeek R1 0528	Weights & Biases	0.135	0.540	compare	161K	161K	N/A	N/A

AI Models Comparison