Singapore-based GPU cloud offering serverless inference for a broad catalog of open-weight and frontier models. Hosts Qwen, MiniMax, DeepSeek, Llama, and others with OpenAI-compatible endpoints. Focuses on Asia-Pacific availability and competitive pricing. Inference platform · OpenAI-compatible API · Asia Pacific · Gpu Cloud · Low Cost · Open Source · Serverless

Intelligence vs Price

Best value among GMI Cloud models on this chart: GPT-5.2 · GPT-5.1 · Kimi K2 Thinking (and 3 more on the dashed frontier). Hover any dot for full pricing, or click a creator in the legend to isolate.

GMI Cloud models

17 models, 17 with pricing
Input/1M
to
Output/1M
to
Model
Creator
Input Price, $
Output Price, $
Context
Max Output
Inference Providers
Intelligence
Coding
GPT-5.2OpenAI logoOpenAI1.7514.00410K128Kcompare (6)51.3#148.7#1
GPT-5.1OpenAI logoOpenAI1.2510.00410K128Kcompare (6)47.7#244.7#2
GPT-5OpenAI logoOpenAI1.2510.00410K128Kcompare (9)44.6#336.0#4
Claude Opus 4.5Anthropic logoAnthropic5.0025.00410K64Kcompare (9)43.1#442.9#3
Kimi K2 ThinkingMoonshot AI (Kimi) logoMoonshot AI (Kimi)0.5741.20262K33Kcompare (14)40.9#534.8#5
MiniMax M2.1MiniMax logoMiniMax0.2900.9501.0M197Kcompare (8)39.4#632.8#8
Claude Sonnet 4.5Anthropic logoAnthropic3.0015.001.0M64Kcompare (10)37.1#733.5#7
Claude Sonnet 4Anthropic logoAnthropic3.0015.001.0M64Kcompare (10)33.0#930.6#9
Claude Opus 4Anthropic logoAnthropic15.0075.00410K32Kcompare (8)33.0#8N/A
DeepSeek V3.2DeepSeek logoDeepSeek0.2520.378164K66Kcompare (12)32.1#1034.6#6
DeepSeek V3 324DeepSeek logoDeepSeek0.2000.400164K16Kcompare (13)22.3#1122.0#10
Qwen3 VL 235B A22B InstructAlibaba logoAlibaba0.2000.880262K129Kcompare (7)20.8#1216.5#12
GPT-4oOpenAI logoOpenAI2.5010.00131K16Kcompare (6)14.5#1316.6#11
GPT-4o miniOpenAI logoOpenAI0.1500.600131K16Kcompare (6)12.6#14N/A
Gemini 3 Flash PreviewGoogle logoGoogle0.5003.001.0M66Kcompare (4)N/AN/A
Gemini 3 Pro PreviewGoogle logoGoogle2.0012.001.0M66Kcompare (5)N/AN/A
GLM-4.7 FP8Zhipu AI logoZhipu AI0.4002.00203K16Kcompare (1)N/AN/A