AI model providers
Every AI model provider tracked by CloudPrice - frontier labs, inference platforms and self-hosted open-weights shortcuts. Click into any provider to see their full catalog with live pricing, benchmarks and capabilities.
OpenRouter
The unified interface for LLMs. Find the best models & prices for your prompts
Vercel AI Gateway
Deploy AI apps in seconds with Vercel's AI SDK and Frontend Cloud. Built-in adapters, streaming UI helpers, and zero-config deployments.
Fireworks AI
Use state-of-the-art, open-source LLMs and image models at blazing fast speed, or fine-tune and deploy your own at no additional cost with Fireworks AI!
Microsoft Azure AI Foundry
Microsoft Foundry
Google Vertex AI
Gemini Enterprise Agent Platform (formerly Vertex AI) is a comprehensive platform for developers to build, scale, govern and optimize agents.
Google Gemini
Build with Gemini 2.0 Flash, 2.5 Pro, and Gemma using the Gemini API and Google AI Studio.
Amazon Bedrock
Amazon Bedrock: The platform for building generative AI applications and agents at production scale
Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Alibaba Cloud / Qwen
Supercharge Your AI Journey Effortlessly With Industry-Leading GenAI Models
OpenAI
Creator of GPT-4o, o3, and the GPT model family. Offers text, vision, audio, image generation, speech, and embedding models via a REST API. Pioneered the modern LLM API interface now widely adopted as the de-facto standard.
Novita AI
Novita AI provides 200+ Model APIs, custom deployment, GPU Instances, and Serverless GPUs. Scale AI, optimize performance, and innovate with ease and efficiency.
Deep Infra
DeepInfra offers cost-effective, scalable, easy-to-deploy, and production-ready machine-learning models and infrastructures for deep-learning models.
Snowflake Cortex AI
Discover Snowflake Arctic, a breakthrough LLM built for enterprise AI. Enterprise intelligence. Breakthrough efficiency. Truly open.
Oracle Cloud Infrastructure (OCI)
Transform your business with generative AI, and unlock a new era of productivity with task automation and end-to-end AI solutions for enterprise customers.
Replicate
Run open-source machine learning models with a cloud API
xAI
Elon Musk's AI company and creators of the Grok model family. Grok models offer large context windows, real-time knowledge via X (Twitter) integration, vision, and multimodal output. Grok-4 is their frontier reasoning model.
Deepgram
Power enterprise voice solutions with Deepgram’s Speech-to-Text, Text-to-Speech, and Voice Agent APIs. Real-time, accurate, and built for scale.
Mistral AI
The most powerful AI platform for enterprises. Customize, fine-tune, and deploy AI assistants, autonomous agents, and multimodal AI with open models.
Nebius
Discover the most efficient way to build, tune and run your AI models and applications on top-notch NVIDIA® GPUs.
IBM watsonx
IBM watsonx is a portfolio of AI products that accelerates the impact of generative AI in core workflows to drive productivity.
Databricks
Databricks offers a unified platform for data, analytics and AI. Build better AI with a data-centric approach. Simplify ETL, data warehousing, governance and AI on the Data Intelligence Platform.
Together AI
Build what's next on the AI Native Cloud. Full-stack AI platform for inference, fine-tuning, and GPU clusters — powered by cutting-edge research.
Perplexity
AI company best known for its search assistant. Also offers the Sonar model family via API: Sonar (fast, grounded), Sonar Pro (more capable), and Sonar Reasoning (chain-of-thought). All models include real-time web search grounding by default.
Cohere
Cohere builds powerful models and AI solutions enabling enterprises to automate processes, empower employees, and turn fragmented data into actionable insights.
Stability AI
Stability AI is the enterprise-ready creative partner for teams and creators, delivering professional-grade generative AI tools and solutions for content production at scale.
Lambda Labs
Cloud GPUs, on-demand clusters, private cloud, and hardware for AI training and inference. Run B200 and H100, deploy fast, and scale cost effectively.
Voyage AI
Voyage AI provides cutting-edge embedding models and rerankers for search and retrieval
Moonshot AI / Kimi
Chinese AI company creator of the Kimi model family. Kimi k1.5 and k2 are strong reasoning models with long context. Kimi VL handles vision tasks. Known for efficient long-document processing. Direct API available globally via Moonshot's platform.
GMI Cloud
Singapore-based GPU cloud offering serverless inference for a broad catalog of open-weight and frontier models. Hosts Qwen, MiniMax, DeepSeek, Llama, and others with OpenAI-compatible endpoints. Focuses on Asia-Pacific availability and competitive pricing.
Hyperbolic
Access open-source inference and compute at a fraction of the cost. Build with us.
SambaNova
Discover SambaNova - the complete AI platform delivering the fastest AI inference, fine-tuning, and scalable solutions for agentic AI easily integrated into existing data center infrastructures.
Groq
The Groq LPU delivers inference with the speed and cost developers need.
Nscale
Nscale full-stack AI cloud platform and services are designed for scale, resilience, and speed.
OVHcloud
Discover the generative AI APIs offered by OVHcloud. High-performing, easy to integrate, and secure, for application power.
Anthropic
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
AIML API
One API for 400+ AI models: GPT-5.5, Claude Opus 4.7, Gemini 3.5, DeepSeek v4, Seedance. Save up to 80% vs OpenAI & Anthropic. No token limits. Free playground.
AI21 Labs
AI21 builds Foundation Models and AI Systems for the enterprise. Power your most critical enterprise workflows with accurate, reliable, and scalable AI.
Gradient AI
Superagent provides free AI business research tools and an AI research assistant to analyze, summarize, and generate insights from web and text sources.
Z.AI
Meet Z.ai, your free AI-powered assistant. Build websites, create slides, analyze data, and get instant answers. Fast, smart, and reliable, powered by GLM-5.
MiniMax
Building AGI with our mission Intelligence with Everyone. Global leader in multi-modal models and AI-native products with over 200 million users.
Black Forest Labs
Black Forest Labs is the AI company behind FLUX, the state-of-the-art image generation model. Try FLUX.2, FLUX Kontext, and more via our API.
Fal AI
Easiest & most cost-effective way to use Gen AI. fal.ai is how devs integrate dozens of generative media models. FLUX, Kling, Hailuo +1000 more
Cerebras
Cerebras is the go-to platform for fast and effortless AI training. Learn more at cerebras.ai.
Runway
We are building foundational General World Models that will be capable of simulating all possible worlds and experiences. The next frontier of intelligence will come from models that can understand, perceive, generate and act in the world.
DeepSeek
DeepSeek, unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
Amazon Nova
Amazon Nova is a family of foundation models and services that delivers frontier intelligence and industry-leading price performance.
ElevenLabs
Create lifelike speech with our AI voice generator and voice agents platform. Access 5,000+ voices in 70+ languages with secure APIs and SDKs.
Cloudflare
Run machine learning models, powered by serverless GPUs, on Cloudflare's global network.
FriendliAI
FriendliAI is The Frontier AI Inference Cloud. Built by the researchers who invented the continuous batching technique that is now industry standard, FriendliAI provides AI engineers with a highly optimized engine that constantly evolves to efficiently run state-of-the-art open-weight and custom models at production scale. By maximizing GPU utilization, FriendliAI delivers speeds up to 3x faster than vLLM, and 50% to 90% cost savings relative to closed model APIs. FriendliAI empowers engineers to deploy frontier AI with uncompromising speed, model ownership, and enterprise-grade reliability.
Recraft
Recraft is a top-ranked text-to-image model and design platform for photorealism, vector generation, custom styles, mockups, and more
NLP Cloud
API platform for deploying NLP and LLM models including text generation, summarisation, sentiment analysis, and named entity recognition. Hosts open-source models and provides fine-tuning capabilities. Simple REST API with pay-as-you-go pricing.