Every notable AI you can actually use, in one place. 165 models, tools, and providers across chat, coding, images, video, voice, and the apps you run yourself. Use the filters to see only what is free, only what runs on your own computer, or only what is cloud-based.
Snapshot of 7 June 2026. Prices and model versions change often, and each name links to its official page. About 134 have a free option and about 77 can run on your own hardware.
| Tool | Maker | Category | What it is | Cost | Where it runs | Hardware |
|---|---|---|---|---|---|---|
| AI21 (Jamba) | AI21 Labs | Chat & text | Long-context Mamba-Transformer models with open weights available. | Paid | Cloud or local | Server GPUs to self-host |
| Amazon Nova | Amazon | Chat & text | AWS foundation models for text, image, video, and speech. | Paid | Cloud | Any device |
| ChatGPT | OpenAI | Chat & text | The best-known assistant; web search, images, video, deep research, and agents. | Free + paid | Cloud | Any device |
| Claude | Anthropic | Chat & text | Strong writing, careful reasoning, and document analysis; includes Claude Code. | Free + paid | Cloud | Any device |
| Cohere (Command) | Cohere | Chat & text | Enterprise text models for RAG, agents, and translation. | Paid | Cloud or local | Server GPUs to self-host |
| DeepSeek | DeepSeek | Chat & text | Capable free Chinese-lab assistant with reasoning and web search. | Free | Cloud | Any device |
| Doubao | ByteDance | Chat & text | China most-used chatbot, with voice and video understanding. | Free + paid | Cloud | Any device |
| Ernie Bot | Baidu | Chat & text | Baidu omni-modal assistant; China-focused, login required. | Free + paid | Cloud | Any device |
| Gemini | Chat & text | Google assistant with strong multimodal and long-context, tied to Workspace. | Free + paid | Cloud | Any device | |
| Grok | xAI | Chat & text | Conversational assistant with real-time access to posts on X. | Free + paid | Cloud | Any device |
| HuggingChat | Hugging Face | Chat & text | Former open-source chat front end over Hugging Face models. | Discontinued | Cloud | Any device |
| Kimi | Moonshot AI | Chat & text | Long-context assistant with long-horizon coding and agent features. | Free + paid | Cloud | Any device |
| Meta AI | Meta | Chat & text | Free assistant built into WhatsApp, Instagram, and Messenger; runs Llama. | Free | Cloud | Any device |
| Microsoft Copilot | Microsoft | Chat & text | Microsoft assistant running OpenAI models, built into Windows and Office. | Free + paid | Cloud | Any device |
| Mistral Le Chat | Mistral AI | Chat & text | European assistant with agentic features and a data-residency focus. | Free + paid | Cloud | Any device |
| Perplexity | Perplexity AI | Chat & text | Answer engine: web search with cited answers across several models. | Free + paid | Cloud | Any device |
| Pi | Inflection AI | Chat & text | Companion-style assistant with voice; active development has stalled. | Free | Cloud | Any device |
| Poe | Quora | Chat & text | Aggregator app: chat across many third-party models on one plan. | Free + paid | Cloud | Any device |
| Qwen Chat | Alibaba | Chat & text | Free Alibaba assistant with chat, search, image, and video. | Free | Cloud | Any device |
| Reka | Reka AI | Chat & text | Multimodal models; the company is refocusing on physical-world AI. | Paid | Cloud or local | Server to edge |
| You.com | You.com | Chat & text | Answer engine with a model picker and a deep-research mode. | Free + paid | Cloud | Any device |
| Z.ai (GLM) | Zhipu AI | Chat & text | Chinese lab whose GLM-5 is a leading open-weight model; chat and coding plans. | Free + paid | Cloud or local | Server GPUs to self-host |
| Aider | Community | Coding & app building | Terminal AI pair programmer; edits as diffs and auto-commits each change. | Open-weight | Cloud or local | Any device |
| Amazon Q Developer | Amazon | Coding & app building | AWS coding assistant for the IDE and CLI with completions and agents. | Free + paid | Cloud or local | Any device |
| Amp | Sourcegraph | Coding & app building | Sourcegraph frontier coding agent across CLI, web, and mobile. | Free + paid | Cloud or local | Any device |
| Augment Code | Augment | Coding & app building | Context-engine assistant and agent tuned for large production codebases. | Paid | Cloud or local | Any device |
| Bolt.new | StackBlitz | Coding & app building | In-browser builder that runs a full Node.js environment for backend and login. | Free + paid | Cloud | Any device |
| Claude Code | Anthropic | Coding & app building | Terminal-first agent that reads and edits files, runs commands, and commits. | Free + paid | Cloud or local | Any device |
| Cline | Cline | Coding & app building | Open-source VS Code and CLI agent; bring your own keys across providers. | Open-weight | Cloud or local | Any device |
| Continue.dev | Continue | Coding & app building | Open-source IDE assistant and platform for building custom agents. | Open-weight | Cloud or local | Any device |
| Cursor | Anysphere | Coding & app building | AI code editor (VS Code fork) built around an agent with an autonomy slider. | Free + paid | Cloud or local | Any device |
| Devin | Cognition | Coding & app building | Autonomous AI software engineer that plans and executes whole tasks remotely. | Paid | Cloud | Any device |
| Gemini CLI | Coding & app building | Open-source terminal agent with built-in Google Search grounding. | Free + paid | Cloud or local | Any device | |
| GitHub Copilot | GitHub (Microsoft) | Coding & app building | Autocomplete, chat, and agent mode inside major editors; started the category. | Free + paid | Cloud or local | Any device |
| Google Antigravity | Coding & app building | Google agent-first development platform built around Gemini. | Free + paid | Cloud or local | Any device | |
| JetBrains AI / Junie | JetBrains | Coding & app building | AI assistant plus the Junie agent across JetBrains IDEs. | Free + paid | Cloud or local | Any device |
| Kiro | Amazon | Coding & app building | AWS spec-driven agentic IDE that writes a spec and plan before coding. | Free + paid | Cloud or local | Any device |
| Lovable | Lovable | Coding & app building | Build apps and websites by chatting with AI; full-stack with hosting and login. | Free + paid | Cloud | Any device |
| OpenAI Codex | OpenAI | Coding & app building | Coding agent in the terminal, IDE, web, and iOS; local edits plus cloud tasks. | Free + paid | Cloud or local | Any device |
| OpenCode | Community | Coding & app building | Open-source terminal coding agent supporting many providers, including local. | Open-weight | Cloud or local | Any device |
| Qodo | Qodo | Coding & app building | AI code-quality platform: PR review, test generation, and an agentic CLI. | Free + paid | Cloud or local | Any device |
| Replit | Replit | Coding & app building | Chat-to-app builder with hosting, database, and an agent; a non-coder entry point. | Free + paid | Cloud | Any device |
| Sourcegraph Cody | Sourcegraph | Coding & app building | Enterprise code assistant with deep codebase context. | Paid | Cloud or local | Any device |
| Tabnine | Tabnine | Coding & app building | Privacy-focused assistant deployable fully private or air-gapped. | Paid | Cloud or local | Any device |
| Trae | ByteDance | Coding & app building | Free VS Code-based AI IDE with a project-scaffolding builder mode. | Free + paid | Cloud or local | Any device |
| v0 | Vercel | Coding & app building | Turns prompts into front-end interfaces and full-stack apps; deploys to Vercel. | Free + paid | Cloud | Any device |
| Warp | Warp | Coding & app building | AI-native terminal and agentic development environment. | Free + paid | Cloud or local | Any device |
| Windsurf | Cognition | Coding & app building | AI editor with an agentic Cascade workflow; now owned by Cognition. | Free + paid | Cloud or local | Any device |
| Zed | Zed Industries | Coding & app building | Fast native editor with built-in AI, edit prediction, and external agents. | Free + paid | Cloud or local | Any device |
| Adobe Firefly | Adobe | Image generation | Trained on licensed data for commercial safety; Creative Cloud integration. | Free + paid | Cloud | Any device |
| Canva (Magic Media) | Canva | Image generation | Image generation built into the Canva design editor. | Free + paid | Cloud | Any device |
| FLUX.2 [dev] | Black Forest Labs | Image generation | Open 32B model for generation and single or multi-reference editing. | Open-weight | Cloud or local | High-end GPU (RTX 4090/5090) |
| FLUX.2 [klein] | Black Forest Labs | Image generation | Fastest FLUX, sub-second generation on consumer hardware. | Open-weight | Cloud or local | Gaming GPU ~13GB |
| FLUX.2 [pro] | Black Forest Labs | Image generation | Top-tier photorealism and prompt adherence; the hosted commercial tier. | Paid | Cloud | Any device |
| GPT Image 2 | OpenAI | Image generation | Reasoning-based generation with near-perfect text rendering and 4K output. | Free + paid | Cloud | Any device |
| Grok Imagine | xAI | Image generation | Image and video generation inside Grok and X. | Free + paid | Cloud | Any device |
| HunyuanImage 3.0 | Tencent | Image generation | Largest open text-to-image model; not runnable on a single consumer GPU. | Open-weight | Local | Data-center GPUs (~3x80GB) |
| Ideogram 4 | Ideogram | Image generation | Best-in-class in-image text and typography; now open-weight. | Open-weight | Cloud or local | High-end GPU |
| Imagen 4 | Image generation | Strong photorealism and text rendering, via the Gemini API. | Free + paid | Cloud | Any device | |
| Krea | Krea AI | Image generation | Aggregator of 150+ models plus a real-time canvas. | Free + paid | Cloud | Any device |
| Leonardo.Ai | Leonardo (Canva) | Image generation | Game and asset-oriented platform with its own models. | Free + paid | Cloud | Any device |
| Magnific (Freepik) | Freepik | Image generation | Aggregator running its own Mystic model plus 30+ third-party models. | Free + paid | Cloud | Any device |
| Midjourney | Midjourney | Image generation | Strongest aesthetic quality and atmosphere; Discord and web app. | Paid | Cloud | Any device |
| Nano Banana (Gemini Image) | Image generation | Conversational image generation and editing in Gemini; Pro adds high-res. | Free + paid | Cloud | Any device | |
| Playground | Playground | Image generation | Design-focused editor routing to several top models. | Free + paid | Cloud | Any device |
| Qwen-Image | Alibaba | Image generation | Open model with excellent complex text rendering, including Chinese. | Open-weight | Cloud or local | High-end GPU (20B) |
| Recraft | Recraft | Image generation | Raster plus native vector and SVG output, with brand style sets. | Free + paid | Cloud | Any device |
| Reve | Reve AI | Image generation | Strong prompt adherence and typography from a Palo Alto startup. | Free + paid | Cloud | Any device |
| Seedream | ByteDance | Image generation | High-resolution (up to 4K) generation and editing. | Paid | Cloud | Any device |
| Stable Diffusion 3.5 | Stability AI | Image generation | The largest open LoRA and fine-tune ecosystem; runs locally. | Open-weight | Cloud or local | Gaming GPU 8GB+ |
| Z-Image | Alibaba | Image generation | Efficient 6B open model with top open photorealism; runs on consumer cards. | Open-weight | Cloud or local | Gaming GPU ~16GB |
| Adobe Firefly Video | Adobe | Video generation | Generate, translate, and lip-sync video; positioned as commercially safe. | Free + paid | Cloud | Any device |
| CogVideoX | Zhipu / THUDM | Video generation | Early, widely-adopted open video family with modest hardware needs. | Open-weight | Local | From ~5GB VRAM |
| Google Veo | Video generation | Google flagship video model with native synchronized audio. | Free + paid | Cloud | Any device | |
| Grok Imagine (Video) | xAI | Video generation | xAI native video-plus-audio model inside Grok. | Free + paid | Cloud | Any device |
| Hailuo | MiniMax | Video generation | Video generator known for strong prompt adherence and complex motion. | Free + paid | Cloud | Any device |
| Hedra | Hedra | Video generation | Animates a single photo into a talking character driven by audio. | Free + paid | Cloud | Any device |
| HeyGen | HeyGen | Video generation | AI avatar video for marketing and dubbing across 175+ languages. | Free + paid | Cloud | Any device |
| Higgsfield | Higgsfield AI | Video generation | Creative platform running many models with camera-motion presets. | Free + paid | Cloud | Any device |
| HunyuanVideo 1.5 | Tencent | Video generation | Lightweight open video model; note the territorial license limits. | Open-weight | Cloud or local | ~14GB VRAM and up |
| Kaiber | Kaiber AI | Video generation | Multi-model creative video studio with a timeline editor. | Free + paid | Cloud | Any device |
| Kling | Kuaishou | Video generation | High-quality Chinese video model with strong motion; 4K on higher tiers. | Free + paid | Cloud | Any device |
| LTX-2 | Lightricks | Video generation | Open audio-plus-video model with native 4K and synchronized audio. | Open-weight | Cloud or local | Consumer GPUs |
| Luma Dream Machine | Luma AI | Video generation | Text and image-to-video with an agentic creative workflow. | Free + paid | Cloud | Any device |
| Mochi 1 | Genmo | Video generation | Open diffusion model with strong motion and prompt adherence; 480p preview. | Open-weight | Cloud or local | ~22GB+ VRAM |
| OpenAI Sora | OpenAI | Video generation | Formerly OpenAI flagship video model; the product has been retired. | Discontinued | Cloud | Any device |
| Pika | Pika Labs | Video generation | Consumer video with editing effects; up to 1080p. | Free + paid | Cloud | Any device |
| Runway | Runway | Video generation | Flagship Western text and image-to-video platform with editing tools. | Free + paid | Cloud | Any device |
| Seedance | ByteDance | Video generation | ByteDance video model with native audio and real-human video support. | Paid | Cloud | Any device |
| Synthesia | Synthesia | Video generation | Enterprise AI avatar platform for training and corporate video at scale. | Paid | Cloud | Any device |
| Vidu | Shengshu | Video generation | Chinese video model with audio sync and multi-shot generation. | Free + paid | Cloud | Any device |
| Wan 2.2 | Alibaba | Video generation | Open Mixture-of-Experts video family; text, image, and a fast 5B hybrid. | Open-weight | Cloud or local | 5B on RTX 4090; A14B ~80GB |
| AssemblyAI | AssemblyAI | Voice, audio & music | Transcription API with speaker labels and audio-intelligence add-ons. | Free + paid | Cloud | Any device |
| Azure AI Speech | Microsoft | Voice, audio & music | Neural text-to-speech, custom voice, and speech-to-text on Azure. | Free + paid | Cloud | Any device |
| Cartesia (Sonic) | Cartesia | Voice, audio & music | Low-latency real-time TTS plus speech-to-text for voice apps. | Free + paid | Cloud | Any device |
| Chatterbox | Resemble AI | Voice, audio & music | Open zero-shot voice cloning TTS with a low-latency Turbo variant. | Open-weight | Cloud or local | GPU recommended |
| Deepgram (Nova-3) | Deepgram | Voice, audio & music | Fast, accurate transcription API, plus Aura TTS and voice agents. | Free + paid | Cloud | Any device |
| ElevenLabs | ElevenLabs | Voice, audio & music | High-quality text-to-speech, voice cloning, dubbing, and sound effects. | Free + paid | Cloud | Any device |
| ElevenLabs Music | ElevenLabs | Voice, audio & music | Text-to-music generation inside the ElevenLabs platform. | Free + paid | Cloud | Any device |
| ElevenLabs Scribe | ElevenLabs | Voice, audio & music | High-accuracy transcription API with a realtime variant. | Free + paid | Cloud | Any device |
| Google Cloud STT | Voice, audio & music | Streaming and batch transcription across 45+ languages. | Free + paid | Cloud | Any device | |
| Google Cloud TTS | Voice, audio & music | Enterprise text-to-speech with WaveNet, Neural2, and Chirp 3 HD voices. | Free + paid | Cloud | Any device | |
| Google Lyria | Voice, audio & music | Enterprise music-generation model; 48kHz stereo with SynthID watermarking. | Paid | Cloud | Any device | |
| Hume (Octave) | Hume AI | Voice, audio & music | Emotionally expressive text-to-speech with prompt-controlled tone. | Free + paid | Cloud | Any device |
| Kokoro | hexgrad (community) | Voice, audio & music | Small, fast open text-to-speech model that runs almost anywhere. | Open-weight | Cloud or local | CPU or modest GPU (82M) |
| Mubert | Mubert | Voice, audio & music | Royalty-free generative music for creators, plus an API. | Free + paid | Cloud | Any device |
| Murf AI | Murf | Voice, audio & music | Studio-style voiceover with 200+ voices, translation, and dubbing. | Free + paid | Cloud | Any device |
| MusicGen | Meta | Voice, audio & music | Open text-to-music model (AudioCraft); weights are non-commercial. | Open-weight | Cloud or local | GPU recommended |
| NVIDIA Canary | NVIDIA | Voice, audio & music | Open transcription and translation across 25 European languages. | Open-weight | Cloud or local | NVIDIA GPU |
| NVIDIA Parakeet | NVIDIA | Voice, audio & music | High-throughput open English transcription model. | Open-weight | Cloud or local | NVIDIA GPU |
| OpenAI Realtime voice | OpenAI | Voice, audio & music | End-to-end speech-in, speech-out model for low-latency voice agents. | Paid | Cloud | Any device |
| OpenAI TTS | OpenAI | Voice, audio & music | Developer text-to-speech API with steerable tone. | Paid | Cloud | Any device |
| Otter.ai | Otter.ai | Voice, audio & music | Meeting transcription with live notes, summaries, and speaker labels. | Free + paid | Cloud | Any device |
| PlayHT / PlayAI | PlayAI | Voice, audio & music | Text-to-speech and instant voice cloning for creators and agents. | Free + paid | Cloud | Any device |
| Resemble AI | Resemble AI | Voice, audio & music | Voice cloning, TTS, voice changer, plus deepfake detection. | Paid | Cloud | Any device |
| Riffusion | Riffusion | Voice, audio & music | Text-to-music app producing royalty-free full songs. | Free + paid | Cloud | Any device |
| Speechify | Speechify | Voice, audio & music | Consumer read-aloud text-to-speech across apps, plus an API. | Free + paid | Cloud or local | Any device |
| Stable Audio | Stability AI | Voice, audio & music | Text-to-audio for music and sound effects, enterprise-oriented. | Paid | Cloud | Any device |
| Stable Audio Open | Stability AI | Voice, audio & music | Open text-to-audio model for short samples and sound effects. | Open-weight | Cloud or local | GPU recommended |
| Suno | Suno | Voice, audio & music | Consumer text-to-song generator with vocals and instrumentation. | Free + paid | Cloud | Any device |
| Udio | Udio | Voice, audio & music | Text-to-music generator producing full songs with vocals. | Free + paid | Cloud | Any device |
| WellSaid Labs | WellSaid | Voice, audio & music | Enterprise text-to-speech for narration and corporate voiceover. | Paid | Cloud | Any device |
| Whisper | OpenAI | Voice, audio & music | Widely used open transcription model; multilingual and robust. | Open-weight | Cloud or local | GPU for large; CPU for small |
| AnythingLLM | Mintplex Labs | Run on your computer | Runtime app: all-in-one desktop app for chat with docs, RAG, and agents. | Free | Cloud or local | Your PC or Mac |
| Cohere Command (open) | Cohere | Run on your computer | Open model: enterprise RAG and agent models; open weights are non-commercial. | Open-weight | Cloud or local | 2x H100; R7B ~8-12GB |
| DeepSeek (open) | DeepSeek | Run on your computer | Open model: small distilled versions run locally; the flagships need a data center. | Open-weight | Cloud or local | Distills 1.5-8B local; flagship data-center |
| EuroLLM | EU consortium | Run on your computer | Open model: European multilingual model across all 24 EU official languages. | Open-weight | Cloud or local | 22B ~16-24GB |
| Falcon | TII (UAE) | Run on your computer | Open model: UAE family including hybrid-attention and Mamba variants. | Open-weight | Cloud or local | 1-10B laptop to 10GB; 180B server |
| Gemma | Run on your computer | Open model: Google open family (separate from Gemini), from mobile to server. | Open-weight | Cloud or local | 2-4B phone/laptop; 27B ~24GB | |
| GLM | Zhipu AI | Run on your computer | Open model: Chinese open family, widely served, from local to server scale. | Open-weight | Cloud or local | Smaller ~8-24GB; GLM-5 server |
| gpt-oss | OpenAI | Run on your computer | Open model: OpenAI first open LLMs since GPT-2; MoE with 128K context. | Open-weight | Cloud or local | 20b ~16GB; 120b ~80GB GPU |
| GPT4All | Nomic AI | Run on your computer | Runtime app: one-click desktop app; chat with your own files offline. | Free | Local | Your PC or Mac (CPU ok) |
| Granite | IBM | Run on your computer | Open model: enterprise small and mid models with transparent training. | Open-weight | Cloud or local | 3B laptop; 8B ~8GB; 30B ~24GB |
| Jan | Menlo Research | Run on your computer | Runtime app: open-source ChatGPT-style desktop app for local or cloud models. | Free | Cloud or local | Your PC or Mac |
| Kimi K2 | Moonshot AI | Run on your computer | Open model: large MoE strong on agentic coding; not laptop-runnable. | Open-weight | Cloud or local | ~1T MoE, data-center only |
| KoboldCpp | Community | Run on your computer | Runtime: single-file llama.cpp fork with a UI; popular for writing and roleplay. | Free | Local | Your PC or Mac |
| Llama | Meta | Run on your computer | Open model: the most widely deployed open family; Llama 4 adds MoE and multimodal. | Open-weight | Cloud or local | 8B ~8GB GPU / 16GB Mac; 70B ~48GB+ |
| llama.cpp | ggml-org | Run on your computer | Runtime: the C/C++ engine behind most local runners; CLI and a server. Developer tool. | Free | Local | Your PC or Mac (CPU or GPU) |
| Llamafile | Mozilla.ai | Run on your computer | Runtime: packs a model into one executable file you just run. | Free | Local | Your PC or Mac (no install) |
| LM Studio | LM Studio | Run on your computer | Runtime app: fully graphical desktop app that searches and runs open models. | Free | Local | Your PC or Mac |
| LocalAI | Community | Run on your computer | Runtime: self-hosted drop-in replacement for the OpenAI and Anthropic APIs. | Free | Local | Your PC or server (CPU ok) |
| Mistral / Mixtral | Mistral AI | Run on your computer | Open model: French lab with small dense and large MoE open releases. | Open-weight | Cloud or local | 7B ~6GB; large MoE on a server |
| MLC LLM | MLC AI | Run on your computer | Runtime: compiles models to run across GPUs, phones, and the browser. | Free | Local | PC, Mac, phone, or browser |
| MLX | Apple | Run on your computer | Runtime: Apple framework with the fastest token generation on Apple Silicon. | Free | Local | Apple Silicon Mac |
| Msty | Msty | Run on your computer | Runtime app: polished desktop app with multi-model compare and knowledge stacks. | Free + paid | Cloud or local | Your PC or Mac |
| Nemotron | NVIDIA | Run on your computer | Open model: NVIDIA family tuned for agentic AI; Nano runs locally. | Open-weight | Cloud or local | Nano ~16-24GB; Ultra data-center |
| Ollama | Ollama | Run on your computer | Runtime app: popular local runner with a desktop chat app; no terminal needed. | Free | Local | Your PC or Mac |
| OLMo 3 | Allen Institute (Ai2) | Run on your computer | Open model: truly-open research models with all code, data, and checkpoints. | Open-weight | Cloud or local | 7B ~8GB; 32B ~24GB |
| Open WebUI | Open WebUI | Run on your computer | Runtime: self-hosted web interface for local and cloud models; multi-user. | Free | Cloud or local | Your server or PC |
| Phi | Microsoft | Run on your computer | Open model: small reasoning-strong models built data-quality first. | Open-weight | Cloud or local | 14B ~8-10GB; mini fits 8GB |
| Qwen | Alibaba | Run on your computer | Open model: very active family, dense plus MoE, with strong coding variants. | Open-weight | Cloud or local | 8B ~8GB; 32B ~24GB; 235B server |
| SmolLM3 | Hugging Face | Run on your computer | Open model: small, fully-open model with weights, data, and configs released. | Open-weight | Cloud or local | 3B on a laptop / 4-6GB |
| text-generation-webui | oobabooga | Run on your computer | Runtime: Gradio web UI for local models with many backends and an API. | Free | Local | Your PC or Mac |
| vLLM | vLLM project | Run on your computer | Runtime: high-throughput serving engine; OpenAI-compatible. Production tool. | Free | Cloud or local | Server-class GPU(s) |
| Amazon Bedrock | AWS | Developer access & hubs | Enterprise catalog of many models with AWS security and compliance. | Paid | Cloud | Any device |
| Azure AI Foundry | Microsoft | Developer access & hubs | Microsoft enterprise model catalog with Azure governance. | Paid | Cloud | Any device |
| DeepInfra | DeepInfra | Developer access & hubs | Serverless inference marketed on low per-token cost for open models. | Paid | Cloud | Any device |
| Fal | Fal | Developer access & hubs | Fast inference platform centered on generative media. | Paid | Cloud | Any device |
| Fireworks AI | Fireworks AI | Developer access & hubs | Managed inference for open and some closed models, with fine-tuning. | Paid | Cloud | Any device |
| Google Vertex AI | Google Cloud | Developer access & hubs | GCP model platform; Gemini plus open and third-party models. | Paid | Cloud | Any device |
| Groq | Groq | Developer access & hubs | Custom LPU hardware for very fast token generation on open models. | Paid | Cloud | Any device |
| Hugging Face Inference | Hugging Face | Developer access & hubs | Unified API on the Hub routing to many partner inference providers. | Free + paid | Cloud | Any device |
| OpenRouter | OpenRouter | Developer access & hubs | One API routing to 400+ models with fallback and billing in one place. | Paid | Cloud | Any device |
| Replicate | Replicate | Developer access & hubs | Run thousands of community and proprietary models via API. | Paid | Cloud | Any device |
| Together AI | Together AI | Developer access & hubs | Managed inference and fine-tuning for a large open-model catalog. | Paid | Cloud | Any device |