The AI Ecosystem Map: Labs, Models, Apps, Tools & Frameworks

The AI Ecosystem Map

Why This Is So Confusing

The same words get reused across completely different layers of the stack. "Claude" is a company, a model family, a chat app, a coding CLI, a desktop app, a VS Code extension, and a web product. "Copilot" means GitHub Copilot (an IDE extension), Microsoft Copilot (a chat app), and GitHub Copilot coding agent (an autonomous PR writer) — all from different teams.

This page gives every confusing term a precise home in a six-layer map. Once you know which layer a thing lives in, the naming makes sense.

Layer 1: AI Labs          — who trains the models
Layer 2: Foundation Models — the actual AI weights
Layer 3: Consumer Apps    — chat UIs for everyday use
Layer 4: Developer Tools  — coding IDEs, extensions, CLIs
Layer 5: Frameworks       — SDKs for building AI systems
Layer 6: Inference Infra  — where the model actually runs

Layer 1 — AI Labs (Who Trains the Models)

These are the companies that spend billions training foundation models. A lab is not an app — it is the upstream source for the weights that power everything else.

Lab HQ Known for Flagship model (Apr 2026)
OpenAI US (Microsoft-backed) ChatGPT, GPT family, Codex, DALL-E GPT-5.5
Anthropic US (Amazon/Google-backed) Claude family, Claude Code Claude Opus 4.7
Google / Google DeepMind US Gemini family, Gemma (open-weight) Gemini 3.1 Pro
Meta AI US (open-weight) Llama family Llama 4 Scout
xAI US (Elon Musk) Grok family Grok 4.20
Mistral AI France Mistral Large, open-weight small models Mistral Large 3
Moonshot AI China Kimi family (open-weight MoE) Kimi K2.5
MiniMax China MiniMax M-series MiniMax M2.5
Zhipu AI / Z.ai China GLM family (open-weight) GLM-5
Alibaba China Qwen family Qwen 3.5
DeepSeek China (Lianxin-backed) DeepSeek-V series (open-weight) DeepSeek V4
Perplexity AI US Sonar (search-grounded; built on Llama) Sonar Pro
Cohere Canada Command family (enterprise focus) Command R+

Key point: Labs and models are not the same thing. Anthropic is the lab. Claude is the model. Claude.ai is the app. All three words get used interchangeably in conversation, which is where the confusion starts.


Layer 2 — Foundation Models (The AI Brain)

A foundation model is a set of trained weights — not an app, not a service, not a chatbot. The same weights can power a chat interface, a coding CLI, a VS Code extension, and an API endpoint simultaneously.

Core insight: GPT-5.4 is a model. ChatGPT is an app that uses GPT-5.4. Cursor is another app that also uses GPT-5.4. GitHub Copilot can use it too. The same underlying model, accessed through completely different surfaces.

OpenAI GPT-5.x Family (as of April 23, 2026 — GPT-5.5 released today)

Model What it is Context API price (input)
GPT-5.5 Latest flagship; best coding + agentic; rolling out to ChatGPT and Codex 1M tokens API coming soon
GPT-5.4 Current API flagship; powers most production apps 1M tokens $2.50/M tokens
GPT-5.4 Pro Higher-quality variant of GPT-5.4 1M tokens Premium
GPT-5.4 mini Near-flagship performance, lower cost + latency 400K tokens $0.75/M tokens
GPT-5.4 nano Cheapest GPT-5-class model for simple tasks 400K tokens $0.20/M tokens
GPT-5.3-Codex Dedicated agentic coding variant; powers the Codex product Codex-only
Cursor Composer 2 Cursor's own RL-trained coding model (not OpenAI) Cursor-only

Anthropic Claude 4.x Family

Model What it is Context Best use
Claude Opus 4.7 Latest GA (April 15, 2026); step-change agentic coding 1M tokens Complex multi-file coding, long-horizon tasks
Claude Sonnet 4.6 Speed/cost balance; most popular for production 1M tokens (beta) Everyday coding, high-volume production
Claude Haiku 4.5 Fast, cheap, high-volume tasks Simple completions, rapid responses

Google Gemini 3.x Family

Model What it is Context Notes
Gemini 3.1 Pro Multimodal flagship; leads Artificial Analysis Intelligence Index 1M tokens Best breadth across benchmarks
Gemini 3 Flash Default model in the Gemini app; fast balance 1M tokens Consumer app default
Gemini 3.1 Flash-Lite Cheapest large-context option commercially available 1M tokens $0.25/M input
Gemma 4 (31B) Open-weight; Apache 2.0; runs on-device 256K tokens #3 on Arena AI leaderboard

Other Notable Models (April 2026)

Model Lab What makes it notable
Kimi K2.5 Moonshot AI 1T parameter MoE, open-weight (modified MIT); 256K context; agentic swarm (up to 100 sub-agents); $0.60/M input
MiniMax M2.5 MiniMax 80.2% on SWE-bench Verified — matches best closed models; $0.30/M input
Grok 4.20 xAI Runs 4 parallel agents internally; real-time X (Twitter) data access
Llama 4 Scout Meta 10M token context window; open-weight; Apache 2.0
DeepSeek V4 DeepSeek ~$0.28/M input; ~90% of GPT-5.4 quality; built on Huawei Ascend (no NVIDIA)
GLM-5 Zhipu / Z.ai 77.8% SWE-bench; top open-source Chatbot Arena Elo; $3/month subscription tier
Qwen 3.5 (9B) Alibaba 9B model matching 120B+ models on reasoning; Apache 2.0

Layer 3 — Consumer AI Apps (Chat for Everyday Use)

These are the products a non-technical person opens to talk to an AI. No coding, no API keys, no setup.

App Company Primary model Where you find it
ChatGPT OpenAI GPT-5.x family Web, iOS, Android, macOS/Windows desktop
Claude.ai Anthropic Claude Opus/Sonnet Web, iOS, Android, macOS desktop
Gemini Google Gemini 3.x Web, iOS, Android; baked into Google Workspace
Microsoft Copilot Microsoft GPT-5.x via Azure Web, Windows, iOS, Android
Grok xAI Grok 4.x Web, X (Twitter) app
Perplexity Perplexity AI Sonar (Llama-based) Web, iOS, Android
Kimi Moonshot AI Kimi K2.5 Web, iOS, Android

These are all chatbots. They differ in which model they use, what live data they can access (web, email, calendar), and what extra features they layer on top (image generation, voice mode, memory, etc.).

Consumer App Subscription Plans (April 2026)

Consumer AI apps use flat-rate subscription pricing — you pay a fixed monthly fee for access to the app and its features, not per message sent. This is fundamentally different from API pricing (see Layer 6), where you pay per token consumed and get raw model access.

OpenAI / ChatGPT

Plan Price Models included Key features
Free $0/mo GPT-5.3 Instant (limited; ads in US) Basic chat, web search, limited image gen
Go $8/mo GPT-5.3 + more volume, still has ads More messages; missing advanced features
Plus $20/mo GPT-5.3 + GPT-5.4 Thinking Deep Research (10/mo), Sora, Codex, Agent Mode
Pro $100 $100/mo Everything in Plus 5× higher limits than Plus
Pro $200 $200/mo GPT-5.4 Pro 20× Plus limits, 250 Deep Research runs/mo, double context window
Business $25/user/mo Unlimited GPT-5.4 + Thinking 60+ integrations (Slack, Drive, GitHub), SOC 2, SAML SSO, team workspace
Enterprise Custom All models Privately hosted AI, SCIM, audit logs, dedicated support

Anthropic / Claude.ai

Plan Price Models included Key features
Free $0/mo Sonnet + Haiku Basic chat; web, iOS, Android, desktop
Pro $20/mo ($17 annual) Opus + Sonnet + Haiku Claude Code, Research, cross-conversation memory, ~5× more usage than Free
Max 5× $100/mo All models 5× Pro usage, priority access, early features
Max 20× $200/mo All models 20× Pro usage, highest output limits, parallel agent workflows
Team (Standard) $25/seat/mo All Max features Admin controls, SSO, no training on data by default
Team (Premium) $100/seat/mo All models + Claude Code 5× Standard usage; for developer-heavy teams
Enterprise Custom All models 500K context window, HIPAA-ready, SCIM, audit logs, compliance API

Google / Gemini

Plan Price Models included Key features
Free $0/mo Gemini Flash (lighter) Basic chat; limited features
Google AI Pro $19.99/mo Gemini 2.5 Pro + Gemini 3 (US) Deep Research, Veo video gen, Gemini in Gmail/Docs/Sheets, 1,000 AI credits/mo
Google AI Ultra ~$41.67/mo ($124.99/3 months) Gemini 3.1 Pro, all models Highest limits, 25,000 AI credits/mo, $100/mo Google Cloud credits, YouTube Premium

Subscription ≠ API access. None of these consumer plans give you programmatic API access. If you want to call a model from your own code or product, you need a separate API account with pay-per-token billing — see Layer 6.


Layer 4 — Developer / Coding Tools

This is where the naming chaos peaks. The same capability (an AI that reads and edits your code) comes packaged as four completely different form factors:

                    ┌─────────────────────────────────┐
                    │      Foundation Model            │
                    │  (GPT-5.5, Claude Opus 4.7, ...) │
                    └──┬──────┬──────────┬────────┬───┘
                       │      │          │        │
                  ┌────▼──┐ ┌─▼──────┐ ┌▼──────┐ ┌▼──────────┐
                  │AI-Native│ │IDE     │ │Terminal│ │Web/Desktop│
                  │IDE     │ │Extension│ │CLI    │ │Coding App │
                  │(new    │ │(add-on │ │(no GUI)│ │(GUI agent)│
                  │editor) │ │to your │ │       │ │           │
                  │        │ │editor) │ │       │ │           │
                  └────────┘ └────────┘ └───────┘ └───────────┘

4a — AI-Native IDEs (replaces your editor)

You install this instead of VS Code. It's a complete editor built from the ground up around AI.

Tool Built on Models supported Key differentiator
Cursor VS Code fork Claude, GPT-5.x, Gemini, Composer 2 Best autocomplete (Supermaven); Composer multi-file editing; 1M+ users
Windsurf Proprietary Claude, GPT, Gemini Flows agentic mode; beginner-friendly
Zed Rust (native) Multi-model Fastest keystroke latency; strong local-model story
Google Antigravity Proprietary Gemini-first Public preview; Manager view for agent oversight
Trae Proprietary (ByteDance) Multi-model Growing fast; Asia-focused
Kiro Proprietary (Amazon) Amazon Nova AWS-native; early access

4b — IDE Extensions (adds AI to your existing editor)

You keep VS Code (or JetBrains, or Xcode) and install a plugin. Zero disruption to your current workflow.

Extension Company Works in Models
GitHub Copilot GitHub / Microsoft VS Code, JetBrains, Xcode, Neovim, Eclipse, Vim GPT-5.x, Claude, Gemini (your choice)
Claude Code (extension) Anthropic VS Code, JetBrains Claude Sonnet / Opus
Codex (extension) OpenAI VS Code, Cursor, Windsurf GPT-5.5 / GPT-5.4
Gemini Code Assist Google VS Code, JetBrains Gemini 3.1 Pro
Amazon Q Developer AWS VS Code, JetBrains Amazon Nova
Continue Open-source VS Code, JetBrains BYOK — any model you configure

4c — Terminal / CLI Agents (lives in your terminal, no GUI)

You type a command in your terminal. The agent reads and writes your actual local files, runs shell commands, commits to git — all without any visual editor. Maximum control, steepest learning curve.

Tool Company License Key trait
Claude Code CLI Anthropic Proprietary claude command; 80.8% SWE-bench Verified; 1M context; MCP support; CLAUDE.md memory files; the most capable terminal agent
Codex CLI OpenAI Open-source (MIT) codex command; BYOK with any ChatGPT-compatible model; sandboxed execution
Aider Open-source Apache 2.0 Git-native workflow; strong on refactoring; BYOK
OpenCode Open-source MIT Multi-model BYOK; free (you pay for API only)

4d — Web / Desktop Coding Agent Apps (GUI wrappers for agent workflows)

Dedicated apps for coding tasks — not general-purpose chat. They give you a visual interface to the same agent capability you'd run in the terminal.

App Company Surface What it is
Claude Code Desktop Anthropic macOS app (Windows preview) Visual diffs; Cowork background agents; Routines (scheduled automations); parallel sessions
claude.ai/code Anthropic Browser Cloud-sandboxed Claude Code; GitHub repo cloning and integration
Codex (chatgpt.com/codex) OpenAI Browser Web-based Codex coding agent; cloud task execution with approval workflow

"Claude Code" is one brand name for four different surfaces: CLI, VS Code extension, Desktop App, and web. They share the same Claude model and billing account but have very different UX and capability levels. The CLI is the most powerful (multi-agent, unlimited sessions, full MCP, full filesystem). The web version is the most restricted (cloud sandbox only).


Layer 5 — Agentic Frameworks (SDKs for Building AI Systems)

These are developer libraries for engineering teams building AI-powered applications. They are not end-user tools.

If you are a developer writing Python or TypeScript to build the next Claude Code, Cursor, or an internal AI workflow — this is your layer.

Framework Who Language Best for When to pick it
LangChain / LangGraph LangChain Inc. Python, TypeScript Complex stateful workflows; production systems Needs state machines, checkpointing, human-in-the-loop (48K+ stars)
CrewAI CrewAI Inc. Python Role-based multi-agent teams; rapid prototyping Need a working multi-agent prototype fast
AutoGen (v0.4+) Microsoft Python, .NET Conversational multi-agent debate / consensus Agents need to negotiate or debate; enterprise .NET shops
PydanticAI Pydantic team Python Type-safe production agents; FastAPI integration New project where type safety and DI matter
LlamaIndex LlamaIndex Inc. Python RAG + agentic workflows over large document sets Document-heavy knowledge retrieval
Semantic Kernel Microsoft C#, Python, Java Enterprise .NET / Azure integration Microsoft stack; existing Azure identity
Google ADK Google Python A2A-compatible agents on Google Cloud Building agents that use A2A protocol
OpenAI Agents SDK OpenAI Python, TypeScript Simple tool-calling agents using OpenAI models OpenAI-first shop; want minimal abstraction

You almost never need a framework if you are an end-user. Frameworks are for the team building the product you will eventually use. If you are deciding between Cursor and GitHub Copilot, this layer is not relevant to you.


Layer 6 — Inference Providers (Where the Model Actually Runs)

Inference providers are usually invisible to end users. They are the cloud services that actually host the model weights and serve the responses. A single lab may run its own inference (Anthropic API) or license to third-party providers (Mistral on Azure, Claude on Bedrock).

Provider What it offers
OpenAI API GPT-5.x models via REST; used by ChatGPT, Codex, and thousands of third-party apps
Anthropic API Claude models; used by Claude.ai, Claude Code, and third-party apps
Google AI Studio Gemini models for developers; free tier; easy API access
Google Vertex AI Gemini + third-party models for enterprise; compliance, private networking
AWS Bedrock Multi-provider (Claude, Llama, Mistral, Gemma) via AWS; enterprise compliance and IAM
Azure OpenAI OpenAI models served through Microsoft's cloud; enterprise SLAs, data residency
OpenRouter Aggregates 150+ models from all providers via a single API and unified billing
Groq Ultra-fast inference (Llama, Mistral, Gemma) via custom LPU silicon; lowest latency
Fireworks AI Fast open-model hosting; fine-tuning support
Together AI Open-weight model hosting with fine-tuning and LoRA
Ollama Run open-weight models (Llama, Mistral, Gemma, Qwen, etc.) entirely on your own hardware; no API key, no cloud, no cost per token

API Pricing: On-Demand Pay-Per-Token (April 2026)

APIs charge per million tokens (MTok) — input (your prompt) and output (the model's response) are billed separately. There is no subscription; you pay only for what you use. This makes APIs ideal for variable workloads, automation, and building products.

OpenAI API

Model Input (per 1M tokens) Output (per 1M tokens) Notes
GPT-5.4 $2.50 $10.00 Current API flagship; 1M context
GPT-5.4 mini $0.75 ~$3.00 Near-flagship performance at lower cost
GPT-5.4 nano $0.20 ~$0.80 High-volume, simple tasks
GPT-5.5 TBA TBA API availability announced soon

Anthropic API

Model Input (per 1M tokens) Output (per 1M tokens) Notes
Claude Opus 4.7 $5.00 $25.00 Latest flagship; 1M context
Claude Sonnet 4.6 $3.00 $15.00 Most popular in production
Claude Haiku 4.5 $1.00 $5.00 Fast, high-volume tasks
(Prompt caching) 70–90% off On repeated/cached context

Google Gemini API (via AI Studio or Vertex AI)

Model Input (per 1M tokens) Output (per 1M tokens) Notes
Gemini 3.1 Pro $2.00 / $4.00† $12.00 / $18.00† Most capable; context-tiered pricing
Gemini 2.5 Pro $1.25 / $2.50† $10.00 / $15.00† Strong coding + reasoning
Gemini 3 Flash $0.50 $3.00 Fast, balanced
Gemini 2.5 Flash-Lite $0.10 $0.40 Cheapest large-context option

†Lower price for prompts ≤200K tokens; higher for prompts >200K tokens.

Free API tiers: Google AI Studio offers 25 Gemini 2.5 Pro requests/day and 1,500 Flash requests/day free — generous for development and testing. Anthropic gives new accounts small free credits. OpenAI has no free tier but has very cheap nano/mini models.

Subscription vs. API breakeven: At heavy daily use (e.g., 8+ hours of Claude Code agentic work), the Claude Max 20× subscription at $200/month is typically cheaper than equivalent API token spend. For lighter, variable, or bursty usage, pay-as-you-go API is usually more cost-effective. Model your actual usage pattern before committing to either.


Common Confusions, Resolved

"Anthropic vs. Claude"

  • Anthropic = the company (lab). Trains the models.
  • Claude = the model family (Opus, Sonnet, Haiku). The weights.
  • Claude.ai = the consumer chat app (Layer 3).
  • Claude Code = the developer coding tool (Layer 4), which itself ships in four forms.

"ChatGPT vs. GPT-5.4 vs. GPT-5.5"

  • GPT-5.4 / GPT-5.5 = models (Layer 2). The AI brain.
  • ChatGPT = the consumer app (Layer 3) that uses GPT-5.x.
  • Cursor, GitHub Copilot, and your custom app can also use the same GPT-5.4 model via the API.

"ChatGPT (the app) vs. OpenAI API — same model, very different harness"

This is one of the most practically important distinctions to understand.

Same model, completely different experience. ChatGPT and the OpenAI API both use GPT-5.x weights — but ChatGPT wraps the model in a thick application layer ("harness") that adds features, enforces limits, and abstracts away control. The API gives you the raw model.

What you get ChatGPT (consumer app) OpenAI API (developer access)
Pricing model Flat-rate subscription (Free → $200/mo) Pay-per-token; no monthly minimum
Model selection Auto-routing router picks model for you; you can nudge it You explicitly specify the model (gpt-5.4, gpt-5.4-mini, etc.)
Context window Tiered by plan (~320 pages on Plus; ~680 on Pro $200) Full per-model spec (1M tokens on GPT-5.4)
Memory Built-in cross-conversation memory None by default — you manage your own context
Web browsing Built-in (Bing-powered) Tool you add yourself
Image generation Built-in (DALL-E) Separate Images API call
Code execution Built-in sandbox Tool you add yourself
Voice mode Built-in Separate Realtime API
System prompt Managed by OpenAI (you can't see it) You write it; full control
Tools / function calling Predefined by OpenAI You define your own tools
Data privacy Training opt-out varies by plan; conversation storage on OpenAI servers No training on API data by default; you control retention
Best for Individuals and teams doing knowledge work Developers building products or automating workflows

Key insight: A developer paying $20/month for ChatGPT Plus and a developer paying $20 in API tokens are not getting the same thing. The Plus subscriber gets a polished product with baked-in tools and routing but less control. The API caller gets raw model access with full control but must build the tooling layer themselves. For many developer use cases, the API is actually cheaper per effective output — teams that switch from ChatGPT to the API often cut costs 30–50% by routing simpler tasks to cheaper models explicitly.

ChatGPT's auto-router: Since early 2026, ChatGPT uses an automatic routing layer that picks between GPT-5.3 Instant and GPT-5.4 Thinking based on query complexity. You can override this by manually selecting a model, but the default is opaque. The API has no such routing — what you ask for is exactly what runs.

"Claude Code CLI vs. Claude Code Desktop vs. Claude Code extension"

All three are branded "Claude Code." All use Claude models. They are very different:

CLI Desktop App VS Code Extension
Where it runs Your terminal macOS app (Win preview) VS Code sidebar
File access Full filesystem Full local filesystem Current workspace
Multiple sessions Unlimited Yes (new parallel design) One per workspace
MCP support Full Partial Yes
Best for Power users, automation, overnight runs Visual workflow, Cowork agents Daily coding inside VS Code

"Codex vs. ChatGPT"

Both are OpenAI products. ChatGPT is a general-purpose assistant. Codex is specifically for autonomous coding tasks. GPT-5.3-Codex and GPT-5.5 are the models inside Codex. "Codex" by itself means the product/app (browser + CLI + extension).

"Cursor vs. GitHub Copilot"

  • Cursor = a whole new IDE. You switch away from VS Code (though it feels identical). AI is woven into every keystroke.
  • GitHub Copilot = a plugin. You stay in VS Code (or JetBrains, or Xcode). Lower disruption, lower cost ($10/month vs. $20/month).
  • Both can use the same underlying GPT-5.x or Claude models.

"LangChain vs. Claude Code"

  • LangChain / LangGraph = a developer framework (Layer 5) for building AI apps. Engineering tool.
  • Claude Code = an end-user developer tool (Layer 4). A product you use.
  • A software team might use LangChain to build something like Claude Code for their company.

"IDE vs. IDE Extension vs. CLI"

  • IDE = the whole editor. Cursor and Zed are IDEs. You open them instead of VS Code.
  • IDE Extension = a plugin installed inside your existing editor. GitHub Copilot is an extension.
  • CLI = runs in a terminal window. No visual editor at all. Claude Code CLI is a CLI.

"Gemini vs. Google"

  • Google / Google DeepMind = the lab (Layer 1).
  • Gemini = the model family (Layer 2): Gemini 3.1 Pro, Gemini 3 Flash, etc.
  • Gemini (the app) = the consumer chat product (Layer 3), like ChatGPT but from Google.
  • Gemini Code Assist = the IDE extension (Layer 4).
  • Google Antigravity = a new AI-native IDE (also Layer 4).
  • Same word, four different layers.

How the Layers Connect

┌─────────────────────────────────────────────────────────────────┐
│  Layer 6: Inference Providers                                   │
│  (OpenAI API · Anthropic API · Bedrock · Vertex · OpenRouter)  │
└──────────────────────────────┬──────────────────────────────────┘
                               │ serve weights to
┌──────────────────────────────▼──────────────────────────────────┐
│  Layer 2: Foundation Models                                     │
│  (GPT-5.5 · Claude Opus 4.7 · Gemini 3.1 Pro · Kimi K2.5 ...)  │
└────────────┬──────────────────────────────────────┬────────────┘
             │ power                                 │ power
┌────────────▼───────────────┐   ┌──────────────────▼───────────┐
│  Layer 3: Consumer Apps    │   │  Layer 4: Developer Tools    │
│  (ChatGPT · Claude.ai ·    │   │  (Cursor · Copilot ·         │
│   Gemini · Grok ...)       │   │   Claude Code · Codex ...)   │
└────────────────────────────┘   └──────────────────────────────┘

                                                  │ built with
                                 ┌────────────────┴─────────────┐
                                 │  Layer 5: Frameworks         │
                                 │  (LangGraph · CrewAI ·       │
                                 │   AutoGen · PydanticAI ...)  │
                                 └──────────────────────────────┘
     ┌─────────────────────────────────────────────────────────┐
     │  Layer 1: AI Labs (train and update the models)         │
     │  (OpenAI · Anthropic · Google · Meta · xAI · Mistral …) │
     └─────────────────────────────────────────────────────────┘

Quick Reference: What Am I Looking At?

You hear... It lives in... It is a...
OpenAI, Anthropic, Google DeepMind Layer 1 A company that trains AI models
GPT-5.5, Claude Opus 4.7, Gemini 3.1 Pro Layer 2 A set of trained model weights
ChatGPT, Claude.ai, Gemini app Layer 3 A consumer chat application
Cursor, Windsurf, Zed Layer 4a An AI-native IDE (replaces VS Code)
GitHub Copilot, Gemini Code Assist Layer 4b An IDE extension (plugin for your existing editor)
Claude Code CLI, Codex CLI, Aider Layer 4c A terminal agent (no GUI)
Claude Code Desktop, chatgpt.com/codex Layer 4d A GUI coding agent app
LangChain, CrewAI, AutoGen, PydanticAI Layer 5 A developer framework for building AI apps
Bedrock, Vertex AI, Ollama, OpenRouter Layer 6 Inference infrastructure (where the model runs)

See Also

PRIVATE PREVIEW

Request early access

Amprealize is invite-only during the preview. Share a little context and we’ll reach out.