Skip to main content

MCP Gateway Positioning

A common question when evaluating STOA: "How does this relate to AI provider billing? Aren't tokens already managed by Claude/OpenAI?"

This page clarifies exactly what STOA MCP Gateway manages and its relationship to AI providers.

The Two Layers

LayerWho ManagesWhat's MeasuredBilling Model
AI ProviderAnthropic, OpenAI, etc.Tokens consumedPay per token
MCP Gateway (STOA)Your organizationTool invocationsPay per request
Key Insight

These are different things. STOA doesn't re-bill tokens — STOA bills tool invocations and provides governance.

What STOA MCP Gateway Does

1. Governance & Policy Enforcement

┌─────────────────────────────────────────────────────────┐
│ AI Agent (Claude) │
│ │ │
│ calls tool │
│ ▼ │
│ ┌─────────────────────────────────────────────────┐ │
│ │ STOA MCP Gateway │ │
│ │ ┌─────────┐ ┌─────────┐ ┌─────────────────┐ │ │
│ │ │ OPA │ │ Audit │ │ Rate Limiting │ │ │
│ │ │ Policies│ │ Trail │ │ per tenant │ │ │
│ │ └─────────┘ └─────────┘ └─────────────────┘ │ │
│ └─────────────────────────────────────────────────┘ │
│ │ │
│ forwards to │
│ ▼ │
│ Backend Service │
└─────────────────────────────────────────────────────────┘
  • OPA Policies: Fine-grained authorization per tool, per user, per tenant
  • Audit Trail: Complete log of who called what tool, when, with what parameters
  • Rate Limiting: Control usage per team, per application, per user

2. Multi-Tenancy & Isolation

  • Tenant Isolation: Each team/department sees only their authorized tools
  • Quotas: Set limits per tenant (e.g., "Marketing team: 10,000 calls/month")
  • Dashboards: Usage analytics per team, cost allocation

3. Unified Catalog

┌─────────────────────────────────────────┐
│ STOA Developer Portal │
├─────────────────────────────────────────┤
│ REST APIs │ MCP Tools │
│ ──────────── │ ────────── │
│ • Payment API │ • create_invoice │
│ • User API │ • search_orders │
│ • Product API │ • generate_report │
└─────────────────────────────────────────┘
Same portal, same subscription,
same governance

4. Developer Experience

  • Subscribe: One-click subscription to tools
  • Test: Try tools directly from the portal
  • Monitor: Real-time usage, latency, error rates
  • API Keys: Secure key management with 2FA

What STOA Does NOT Do

Not in Scope

These are explicitly not part of STOA's value proposition:

❌ We Don'tWhy
Re-bill Claude/OpenAI tokensThat's the AI provider's job
Intercept LLM responsesPrivacy concern, adds latency
Create a "Claude API wrapper"No value-add, just complexity
Count tokens in responsesAlready done by the provider

The Value Equation

Without STOA MCP Gateway:

  • AI agents call tools directly
  • No visibility into usage
  • No governance
  • No multi-tenant isolation
  • Each team manages their own tool access

With STOA MCP Gateway:

  • Centralized tool catalog
  • Policy-as-Code governance
  • Complete audit trail
  • Usage analytics & cost allocation
  • Self-service developer portal

Summary

┌──────────────────────────────────────────────────────────┐
│ │
│ AI Provider (Claude, OpenAI, etc.) │
│ └── Manages: Tokens, reasoning, generation │
│ └── Bills: Per token consumed │
│ │
│ STOA MCP Gateway │
│ └── Manages: Tool access, governance, multi-tenancy │
│ └── Bills: Per tool invocation (optional) │
│ │
│ Your Backend Services │
│ └── Execute: Business logic, data access │
│ └── Own: Your data, your APIs │
│ │
└──────────────────────────────────────────────────────────┘

STOA is the governance layer between AI agents and your enterprise tools.