Skip to main content

Build vs Buy API Gateway: True Cost Analysis for SaaS

· 13 min read
STOA Team
The STOA Platform Team

Corrections & Updates (2026-04-16): An earlier version of this article included three-year TCO tables with specific Euro totals and a managed-SaaS per-call rate. Those figures were illustrative but presented with a precision the underlying inputs did not support. This version replaces the fabricated tables with a qualitative framework and points to primary sources (salary databases, vendor pricing pages) so readers can plug in their own inputs. The qualitative conclusion — "for most mid-stage SaaS, OSS self-hosted beats custom build and matches or beats managed SaaS beyond moderate scale" — is unchanged. See commit history for the diff.

"We can build that ourselves in a sprint." We have all said it. Sometimes it is true. For most infrastructure decisions, it is not — especially for API gateways, where the scope of what "that" means expands significantly once you are in production.

This is the final installment of the SaaS Playbook series. We have covered multi-tenancy, rate limiting, audit logging, scaling, and production readiness. Now we tackle the meta-question: should you have built all of this yourself?

SaaS Production Checklist: 20 Gates Before Go-Live

· 12 min read
STOA Team
The STOA Platform Team

You have built it. You have tested it. Your team says it is ready. Before you open the doors, run through this checklist. Every item here represents a failure mode that real SaaS companies have experienced in production. Not theoretical risks — actual incidents that cost companies customers, regulatory scrutiny, or engineer weekends.

This is Part 5 (final) of the SaaS Playbook series. It assumes you have implemented the foundations covered in Parts 1, 2, 3, and 4.

Scaling Multi-Tenant APIs from 50 to 5000 Tenants

· 11 min read
STOA Team
The STOA Platform Team

Scaling a multi-tenant SaaS API is not the same as scaling a single-tenant one. At 50 tenants, your API gateway does a small amount of per-tenant work on each request — resolving a policy, checking a rate limit, validating a token. At 5000 tenants, that same work multiplied across thousands of concurrent connections creates challenges that do not show up in early load tests.

This is Part 4 of the SaaS Playbook series. We assume you have already implemented the foundations: multi-tenancy, rate limiting, and audit logging. Now you need to scale them.

SaaS Audit Logging: GDPR, SOC 2, and Per-Tenant Isolation

· 13 min read
STOA Team
The STOA Platform Team

Every SaaS product eventually faces a compliance question. An enterprise customer asks for a SOC 2 Type II report. A European customer requests a GDPR audit log. A financial services customer needs proof that no one accessed their data without authorization. How you answer these questions — and whether you can answer them at all — depends entirely on decisions you made when building your logging infrastructure.

This is Part 3 of the SaaS Playbook series. Part 1 covered multi-tenancy fundamentals. Part 2 covered rate limiting strategies. Here we tackle audit logging and compliance.

SaaS Rate Limiting: Per-Tenant Strategies That Scale

· 11 min read
STOA Team
The STOA Platform Team

Rate limiting is the difference between a SaaS product that scales gracefully and one that falls over every time a customer runs a batch job. But standard rate limiting — one global bucket, one set of limits — does not work for multi-tenant SaaS. You need per-tenant, per-tier, per-endpoint rate limiting that can enforce different quotas for different customers without letting anyone degrade the experience for others.

This is Part 2 of the SaaS Playbook series. Part 1 covered multi-tenancy fundamentals and tenant isolation models. Here we go deep on rate limiting strategies.

SMB API Gateway Guide 2026: Find the Right Fit

· 13 min read
STOA Team
The STOA Platform Team

Corrections & Updates (2026-04-16): An earlier version of this guide included a Total Cost of Ownership table with specific monthly Euro ranges for self-hosted and managed options. Those figures were illustrative but presented with a precision the underlying inputs did not support. This version replaces the table with a qualitative framework and links to each vendor's public pricing page so readers can plug in their own request volumes and infrastructure rates. The qualitative conclusion — self-hosted open source wins at moderate-to-high scale, managed options stay competitive at low volume — is unchanged.

Choosing an API gateway as a small or medium business in 2026 is harder than it should be. Most comparison articles assume you have a dedicated platform team, a six-figure infrastructure budget, and months to spare on evaluation. Most SaaS companies have none of those things.

This buying guide is written for SMB teams: typically 5-30 engineers, a product that is live (or nearly live), and a need for production-grade API management without enterprise complexity and enterprise pricing.

Multi-Tenancy 101: SaaS Tenant Isolation That Scales

· 11 min read
STOA Team
The STOA Platform Team

Multi-tenancy is the architectural backbone of every SaaS product. Done well, it lets you serve thousands of organizations from a single deployment with strong isolation, predictable costs, and zero cross-contamination. Done poorly, it is the source of your worst production incidents — the kind where tenant A's data appears in tenant B's response.

This is Part 1 of the SaaS Playbook series. We cover the foundational concepts and how STOA handles multi-tenancy at the API gateway layer. Later parts go deep on rate limiting strategies, audit and compliance, scaling, and production checklists.

Rate Limiting That Works (Freelancer Security Part 2)

· 10 min read
STOA Team
The STOA Platform Team

You set up rate limiting: 100 requests per minute. Done, right?

Not quite. A fixed limit of 100 req/min breaks legitimate users during burst activity, lets bots abuse you with slow trickle attacks, and doesn't differentiate between your free users and your paying customers.

This is Part 2 of the series. We'll go deep on rate limiting — the strategies that work in practice.