Deploy an MCP Gateway with Docker Compose in 10 Minutes

February 12, 2026 · 10 min read

The STOA Platform Team

AI agents need a secure, standardized way to access your APIs. The Model Context Protocol (MCP) provides that bridge, and STOA Platform makes it trivial to deploy. In this tutorial, you'll learn how to set up a production-ready MCP gateway using Docker Compose in under 10 minutes.

New to MCP gateways? Start with our comprehensive guide: What is an MCP Gateway? to understand the architecture and security model before deploying.

By the end of this guide, you'll have a running gateway that exposes your existing REST APIs to AI agents like Claude, connects to authentication, and enforces runtime policies.

What You'll Build

You'll deploy a complete MCP gateway stack with:

stoa-gateway: Rust-based MCP server (edge-mcp mode)
Keycloak: OAuth2/OIDC authentication
PostgreSQL: Metadata storage for tools, subscriptions, and policies

This setup focuses on the gateway component with hands-on examples. For the full platform (Portal, observability, demo data), see the STOA quickstart guide.

Prerequisites

Before you start, make sure you have:

Docker 24+ and Docker Compose 2.x installed
curl and jq for testing
A REST API endpoint to expose (we'll use JSONPlaceholder as a demo)
Basic knowledge of Docker, REST APIs, and OAuth2

If you're migrating from an existing API gateway, check out our API Gateway Migration Guide first.

Step 1: Launch the Stack with Docker Compose

Create a new directory for your MCP gateway project:

mkdir mcp-gateway-demo
cd mcp-gateway-demo

Create a docker-compose.yml file:

services:
  postgres:
    image: postgres:16-alpine
    environment:
      POSTGRES_DB: stoa
      POSTGRES_USER: stoa
      POSTGRES_PASSWORD: changeme
    volumes:
      - postgres_data:/var/lib/postgresql/data
    healthcheck:
      test: ["CMD-SHELL", "pg_isready -U stoa"]
      interval: 10s
      timeout: 5s
      retries: 5

  keycloak:
    image: quay.io/keycloak/keycloak:24.0
    environment:
      KC_DB: postgres
      KC_DB_URL: jdbc:postgresql://postgres:5432/stoa
      KC_DB_USERNAME: stoa
      KC_DB_PASSWORD: changeme
      KEYCLOAK_ADMIN: admin
      KEYCLOAK_ADMIN_PASSWORD: admin
      KC_HTTP_ENABLED: "true"
      KC_HOSTNAME_STRICT: "false"
      KC_HEALTH_ENABLED: "true"
    command: start-dev
    ports:
      - "8081:8080"
    depends_on:
      postgres:
        condition: service_healthy
    healthcheck:
      test: ["CMD-SHELL", "exec 3<>/dev/tcp/localhost/8080 && echo -e 'GET /health/ready HTTP/1.1\\r\\nHost: localhost\\r\\nConnection: close\\r\\n\\r\\n' >&3 && cat <&3 | grep -q '200 OK'"]
      interval: 10s
      timeout: 5s
      retries: 10
      start_period: 30s

  stoa-gateway:
    image: ghcr.io/stoa-platform/stoa-gateway:latest
    environment:
      STOA_PORT: "8080"
      STOA_HOST: "0.0.0.0"
      STOA_KEYCLOAK_URL: http://keycloak:8080
      STOA_KEYCLOAK_REALM: stoa
      STOA_KEYCLOAK_CLIENT_ID: stoa-mcp-gateway
      STOA_LOG_LEVEL: info
      STOA_LOG_FORMAT: text
      STOA_GATEWAY_MODE: edge-mcp
      STOA_NATIVE_TOOLS_ENABLED: "true"
    ports:
      - "8082:8080"
    depends_on:
      postgres:
        condition: service_healthy
      keycloak:
        condition: service_healthy
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:8080/health"]
      interval: 10s
      timeout: 5s
      retries: 5
      start_period: 15s

volumes:
  postgres_data:

Launch the stack:

docker compose up -d

Verify all services are running:

docker compose ps

You should see three containers: postgres, keycloak, and stoa-gateway all in the "Up" state. Keycloak can take 30-60 seconds to become healthy.

Access Keycloak: Navigate to http://localhost:8081 and log in with admin/admin. You'll need to create the stoa realm and stoa-mcp-gateway client manually in this quickstart. For production, use the full Helm chart which includes automated realm setup.

Step 2: Verify the Gateway

Once all services are healthy, verify the gateway is running:

# Health check
curl -s http://localhost:8082/health | jq .
# Expected: {"status":"ok"}

# MCP discovery
curl -s http://localhost:8082/mcp | jq .

# MCP capabilities
curl -s http://localhost:8082/mcp/capabilities | jq .

You should see MCP capabilities including tools, resources, and prompts support.

Step 3: Register Your First MCP Tool

An MCP "tool" is a function that AI agents can call. Each tool maps to one of your REST API endpoints. Let's register a tool that searches contacts.

First, get an access token from Keycloak:

export KEYCLOAK_TOKEN=$(curl -s -X POST \
  'http://localhost:8081/realms/stoa/protocol/openid-connect/token' \
  -H 'Content-Type: application/x-www-form-urlencoded' \
  -d 'client_id=stoa-mcp-gateway' \
  -d 'client_secret=your-client-secret-here' \
  -d 'grant_type=client_credentials' \
  | jq -r '.access_token')

Now register the tool via the gateway's admin API:

curl -X POST http://localhost:8082/admin/tools \
  -H "Authorization: Bearer $KEYCLOAK_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "search-contacts",
    "display_name": "Search Contacts",
    "description": "Search for contacts by name or email",
    "tenant_id": "acme",
    "endpoint": "https://jsonplaceholder.typicode.com/users",
    "method": "GET",
    "input_schema": {
      "type": "object",
      "properties": {
        "query": {
          "type": "string",
          "description": "Search query"
        }
      },
      "required": ["query"]
    }
  }'

Response:

{
  "id": "550e8400-e29b-41d4-a716-446655440000",
  "name": "search-contacts",
  "display_name": "Search Contacts",
  "status": "active",
  "created_at": "2026-02-12T10:30:00Z"
}

Your tool is now registered and ready to be called by AI agents.

Step 4: Connect an AI Agent

The gateway exposes an MCP-compliant server endpoint at http://localhost:8082/mcp. AI agents connect via Server-Sent Events (SSE) or Streamable HTTP transport.

Here's how Claude Desktop connects to your gateway. Add this to your Claude Desktop MCP settings (~/Library/Application Support/Claude/claude_desktop_config.json on macOS):

{
  "mcpServers": {
    "stoa-acme": {
      "url": "http://localhost:8082/mcp",
      "transport": "sse",
      "headers": {
        "Authorization": "Bearer YOUR_GATEWAY_API_KEY",
        "X-Tenant-ID": "acme"
      }
    }
  }
}

Restart Claude Desktop. You should now see "search-contacts" available as a tool in Claude's interface.

Test the tool from Claude:

User: Find contacts with the name "Leanne"
Claude: [calls search-contacts tool with query="Leanne"]

The gateway proxies the request to https://jsonplaceholder.typicode.com/users?query=Leanne, transforms the response, and returns it to Claude in MCP format.

Python SDK Example

If you're building a custom AI agent, use the STOA Python SDK:

from stoa_sdk import STOAClient

client = STOAClient(
    gateway_url="http://localhost:8082",
    api_key="YOUR_GATEWAY_API_KEY",
    tenant_id="acme"
)

# List available tools
tools = client.list_tools()
print(f"Available tools: {[t['name'] for t in tools]}")

# Call a tool
result = client.call_tool(
    name="search-contacts",
    arguments={"query": "Leanne"}
)
print(result)

This approach decouples your agent code from the underlying API structure. If you switch from JSONPlaceholder to your own CRM API, the agent code remains unchanged.

Step 5: Add a Runtime Policy

The gateway supports Open Policy Agent (OPA) policies for dynamic authorization, rate limiting, and data filtering.

Create a policy file policies/rate-limit.rego:

package stoa.policies

import future.keywords.if
import future.keywords.in

default allow := false

# Allow up to 100 requests per hour per tenant
allow if {
    input.method == "GET"
    count(quota_usage[input.tenant_id]) < 100
}

quota_usage[tenant_id] := requests if {
    tenant_id := input.tenant_id
    requests := [r | r := data.requests[_]; r.tenant_id == tenant_id]
}

Load the policy into the gateway:

curl -X POST http://localhost:8082/admin/policies \
  -H "Authorization: Bearer $KEYCLOAK_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "rate-limit-acme",
    "tenant_id": "acme",
    "policy": "'"$(cat policies/rate-limit.rego)"'",
    "enabled": true
  }'

Now, tenant "acme" is limited to 100 tool calls per hour. The 101st call returns:

{
  "error": "quota_exceeded",
  "message": "Tenant acme has exceeded rate limit",
  "retry_after": 3600
}

Policies are evaluated before proxying to your backend API, so you never waste backend capacity on unauthorized requests.

Step 6: Monitor and Debug

The gateway exposes Prometheus metrics at http://localhost:8082/metrics:

curl -s http://localhost:8082/metrics | grep stoa_tool_calls_total

Sample metrics:

stoa_tool_calls_total{tenant_id="acme",tool_name="search-contacts",status="success"} 42
stoa_tool_calls_total{tenant_id="acme",tool_name="search-contacts",status="error"} 2
stoa_policy_evaluations_total{tenant_id="acme",policy_name="rate-limit-acme",result="allow"} 40

For structured logs, check the gateway container:

docker compose logs -f stoa-gateway

You'll see:

{
  "level": "info",
  "timestamp": "2026-02-12T10:45:00Z",
  "tenant_id": "acme",
  "tool_name": "search-contacts",
  "duration_ms": 123,
  "status": 200
}

For production observability, integrate with Grafana and Loki. See ADR-023: Zero Blind Spot Observability for details.

What You've Learned

You now have a working MCP gateway that:

Exposes REST APIs as MCP tools for AI agents
Authenticates via OAuth2/OIDC (Keycloak)
Enforces policies (rate limiting, authorization) with OPA
Monitors tool usage with Prometheus metrics

This is the foundation for production deployments. Next steps:

Add more tools: Each backend API endpoint becomes a tool
Enable mTLS: Secure gateway-to-backend communication (ADR-039)
Deploy to Kubernetes: Use the Helm chart
Integrate with your Control Plane: Manage tools via UI instead of curl

Production Checklist

Before deploying to production:

Replace changeme passwords with secrets from a vault
Enable TLS for all services (gateway, Keycloak, Postgres)
Configure Keycloak realm export for disaster recovery
Set up log aggregation (ELK, Loki, or CloudWatch)
Configure Prometheus alerts for quota exceeded, auth failures
Test failover scenarios (Postgres down, Keycloak unavailable)
Document your tool catalog in a Git repository

Version Note

This tutorial uses:

STOA Gateway: latest (Rust, edge-mcp mode)
Keycloak: 24.0
PostgreSQL: 16

For the latest versions, check the STOA releases page.

Frequently Asked Questions

What are the minimum Docker requirements for running an MCP gateway?

You need Docker 24+ and Docker Compose 2.x. For development/testing, allocate at least 4GB RAM to Docker Desktop. For production, deploy on a Linux server with 8GB+ RAM and persistent volumes for PostgreSQL data. The gateway itself is lightweight (Rust binary, ~50MB RAM under load), but Keycloak and PostgreSQL require more resources. See the quickstart guide for Kubernetes deployment options.

Is this setup production-ready?

The Docker Compose setup is suitable for development, testing, and low-traffic production deployments. Before production use, replace all changeme passwords with secrets from a vault, enable TLS for all services, configure log aggregation, and set up Prometheus alerts. For high-availability production deployments with multiple replicas, use the Helm chart on Kubernetes.

What's the next step after completing this quickstart?

After the quickstart, add more tools by registering additional REST API endpoints, enable mTLS for gateway-to-backend security, integrate with your Control Plane for UI-based tool management instead of curl, and deploy to Kubernetes for production use. Explore the full platform capabilities in the MCP gateway concepts guide and architecture documentation.

About STOA Platform: STOA is the open-source API gateway built for AI agents. Define your API contract once with the Universal API Contract (UAC), and expose it everywhere: MCP, REST, GraphQL, gRPC. Apache 2.0 licensed. Get started today.

Need help? Join our Discord community or check the troubleshooting guide.

Disclaimer: Product capabilities and configuration options may change between versions. This guide reflects the state of STOA Platform as of March 2026. For the most current setup instructions, refer to the official documentation.

What You'll Build​

Prerequisites​

Step 1: Launch the Stack with Docker Compose​

Step 2: Verify the Gateway​

Step 3: Register Your First MCP Tool​

Step 4: Connect an AI Agent​

Python SDK Example​

Step 5: Add a Runtime Policy​

Step 6: Monitor and Debug​

What You've Learned​

Further Reading​

Production Checklist​

Version Note​

Frequently Asked Questions​

What are the minimum Docker requirements for running an MCP gateway?​

Is this setup production-ready?​

What's the next step after completing this quickstart?​