API Gateway¶

Scope¶

This file covers API gateway architecture decisions including gateway selection, authentication enforcement, traffic management, request transformation, and API lifecycle concerns at the gateway layer. API gateways handle north-south traffic (external clients to internal services) and optionally east-west traffic between internal services. For API design conventions (versioning, pagination, error formats), see general/api-design.md. For service-to-service communication and mTLS via sidecar proxies, see general/service-mesh.md. For deployment strategies, see general/deployment.md.

Checklist¶

Why This Matters¶

The API gateway is the front door to every service in the architecture. Every external request passes through it, making it simultaneously the most critical infrastructure component and the most attractive target for misconfiguration. A gateway that fails to enforce authentication exposes every backend service. A gateway without rate limiting allows a single client to overwhelm the entire platform. A gateway with excessive transformation logic becomes an opaque bottleneck that is difficult to test and debug.

Cost is a frequently underestimated concern. Managed API gateways such as AWS API Gateway charge per request -- at $3.50 per million requests, an API handling 100 million requests per month costs $350 in gateway fees alone, before accounting for data transfer and caching. At higher volumes, self-hosted gateways such as Kong or APISIX running on existing Kubernetes infrastructure become significantly cheaper in direct costs, though they require operational investment in upgrades, monitoring, and scaling. The cost model decision should be made early because migrating between gateway platforms requires rewriting routing rules, authentication integration, and transformation logic.

The boundary between API gateway, load balancer, and service mesh is the most important architectural clarity to establish. Without clear responsibility boundaries, teams duplicate rate limiting in the gateway and the mesh, apply authentication inconsistently across layers, and debug routing issues across three different configuration systems. The general principle is: the API gateway handles external-facing concerns (client authentication, rate limiting, API versioning, request validation), the service mesh handles internal service-to-service concerns (mTLS, retries, circuit breaking), and the load balancer handles raw traffic distribution. When a team cannot articulate which layer handles which concern, incidents are longer and harder to diagnose.

Common Decisions (ADR Triggers)¶

Managed vs. self-hosted gateway -- managed gateways (AWS API Gateway, Azure APIM, Apigee) reduce operational burden but introduce per-request costs and vendor lock-in for routing configuration; self-hosted gateways (Kong, Traefik, APISIX, Envoy Gateway) offer full control and predictable costs but require operational capacity for upgrades, scaling, and high availability
Gateway authentication strategy -- gateway-terminated JWT validation vs. token passthrough to backends, API key management approach, whether to use OAuth 2.0 introspection (real-time but adds latency) or local JWT validation (fast but cannot revoke tokens until expiry)
Rate limiting architecture -- local rate limiting (per-instance counters, simple but inaccurate with multiple gateway instances) vs. distributed rate limiting (Redis or similar shared counter, accurate but adds latency and external dependency); fixed window vs. sliding window vs. token bucket algorithm
API versioning strategy at the gateway -- URL path versioning with separate route rules per version vs. header-based versioning with content negotiation; how to sunset old versions and redirect clients
Gateway per team vs. shared gateway -- single shared gateway simplifies operations but creates a bottleneck for configuration changes; per-team or per-domain gateways give autonomy but multiply operational overhead and complicate cross-cutting concerns like authentication
BFF pattern adoption -- dedicated gateway configuration per client type vs. single unified API; BFF reduces payload sizes and call counts for mobile clients but multiplies the number of API surfaces to maintain
Kubernetes Ingress vs. Gateway API -- legacy Ingress with controller-specific annotations vs. Gateway API standard with richer native routing; Gateway API is the future standard but may have less mature tooling for specific controllers
GraphQL gateway adoption -- Apollo Federation vs. schema stitching vs. single GraphQL server; federation gives service teams ownership but adds complexity in schema composition, error handling, and query planning across subgraphs

API Gateway¶

Scope¶

Checklist¶

Why This Matters¶

Common Decisions (ADR Triggers)¶

Reference Links¶

See Also¶