PostgreSQL¶

Scope¶

This file covers PostgreSQL architecture decisions: version selection, streaming and logical replication, HA solutions (Patroni, repmgr), connection pooling (PgBouncer, Pgpool-II), vacuum tuning and autovacuum configuration, memory tuning (shared_buffers, work_mem, effective_cache_size, maintenance_work_mem), pgvector for AI/ML vector search, extensions ecosystem, migration from Oracle and SQL Server, and managed service options (Amazon RDS/Aurora PostgreSQL, Google Cloud SQL/AlloyDB, Azure Database for PostgreSQL). For general database strategy (engine selection, replication patterns, encryption), see general/data.md. For migration methodology and cutover planning, see general/database-migration.md.

Checklist¶

Why This Matters¶

PostgreSQL has become the default relational database for new projects and the most common migration target for organizations leaving Oracle or SQL Server, but its "easy to start, hard to master" nature creates operational risks that surface only at scale. A PostgreSQL instance with default settings will run acceptably for small workloads, but default autovacuum settings are insufficient for tables with millions of rows, default connection limits assume a handful of users, and default memory settings leave most available RAM unused. The difference between a well-tuned PostgreSQL deployment and a default one can be orders of magnitude in query performance and reliability.

The most dangerous PostgreSQL failure mode is transaction ID wraparound, which occurs when autovacuum cannot keep up with the rate of dead tuple accumulation. When the database approaches the 2-billion transaction limit without successful vacuuming, PostgreSQL refuses all write operations to prevent data corruption — an intentional safety mechanism that presents as a complete outage. This is entirely preventable with proper autovacuum configuration and monitoring, but organizations that treat PostgreSQL as "install and forget" routinely encounter this in production. Similarly, connection pooling is not optional for production PostgreSQL — the process-per-connection model means that even 500 idle connections consume significant memory and OS resources, and connection storms from application restarts or autoscaling events can saturate the server.

Common Decisions (ADR Triggers)¶

ADR: HA Solution Selection¶

Context: PostgreSQL requires an external tool to manage automatic failover, as it does not include built-in cluster management.

Options:

Criterion	Patroni	repmgr	Cloud Managed (RDS/Cloud SQL)
Automatic Failover	Yes (via DCS consensus)	Yes (with repmgrd daemon)	Yes (built-in)
Consensus Store	etcd, ZooKeeper, or Consul required	Optional (witness server)	Not applicable
Split-Brain Prevention	Strong (DCS-based leader lock)	Moderate (depends on configuration)	Strong (provider-managed)
Operational Complexity	Moderate (manage DCS cluster)	Lower (fewer moving parts)	Lowest (fully managed)
Customization	High (extensive configuration)	Moderate	Limited (provider-controlled)

Decision drivers: Operational team expertise, existing infrastructure (etcd/Consul already deployed for other services), split-brain risk tolerance, and whether the deployment is self-hosted or cloud-managed.

ADR: Connection Pooling Strategy¶

Context: PostgreSQL's process-per-connection model requires a connection pooler to support high-concurrency workloads.

Options: - PgBouncer (transaction mode): Lightweight, single-purpose pooler. Multiplexes hundreds of application connections onto a small number of database connections. Transaction mode releases connections between transactions, maximizing reuse. Does not support session-level features like prepared statements (in transaction mode) or LISTEN/NOTIFY. - PgBouncer (session mode): Maintains a 1:1 mapping between client and server sessions. Supports all PostgreSQL features. Provides connection queueing but minimal multiplexing. Useful when prepared statements or session variables are required. - Pgpool-II: Provides both connection pooling and query load balancing across read replicas. Supports automatic read/write splitting. Higher resource consumption and configuration complexity than PgBouncer. Better suited when a single tool must handle both pooling and routing. - Application-side pooling: Built-in pooling in frameworks (e.g., SQLAlchemy pool, HikariCP). No additional infrastructure. Limits pool per application instance, not globally. Does not prevent connection storms during application scaling events.

Decision drivers: Concurrency requirements, need for read/write splitting, use of session-level features (prepared statements, temp tables), and operational preference for infrastructure-level vs. application-level pooling.

ADR: Self-Hosted vs. Managed PostgreSQL¶

Context: The organization must decide between managing PostgreSQL on VMs/containers or using a cloud-managed service.

Options: - Self-hosted (VM or Kubernetes): Full control over version, extensions, configuration, and upgrade timing. Requires DBA expertise for patching, backup, HA configuration, and performance tuning. Can use any PostgreSQL extension. Lowest per-instance cost but highest operational cost. - Amazon RDS / Aurora PostgreSQL: Managed HA, automated backups, and push-button read replicas. Aurora offers storage auto-scaling and faster replication. Limited extension set and parameter group restrictions. Aurora Serverless v2 for variable workloads. - Google Cloud SQL / AlloyDB: Cloud SQL for standard managed PostgreSQL. AlloyDB for analytics-heavy workloads with columnar engine and PostgreSQL compatibility. AlloyDB provides better performance for mixed OLTP/OLAP but is GCP-only. - Azure Database for PostgreSQL Flexible Server: Managed service with zone-redundant HA. Supports most extensions. Integrates with Azure AD for authentication. Comparable to RDS in feature set.

Decision drivers: Extension requirements, DBA availability, cloud provider commitment, need for advanced features (custom WAL handling, specific extensions), and total cost of ownership including operational overhead.

Reference Links¶

PostgreSQL Documentation -- configuration, replication, performance tuning, and extension development
PostgreSQL Wiki: Tuning -- shared_buffers, work_mem, effective_cache_size, and autovacuum configuration
Patroni Documentation -- HA cluster management with automatic leader election and failover