After 1.5 years running Postgres, here are the 8 operational mistakes that cause outages. Each failure mode will warn you about a landmine before you hit one. Give it a read…
This article discusses best practices for versioned schema changes, safe column modifications, and managing replica lag across database clusters. Includes practical patterns for transactions, data batching, and rollback strategies to maintain high availability in production.
Recently, I was working on a custom Observability suite where I needed to monitor long-running and blocking queries running on a HA Postgres Cluster running over Kubernetes using Patroni.