You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since Postgres 17, the pg_replication_slots returns an "inactive_since" field for replication slots where active = FALSE. It would be super helpful to monitor for streaming replicas that go offline.
Description
^^
Importance
really want
Value proposition
(1) This is very useful information to monitor for DB admins. A replica going offline or losing connection with the primary can cause downtime, stale data, table bloat on the primary, among other things.
Proposed implementation
A simple chart or monitor that just shows the number of expected active replica slots: select count(*) from pg_replication_slots; vs the number of actual active slots select count(*) from pg_replication_slots where active=TRUE; would be great. I think it would work exactly like your monitor for RAID configurations (Storage -> Management -> md.health, and md.disks). Where it shows the number of faulty (inactive replication slots), and the total number (total replication slots).
Displaying the new inactive_since (timestamp or in seconds/mins) field for inactive replicas would also be neat if possible.
The text was updated successfully, but these errors were encountered:
Problem
Hi!
Since Postgres 17, the pg_replication_slots returns an "inactive_since" field for replication slots where active = FALSE. It would be super helpful to monitor for streaming replicas that go offline.
Description
^^
Importance
really want
Value proposition
(1) This is very useful information to monitor for DB admins. A replica going offline or losing connection with the primary can cause downtime, stale data, table bloat on the primary, among other things.
Proposed implementation
A simple chart or monitor that just shows the number of expected active replica slots:
select count(*) from pg_replication_slots;
vs the number of actual active slotsselect count(*) from pg_replication_slots where active=TRUE;
would be great. I think it would work exactly like your monitor for RAID configurations (Storage -> Management -> md.health, and md.disks). Where it shows the number of faulty (inactive replication slots), and the total number (total replication slots).Displaying the new inactive_since (timestamp or in seconds/mins) field for inactive replicas would also be neat if possible.
The text was updated successfully, but these errors were encountered: