Learning path
HA & Failover On-Call
Detect a crash, promote a replica, avoid split-brain, validate after failover.
SRE · Advanced · 1 course · 10 simulations
The path
Courses, then incidents
Work through the courses, then practise the incidents — each step links to its page.
- Course Streaming Replication and Failover →
- ha-failover Primary crash detection →
- ha-failover Manual replica promotion →
- ha-failover Failed failover due to lag →
- ha-failover Split-brain risk →
- ha-failover Old primary returns →
- ha-failover Read/write endpoint confusion →
- ha-failover Failover with slot cleanup →
- ha-failover Timeline divergence detected →
- ha-failover Backup before rejoin →
- ha-failover Post-failover validation →