fivenines

Practice / target 80%

Failover Controller Practice

01

One monitor cannot reach the primary, but two others can. What state should the controller avoid entering too early?

intro

Scenario

Three monitor processes watch one primary; a network blip makes the primary unreachable from exactly one of them.

02

Why should replication offset influence replica promotion?

intro

Scenario

A failover is starting and the controller must pick which of several replicas to promote.

03

A promoted replica is missing the last acknowledged write. Which replication property made this possible?

applied

Scenario

After failover, a client reports that a write the old primary acknowledged is missing from the new primary.

04

How do epochs reduce the risk of stale controllers undoing a newer topology?

applied

Scenario

A controller that was network-partitioned during a failover comes back and tries to push its old view of the topology.

05

What should happen when the old primary returns after a successful failover?

advanced

Scenario

The failed primary finishes rebooting an hour after a replica was promoted in its place.

Ready to build

Design the system.

Put the learned theory into real-life practice and get architectural feedback.