Linux-HA Logo

HEAD:          DRBD state machine and recovery strategies
AUTHOR:        Lars Ellenberg
LAST_MODIFIED: [[Date(2004-09-27T08:03:49)]]

definitions

Event
In this context, events are administrative requests (which DRBD may refuse to handle, and therefore are allowed to fail), and failures (or self-healing, think network-hickup). Since failures are "force-majeur", the must be handled. Though this probably implies that we need some emergency catch-all handler, we prefer to have a specific recovery strategy for each possible failure case.
DRBD
the driver
drbd
one device
CM
cluster manager/operator
active
drbd in writable state
coordinator
drbd coordinator for two (or more, once implemented) active nodes.
mirror
drbd in "slave" state, only mirroring/serving requests from the peer. I don't like the term "slave", and it is not strictly "passive" either...

init DRBD driver

device creation

configuration

Internal DRBD states

simplified states

recovery

communication fails

recovery actions on mirror node

recovery actions on active node

recovery actions of CM


<!> ... To be continued ...



References

[1]http://www.linux-ha.org/SplitBrain


This information provided courtesy of the Linux-HA project at http://linux-ha.org/