Hi,
I'm trying to run a couple of ha-clusters using drbd to keep disk content
synchonized; requirements sould be fairly common:
- 2.2.x kernel
- most stable drbd version
- a journaling filesystem to use on top of drbd
Is there a general consensus on what kernel version, patches (Andreas VM
patches?), drbd version and journaling filesystem should be least likely to
curl up and die?
So far, I've seen a nice collection of
* drbd loosing the connection in case of higher system/disk activity
* heartbeat thinking the other side was down and triggering an erroneous
takeover
* kernel crashes with infamous "try to free pages" messages
* ext3 exceptions (ext3 thinks write ordering wasn't observed). this mostly
happened near the moment drbd lost its connection
* system lockups when running large data copies onto ext3 on drbd (system
CPU goes to 100%, cp in "D" state can no longer be killed. reset button
time.)
Currently I'd expect drbd 5.7 + late 2.218pre kernel + Andreas VM_global_7
patches + reiserfs to be as near to a stable system as I can get. I'm still
seeing drbd disconnects/hangs however; especialy on systems with fast
harddisks and slow (10Mbit) network interconnect.
Grateful for any configuration hints,
Martin
"you have moved your mouse, please reboot to make this change take effect"
--------------------------------------------------
Martin Bene vox: +43-316-813824
simon media fax: +43-316-813824-6
Nikolaiplatz 4 e-mail: mb@example.com
8020 Graz, Austria
--------------------------------------------------
finger mb@example.com for PGP public key
I'm trying to run a couple of ha-clusters using drbd to keep disk content
synchonized; requirements sould be fairly common:
- 2.2.x kernel
- most stable drbd version
- a journaling filesystem to use on top of drbd
Is there a general consensus on what kernel version, patches (Andreas VM
patches?), drbd version and journaling filesystem should be least likely to
curl up and die?
So far, I've seen a nice collection of
* drbd loosing the connection in case of higher system/disk activity
* heartbeat thinking the other side was down and triggering an erroneous
takeover
* kernel crashes with infamous "try to free pages" messages
* ext3 exceptions (ext3 thinks write ordering wasn't observed). this mostly
happened near the moment drbd lost its connection
* system lockups when running large data copies onto ext3 on drbd (system
CPU goes to 100%, cp in "D" state can no longer be killed. reset button
time.)
Currently I'd expect drbd 5.7 + late 2.218pre kernel + Andreas VM_global_7
patches + reiserfs to be as near to a stable system as I can get. I'm still
seeing drbd disconnects/hangs however; especialy on systems with fast
harddisks and slow (10Mbit) network interconnect.
Grateful for any configuration hints,
Martin
"you have moved your mouse, please reboot to make this change take effect"
--------------------------------------------------
Martin Bene vox: +43-316-813824
simon media fax: +43-316-813824-6
Nikolaiplatz 4 e-mail: mb@example.com
8020 Graz, Austria
--------------------------------------------------
finger mb@example.com for PGP public key