Hi all,
I am running heavy traffic on the primary and then kill the secondary
(power-off). After the primary has recognized what happened it throws these
messages -
drbd0: ack timeout detected (pc=30)!
drbd : timeout detected! (pid=3)
drbd0: Connection lost.(pc=30,uc=0)
and then dies!!!
Not pingable any more, no console input - simply dead. This happens every
now and then. Rebooting both nodes gets the system in a good condition
again.
I have mounted the disks with sync and I am continuously copying files to
the disk to generate the load
Any idea what happened? Unfortunately I am not the kernel-wizard :-/
/Wolfram
Here are the system details:
----------------------------------------
700 MHz Pentium III
512MB RAM
18GB IDE Disk
Lynuxworks BlueCat 3.0 (Kernel 2.2.12-1)
Heartbeat 0.4.9
DRBD 5.8.1 (reproduced with 6.1-pre2 as well)
eth0 (100BaseT) is used for heartbeat
eth1 (100BaseT) is used for heartbeat and drbd-sync
eth2 (100BaseT) is used for cluster-IP and client access
Here is the drbdsetup:
----------------------------------
Node1# drbdsetup /dev/nb0 show
Lower device: 22:03 (/dev/hdc3)
Disk options:
do-panic
Local address: 172.21.1.1:7788
Remote address: 172.21.2.1:7788
Wire protocol: B
Net options:
timeout = 6.0 sec
sync-rate = 3000 KB/sec
tl-size = 256
connect-int = 10 sec
ping-int = 10 sec
Node2# drbdsetup /dev/nb0 show
Lower device: 22:03 (/dev/hdc3)
Disk options:
do-panic
Local address: 172.21.2.1:7788
Remote address: 172.21.1.1:7788
Wire protocol: B
Net options:
timeout = 6.0 sec
sync-rate = 3000 KB/sec
tl-size = 256
connect-int = 10 sec
ping-int = 10 sec
=======================================================================
Wolfram Weyer FORCE COMPUTERS GmbH
Staff Engineer - Systems Engineering A Solectron Subsidiary
phone: +49 89 60814-523 Street: Prof.-Messerschmitt-Str. 1
fax: +49 89 60814-112 City: D-85579 Neubiberg/Muenchen
mailto:Wolfram.Weyer@example.com <mailto:Wolfram.Weyer@force.de>
http://www.forcecomputers.com <http://www.forcecomputers.com/>
=======================================================================
I am running heavy traffic on the primary and then kill the secondary
(power-off). After the primary has recognized what happened it throws these
messages -
drbd0: ack timeout detected (pc=30)!
drbd : timeout detected! (pid=3)
drbd0: Connection lost.(pc=30,uc=0)
and then dies!!!
Not pingable any more, no console input - simply dead. This happens every
now and then. Rebooting both nodes gets the system in a good condition
again.
I have mounted the disks with sync and I am continuously copying files to
the disk to generate the load
Any idea what happened? Unfortunately I am not the kernel-wizard :-/
/Wolfram
Here are the system details:
----------------------------------------
700 MHz Pentium III
512MB RAM
18GB IDE Disk
Lynuxworks BlueCat 3.0 (Kernel 2.2.12-1)
Heartbeat 0.4.9
DRBD 5.8.1 (reproduced with 6.1-pre2 as well)
eth0 (100BaseT) is used for heartbeat
eth1 (100BaseT) is used for heartbeat and drbd-sync
eth2 (100BaseT) is used for cluster-IP and client access
Here is the drbdsetup:
----------------------------------
Node1# drbdsetup /dev/nb0 show
Lower device: 22:03 (/dev/hdc3)
Disk options:
do-panic
Local address: 172.21.1.1:7788
Remote address: 172.21.2.1:7788
Wire protocol: B
Net options:
timeout = 6.0 sec
sync-rate = 3000 KB/sec
tl-size = 256
connect-int = 10 sec
ping-int = 10 sec
Node2# drbdsetup /dev/nb0 show
Lower device: 22:03 (/dev/hdc3)
Disk options:
do-panic
Local address: 172.21.2.1:7788
Remote address: 172.21.1.1:7788
Wire protocol: B
Net options:
timeout = 6.0 sec
sync-rate = 3000 KB/sec
tl-size = 256
connect-int = 10 sec
ping-int = 10 sec
=======================================================================
Wolfram Weyer FORCE COMPUTERS GmbH
Staff Engineer - Systems Engineering A Solectron Subsidiary
phone: +49 89 60814-523 Street: Prof.-Messerschmitt-Str. 1
fax: +49 89 60814-112 City: D-85579 Neubiberg/Muenchen
mailto:Wolfram.Weyer@example.com <mailto:Wolfram.Weyer@force.de>
http://www.forcecomputers.com <http://www.forcecomputers.com/>
=======================================================================