Holger Kiehl wrote:
>
> On Tue, 1 Aug 2000, Alan Robertson wrote:
>
> > Holger Kiehl wrote:
> > >
> > > Hello
> > >
> > > When pulling the ethernet cable off the active node, the secondary will
> > > not take over. I have heartbeat 0.4.8 and nice_failback is set. There
> > > are two network cards but I pulled the cable of both cards but it does
> > > not switch over.
> > >
> > > What do I have to do that this works, ie when the active node is no longer
> > > reachable because, say the switch is broken, that the secondary node will
> > > take over?
> >
> So this must work and there is no need for mon?
>
> I think I have setup everything correct. But could someone please check it:
>
> There are two nodes yoda (192.168.124.126) and presto (192.168.124.125)
> which both have the same haresources file:
>
> yoda 192.168.124.127
>
> With only one tab inside!
>
> The ha.cf of yoda looks as follows:
>
> debugfile /var/log/ha-debug
> logfile /var/log/ha-log
> keepalive 2
> deadtime 10
> serial /dev/ttyS1
> baud 19200
> udpport 694
> udp eth0
> node yoda
> node presto.dwd.de
>
> And that of presto:
>
> debugfile /var/log/ha-debug
> logfile /var/log/ha-log
> keepalive 2
> deadtime 10
> serial /dev/ttyS1
> baud 19200
> udpport 694
> udp eth0
> node presto.dwd.de
> node yoda
Ok, looks good, just one thing as mentioned in the sample ha.cf file:
The host name in the node statement has to match the output of 'uname
-n'
>
> I did check the serial line by catting some data from one node to the other.
> So the serial line is working. But when I pull the ethernet cable from
>
> The last line is after I plugged in the ethernet cable. 192.168.124.127
> will not answer to any pings. I did this several times also waiting longer,
> always with the same result. The ha-log of presto looks nearly the same:
And to see if I understand, what you are trying to achieve:
You have setup a ha-cluster with two nodes, connected by two different
ethernet cable and additionally with a serial connection. You are now
trying to simulate a connectivity loss of one node by pulling the
primary network cable. You even pulled both ethernet cables. And you are
expecting that the other node would notice that the primary node is no
longer reachable by the clients and therefor do a fail over - right?
If my assumption is right, than this won't work, as stated in my
previous post, because the nodes can still see the heartbeat via the
serial connection and because heartbeat does not test the reachability
of the service IP address. This must be done with mon or any other
similar software.
juri
--
juri.haberland@innominate.de
system engineer innominate AG
clustering & security networking people
phone: +49-30-308806-45 fax: -77
http://innominate.de