My front-end machine has this:
<Files ~ "\.(cgi|pl)">
Backhand byAge
Backhand byHostname (back1|back2|back3|back4|back5|back6|back7)
Backhand byRandom
Backhand byLogWindow
Backhand byLoad
</Files>
Everything works just fine until one of the "back" machines bails or locks
up. Once a machine crashes and is unavailable (i.e. can't even ping the
IP/hostname because it's dead at the network level), the front end machine
just doesn't respond anymore.
I just verified that this is true by taking down eth0 on "back4" -- and
the "Age" column on ALL of the servers in the backhand-status table
started growing higher and higher until finally, the page wouldn't load up
anymore. When I re-initialized eth0 on "back4", the entire cluster came
back up. Just FYI, it locks up the front end so bad that even normal web
requests for static "non-backhand" content doesn't even come up.
What I'd really like is that if a server becomes unavailable, it simply
falls off the available list and everything continues as normal. When I
kill the Apache process on any of the backend boxes, it behaves as it
should -- the Age value increases on the single server, and eventually,
the status color turns red on the backhand-status table.
Thanks for any tips.
--Neil
<Files ~ "\.(cgi|pl)">
Backhand byAge
Backhand byHostname (back1|back2|back3|back4|back5|back6|back7)
Backhand byRandom
Backhand byLogWindow
Backhand byLoad
</Files>
Everything works just fine until one of the "back" machines bails or locks
up. Once a machine crashes and is unavailable (i.e. can't even ping the
IP/hostname because it's dead at the network level), the front end machine
just doesn't respond anymore.
I just verified that this is true by taking down eth0 on "back4" -- and
the "Age" column on ALL of the servers in the backhand-status table
started growing higher and higher until finally, the page wouldn't load up
anymore. When I re-initialized eth0 on "back4", the entire cluster came
back up. Just FYI, it locks up the front end so bad that even normal web
requests for static "non-backhand" content doesn't even come up.
What I'd really like is that if a server becomes unavailable, it simply
falls off the available list and everything continues as normal. When I
kill the Apache process on any of the backend boxes, it behaves as it
should -- the Age value increases on the single server, and eventually,
the status color turns red on the backhand-status table.
Thanks for any tips.
--Neil