Hi!
I feel the current clusterstack for SLES11 SP3 has several problems. I'm fighting for a day to get my test cluster up again after having installed the latest updates. I still cannot find out what's going on, but I suspect there are too many bugs in the software (again). For example I just saw these messages:
Jan 15 08:39:50 o4 attrd[13911]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 attrd[13911]: crit: attrd_cs_destroy: Lost connection to Corosync service!
Jan 15 08:39:50 o4 cib[13908]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 cib[13908]: error: cib_cs_destroy: Corosync connection lost! Exiting.
Jan 15 08:39:50 o4 attrd[13911]: notice: main: Exiting...
Jan 15 08:39:50 o4 attrd[13911]: notice: main: Disconnecting client 0x611eb0, pid=13913...
Jan 15 08:39:50 o4 stonith-ng[13909]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 stonith-ng[13909]: error: stonith_peer_cs_destroy: Corosync connection terminated
Jan 15 08:39:50 o4 attrd[13911]: error: attrd_cib_connection_destroy: Connection to the CIB terminated...
Jan 15 08:39:50 o4 crmd[13913]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 crmd[13913]: error: crmd_cs_destroy: connection terminated
I'm afraid nobody except the developer can make any sense from these error messages, except that there is an error.
Here's the list of software installed recently:
drbd-pacemaker-8.4.4-0.20.2 Mon Jan 13 12:58:49 2014
drbd-8.4.4-0.20.2 Mon Jan 13 12:58:49 2014
sleha-bootstrap-0.3-0.26.1 Mon Jan 13 12:58:48 2014
pacemaker-mgmt-2.1.2-0.11.4 Mon Jan 13 12:58:48 2014
crmsh-1.2.6-0.25.4 Mon Jan 13 12:58:48 2014
pacemaker-1.1.10-0.9.28 Mon Jan 13 12:58:47 2014
pacemaker-mgmt-client-2.1.2-0.11.4 Mon Jan 13 12:58:43 2014
cluster-glue-1.0.11-0.19.4 Mon Jan 13 12:58:43 2014
sbd-1.2.1-0.7.22 Mon Jan 13 12:58:42 2014
libpacemaker3-1.1.10-0.9.28 Mon Jan 13 12:58:41 2014
resource-agents-3.9.5-0.32.22 Mon Jan 13 12:58:38 2014
openais-1.1.4-5.17.5 Mon Jan 13 11:12:19 2014
libglue2-1.0.11-0.19.4 Mon Jan 13 11:12:18 2014
drbd-xen-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
drbd-udev-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
drbd-heartbeat-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
drbd-bash-completion-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
libqb0-0.16.0-0.7.4 Mon Jan 13 11:12:17 2014
libopenais3-1.1.4-5.17.5 Mon Jan 13 11:12:17 2014
drbd-utils-8.4.4-0.20.2 Mon Jan 13 11:12:17 2014
Regards,
Ulrich
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
I feel the current clusterstack for SLES11 SP3 has several problems. I'm fighting for a day to get my test cluster up again after having installed the latest updates. I still cannot find out what's going on, but I suspect there are too many bugs in the software (again). For example I just saw these messages:
Jan 15 08:39:50 o4 attrd[13911]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 attrd[13911]: crit: attrd_cs_destroy: Lost connection to Corosync service!
Jan 15 08:39:50 o4 cib[13908]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 cib[13908]: error: cib_cs_destroy: Corosync connection lost! Exiting.
Jan 15 08:39:50 o4 attrd[13911]: notice: main: Exiting...
Jan 15 08:39:50 o4 attrd[13911]: notice: main: Disconnecting client 0x611eb0, pid=13913...
Jan 15 08:39:50 o4 stonith-ng[13909]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 stonith-ng[13909]: error: stonith_peer_cs_destroy: Corosync connection terminated
Jan 15 08:39:50 o4 attrd[13911]: error: attrd_cib_connection_destroy: Connection to the CIB terminated...
Jan 15 08:39:50 o4 crmd[13913]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 crmd[13913]: error: crmd_cs_destroy: connection terminated
I'm afraid nobody except the developer can make any sense from these error messages, except that there is an error.
Here's the list of software installed recently:
drbd-pacemaker-8.4.4-0.20.2 Mon Jan 13 12:58:49 2014
drbd-8.4.4-0.20.2 Mon Jan 13 12:58:49 2014
sleha-bootstrap-0.3-0.26.1 Mon Jan 13 12:58:48 2014
pacemaker-mgmt-2.1.2-0.11.4 Mon Jan 13 12:58:48 2014
crmsh-1.2.6-0.25.4 Mon Jan 13 12:58:48 2014
pacemaker-1.1.10-0.9.28 Mon Jan 13 12:58:47 2014
pacemaker-mgmt-client-2.1.2-0.11.4 Mon Jan 13 12:58:43 2014
cluster-glue-1.0.11-0.19.4 Mon Jan 13 12:58:43 2014
sbd-1.2.1-0.7.22 Mon Jan 13 12:58:42 2014
libpacemaker3-1.1.10-0.9.28 Mon Jan 13 12:58:41 2014
resource-agents-3.9.5-0.32.22 Mon Jan 13 12:58:38 2014
openais-1.1.4-5.17.5 Mon Jan 13 11:12:19 2014
libglue2-1.0.11-0.19.4 Mon Jan 13 11:12:18 2014
drbd-xen-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
drbd-udev-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
drbd-heartbeat-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
drbd-bash-completion-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
libqb0-0.16.0-0.7.4 Mon Jan 13 11:12:17 2014
libopenais3-1.1.4-5.17.5 Mon Jan 13 11:12:17 2014
drbd-utils-8.4.4-0.20.2 Mon Jan 13 11:12:17 2014
Regards,
Ulrich
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems