Mailing List Archive

SLE11 SP3: attrd[13911]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Hi!

I feel the current clusterstack for SLES11 SP3 has several problems. I'm fighting for a day to get my test cluster up again after having installed the latest updates. I still cannot find out what's going on, but I suspect there are too many bugs in the software (again). For example I just saw these messages:

Jan 15 08:39:50 o4 attrd[13911]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 attrd[13911]: crit: attrd_cs_destroy: Lost connection to Corosync service!
Jan 15 08:39:50 o4 cib[13908]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 cib[13908]: error: cib_cs_destroy: Corosync connection lost! Exiting.
Jan 15 08:39:50 o4 attrd[13911]: notice: main: Exiting...
Jan 15 08:39:50 o4 attrd[13911]: notice: main: Disconnecting client 0x611eb0, pid=13913...
Jan 15 08:39:50 o4 stonith-ng[13909]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 stonith-ng[13909]: error: stonith_peer_cs_destroy: Corosync connection terminated
Jan 15 08:39:50 o4 attrd[13911]: error: attrd_cib_connection_destroy: Connection to the CIB terminated...
Jan 15 08:39:50 o4 crmd[13913]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
Jan 15 08:39:50 o4 crmd[13913]: error: crmd_cs_destroy: connection terminated

I'm afraid nobody except the developer can make any sense from these error messages, except that there is an error.

Here's the list of software installed recently:
drbd-pacemaker-8.4.4-0.20.2 Mon Jan 13 12:58:49 2014
drbd-8.4.4-0.20.2 Mon Jan 13 12:58:49 2014
sleha-bootstrap-0.3-0.26.1 Mon Jan 13 12:58:48 2014
pacemaker-mgmt-2.1.2-0.11.4 Mon Jan 13 12:58:48 2014
crmsh-1.2.6-0.25.4 Mon Jan 13 12:58:48 2014
pacemaker-1.1.10-0.9.28 Mon Jan 13 12:58:47 2014
pacemaker-mgmt-client-2.1.2-0.11.4 Mon Jan 13 12:58:43 2014
cluster-glue-1.0.11-0.19.4 Mon Jan 13 12:58:43 2014
sbd-1.2.1-0.7.22 Mon Jan 13 12:58:42 2014
libpacemaker3-1.1.10-0.9.28 Mon Jan 13 12:58:41 2014
resource-agents-3.9.5-0.32.22 Mon Jan 13 12:58:38 2014
openais-1.1.4-5.17.5 Mon Jan 13 11:12:19 2014
libglue2-1.0.11-0.19.4 Mon Jan 13 11:12:18 2014
drbd-xen-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
drbd-udev-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
drbd-heartbeat-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
drbd-bash-completion-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
libqb0-0.16.0-0.7.4 Mon Jan 13 11:12:17 2014
libopenais3-1.1.4-5.17.5 Mon Jan 13 11:12:17 2014
drbd-utils-8.4.4-0.20.2 Mon Jan 13 11:12:17 2014

Regards,
Ulrich


_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
Re: SLE11 SP3: attrd[13911]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0) [ In reply to ]
Looks like corosync is dying underneath pacemaker.

On 15 Jan 2014, at 6:49 pm, Ulrich Windl <Ulrich.Windl@rz.uni-regensburg.de> wrote:

> Hi!
>
> I feel the current clusterstack for SLES11 SP3 has several problems. I'm fighting for a day to get my test cluster up again after having installed the latest updates. I still cannot find out what's going on, but I suspect there are too many bugs in the software (again). For example I just saw these messages:
>
> Jan 15 08:39:50 o4 attrd[13911]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
> Jan 15 08:39:50 o4 attrd[13911]: crit: attrd_cs_destroy: Lost connection to Corosync service!
> Jan 15 08:39:50 o4 cib[13908]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
> Jan 15 08:39:50 o4 cib[13908]: error: cib_cs_destroy: Corosync connection lost! Exiting.
> Jan 15 08:39:50 o4 attrd[13911]: notice: main: Exiting...
> Jan 15 08:39:50 o4 attrd[13911]: notice: main: Disconnecting client 0x611eb0, pid=13913...
> Jan 15 08:39:50 o4 stonith-ng[13909]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
> Jan 15 08:39:50 o4 stonith-ng[13909]: error: stonith_peer_cs_destroy: Corosync connection terminated
> Jan 15 08:39:50 o4 attrd[13911]: error: attrd_cib_connection_destroy: Connection to the CIB terminated...
> Jan 15 08:39:50 o4 crmd[13913]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0)
> Jan 15 08:39:50 o4 crmd[13913]: error: crmd_cs_destroy: connection terminated
>
> I'm afraid nobody except the developer can make any sense from these error messages, except that there is an error.

Seriously? "Lost connection to Corosync service!" and "Corosync connection lost! Exiting." are too cryptic for you?

>
> Here's the list of software installed recently:
> drbd-pacemaker-8.4.4-0.20.2 Mon Jan 13 12:58:49 2014
> drbd-8.4.4-0.20.2 Mon Jan 13 12:58:49 2014
> sleha-bootstrap-0.3-0.26.1 Mon Jan 13 12:58:48 2014
> pacemaker-mgmt-2.1.2-0.11.4 Mon Jan 13 12:58:48 2014
> crmsh-1.2.6-0.25.4 Mon Jan 13 12:58:48 2014
> pacemaker-1.1.10-0.9.28 Mon Jan 13 12:58:47 2014
> pacemaker-mgmt-client-2.1.2-0.11.4 Mon Jan 13 12:58:43 2014
> cluster-glue-1.0.11-0.19.4 Mon Jan 13 12:58:43 2014
> sbd-1.2.1-0.7.22 Mon Jan 13 12:58:42 2014
> libpacemaker3-1.1.10-0.9.28 Mon Jan 13 12:58:41 2014
> resource-agents-3.9.5-0.32.22 Mon Jan 13 12:58:38 2014
> openais-1.1.4-5.17.5 Mon Jan 13 11:12:19 2014
> libglue2-1.0.11-0.19.4 Mon Jan 13 11:12:18 2014
> drbd-xen-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
> drbd-udev-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
> drbd-heartbeat-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
> drbd-bash-completion-8.4.4-0.20.2 Mon Jan 13 11:12:18 2014
> libqb0-0.16.0-0.7.4 Mon Jan 13 11:12:17 2014
> libopenais3-1.1.4-5.17.5 Mon Jan 13 11:12:17 2014
> drbd-utils-8.4.4-0.20.2 Mon Jan 13 11:12:17 2014
>
> Regards,
> Ulrich
>
>
> _______________________________________________
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
Re: SLE11 SP3: attrd[13911]: error: plugin_dispatch: Receiving message body failed: (2) Library error: Success (0) [ In reply to ]
On 2014-01-15T08:49:55, Ulrich Windl <Ulrich.Windl@rz.uni-regensburg.de> wrote:

> I feel the current clusterstack for SLES11 SP3 has several problems. I'm fighting for a day to get my test cluster up again after having installed the latest updates. I still cannot find out what's going on, but I suspect there are too many bugs in the software (again). For example I just saw these messages:

Please raise your issues with support. We cannot provide support for
enterprise software via upstream mailing lists.

Also, your scenario description doesn't include enough context to debug
what's going on and why.


Regards,
Lars

--
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde

_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems