ç¦ç”°ã•ã‚“
ã“ã‚“ã°ã‚“ã¯ã€å±±å†…ã§ã™ã€‚
変ã‚らãªã„よã†ã§ã™ã。。。
ã¨ã‚Šã‚ãˆãšã€æ˜Žæ—¥ãらã„ã«ã€RHEL上ã§ã™ãŒã€
Heartbeat3.0.6
Pacemakerã®æœ€æ–°
組ã¿åˆã‚ã›ã§ã€åŒã˜ã‚ˆã†ãªè¨å®š(リソースã¯Dummyã€external/xen0ã¯external/sshã«ãªã‚Šã¾ã™ãŒï¼‰stonith-helperãŒå‹•ãã‹ã©ã†ã‹ã‚’確èªã—ã¦ã¿ã¾ã™ã€‚
#stonith-helperã®-x指定ã®å‡ºåŠ›ãŒç¢ºèªå‡ºæ¥ã‚‹ã¨ã€ã‚‚ã†å°‘ã—å•é¡ŒãŒçµžã‚Šã‚„ã™ã„ã®ã§ã™ãŒãƒ»ãƒ»ãƒ»
以上ã§ã™ã€‚
----- Original Message -----
>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>To: 山内英生 <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>Date: 2015/3/17, Tue 21:24
>Subject: Re: [Linux-ha-jp] スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>
>
>山内ã•ã‚“
>
>ã“ã‚“ã°ã‚“ã¯ã€ç¦ç”°ã§ã™ã€‚
>最新版ã®æƒ…å ±ã‚’ã‚ã‚ŠãŒã¨ã†ã”ã–ã„ã¾ã—ãŸã€‚
>
>早速インストールã—ã¦ã¿ã¾ã—ãŸã€‚
>
>起動後ã®çŠ¶æ…‹ã§ã™ã€‚
>
>failed actionsã¯å¤‰ã‚ã‚Šãªã„よã†ã§ã™ã€‚
>
>
>
># crm_mon -rfA
>Last updated: Tue Mar 17 21:03:49 2015
>Last change: Tue Mar 17 20:30:58 2015
>Stack: heartbeat
>Current DC: lbv1.beta.com (38b0f200-83ea-8633-6f37-047d36cd39c6) - parti
>tion with quorum
>Version: 1.1.12-e32080b
>2 Nodes configured
>8 Resources configured
>
>
>Online: [ lbv1.beta.com lbv2.beta.com ]
>
>Full list of resources:
>
>Â Resource Group: HAvarnish
>Â Â Â Â vip_208Â Â Â (ocf::heartbeat:IPaddr2):Â Â Â Â Â Â Started lbv1.beta.com
>    varnishd  (lsb:varnish): Started lbv1.beta.com
>Â Resource Group: grpStonith1
>Â Â Â Â Stonith1-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>Â Â Â Â Stonith1-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>Â Resource Group: grpStonith2
>Â Â Â Â Stonith2-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>Â Â Â Â Stonith2-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>Â Clone Set: clone_ping [ping]
>Â Â Â Â Started: [ lbv1.beta.com lbv2.beta.com ]
>
>Node Attributes:
>* Node lbv1.beta.com:
>   + default_ping_set                 : 100
>* Node lbv2.beta.com:
>   + default_ping_set                 : 100
>
>Migration summary:
>* Node lbv1.beta.com:
>Â Â Stonith2-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>Â 21:03:39 2015'
>* Node lbv2.beta.com:
>Â Â Stonith1-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>Â 21:03:32 2015'
>
>Failed actions:
>Â Â Â Stonith2-1_start_0 on lbv1.beta.com 'unknown error' (1): call=31, st
>atus=Error, exit-reason='none', last-rc-change='Tue Mar 17 21:03:37 2015', queue
>d=0ms, exec=1085ms
>Â Â Â Stonith1-1_start_0 on lbv2.beta.com 'unknown error' (1): call=18, st
>atus=Error, exit-reason='none', last-rc-change='Tue Mar 17 21:03:30 2015', queue
>d=0ms, exec=1061ms
>
>
>
>
>ãƒã‚°ã§ã™ã€‚
>
>
># less /var/log/ha-debug
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: info: Pacemaker support: yes
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: File /etc/ha.d//haresources exists.
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: This file is not used because pacemaker is enabled
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/heartbeat/ccm
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/pacemaker/cib
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/pacemaker/stonithd
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/pacemaker/lrmd
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/pacemaker/attrd
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/pacemaker/crmd
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: Core dumps could be lost if multiple dumps occur.
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: Consider setting non-default value in /proc/sys/kernel/core_pattern (or equivalent) for maximum supportability
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: Consider setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: Logging daemon is disabled --enabling logging daemon is recommended
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: info: **************************
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: info: Configuration validated. Starting heartbeat 3.0.6
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: heartbeat: version 3.0.6
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: Heartbeat generation: 1423534116
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: seed is -1702799346
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth1
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: glib: ucast: bound send socket to device: eth1
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: glib: ucast: set SO_REUSEADDR
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: glib: ucast: bound receive socket to device: eth1
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: glib: ucast: started on port 694 interface eth1 to 10.0.17.133
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: Local status now set to: 'up'
>Mar 17 21:02:46 lbv1.beta.com heartbeat: [4236]: info: Link lbv2.beta.com:eth1 up.
>Mar 17 21:02:46 lbv1.beta.com heartbeat: [4236]: info: Status update for node lbv2.beta.com: status up
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Comm_now_up(): updating status to active
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Local status now set to: 'active'
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/heartbeat/ccm" (109,113)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/cib" (109,113)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/stonithd" (0,0)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/lrmd" (0,0)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/attrd" (109,113)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/crmd" (109,113)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: debug: get_delnodelist: delnodelist=
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4250]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/crmd" as uid 109Â gid 113 (pid 4250)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4246]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/cib" as uid 109Â gid 113 (pid 4246)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4249]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/attrd" as uid 109Â gid 113 (pid 4249)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4245]: info: Starting "/usr/local/heartbeat/libexec/heartbeat/ccm" as uid 109Â gid 113 (pid 4245)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4248]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/lrmd" as uid 0Â gid 0 (pid 4248)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4247]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/stonithd" as uid 0Â gid 0 (pid 4247)
>Mar 17 21:02:47 lbv1.beta.com ccm: [4245]: info: Hostname: lbv1.beta.com
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: the send queue length from heartbeat to client ccm is set to 1024
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: the send queue length from heartbeat to client attrd is set to 1024
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: the send queue length from heartbeat to client stonith-ng is set to 1024
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Status update for node lbv2.beta.com: status active
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: the send queue length from heartbeat to client cib is set to 1024
>Mar 17 21:02:51 lbv1.beta.com heartbeat: [4236]: WARN: 1 lost packet(s) for [lbv2.beta.com] [15:17]
>Mar 17 21:02:51 lbv1.beta.com heartbeat: [4236]: info: No pkts missing from lbv2.beta.com!
>Mar 17 21:02:52 lbv1.beta.com heartbeat: [4236]: WARN: 1 lost packet(s) for [lbv2.beta.com] [19:21]
>Mar 17 21:02:52 lbv1.beta.com heartbeat: [4236]: info: No pkts missing from lbv2.beta.com!
>Mar 17 21:02:52 lbv1.beta.com heartbeat: [4236]: info: the send queue length from heartbeat to client crmd is set to 1024
>Mar 17 21:02:53 lbv1.beta.com heartbeat: [4236]: WARN: 1 lost packet(s) for [lbv2.beta.com] [24:26]
>Mar 17 21:02:53 lbv1.beta.com heartbeat: [4236]: info: No pkts missing from lbv2.beta.com!
>Mar 17 21:02:54 lbv1.beta.com heartbeat: [4236]: WARN: 1 lost packet(s) for [lbv2.beta.com] [26:28]
>Mar 17 21:02:54 lbv1.beta.com heartbeat: [4236]: info: No pkts missing from lbv2.beta.com!
>Mar 17 21:02:54 lbv1.beta.com heartbeat: [4236]: WARN: 1 lost packet(s) for [lbv2.beta.com] [30:32]
>Mar 17 21:02:54 lbv1.beta.com heartbeat: [4236]: info: No pkts missing from lbv2.beta.com!
>
>
>
># less /var/log/error
>
>Mar 17 21:02:47 lbv1 attrd[4249]:Â Â Â error: ha_msg_dispatch: Ignored incoming message. Please set_msg_callback on hbclstat
>Mar 17 21:02:48 lbv1 attrd[4249]:Â Â Â error: ha_msg_dispatch: Ignored incoming message. Please set_msg_callback on hbclstat
>Mar 17 21:02:53 lbv1 stonith-ng[4247]:Â Â Â error: ha_msg_dispatch: Ignored incoming message. Please set_msg_callback on hbclstat
>Mar 17 21:02:53 lbv1 stonith-ng[4247]:Â Â Â error: ha_msg_dispatch: Ignored incoming message. Please set_msg_callback on hbclstat
>Mar 17 21:03:39 lbv1 crmd[4250]:Â Â Â error: process_lrm_event: Operation Stonith2-1_start_0 (node=lbv1.beta.com, call=31, status=4, cib-update=42, confirmed=true) Error
>
># cat syslog|egrep 'Mar 17 21:03|Mar 17 21:02' |egrep 'heartbeat|stonith|pacemaker|error'
>Mar 17 21:03:24 lbv1 pengine[4253]:Â Â notice: process_pe_message: Calculated Transition 0: /var/lib/pacemaker/pengine/pe-input-115.bz2
>Mar 17 21:03:27 lbv1 crmd[4250]:Â Â notice: run_graph: Transition 0 (Complete=15, Pending=0, Fired=0, Skipped=16, Incomplete=2, Source=/var/lib/pacemaker/pengine/pe-input-115.bz2): Stopped
>Mar 17 21:03:29 lbv1 pengine[4253]:Â Â notice: process_pe_message: Calculated Transition 1: /var/lib/pacemaker/pengine/pe-input-116.bz2
>Mar 17 21:03:34 lbv1 crmd[4250]:Â Â notice: run_graph: Transition 1 (Complete=8, Pending=0, Fired=0, Skipped=12, Incomplete=1, Source=/var/lib/pacemaker/pengine/pe-input-116.bz2): Stopped
>Mar 17 21:03:37 lbv1 pengine[4253]:Â warning: unpack_rsc_op_failure: Processing failed op start for Stonith1-1 on lbv2.beta.com: unknown error (1)
>Mar 17 21:03:37 lbv1 pengine[4253]:Â warning: unpack_rsc_op_failure: Processing failed op start for Stonith1-1 on lbv2.beta.com: unknown error (1)
>Mar 17 21:03:37 lbv1 pengine[4253]:Â Â notice: process_pe_message: Calculated Transition 2: /var/lib/pacemaker/pengine/pe-input-117.bz2
>Mar 17 21:03:39 lbv1 stonith-ng[4247]:Â Â notice: log_operation: Operation 'monitor' [4377] for device 'Stonith2-1' returned: -201 (Generic Pacemaker error)
>Mar 17 21:03:39 lbv1 stonith-ng[4247]:Â warning: log_operation: Stonith2-1:4377 [ Performing: stonith -t external/stonith-helper -S ]
>Mar 17 21:03:39 lbv1 stonith-ng[4247]:Â warning: log_operation: Stonith2-1:4377 [ failed to exec "stonith" ]
>Mar 17 21:03:39 lbv1 stonith-ng[4247]:Â warning: log_operation: Stonith2-1:4377 [ failed:Â 2 ]
>Mar 17 21:03:39 lbv1 crmd[4250]:Â Â Â error: process_lrm_event: Operation Stonith2-1_start_0 (node=lbv1.beta.com, call=31, status=4, cib-update=42, confirmed=true) Error
>Mar 17 21:03:40 lbv1 crmd[4250]:Â Â notice: run_graph: Transition 2 (Complete=12, Pending=0, Fired=0, Skipped=3, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-117.bz2): Stopped
>Mar 17 21:03:42 lbv1 pengine[4253]:Â warning: unpack_rsc_op_failure: Processing failed op start for Stonith2-1 on lbv1.beta.com: unknown error (1)
>Mar 17 21:03:42 lbv1 pengine[4253]:Â warning: unpack_rsc_op_failure: Processing failed op start for Stonith2-1 on lbv1.beta.com: unknown error (1)
>Mar 17 21:03:42 lbv1 pengine[4253]:Â warning: unpack_rsc_op_failure: Processing failed op start for Stonith1-1 on lbv2.beta.com: unknown error (1)
>Mar 17 21:03:42 lbv1 pengine[4253]:Â Â notice: process_pe_message: Calculated Transition 3: /var/lib/pacemaker/pengine/pe-input-118.bz2
>Mar 17 21:03:42 lbv1 IPaddr2(vip_208)[4448]: INFO: /usr/libexec/heartbeat/send_arp -i 200 -r 5 -p /var/run/resource-agents/send_arp-192.168.17.208 eth0 192.168.17.208 auto not_used not_used
>Mar 17 21:03:47 lbv1 crmd[4250]:Â Â notice: run_graph: Transition 3 (Complete=10, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-118.bz2): Complete
>
>宜ã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>
>以上
>
>
>
>2015年3月17日 18:31 <renayama19661014@ybb.ne.jp>:
>
>ç¦ç”°ã•ã‚“
>>
>>ã“ã‚“ã°ã‚“ã¯ã€å±±å†…ã§ã™ã€‚
>>
>>tag付ã‘ã•ã‚Œã¦ã„ãªã„ã®ã§ã€æœ¬æ—¥ã®æœ€æ–°ç‰ˆã¯ã€
>>
>>Â * https://github.com/ClusterLabs/pacemaker/tree/e32080b460f81486b85d08ec958582b3e72d858c
>>
>>
>>ã«ãªã‚Šã¾ã™ã€‚
>>å³å´ã®[Download ZIP]ã‹ã‚‰ãƒ€ã‚¦ãƒ³ãƒãƒ¼ãƒ‰å‡ºæ¥ã¾ã™ã€‚
>>
>>以上ã§ã™ã€‚
>>
>>
>>----- Original Message -----
>>>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>>
>>>To: "renayama19661014@ybb.ne.jp" <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>>>Date: 2015/3/17, Tue 18:07
>>>Subject: スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>>>
>>>
>>>山内ã•ã‚“
>>>
>>>
>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€ç¦ç”°ã§ã™ã€‚
>>>
>>>
>>>ã“ã¡ã‚‰ã‚’見ãŸã®ã§ã™ãŒã€
>>>https://github.com/ClusterLabs/pacemaker/tags
>>>
>>>
>>>
>>>pacemaker 1.1.12 561c4cf ãŒæœ€æ–°ã®ã‚ˆã†ãªã®ã§ã™ãŒã€‚
>>>済ã¿ã¾ã›ã‚“ãŒã€ã“れ以é™ã®æœ€æ–°ç‰ˆã¯ã©ã¡ã‚‰ã«ã‚ã‚‹ã‹æ•™ãˆã¦é ‚ã‘ã¾ã™ã‹ã€‚
>>>
>>>
>>>宜ã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>>>
>>>
>>>以上
>>>
>>>
>>>
>>>2015å¹´3月17æ—¥ç«æ›œæ—¥ã€<renayama19661014@ybb.ne.jp>ã•ã‚“ã¯æ›¸ãã¾ã—ãŸ:
>>>
>>>ç¦ç”°ã•ã‚“
>>>>
>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€‚山内ã§ã™ã€‚
>>>>
>>>>ã¯ã„。å¤ã„ã§ã™ã€‚
>>>>
>>>>PacemakerãŒHeartbeat3.0.6ã«å¯¾å¿œã—ãŸã®ã¯æ„外ã¨æœ€è¿‘ã§ã™ã€‚
>>>>ã‚‚ã£ã¨æ–°ã—ã„ã‚‚ã®ã‚’入れã¦ãã ã•ã„。(ã¾ãŸã€ã‚½ãƒ¼ã‚¹ã‹ã‚‰æ§‹ç¯‰ã™ã‚‹å¿…è¦ãŒã‚ã‚Šã¾ã™ãŒãƒ»ãƒ»ãƒ»ãƒ»)
>>>>
>>>>
>>>>
>>>>本家ã®githubã‹ã‚‰å…¥æ‰‹å¯èƒ½ã§ã™ã€‚
>>>>Â * https://github.com/ClusterLabs/pacemaker
>>>>
>>>>
>>>>å ´åˆã«ã‚ˆã£ã¦ã¯ã€æœ€æ–°ã®masterã¯ã‚¨ãƒ©ãƒ¼ãªã©ãŒå‡ºã‚‹å ´åˆãŒã‚ã‚Šã¾ã™ã®ã§ã€ãã®å ´åˆã¯ã€ãƒãƒ¼ã‚¸ãƒ§ãƒ³ã‚’å¤ã„æ–¹ã«ãŸãã£ã¦
>>>>ã„ãã®ãŒè‰¯ã„ã¨æ€ã„ã¾ã™ã€‚
>>>>
>>>>以上ã§ã™ã€‚
>>>>
>>>>
>>>>
>>>>----- Original Message -----
>>>>>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>>>>>To: 山内英生 <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>>>>>Date: 2015/3/17, Tue 16:06
>>>>>Subject: Re: [Linux-ha-jp] スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>>>>>
>>>>>
>>>>>山内ã•ã‚“
>>>>>
>>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€ç¦ç”°ã§ã™ã€‚
>>>>>
>>>>>以å‰ã®ãƒ¡ãƒ¼ãƒ«ã§heartbeatã¨pacemakerを最新版を入れãŸã»ã†ãŒè‰¯ã„ã¨å›žç”é ‚ãã¾ã—ãŸã€‚
>>>>>ãã“ã§ä»Šå›žã€heartbeat3.0.6ã¨pacemaker1.1.12を入れãŸã®ã§ã™ãŒã€‚
>>>>>
>>>>>heartbeat configuration: Version = "3.0.6"
>>>>>pacemaker configuration: Version = 1.1.12 (Build: 561c4cf)pacemakerãŒã¾ã å¤ã„ã¨ã„ã†ã“ã¨ã§ã—ょã†ã‹ã€‚
>>>>>
>>>>>済ã¿ã¾ã›ã‚“ãŒã€å®œã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>>>>>
>>>>>以上
>>>>>
>>>>>
>>>>>
>>>>>2015年3月17日 14:59 <renayama19661014@ybb.ne.jp>:
>>>>>
>>>>>ç¦ç”°ã•ã‚“
>>>>>>
>>>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€‚山内ã§ã™ã€‚
>>>>>>
>>>>>>ãµã¨æ€ã£ãŸã®ã™ãŒã€ä»¥å‰ã®ã‚„ã‚Šå–ã‚Šã®ãƒ¡ãƒ¼ãƒ«ã§ä»¥ä¸‹ã¨å›žç”ã—ã¦ã¾ã™ãŒã€å•é¡Œãªã„ã§ã—ょã†ã‹ï¼Ÿ
>>>>>>
>>>>>>
>>>>>>>>>>>> 2)Heartbeat3.0.6+Pacemaker最新 : OK
>>>>>>>>>>>> Â Â
>>>>>>>>>>>> ã©ã†ã‚„らã€Heartbeatも最新版3.0.6を組åˆã›ã‚‹å¿…è¦ãŒã‚るよã†ã§ã™ã€‚
>>>>>>>>>>>> Â *Â http://hg.linux-ha.org/heartbeat-STABLE_3_0/rev/cceeb47a7d8f
>>>>>>
>>>>>>以下ã®crm_monã®ãƒãƒ¼ã‚¸ãƒ§ãƒ³ã‚’見るã¨ã€1.1.12ã®ã‚ˆã†ã§ã™ã€‚
>>>>>>Heartbeat3.0.6ã¨çµ„ã¿åˆã‚ã›ã‚‹ã«ã¯ã€ã‹ãªã‚Šæ–°ã—ã‚ã®PacemakerãŒå¿…è¦ã§ã™ã€‚
>>>>>>
>>>>>>># crm_mon -rfA
>>>>>>>
>>>>>>>Last updated: Tue Mar 17 14:14:39 2015
>>>>>>>Last change: Tue Mar 17 14:01:43 2015
>>>>>>>Stack: heartbeat
>>>>>>>Current DC: lbv2.beta.com (82ffc36f-1ad8-8686-7db0-35686465c624) - parti
>>>>>>>tion with quorum
>>>>>>>Version: 1.1.12-561c4cf
>>>>>>
>>>>>>ãŸã¶ã‚“ã€ä»¥ä¸‹ã®å¤‰æ›´ä»¥é™ã¯å°‘ãªãã¨ã‚‚å¿…è¦ã‹ã¨æ€ã„ã¾ã™ã€‚
>>>>>>
>>>>>>https://github.com/ClusterLabs/pacemaker/commit/f2302da063d08719d28367d8e362b8bfb0f85bf3
>>>>>>
>>>>>>
>>>>>>
>>>>>>以上ã§ã™ã€‚
>>>>>>
>>>>>>
>>>>>>
>>>>>>----- Original Message -----
>>>>>>>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>>>>>>>To: 山内英生 <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>>>>>>
>>>>>>>Date: 2015/3/17, Tue 14:38
>>>>>>>Subject: Re: [Linux-ha-jp] スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>>>>>>>
>>>>>>>
>>>>>>>山内ã•ã‚“
>>>>>>>
>>>>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€ç¦ç”°ã§ã™ã€‚
>>>>>>>
>>>>>>>stonith-helperã®ã‚·ã‚§ãƒãƒ³ã‚°è¡Œã«-xã‚’è¿½åŠ ã™ã‚Œã°è‰¯ã„ã®ã§ã—ょã†ã‹ï¼Ÿ
>>>>>>>stonith-helperã®å…ˆé 行を#!/bin/bash -xã«ã—ã¦ã‚¯ãƒ©ã‚¹ã‚¿ã‚’èµ·å‹•ã—ã¦ã¿ã¾ã—ãŸã€‚
>>>>>>>
>>>>>>>crm_monã§ã¯å…ˆã»ã©ã¨å¤‰ã‚ã‚Šã¯ãªã„よã†ã§ã™ã€‚
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>># crm_mon -rfA
>>>>>>>
>>>>>>>Last updated: Tue Mar 17 14:14:39 2015
>>>>>>>Last change: Tue Mar 17 14:01:43 2015
>>>>>>>Stack: heartbeat
>>>>>>>Current DC: lbv2.beta.com (82ffc36f-1ad8-8686-7db0-35686465c624) - parti
>>>>>>>tion with quorum
>>>>>>>Version: 1.1.12-561c4cf
>>>>>>>2 Nodes configured
>>>>>>>8 Resources configured
>>>>>>>
>>>>>>>Online: [ lbv1.beta.com lbv2.beta.com ]
>>>>>>>
>>>>>>>Full list of resources:
>>>>>>>
>>>>>>>Â Resource Group: HAvarnish
>>>>>>>Â Â Â Â vip_208Â Â Â (ocf::heartbeat:IPaddr2):Â Â Â Â Â Â Started lbv1.beta.com
>>>>>>>    varnishd  (lsb:varnish): Started lbv1.beta.com
>>>>>>>Â Resource Group: grpStonith1
>>>>>>>Â Â Â Â Stonith1-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>>>>>>>Â Â Â Â Stonith1-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>>>>>>>Â Resource Group: grpStonith2
>>>>>>>Â Â Â Â Stonith2-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>>>>>>>Â Â Â Â Stonith2-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>>>>>>>Â Clone Set: clone_ping [ping]
>>>>>>>Â Â Â Â Started: [ lbv1.beta.com lbv2.beta.com ]
>>>>>>>
>>>>>>>Node Attributes:
>>>>>>>* Node lbv1.beta.com:
>>>>>>>   + default_ping_set                 : 100
>>>>>>>* Node lbv2.beta.com:
>>>>>>>   + default_ping_set                 : 100
>>>>>>>
>>>>>>>Migration summary:
>>>>>>>* Node lbv2.beta.com:
>>>>>>>Â Â Stonith1-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>>>>>>>Â 14:12:16 2015'
>>>>>>>* Node lbv1.beta.com:
>>>>>>>Â Â Stonith2-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>>>>>>>Â 14:12:21 2015'
>>>>>>>
>>>>>>>Failed actions:
>>>>>>>Â Â Â Stonith1-1_start_0 on lbv2.beta.com 'unknown error' (1): call=31, st
>>>>>>>atus=Error, last-rc-change='Tue Mar 17 14:12:14 2015', queued=0ms, exec=1065ms
>>>>>>>Â Â Â Stonith2-1_start_0 on lbv1.beta.com 'unknown error' (1): call=26, st
>>>>>>>atus=Error, last-rc-change='Tue Mar 17 14:12:19 2015', queued=0ms, exec=1081ms
>>>>>>>
>>>>>>>ãã®ä»–ã®ãƒã‚°ã‚’探ã—ã¦ã¿ã¾ã—ãŸã€‚
>>>>>>>
>>>>>>>heartbeat起動時ã§ã™ã€‚
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>># less /var/log/pm_logconv.out
>>>>>>>Mar 17 14:11:28 lbv1.beta.com info: Starting Heartbeat 3.0.6.
>>>>>>>Mar 17 14:11:33 lbv1.beta.com info: Link lbv2.beta.com:eth1 is up.
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "ccm" process. (pid=13264)
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "lrmd" process. (pid=13267)
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "attrd" process. (pid=13268)
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "stonithd" process. (pid=13266)
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "cib" process. (pid=13265)
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "crmd" process. (pid=13269)
>>>>>>>
>>>>>>>
>>>>>>># less /var/log/error
>>>>>>>Mar 17 14:12:20 lbv1 crmd[13269]:Â Â Â error: process_lrm_event: Operation Stonith2-1_start_0 (node=lbv1.beta.com, call=26, status=4, cib-update=19, confirmed=true) Error
>>>>>>>
>>>>>>>
>>>>>>>syslogã‹ã‚‰stonithã‚’grepã—ãŸã‚‚ã®ã§ã™
>>>>>>>
>>>>>>>Mar 17 14:11:34 lbv1 heartbeat: [13255]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/stonithd" (0,0)
>>>>>>>Mar 17 14:11:34 lbv1 heartbeat: [13266]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/stonithd" as uid 0Â gid 0 (pid 13266)
>>>>>>>Mar 17 14:11:34 lbv1 stonithd[13266]:Â Â notice: crm_cluster_connect: Connecting to cluster infrastructure: heartbeat
>>>>>>>Mar 17 14:11:34 lbv1 heartbeat: [13255]: info: the send queue length from heartbeat to client stonithd is set to 1024
>>>>>>>Mar 17 14:11:40 lbv1 stonithd[13266]:Â Â notice: setup_cib: Watching for stonith topology changes
>>>>>>>Mar 17 14:11:40 lbv1 stonithd[13266]:Â Â notice: unpack_config: On loss of CCM Quorum: Ignore
>>>>>>>Mar 17 14:11:40 lbv1 stonithd[13266]:Â warning: handle_startup_fencing: Blind faith: not fencing unseen nodes
>>>>>>>Mar 17 14:11:40 lbv1 stonithd[13266]:Â warning: handle_startup_fencing: Blind faith: not fencing unseen nodes
>>>>>>>Mar 17 14:11:41 lbv1 stonithd[13266]:Â Â notice: stonith_device_register: Added 'Stonith2-1' to the device list (1 active devices)
>>>>>>>Mar 17 14:11:41 lbv1 stonithd[13266]:Â Â notice: stonith_device_register: Added 'Stonith2-2' to the device list (2 active devices)
>>>>>>>Mar 17 14:12:04 lbv1 stonithd[13266]:Â Â notice: xml_patch_version_check: Versions did not change in patch 0.5.0
>>>>>>>Mar 17 14:12:20 lbv1 stonithd[13266]:Â Â notice: log_operation: Operation 'monitor' [13386] for device 'Stonith2-1' returned: -201 (Generic Pacemaker error)
>>>>>>>Mar 17 14:12:20 lbv1 stonithd[13266]:Â warning: log_operation: Stonith2-1:13386 [ Performing: stonith -t external/stonith-helper -S ]
>>>>>>>Mar 17 14:12:20 lbv1 stonithd[13266]:Â warning: log_operation: Stonith2-1:13386 [ failed to exec "stonith" ]
>>>>>>>Mar 17 14:12:20 lbv1 stonithd[13266]:Â warning: log_operation: Stonith2-1:13386 [ failed:Â 2 ]
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>宜ã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>>>>>>>
>>>>>>>以上
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>2015年3月17日 13:32 <renayama19661014@ybb.ne.jp>:
>>>>>>>
>>>>>>>ç¦ç”°ã•ã‚“
>>>>>>>>
>>>>>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€‚山内ã§ã™ã€‚
>>>>>>>>
>>>>>>>>ã¨ã„ã†ã“ã¨ã¯ã€stonith-helperã®startã«å•é¡ŒãŒã‚るよã†ã§ã™ã。
>>>>>>>>
>>>>>>>>stonith-helperã®å…ˆé ã«
>>>>>>>>
>>>>>>>>#!/bin/bash -x
>>>>>>>>
>>>>>>>>
>>>>>>>>を入れã¦ã€ã‚¯ãƒ©ã‚¹ã‚¿ã‚’èµ·å‹•ã™ã‚‹ã¨ä½•ã‹ã‚ã‹ã‚‹ã‹ã‚‚知れã¾ã›ã‚“。
>>>>>>>>
>>>>>>>>ã¡ãªã¿ã«ã€stonith-helperã®ãƒã‚°ã‚‚ã©ã“ã‹ã«å‡ºã¦ã„ã‚‹ã¨æ€ã†ã®ã§ã™ãŒã€‚。。
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>以上ã§ã™ã€‚
>>>>>>>>
>>>>>>>>----- Original Message -----
>>>>>>>>>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>>>>>>>>>To: 山内英生 <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>>>>>>>>
>>>>>>>>>Date: 2015/3/17, Tue 12:31
>>>>>>>>>Subject: Re: [Linux-ha-jp] スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>山内ã•ã‚“
>>>>>>>>>cc:æ¾å³¶ã•ã‚“
>>>>>>>>>
>>>>>>>>>ã“ã‚“ã«ã¡ã¯ã€ç¦ç”°ã§ã™ã€‚
>>>>>>>>>
>>>>>>>>>åŒã˜ãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã«xen0ã¯ã‚ã‚Šã¾ã—ãŸã€‚
>>>>>>>>>
>>>>>>>>># pwd
>>>>>>>>>/usr/local/heartbeat/lib/stonith/plugins/external
>>>>>>>>>
>>>>>>>>># ls
>>>>>>>>>drac5         ibmrsa        kdumpcheck riloe       vmware
>>>>>>>>>dracmc-telnet ibmrsa-telnet libvirt    ssh       xen0
>>>>>>>>>hetzner       ipmi        nut    stonith-helper xen0-ha
>>>>>>>>>hmchttp       ippower9258   rackpdu    vcenter
>>>>>>>>>
>>>>>>>>>宜ã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>>>>>>>>>
>>>>>>>>>以上
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>2015-03-17 10:53 GMT+09:00 <renayama19661014@ybb.ne.jp>:
>>>>>>>>>
>>>>>>>>>ç¦ç”°ã•ã‚“
>>>>>>>>>>cc:æ¾å³¶ã•ã‚“
>>>>>>>>>>
>>>>>>>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€‚山内ã§ã™ã€‚
>>>>>>>>>>
>>>>>>>>>>>標準出力や標準エラー出力ã¯ã‚ã‚Šã¾ã›ã‚“ã§ã—ãŸã€‚
>>>>>>>>>>>
>>>>>>>>>>>stonith-helperãŒãŠã‹ã—ã„ã®ã§ã—ょã†ã‹ã€‚
>>>>>>>>>>>stonith-helperã¯ã‚·ã‚§ãƒ«ã‚¹ã‚¯ãƒªãƒ—トãªã®ã§ã‚¤ãƒ³ã‚¹ãƒˆãƒ¼ãƒ«ã¯ã‚ã¾ã‚Šæ°—ã«ã—ã¦ã„ãªã‹ã£ãŸã®ã§ã™ãŒã€‚
>>>>>>>>>>>stonith-helperã¯ã“ã“ã«é…ç½®ã•ã‚Œã¦ã„ã¾ã™ã€‚
>>>>>>>>>>>/usr/local/heartbeat/lib/stonith/plugins/external/stonith-helper
>>>>>>>>>>
>>>>>>>>>>ã“ã®ãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã«xen0ã‚‚ã‚ã‚Šã¾ã™ã‹ï¼Ÿ
>>>>>>>>>>ç„¡ã„よã†ã§ã—ãŸã‚‰ã€å•é¡ŒãŒã‚ã‚Šã¾ã™ã®ã§ã€ä¸€åº¦ã€stonith-helperã®ãƒ•ã‚¡ã‚¤ãƒ«ã‚’属性ãªã©ã¯ãã®ã¾ã¾ã€xen0ã¨åŒã˜ãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã«
>>>>>>>>>>コピーã—ã¦ã¿ã¦ãã ã•ã„。
>>>>>>>>>>
>>>>>>>>>>ãã‚Œã§ç¨¼åƒã™ã‚‹ãªã‚‰ã€pm_extrasã®ã‚¤ãƒ³ã‚¹ãƒˆãƒ¼ãƒ«ã«å•é¡ŒãŒã‚ã‚‹ã¨ã„ã†ã“ã¨ã«ãªã‚Šã¾ã™ã€‚
>>>>>>>>>>
>>>>>>>>>>以上ã§ã™ã€‚
>>>>>>>>>>
>>>>>>>>>>----- Original Message -----
>>>>>>>>>>>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>>>>>>>>>>>To: 山内英生 <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>>>>>>>>>>
>>>>>>>>>>>Date: 2015/3/17, Tue 10:31
>>>>>>>>>>>Subject: Re: [Linux-ha-jp] スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>山内ã•ã‚“
>>>>>>>>>>>cc:æ¾å³¶ã•ã‚“
>>>>>>>>>>>
>>>>>>>>>>>ãŠã¯ã‚ˆã†ã”ã–ã„ã¾ã™ã€ç¦ç”°ã§ã™ã€‚
>>>>>>>>>>>crmã®ä¾‹ã‚’ã‚ã‚ŠãŒã¨ã†ã”ã–ã„ã¾ã™ã€‚
>>>>>>>>>>>
>>>>>>>>>>>早速ã€ã“ã¡ã‚‰ã®ç’°å¢ƒã«åˆã‚ã›ã¦ã¿ã¾ã—ãŸã€‚
>>>>>>>>>>>
>>>>>>>>>>>$ cat test.crm
>>>>>>>>>>>### Cluster Option ###
>>>>>>>>>>>property \
>>>>>>>>>>>Â Â Â no-quorum-policy="ignore" \
>>>>>>>>>>>Â Â Â stonith-enabled="true" \
>>>>>>>>>>>Â Â Â startup-fencing="false" \
>>>>>>>>>>>Â Â Â stonith-timeout="710s" \
>>>>>>>>>>>Â Â Â crmd-transition-delay="2s"
>>>>>>>>>>>
>>>>>>>>>>>### Resource Default ###
>>>>>>>>>>>rsc_defaults \
>>>>>>>>>>>Â Â Â resource-stickiness="INFINITY" \
>>>>>>>>>>>Â Â Â migration-threshold="1"
>>>>>>>>>>>
>>>>>>>>>>>### Group Configuration ###
>>>>>>>>>>>group HAvarnish \
>>>>>>>>>>>Â Â Â vip_208 \
>>>>>>>>>>>Â Â Â varnishd
>>>>>>>>>>>
>>>>>>>>>>>group grpStonith1 \
>>>>>>>>>>>Â Â Â Stonith1-1 \
>>>>>>>>>>>Â Â Â Stonith1-2
>>>>>>>>>>>
>>>>>>>>>>>group grpStonith2 \
>>>>>>>>>>>Â Â Â Stonith2-1 \
>>>>>>>>>>>Â Â Â Stonith2-2
>>>>>>>>>>>
>>>>>>>>>>>### Clone Configuration ###
>>>>>>>>>>>clone clone_ping \
>>>>>>>>>>>Â Â Â ping
>>>>>>>>>>>
>>>>>>>>>>>### Fencing Topology ###
>>>>>>>>>>>fencing_topology \
>>>>>>>>>>>Â Â Â lbv1.beta.com: Stonith1-1 Stonith1-2 \
>>>>>>>>>>>Â Â Â lbv2.beta.com: Stonith2-1 Stonith2-2
>>>>>>>>>>>
>>>>>>>>>>>### Primitive Configuration ###
>>>>>>>>>>>primitive vip_208 ocf:heartbeat:IPaddr2 \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â ip="192.168.17.208" \
>>>>>>>>>>>Â Â Â Â Â Â Â nic="eth0" \
>>>>>>>>>>>Â Â Â Â Â Â Â cidr_netmask="24" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="90s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op monitor interval="5s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="100s" on-fail="fence"
>>>>>>>>>>>
>>>>>>>>>>>primitive varnishd lsb:varnish \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="90s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="100s" on-fail="fence"
>>>>>>>>>>>
>>>>>>>>>>>primitive ping ocf:pacemaker:ping \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â name="default_ping_set" \
>>>>>>>>>>>Â Â Â Â Â Â Â host_list="192.168.17.254" \
>>>>>>>>>>>Â Â Â Â Â Â Â multiplier="100" \
>>>>>>>>>>>Â Â Â Â Â Â Â dampen="1" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="90s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="100s" on-fail="fence"
>>>>>>>>>>>
>>>>>>>>>>>primitive Stonith1-1 stonith:external/stonith-helper \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_retries="1" \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_timeout="40s" \
>>>>>>>>>>>Â Â Â Â Â Â Â hostlist="lbv1.beta.com" \
>>>>>>>>>>>Â Â Â Â Â Â Â dead_check_target="192.168.17.132 10.0.17.132" \
>>>>>>>>>>>Â Â Â Â Â Â Â standby_check_command="/usr/local/sbin/crm_resource -r varnishd -W | grep -q `hostname`" \
>>>>>>>>>>>Â Â Â Â Â Â Â run_online_check="yes" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>
>>>>>>>>>>>primitive Stonith1-2 stonith:external/xen0 \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_timeout="60s" \
>>>>>>>>>>>Â Â Â Â Â Â Â hostlist="lbv1.beta.com:/etc/xen/lbv1.cfg" \
>>>>>>>>>>>Â Â Â Â Â Â Â dom0="xen0.beta.com" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op monitor interval="3600s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>
>>>>>>>>>>>primitive Stonith2-1 stonith:external/stonith-helper \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_retries="1" \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_timeout="40s" \
>>>>>>>>>>>Â Â Â Â Â Â Â hostlist="lbv2.beta.com" \
>>>>>>>>>>>Â Â Â Â Â Â Â dead_check_target="192.168.17.133 10.0.17.133" \
>>>>>>>>>>>Â Â Â Â Â Â Â standby_check_command="/usr/local/sbin/crm_resource -r varnishd -W | grep -q `hostname`" \
>>>>>>>>>>>Â Â Â Â Â Â Â run_online_check="yes" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>
>>>>>>>>>>>primitive Stonith2-2 stonith:external/xen0 \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_timeout="60s" \
>>>>>>>>>>>Â Â Â Â Â Â Â hostlist="lbv2.beta.com:/etc/xen/lbv2.cfg" \
>>>>>>>>>>>Â Â Â Â Â Â Â dom0="xen0.beta.com" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op monitor interval="3600s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>
>>>>>>>>>>>### Resource Location ###
>>>>>>>>>>>location HA_location-1 HAvarnish \
>>>>>>>>>>>Â Â Â rule 200: #uname eq lbv1.beta.com \
>>>>>>>>>>>Â Â Â rule 100: #uname eq lbv2.beta.com
>>>>>>>>>>>
>>>>>>>>>>>location HA_location-2 HAvarnish \
>>>>>>>>>>>Â Â Â rule -INFINITY: not_defined default_ping_set or default_ping_set lt 100
>>>>>>>>>>>
>>>>>>>>>>>location HA_location-3 grpStonith1 \
>>>>>>>>>>>Â Â Â rule -INFINITY: #uname eq lbv1.beta.com
>>>>>>>>>>>
>>>>>>>>>>>location HA_location-4 grpStonith2 \
>>>>>>>>>>>Â Â Â rule -INFINITY: #uname eq lbv2.beta.com
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>ã“れをæµã—ã“ã‚“ã ã¨ã“ã‚ã€æ˜¨æ—¥ã¨ã¯ãƒ¡ãƒƒã‚»ãƒ¼ã‚¸ãŒç•°ãªã‚Šã¾ã™ã€‚
>>>>>>>>>>>pingã®ãƒ¡ãƒƒã‚»ãƒ¼ã‚¸ã¯ãªããªã£ã¦ã„ã¾ã—ãŸã€‚
>>>>>>>>>>>
>>>>>>>>>>># crm_mon -rfA
>>>>>>>>>>>Last updated: Tue Mar 17 10:21:28 2015
>>>>>>>>>>>Last change: Tue Mar 17 10:21:09 2015
>>>>>>>>>>>Stack: heartbeat
>>>>>>>>>>>Current DC: lbv2.beta.com (82ffc36f-1ad8-8686-7db0-35686465c624) - parti
>>>>>>>>>>>tion with quorum
>>>>>>>>>>>Version: 1.1.12-561c4cf
>>>>>>>>>>>2 Nodes configured
>>>>>>>>>>>8 Resources configured
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>Online: [ lbv1.beta.com lbv2.beta.com ]
>>>>>>>>>>>
>>>>>>>>>>>Full list of resources:
>>>>>>>>>>>
>>>>>>>>>>>Â Resource Group: HAvarnish
>>>>>>>>>>>Â Â Â Â vip_208Â Â Â (ocf::heartbeat:IPaddr2):Â Â Â Â Â Â Started lbv1.beta.com
>>>>>>>>>>>    varnishd  (lsb:varnish): Started lbv1.beta.com
>>>>>>>>>>>Â Resource Group: grpStonith1
>>>>>>>>>>>Â Â Â Â Stonith1-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>>>>>>>>>>>Â Â Â Â Stonith1-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>>>>>>>>>>>Â Resource Group: grpStonith2
>>>>>>>>>>>Â Â Â Â Stonith2-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>>>>>>>>>>>Â Â Â Â Stonith2-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>>>>>>>>>>>Â Clone Set: clone_ping [ping]
>>>>>>>>>>>Â Â Â Â Started: [ lbv1.beta.com lbv2.beta.com ]
>>>>>>>>>>>
>>>>>>>>>>>Node Attributes:
>>>>>>>>>>>* Node lbv1.beta.com:
>>>>>>>>>>>   + default_ping_set                 : 100
>>>>>>>>>>>* Node lbv2.beta.com:
>>>>>>>>>>>   + default_ping_set                 : 100
>>>>>>>>>>>
>>>>>>>>>>>Migration summary:
>>>>>>>>>>>* Node lbv2.beta.com:
>>>>>>>>>>>Â Â Stonith1-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>>>>>>>>>>>Â 10:21:17 2015'
>>>>>>>>>>>* Node lbv1.beta.com:
>>>>>>>>>>>Â Â Stonith2-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>>>>>>>>>>>Â 10:21:17 2015'
>>>>>>>>>>>
>>>>>>>>>>>Failed actions:
>>>>>>>>>>>Â Â Â Stonith1-1_start_0 on lbv2.beta.com 'unknown error' (1): call=31, st
>>>>>>>>>>>atus=Error, last-rc-change='Tue Mar 17 10:21:15 2015', queued=0ms, exec=1082ms
>>>>>>>>>>>Â Â Â Stonith2-1_start_0 on lbv1.beta.com 'unknown error' (1): call=31, st
>>>>>>>>>>>atus=Error, last-rc-change='Tue Mar 17 10:21:16 2015', queued=0ms, exec=1079ms
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>/var/log/ha-debugã®ãƒã‚°ã§ã™ã€‚
>>>>>>>>>>>
>>>>>>>>>>>IPaddr2(vip_208)[7851]: 2015/03/17_10:21:22 INFO: Adding inet address 192.168.17.208/24 with broadcast address 192.168.17.255 to device eth0
>>>>>>>>>>>IPaddr2(vip_208)[7851]: 2015/03/17_10:21:22 INFO: Bringing device eth0 up
>>>>>>>>>>>IPaddr2(vip_208)[7851]: 2015/03/17_10:21:22 INFO: /usr/libexec/heartbeat/send_arp -i 200 -r 5 -p /var/run/resource-agents/send_arp-192.168.17.208 eth0 192.168.17.208 auto not_used not_used
>>>>>>>>>>>
>>>>>>>>>>>標準出力や標準エラー出力ã¯ã‚ã‚Šã¾ã›ã‚“ã§ã—ãŸã€‚
>>>>>>>>>>>
>>>>>>>>>>>stonith-helperãŒãŠã‹ã—ã„ã®ã§ã—ょã†ã‹ã€‚
>>>>>>>>>>>stonith-helperã¯ã‚·ã‚§ãƒ«ã‚¹ã‚¯ãƒªãƒ—トãªã®ã§ã‚¤ãƒ³ã‚¹ãƒˆãƒ¼ãƒ«ã¯ã‚ã¾ã‚Šæ°—ã«ã—ã¦ã„ãªã‹ã£ãŸã®ã§ã™ãŒã€‚
>>>>>>>>>>>stonith-helperã¯ã“ã“ã«é…ç½®ã•ã‚Œã¦ã„ã¾ã™ã€‚
>>>>>>>>>>>/usr/local/heartbeat/lib/stonith/plugins/external/stonith-helper
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>宜ã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>>>>>>>>>>>
>>>>>>>>>>>以上
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>2015-03-17 9:45 GMT+09:00 <renayama19661014@ybb.ne.jp>:
>>>>>>>>>>>
>>>>>>>>>>>ç¦ç”°ã•ã‚“
>>>>>>>>>>>>
>>>>>>>>>>>>ãŠã¯ã‚ˆã†ã”ã–ã„ã¾ã™ã€‚山内ã§ã™ã€‚
>>>>>>>>>>>>
>>>>>>>>>>>>念ã®ç‚ºã€æ‰‹å…ƒã«ã‚る複数ã®stonithを利用ã—ãŸå ´åˆã®ä¾‹ã‚’抜粋ã—ã¦ãŠé€ã‚Šã—ã¾ã™ã€‚
>>>>>>>>>>>>(実際ã«ã¯ã€æ”¹è¡Œã«æ°—を付ã‘ã¦ãã ã•ã„)
>>>>>>>>>>>>
>>>>>>>>>>>>以下ã®ä¾‹ã¯ã€PM1.1ç³»ã§ã®è¨å®šã§ã€
>>>>>>>>>>>>nodeaã¯ã€prmStonith1-1ã€Â prmStonith1-2ã®é †ã§stonithãŒå®Ÿè¡Œã•ã‚Œã¾ã™ã€‚
>>>>>>>>>>>>nodebã¯ã€prmStonith2-1ã€Â prmStonith2-2ã®é †ã§stonithãŒå®Ÿè¡Œã•ã‚Œã¾ã™ã€‚
>>>>>>>>>>>>
>>>>>>>>>>>>stonith自体ã¯ã€helperã¨sshã§ã™ã€‚
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>(snip)
>>>>>>>>>>>>### Group Configuration ###
>>>>>>>>>>>>group grpStonith1 \
>>>>>>>>>>>>prmStonith1-1 \
>>>>>>>>>>>>prmStonith1-2
>>>>>>>>>>>>
>>>>>>>>>>>>group grpStonith2 \
>>>>>>>>>>>>prmStonith2-1 \
>>>>>>>>>>>>prmStonith2-2
>>>>>>>>>>>>
>>>>>>>>>>>>### Fencing Topology ###
>>>>>>>>>>>>fencing_topology \
>>>>>>>>>>>>nodea: prmStonith1-1 prmStonith1-2 \
>>>>>>>>>>>>nodeb: prmStonith2-1 prmStonith2-2
>>>>>>>>>>>>(snp)
>>>>>>>>>>>>primitive prmStonith1-1 stonith:external/stonith-helper \
>>>>>>>>>>>>params \
>>>>>>>>>>>>
>>>>>>>>>>>>pcmk_reboot_retries="1" \
>>>>>>>>>>>>pcmk_reboot_timeout="40s" \
>>>>>>>>>>>>hostlist="nodea" \
>>>>>>>>>>>>dead_check_target="192.168.28.60 192.168.28.70" \
>>>>>>>>>>>>standby_check_command="/usr/sbin/crm_resource -r prmRES -W | grep -qi `hostname`" \
>>>>>>>>>>>>run_online_check="yes" \
>>>>>>>>>>>>op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>>
>>>>>>>>>>>>primitive prmStonith1-2 stonith:external/ssh \
>>>>>>>>>>>>params \
>>>>>>>>>>>>pcmk_reboot_timeout="60s" \
>>>>>>>>>>>>hostlist="nodea" \
>>>>>>>>>>>>op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op monitor interval="3600s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>>
>>>>>>>>>>>>primitive prmStonith2-1 stonith:external/stonith-helper \
>>>>>>>>>>>>params \
>>>>>>>>>>>>pcmk_reboot_retries="1" \
>>>>>>>>>>>>pcmk_reboot_timeout="40s" \
>>>>>>>>>>>>hostlist="nodeb" \
>>>>>>>>>>>>dead_check_target="192.168.28.61 192.168.28.71" \
>>>>>>>>>>>>standby_check_command="/usr/sbin/crm_resource -r prmRES -W | grep -qi `hostname`" \
>>>>>>>>>>>>run_online_check="yes" \
>>>>>>>>>>>>op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>>
>>>>>>>>>>>>primitive prmStonith2-2 stonith:external/ssh \
>>>>>>>>>>>>params \
>>>>>>>>>>>>pcmk_reboot_timeout="60s" \
>>>>>>>>>>>>hostlist="nodeb" \
>>>>>>>>>>>>op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op monitor interval="3600s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>>(snip)
>>>>>>>>>>>>location rsc_location-grpStonith1-2 grpStonith1 \
>>>>>>>>>>>>rule -INFINITY: #uname eq nodea
>>>>>>>>>>>>location rsc_location-grpStonith2-3 grpStonith2 \
>>>>>>>>>>>>rule -INFINITY: #uname eq nodeb
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>以上ã§ã™ã€‚
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>--
>>>>>>>>>>>
>>>>>>>>>>>ELF Systems
>>>>>>>>>>>Masamichi Fukuda
>>>>>>>>>>>mail to: masamichi_fukuda@elf-systems.com
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>_______________________________________________
>>>>>>>>>>Linux-ha-japan mailing list
>>>>>>>>>>Linux-ha-japan@lists.sourceforge.jp
>>>>>>>>>>http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>--
>>>>>>>>>
>>>>>>>>>ELF Systems
>>>>>>>>>Masamichi Fukuda
>>>>>>>>>mail to: masamichi_fukuda@elf-systems.com
>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>>_______________________________________________
>>>>>>>>Linux-ha-japan mailing list
>>>>>>>>Linux-ha-japan@lists.sourceforge.jp
>>>>>>>>http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>--
>>>>>>>
>>>>>>>ELF Systems
>>>>>>>Masamichi Fukuda
>>>>>>>mail to: masamichi_fukuda@elf-systems.com
>>>>>>>
>>>>>>>
>>>>>>
>>>>>>_______________________________________________
>>>>>>Linux-ha-japan mailing list
>>>>>>Linux-ha-japan@lists.sourceforge.jp
>>>>>>http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>>>
>>>>>
>>>>>
>>>>>--
>>>>>
>>>>>ELF Systems
>>>>>Masamichi Fukuda
>>>>>mail to: masamichi_fukuda@elf-systems.com
>>>>>
>>>>>
>>>>
>>>>_______________________________________________
>>>>Linux-ha-japan mailing list
>>>>Linux-ha-japan@lists.sourceforge.jp
>>>>http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>
>>>
>>>--
>>>
>>>ELF Systems
>>>Masamichi Fukuda
>>>mail to: masamichi_fukuda@elf-systems.com
>>>
>>>
>>>
>>
>>_______________________________________________
>>Linux-ha-japan mailing list
>>Linux-ha-japan@lists.sourceforge.jp
>>http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>
>
>
>--
>
>ELF Systems
>Masamichi Fukuda
>mail to: masamichi_fukuda@elf-systems.com
>
>
_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan@lists.sourceforge.jp
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
ã“ã‚“ã°ã‚“ã¯ã€å±±å†…ã§ã™ã€‚
変ã‚らãªã„よã†ã§ã™ã。。。
ã¨ã‚Šã‚ãˆãšã€æ˜Žæ—¥ãらã„ã«ã€RHEL上ã§ã™ãŒã€
Heartbeat3.0.6
Pacemakerã®æœ€æ–°
組ã¿åˆã‚ã›ã§ã€åŒã˜ã‚ˆã†ãªè¨å®š(リソースã¯Dummyã€external/xen0ã¯external/sshã«ãªã‚Šã¾ã™ãŒï¼‰stonith-helperãŒå‹•ãã‹ã©ã†ã‹ã‚’確èªã—ã¦ã¿ã¾ã™ã€‚
#stonith-helperã®-x指定ã®å‡ºåŠ›ãŒç¢ºèªå‡ºæ¥ã‚‹ã¨ã€ã‚‚ã†å°‘ã—å•é¡ŒãŒçµžã‚Šã‚„ã™ã„ã®ã§ã™ãŒãƒ»ãƒ»ãƒ»
以上ã§ã™ã€‚
----- Original Message -----
>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>To: 山内英生 <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>Date: 2015/3/17, Tue 21:24
>Subject: Re: [Linux-ha-jp] スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>
>
>山内ã•ã‚“
>
>ã“ã‚“ã°ã‚“ã¯ã€ç¦ç”°ã§ã™ã€‚
>最新版ã®æƒ…å ±ã‚’ã‚ã‚ŠãŒã¨ã†ã”ã–ã„ã¾ã—ãŸã€‚
>
>早速インストールã—ã¦ã¿ã¾ã—ãŸã€‚
>
>起動後ã®çŠ¶æ…‹ã§ã™ã€‚
>
>failed actionsã¯å¤‰ã‚ã‚Šãªã„よã†ã§ã™ã€‚
>
>
>
># crm_mon -rfA
>Last updated: Tue Mar 17 21:03:49 2015
>Last change: Tue Mar 17 20:30:58 2015
>Stack: heartbeat
>Current DC: lbv1.beta.com (38b0f200-83ea-8633-6f37-047d36cd39c6) - parti
>tion with quorum
>Version: 1.1.12-e32080b
>2 Nodes configured
>8 Resources configured
>
>
>Online: [ lbv1.beta.com lbv2.beta.com ]
>
>Full list of resources:
>
>Â Resource Group: HAvarnish
>Â Â Â Â vip_208Â Â Â (ocf::heartbeat:IPaddr2):Â Â Â Â Â Â Started lbv1.beta.com
>    varnishd  (lsb:varnish): Started lbv1.beta.com
>Â Resource Group: grpStonith1
>Â Â Â Â Stonith1-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>Â Â Â Â Stonith1-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>Â Resource Group: grpStonith2
>Â Â Â Â Stonith2-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>Â Â Â Â Stonith2-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>Â Clone Set: clone_ping [ping]
>Â Â Â Â Started: [ lbv1.beta.com lbv2.beta.com ]
>
>Node Attributes:
>* Node lbv1.beta.com:
>   + default_ping_set                 : 100
>* Node lbv2.beta.com:
>   + default_ping_set                 : 100
>
>Migration summary:
>* Node lbv1.beta.com:
>Â Â Stonith2-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>Â 21:03:39 2015'
>* Node lbv2.beta.com:
>Â Â Stonith1-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>Â 21:03:32 2015'
>
>Failed actions:
>Â Â Â Stonith2-1_start_0 on lbv1.beta.com 'unknown error' (1): call=31, st
>atus=Error, exit-reason='none', last-rc-change='Tue Mar 17 21:03:37 2015', queue
>d=0ms, exec=1085ms
>Â Â Â Stonith1-1_start_0 on lbv2.beta.com 'unknown error' (1): call=18, st
>atus=Error, exit-reason='none', last-rc-change='Tue Mar 17 21:03:30 2015', queue
>d=0ms, exec=1061ms
>
>
>
>
>ãƒã‚°ã§ã™ã€‚
>
>
># less /var/log/ha-debug
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: info: Pacemaker support: yes
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: File /etc/ha.d//haresources exists.
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: This file is not used because pacemaker is enabled
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/heartbeat/ccm
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/pacemaker/cib
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/pacemaker/stonithd
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/pacemaker/lrmd
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/pacemaker/attrd
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: debug: Checking access of: /usr/local/heartbeat/libexec/pacemaker/crmd
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: Core dumps could be lost if multiple dumps occur.
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: Consider setting non-default value in /proc/sys/kernel/core_pattern (or equivalent) for maximum supportability
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: Consider setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum supportability
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: WARN: Logging daemon is disabled --enabling logging daemon is recommended
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: info: **************************
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4235]: info: Configuration validated. Starting heartbeat 3.0.6
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: heartbeat: version 3.0.6
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: Heartbeat generation: 1423534116
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: seed is -1702799346
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: glib: ucast: write socket priority set to IPTOS_LOWDELAY on eth1
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: glib: ucast: bound send socket to device: eth1
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: glib: ucast: set SO_REUSEADDR
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: glib: ucast: bound receive socket to device: eth1
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: glib: ucast: started on port 694 interface eth1 to 10.0.17.133
>Mar 17 21:02:39 lbv1.beta.com heartbeat: [4236]: info: Local status now set to: 'up'
>Mar 17 21:02:46 lbv1.beta.com heartbeat: [4236]: info: Link lbv2.beta.com:eth1 up.
>Mar 17 21:02:46 lbv1.beta.com heartbeat: [4236]: info: Status update for node lbv2.beta.com: status up
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Comm_now_up(): updating status to active
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Local status now set to: 'active'
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/heartbeat/ccm" (109,113)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/cib" (109,113)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/stonithd" (0,0)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/lrmd" (0,0)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/attrd" (109,113)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/crmd" (109,113)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: debug: get_delnodelist: delnodelist=
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4250]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/crmd" as uid 109Â gid 113 (pid 4250)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4246]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/cib" as uid 109Â gid 113 (pid 4246)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4249]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/attrd" as uid 109Â gid 113 (pid 4249)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4245]: info: Starting "/usr/local/heartbeat/libexec/heartbeat/ccm" as uid 109Â gid 113 (pid 4245)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4248]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/lrmd" as uid 0Â gid 0 (pid 4248)
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4247]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/stonithd" as uid 0Â gid 0 (pid 4247)
>Mar 17 21:02:47 lbv1.beta.com ccm: [4245]: info: Hostname: lbv1.beta.com
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: the send queue length from heartbeat to client ccm is set to 1024
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: the send queue length from heartbeat to client attrd is set to 1024
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: the send queue length from heartbeat to client stonith-ng is set to 1024
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: Status update for node lbv2.beta.com: status active
>Mar 17 21:02:47 lbv1.beta.com heartbeat: [4236]: info: the send queue length from heartbeat to client cib is set to 1024
>Mar 17 21:02:51 lbv1.beta.com heartbeat: [4236]: WARN: 1 lost packet(s) for [lbv2.beta.com] [15:17]
>Mar 17 21:02:51 lbv1.beta.com heartbeat: [4236]: info: No pkts missing from lbv2.beta.com!
>Mar 17 21:02:52 lbv1.beta.com heartbeat: [4236]: WARN: 1 lost packet(s) for [lbv2.beta.com] [19:21]
>Mar 17 21:02:52 lbv1.beta.com heartbeat: [4236]: info: No pkts missing from lbv2.beta.com!
>Mar 17 21:02:52 lbv1.beta.com heartbeat: [4236]: info: the send queue length from heartbeat to client crmd is set to 1024
>Mar 17 21:02:53 lbv1.beta.com heartbeat: [4236]: WARN: 1 lost packet(s) for [lbv2.beta.com] [24:26]
>Mar 17 21:02:53 lbv1.beta.com heartbeat: [4236]: info: No pkts missing from lbv2.beta.com!
>Mar 17 21:02:54 lbv1.beta.com heartbeat: [4236]: WARN: 1 lost packet(s) for [lbv2.beta.com] [26:28]
>Mar 17 21:02:54 lbv1.beta.com heartbeat: [4236]: info: No pkts missing from lbv2.beta.com!
>Mar 17 21:02:54 lbv1.beta.com heartbeat: [4236]: WARN: 1 lost packet(s) for [lbv2.beta.com] [30:32]
>Mar 17 21:02:54 lbv1.beta.com heartbeat: [4236]: info: No pkts missing from lbv2.beta.com!
>
>
>
># less /var/log/error
>
>Mar 17 21:02:47 lbv1 attrd[4249]:Â Â Â error: ha_msg_dispatch: Ignored incoming message. Please set_msg_callback on hbclstat
>Mar 17 21:02:48 lbv1 attrd[4249]:Â Â Â error: ha_msg_dispatch: Ignored incoming message. Please set_msg_callback on hbclstat
>Mar 17 21:02:53 lbv1 stonith-ng[4247]:Â Â Â error: ha_msg_dispatch: Ignored incoming message. Please set_msg_callback on hbclstat
>Mar 17 21:02:53 lbv1 stonith-ng[4247]:Â Â Â error: ha_msg_dispatch: Ignored incoming message. Please set_msg_callback on hbclstat
>Mar 17 21:03:39 lbv1 crmd[4250]:Â Â Â error: process_lrm_event: Operation Stonith2-1_start_0 (node=lbv1.beta.com, call=31, status=4, cib-update=42, confirmed=true) Error
>
># cat syslog|egrep 'Mar 17 21:03|Mar 17 21:02' |egrep 'heartbeat|stonith|pacemaker|error'
>Mar 17 21:03:24 lbv1 pengine[4253]:Â Â notice: process_pe_message: Calculated Transition 0: /var/lib/pacemaker/pengine/pe-input-115.bz2
>Mar 17 21:03:27 lbv1 crmd[4250]:Â Â notice: run_graph: Transition 0 (Complete=15, Pending=0, Fired=0, Skipped=16, Incomplete=2, Source=/var/lib/pacemaker/pengine/pe-input-115.bz2): Stopped
>Mar 17 21:03:29 lbv1 pengine[4253]:Â Â notice: process_pe_message: Calculated Transition 1: /var/lib/pacemaker/pengine/pe-input-116.bz2
>Mar 17 21:03:34 lbv1 crmd[4250]:Â Â notice: run_graph: Transition 1 (Complete=8, Pending=0, Fired=0, Skipped=12, Incomplete=1, Source=/var/lib/pacemaker/pengine/pe-input-116.bz2): Stopped
>Mar 17 21:03:37 lbv1 pengine[4253]:Â warning: unpack_rsc_op_failure: Processing failed op start for Stonith1-1 on lbv2.beta.com: unknown error (1)
>Mar 17 21:03:37 lbv1 pengine[4253]:Â warning: unpack_rsc_op_failure: Processing failed op start for Stonith1-1 on lbv2.beta.com: unknown error (1)
>Mar 17 21:03:37 lbv1 pengine[4253]:Â Â notice: process_pe_message: Calculated Transition 2: /var/lib/pacemaker/pengine/pe-input-117.bz2
>Mar 17 21:03:39 lbv1 stonith-ng[4247]:Â Â notice: log_operation: Operation 'monitor' [4377] for device 'Stonith2-1' returned: -201 (Generic Pacemaker error)
>Mar 17 21:03:39 lbv1 stonith-ng[4247]:Â warning: log_operation: Stonith2-1:4377 [ Performing: stonith -t external/stonith-helper -S ]
>Mar 17 21:03:39 lbv1 stonith-ng[4247]:Â warning: log_operation: Stonith2-1:4377 [ failed to exec "stonith" ]
>Mar 17 21:03:39 lbv1 stonith-ng[4247]:Â warning: log_operation: Stonith2-1:4377 [ failed:Â 2 ]
>Mar 17 21:03:39 lbv1 crmd[4250]:Â Â Â error: process_lrm_event: Operation Stonith2-1_start_0 (node=lbv1.beta.com, call=31, status=4, cib-update=42, confirmed=true) Error
>Mar 17 21:03:40 lbv1 crmd[4250]:Â Â notice: run_graph: Transition 2 (Complete=12, Pending=0, Fired=0, Skipped=3, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-117.bz2): Stopped
>Mar 17 21:03:42 lbv1 pengine[4253]:Â warning: unpack_rsc_op_failure: Processing failed op start for Stonith2-1 on lbv1.beta.com: unknown error (1)
>Mar 17 21:03:42 lbv1 pengine[4253]:Â warning: unpack_rsc_op_failure: Processing failed op start for Stonith2-1 on lbv1.beta.com: unknown error (1)
>Mar 17 21:03:42 lbv1 pengine[4253]:Â warning: unpack_rsc_op_failure: Processing failed op start for Stonith1-1 on lbv2.beta.com: unknown error (1)
>Mar 17 21:03:42 lbv1 pengine[4253]:Â Â notice: process_pe_message: Calculated Transition 3: /var/lib/pacemaker/pengine/pe-input-118.bz2
>Mar 17 21:03:42 lbv1 IPaddr2(vip_208)[4448]: INFO: /usr/libexec/heartbeat/send_arp -i 200 -r 5 -p /var/run/resource-agents/send_arp-192.168.17.208 eth0 192.168.17.208 auto not_used not_used
>Mar 17 21:03:47 lbv1 crmd[4250]:Â Â notice: run_graph: Transition 3 (Complete=10, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-118.bz2): Complete
>
>宜ã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>
>以上
>
>
>
>2015年3月17日 18:31 <renayama19661014@ybb.ne.jp>:
>
>ç¦ç”°ã•ã‚“
>>
>>ã“ã‚“ã°ã‚“ã¯ã€å±±å†…ã§ã™ã€‚
>>
>>tag付ã‘ã•ã‚Œã¦ã„ãªã„ã®ã§ã€æœ¬æ—¥ã®æœ€æ–°ç‰ˆã¯ã€
>>
>>Â * https://github.com/ClusterLabs/pacemaker/tree/e32080b460f81486b85d08ec958582b3e72d858c
>>
>>
>>ã«ãªã‚Šã¾ã™ã€‚
>>å³å´ã®[Download ZIP]ã‹ã‚‰ãƒ€ã‚¦ãƒ³ãƒãƒ¼ãƒ‰å‡ºæ¥ã¾ã™ã€‚
>>
>>以上ã§ã™ã€‚
>>
>>
>>----- Original Message -----
>>>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>>
>>>To: "renayama19661014@ybb.ne.jp" <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>>>Date: 2015/3/17, Tue 18:07
>>>Subject: スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>>>
>>>
>>>山内ã•ã‚“
>>>
>>>
>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€ç¦ç”°ã§ã™ã€‚
>>>
>>>
>>>ã“ã¡ã‚‰ã‚’見ãŸã®ã§ã™ãŒã€
>>>https://github.com/ClusterLabs/pacemaker/tags
>>>
>>>
>>>
>>>pacemaker 1.1.12 561c4cf ãŒæœ€æ–°ã®ã‚ˆã†ãªã®ã§ã™ãŒã€‚
>>>済ã¿ã¾ã›ã‚“ãŒã€ã“れ以é™ã®æœ€æ–°ç‰ˆã¯ã©ã¡ã‚‰ã«ã‚ã‚‹ã‹æ•™ãˆã¦é ‚ã‘ã¾ã™ã‹ã€‚
>>>
>>>
>>>宜ã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>>>
>>>
>>>以上
>>>
>>>
>>>
>>>2015å¹´3月17æ—¥ç«æ›œæ—¥ã€<renayama19661014@ybb.ne.jp>ã•ã‚“ã¯æ›¸ãã¾ã—ãŸ:
>>>
>>>ç¦ç”°ã•ã‚“
>>>>
>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€‚山内ã§ã™ã€‚
>>>>
>>>>ã¯ã„。å¤ã„ã§ã™ã€‚
>>>>
>>>>PacemakerãŒHeartbeat3.0.6ã«å¯¾å¿œã—ãŸã®ã¯æ„外ã¨æœ€è¿‘ã§ã™ã€‚
>>>>ã‚‚ã£ã¨æ–°ã—ã„ã‚‚ã®ã‚’入れã¦ãã ã•ã„。(ã¾ãŸã€ã‚½ãƒ¼ã‚¹ã‹ã‚‰æ§‹ç¯‰ã™ã‚‹å¿…è¦ãŒã‚ã‚Šã¾ã™ãŒãƒ»ãƒ»ãƒ»ãƒ»)
>>>>
>>>>
>>>>
>>>>本家ã®githubã‹ã‚‰å…¥æ‰‹å¯èƒ½ã§ã™ã€‚
>>>>Â * https://github.com/ClusterLabs/pacemaker
>>>>
>>>>
>>>>å ´åˆã«ã‚ˆã£ã¦ã¯ã€æœ€æ–°ã®masterã¯ã‚¨ãƒ©ãƒ¼ãªã©ãŒå‡ºã‚‹å ´åˆãŒã‚ã‚Šã¾ã™ã®ã§ã€ãã®å ´åˆã¯ã€ãƒãƒ¼ã‚¸ãƒ§ãƒ³ã‚’å¤ã„æ–¹ã«ãŸãã£ã¦
>>>>ã„ãã®ãŒè‰¯ã„ã¨æ€ã„ã¾ã™ã€‚
>>>>
>>>>以上ã§ã™ã€‚
>>>>
>>>>
>>>>
>>>>----- Original Message -----
>>>>>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>>>>>To: 山内英生 <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>>>>>Date: 2015/3/17, Tue 16:06
>>>>>Subject: Re: [Linux-ha-jp] スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>>>>>
>>>>>
>>>>>山内ã•ã‚“
>>>>>
>>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€ç¦ç”°ã§ã™ã€‚
>>>>>
>>>>>以å‰ã®ãƒ¡ãƒ¼ãƒ«ã§heartbeatã¨pacemakerを最新版を入れãŸã»ã†ãŒè‰¯ã„ã¨å›žç”é ‚ãã¾ã—ãŸã€‚
>>>>>ãã“ã§ä»Šå›žã€heartbeat3.0.6ã¨pacemaker1.1.12を入れãŸã®ã§ã™ãŒã€‚
>>>>>
>>>>>heartbeat configuration: Version = "3.0.6"
>>>>>pacemaker configuration: Version = 1.1.12 (Build: 561c4cf)pacemakerãŒã¾ã å¤ã„ã¨ã„ã†ã“ã¨ã§ã—ょã†ã‹ã€‚
>>>>>
>>>>>済ã¿ã¾ã›ã‚“ãŒã€å®œã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>>>>>
>>>>>以上
>>>>>
>>>>>
>>>>>
>>>>>2015年3月17日 14:59 <renayama19661014@ybb.ne.jp>:
>>>>>
>>>>>ç¦ç”°ã•ã‚“
>>>>>>
>>>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€‚山内ã§ã™ã€‚
>>>>>>
>>>>>>ãµã¨æ€ã£ãŸã®ã™ãŒã€ä»¥å‰ã®ã‚„ã‚Šå–ã‚Šã®ãƒ¡ãƒ¼ãƒ«ã§ä»¥ä¸‹ã¨å›žç”ã—ã¦ã¾ã™ãŒã€å•é¡Œãªã„ã§ã—ょã†ã‹ï¼Ÿ
>>>>>>
>>>>>>
>>>>>>>>>>>> 2)Heartbeat3.0.6+Pacemaker最新 : OK
>>>>>>>>>>>> Â Â
>>>>>>>>>>>> ã©ã†ã‚„らã€Heartbeatも最新版3.0.6を組åˆã›ã‚‹å¿…è¦ãŒã‚るよã†ã§ã™ã€‚
>>>>>>>>>>>> Â *Â http://hg.linux-ha.org/heartbeat-STABLE_3_0/rev/cceeb47a7d8f
>>>>>>
>>>>>>以下ã®crm_monã®ãƒãƒ¼ã‚¸ãƒ§ãƒ³ã‚’見るã¨ã€1.1.12ã®ã‚ˆã†ã§ã™ã€‚
>>>>>>Heartbeat3.0.6ã¨çµ„ã¿åˆã‚ã›ã‚‹ã«ã¯ã€ã‹ãªã‚Šæ–°ã—ã‚ã®PacemakerãŒå¿…è¦ã§ã™ã€‚
>>>>>>
>>>>>>># crm_mon -rfA
>>>>>>>
>>>>>>>Last updated: Tue Mar 17 14:14:39 2015
>>>>>>>Last change: Tue Mar 17 14:01:43 2015
>>>>>>>Stack: heartbeat
>>>>>>>Current DC: lbv2.beta.com (82ffc36f-1ad8-8686-7db0-35686465c624) - parti
>>>>>>>tion with quorum
>>>>>>>Version: 1.1.12-561c4cf
>>>>>>
>>>>>>ãŸã¶ã‚“ã€ä»¥ä¸‹ã®å¤‰æ›´ä»¥é™ã¯å°‘ãªãã¨ã‚‚å¿…è¦ã‹ã¨æ€ã„ã¾ã™ã€‚
>>>>>>
>>>>>>https://github.com/ClusterLabs/pacemaker/commit/f2302da063d08719d28367d8e362b8bfb0f85bf3
>>>>>>
>>>>>>
>>>>>>
>>>>>>以上ã§ã™ã€‚
>>>>>>
>>>>>>
>>>>>>
>>>>>>----- Original Message -----
>>>>>>>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>>>>>>>To: 山内英生 <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>>>>>>
>>>>>>>Date: 2015/3/17, Tue 14:38
>>>>>>>Subject: Re: [Linux-ha-jp] スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>>>>>>>
>>>>>>>
>>>>>>>山内ã•ã‚“
>>>>>>>
>>>>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€ç¦ç”°ã§ã™ã€‚
>>>>>>>
>>>>>>>stonith-helperã®ã‚·ã‚§ãƒãƒ³ã‚°è¡Œã«-xã‚’è¿½åŠ ã™ã‚Œã°è‰¯ã„ã®ã§ã—ょã†ã‹ï¼Ÿ
>>>>>>>stonith-helperã®å…ˆé 行を#!/bin/bash -xã«ã—ã¦ã‚¯ãƒ©ã‚¹ã‚¿ã‚’èµ·å‹•ã—ã¦ã¿ã¾ã—ãŸã€‚
>>>>>>>
>>>>>>>crm_monã§ã¯å…ˆã»ã©ã¨å¤‰ã‚ã‚Šã¯ãªã„よã†ã§ã™ã€‚
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>># crm_mon -rfA
>>>>>>>
>>>>>>>Last updated: Tue Mar 17 14:14:39 2015
>>>>>>>Last change: Tue Mar 17 14:01:43 2015
>>>>>>>Stack: heartbeat
>>>>>>>Current DC: lbv2.beta.com (82ffc36f-1ad8-8686-7db0-35686465c624) - parti
>>>>>>>tion with quorum
>>>>>>>Version: 1.1.12-561c4cf
>>>>>>>2 Nodes configured
>>>>>>>8 Resources configured
>>>>>>>
>>>>>>>Online: [ lbv1.beta.com lbv2.beta.com ]
>>>>>>>
>>>>>>>Full list of resources:
>>>>>>>
>>>>>>>Â Resource Group: HAvarnish
>>>>>>>Â Â Â Â vip_208Â Â Â (ocf::heartbeat:IPaddr2):Â Â Â Â Â Â Started lbv1.beta.com
>>>>>>>    varnishd  (lsb:varnish): Started lbv1.beta.com
>>>>>>>Â Resource Group: grpStonith1
>>>>>>>Â Â Â Â Stonith1-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>>>>>>>Â Â Â Â Stonith1-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>>>>>>>Â Resource Group: grpStonith2
>>>>>>>Â Â Â Â Stonith2-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>>>>>>>Â Â Â Â Stonith2-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>>>>>>>Â Clone Set: clone_ping [ping]
>>>>>>>Â Â Â Â Started: [ lbv1.beta.com lbv2.beta.com ]
>>>>>>>
>>>>>>>Node Attributes:
>>>>>>>* Node lbv1.beta.com:
>>>>>>>   + default_ping_set                 : 100
>>>>>>>* Node lbv2.beta.com:
>>>>>>>   + default_ping_set                 : 100
>>>>>>>
>>>>>>>Migration summary:
>>>>>>>* Node lbv2.beta.com:
>>>>>>>Â Â Stonith1-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>>>>>>>Â 14:12:16 2015'
>>>>>>>* Node lbv1.beta.com:
>>>>>>>Â Â Stonith2-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>>>>>>>Â 14:12:21 2015'
>>>>>>>
>>>>>>>Failed actions:
>>>>>>>Â Â Â Stonith1-1_start_0 on lbv2.beta.com 'unknown error' (1): call=31, st
>>>>>>>atus=Error, last-rc-change='Tue Mar 17 14:12:14 2015', queued=0ms, exec=1065ms
>>>>>>>Â Â Â Stonith2-1_start_0 on lbv1.beta.com 'unknown error' (1): call=26, st
>>>>>>>atus=Error, last-rc-change='Tue Mar 17 14:12:19 2015', queued=0ms, exec=1081ms
>>>>>>>
>>>>>>>ãã®ä»–ã®ãƒã‚°ã‚’探ã—ã¦ã¿ã¾ã—ãŸã€‚
>>>>>>>
>>>>>>>heartbeat起動時ã§ã™ã€‚
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>># less /var/log/pm_logconv.out
>>>>>>>Mar 17 14:11:28 lbv1.beta.com info: Starting Heartbeat 3.0.6.
>>>>>>>Mar 17 14:11:33 lbv1.beta.com info: Link lbv2.beta.com:eth1 is up.
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "ccm" process. (pid=13264)
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "lrmd" process. (pid=13267)
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "attrd" process. (pid=13268)
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "stonithd" process. (pid=13266)
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "cib" process. (pid=13265)
>>>>>>>Mar 17 14:11:34 lbv1.beta.com info: Start "crmd" process. (pid=13269)
>>>>>>>
>>>>>>>
>>>>>>># less /var/log/error
>>>>>>>Mar 17 14:12:20 lbv1 crmd[13269]:Â Â Â error: process_lrm_event: Operation Stonith2-1_start_0 (node=lbv1.beta.com, call=26, status=4, cib-update=19, confirmed=true) Error
>>>>>>>
>>>>>>>
>>>>>>>syslogã‹ã‚‰stonithã‚’grepã—ãŸã‚‚ã®ã§ã™
>>>>>>>
>>>>>>>Mar 17 14:11:34 lbv1 heartbeat: [13255]: info: Starting child client "/usr/local/heartbeat/libexec/pacemaker/stonithd" (0,0)
>>>>>>>Mar 17 14:11:34 lbv1 heartbeat: [13266]: info: Starting "/usr/local/heartbeat/libexec/pacemaker/stonithd" as uid 0Â gid 0 (pid 13266)
>>>>>>>Mar 17 14:11:34 lbv1 stonithd[13266]:Â Â notice: crm_cluster_connect: Connecting to cluster infrastructure: heartbeat
>>>>>>>Mar 17 14:11:34 lbv1 heartbeat: [13255]: info: the send queue length from heartbeat to client stonithd is set to 1024
>>>>>>>Mar 17 14:11:40 lbv1 stonithd[13266]:Â Â notice: setup_cib: Watching for stonith topology changes
>>>>>>>Mar 17 14:11:40 lbv1 stonithd[13266]:Â Â notice: unpack_config: On loss of CCM Quorum: Ignore
>>>>>>>Mar 17 14:11:40 lbv1 stonithd[13266]:Â warning: handle_startup_fencing: Blind faith: not fencing unseen nodes
>>>>>>>Mar 17 14:11:40 lbv1 stonithd[13266]:Â warning: handle_startup_fencing: Blind faith: not fencing unseen nodes
>>>>>>>Mar 17 14:11:41 lbv1 stonithd[13266]:Â Â notice: stonith_device_register: Added 'Stonith2-1' to the device list (1 active devices)
>>>>>>>Mar 17 14:11:41 lbv1 stonithd[13266]:Â Â notice: stonith_device_register: Added 'Stonith2-2' to the device list (2 active devices)
>>>>>>>Mar 17 14:12:04 lbv1 stonithd[13266]:Â Â notice: xml_patch_version_check: Versions did not change in patch 0.5.0
>>>>>>>Mar 17 14:12:20 lbv1 stonithd[13266]:Â Â notice: log_operation: Operation 'monitor' [13386] for device 'Stonith2-1' returned: -201 (Generic Pacemaker error)
>>>>>>>Mar 17 14:12:20 lbv1 stonithd[13266]:Â warning: log_operation: Stonith2-1:13386 [ Performing: stonith -t external/stonith-helper -S ]
>>>>>>>Mar 17 14:12:20 lbv1 stonithd[13266]:Â warning: log_operation: Stonith2-1:13386 [ failed to exec "stonith" ]
>>>>>>>Mar 17 14:12:20 lbv1 stonithd[13266]:Â warning: log_operation: Stonith2-1:13386 [ failed:Â 2 ]
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>宜ã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>>>>>>>
>>>>>>>以上
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>2015年3月17日 13:32 <renayama19661014@ybb.ne.jp>:
>>>>>>>
>>>>>>>ç¦ç”°ã•ã‚“
>>>>>>>>
>>>>>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€‚山内ã§ã™ã€‚
>>>>>>>>
>>>>>>>>ã¨ã„ã†ã“ã¨ã¯ã€stonith-helperã®startã«å•é¡ŒãŒã‚るよã†ã§ã™ã。
>>>>>>>>
>>>>>>>>stonith-helperã®å…ˆé ã«
>>>>>>>>
>>>>>>>>#!/bin/bash -x
>>>>>>>>
>>>>>>>>
>>>>>>>>を入れã¦ã€ã‚¯ãƒ©ã‚¹ã‚¿ã‚’èµ·å‹•ã™ã‚‹ã¨ä½•ã‹ã‚ã‹ã‚‹ã‹ã‚‚知れã¾ã›ã‚“。
>>>>>>>>
>>>>>>>>ã¡ãªã¿ã«ã€stonith-helperã®ãƒã‚°ã‚‚ã©ã“ã‹ã«å‡ºã¦ã„ã‚‹ã¨æ€ã†ã®ã§ã™ãŒã€‚。。
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>以上ã§ã™ã€‚
>>>>>>>>
>>>>>>>>----- Original Message -----
>>>>>>>>>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>>>>>>>>>To: 山内英生 <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>>>>>>>>
>>>>>>>>>Date: 2015/3/17, Tue 12:31
>>>>>>>>>Subject: Re: [Linux-ha-jp] スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>山内ã•ã‚“
>>>>>>>>>cc:æ¾å³¶ã•ã‚“
>>>>>>>>>
>>>>>>>>>ã“ã‚“ã«ã¡ã¯ã€ç¦ç”°ã§ã™ã€‚
>>>>>>>>>
>>>>>>>>>åŒã˜ãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã«xen0ã¯ã‚ã‚Šã¾ã—ãŸã€‚
>>>>>>>>>
>>>>>>>>># pwd
>>>>>>>>>/usr/local/heartbeat/lib/stonith/plugins/external
>>>>>>>>>
>>>>>>>>># ls
>>>>>>>>>drac5         ibmrsa        kdumpcheck riloe       vmware
>>>>>>>>>dracmc-telnet ibmrsa-telnet libvirt    ssh       xen0
>>>>>>>>>hetzner       ipmi        nut    stonith-helper xen0-ha
>>>>>>>>>hmchttp       ippower9258   rackpdu    vcenter
>>>>>>>>>
>>>>>>>>>宜ã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>>>>>>>>>
>>>>>>>>>以上
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>2015-03-17 10:53 GMT+09:00 <renayama19661014@ybb.ne.jp>:
>>>>>>>>>
>>>>>>>>>ç¦ç”°ã•ã‚“
>>>>>>>>>>cc:æ¾å³¶ã•ã‚“
>>>>>>>>>>
>>>>>>>>>>ãŠç–²ã‚Œæ§˜ã§ã™ã€‚山内ã§ã™ã€‚
>>>>>>>>>>
>>>>>>>>>>>標準出力や標準エラー出力ã¯ã‚ã‚Šã¾ã›ã‚“ã§ã—ãŸã€‚
>>>>>>>>>>>
>>>>>>>>>>>stonith-helperãŒãŠã‹ã—ã„ã®ã§ã—ょã†ã‹ã€‚
>>>>>>>>>>>stonith-helperã¯ã‚·ã‚§ãƒ«ã‚¹ã‚¯ãƒªãƒ—トãªã®ã§ã‚¤ãƒ³ã‚¹ãƒˆãƒ¼ãƒ«ã¯ã‚ã¾ã‚Šæ°—ã«ã—ã¦ã„ãªã‹ã£ãŸã®ã§ã™ãŒã€‚
>>>>>>>>>>>stonith-helperã¯ã“ã“ã«é…ç½®ã•ã‚Œã¦ã„ã¾ã™ã€‚
>>>>>>>>>>>/usr/local/heartbeat/lib/stonith/plugins/external/stonith-helper
>>>>>>>>>>
>>>>>>>>>>ã“ã®ãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã«xen0ã‚‚ã‚ã‚Šã¾ã™ã‹ï¼Ÿ
>>>>>>>>>>ç„¡ã„よã†ã§ã—ãŸã‚‰ã€å•é¡ŒãŒã‚ã‚Šã¾ã™ã®ã§ã€ä¸€åº¦ã€stonith-helperã®ãƒ•ã‚¡ã‚¤ãƒ«ã‚’属性ãªã©ã¯ãã®ã¾ã¾ã€xen0ã¨åŒã˜ãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã«
>>>>>>>>>>コピーã—ã¦ã¿ã¦ãã ã•ã„。
>>>>>>>>>>
>>>>>>>>>>ãã‚Œã§ç¨¼åƒã™ã‚‹ãªã‚‰ã€pm_extrasã®ã‚¤ãƒ³ã‚¹ãƒˆãƒ¼ãƒ«ã«å•é¡ŒãŒã‚ã‚‹ã¨ã„ã†ã“ã¨ã«ãªã‚Šã¾ã™ã€‚
>>>>>>>>>>
>>>>>>>>>>以上ã§ã™ã€‚
>>>>>>>>>>
>>>>>>>>>>----- Original Message -----
>>>>>>>>>>>From: Masamichi Fukuda - elf-systems <masamichi_fukuda@elf-systems.com>
>>>>>>>>>>>To: 山内英生 <renayama19661014@ybb.ne.jp>; "linux-ha-japan@lists.sourceforge.jp" <linux-ha-japan@lists.sourceforge.jp>
>>>>>>>>>>
>>>>>>>>>>>Date: 2015/3/17, Tue 10:31
>>>>>>>>>>>Subject: Re: [Linux-ha-jp] スプリットブレイン時ã®STONITHエラーã«ã¤ã„ã¦
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>山内ã•ã‚“
>>>>>>>>>>>cc:æ¾å³¶ã•ã‚“
>>>>>>>>>>>
>>>>>>>>>>>ãŠã¯ã‚ˆã†ã”ã–ã„ã¾ã™ã€ç¦ç”°ã§ã™ã€‚
>>>>>>>>>>>crmã®ä¾‹ã‚’ã‚ã‚ŠãŒã¨ã†ã”ã–ã„ã¾ã™ã€‚
>>>>>>>>>>>
>>>>>>>>>>>早速ã€ã“ã¡ã‚‰ã®ç’°å¢ƒã«åˆã‚ã›ã¦ã¿ã¾ã—ãŸã€‚
>>>>>>>>>>>
>>>>>>>>>>>$ cat test.crm
>>>>>>>>>>>### Cluster Option ###
>>>>>>>>>>>property \
>>>>>>>>>>>Â Â Â no-quorum-policy="ignore" \
>>>>>>>>>>>Â Â Â stonith-enabled="true" \
>>>>>>>>>>>Â Â Â startup-fencing="false" \
>>>>>>>>>>>Â Â Â stonith-timeout="710s" \
>>>>>>>>>>>Â Â Â crmd-transition-delay="2s"
>>>>>>>>>>>
>>>>>>>>>>>### Resource Default ###
>>>>>>>>>>>rsc_defaults \
>>>>>>>>>>>Â Â Â resource-stickiness="INFINITY" \
>>>>>>>>>>>Â Â Â migration-threshold="1"
>>>>>>>>>>>
>>>>>>>>>>>### Group Configuration ###
>>>>>>>>>>>group HAvarnish \
>>>>>>>>>>>Â Â Â vip_208 \
>>>>>>>>>>>Â Â Â varnishd
>>>>>>>>>>>
>>>>>>>>>>>group grpStonith1 \
>>>>>>>>>>>Â Â Â Stonith1-1 \
>>>>>>>>>>>Â Â Â Stonith1-2
>>>>>>>>>>>
>>>>>>>>>>>group grpStonith2 \
>>>>>>>>>>>Â Â Â Stonith2-1 \
>>>>>>>>>>>Â Â Â Stonith2-2
>>>>>>>>>>>
>>>>>>>>>>>### Clone Configuration ###
>>>>>>>>>>>clone clone_ping \
>>>>>>>>>>>Â Â Â ping
>>>>>>>>>>>
>>>>>>>>>>>### Fencing Topology ###
>>>>>>>>>>>fencing_topology \
>>>>>>>>>>>Â Â Â lbv1.beta.com: Stonith1-1 Stonith1-2 \
>>>>>>>>>>>Â Â Â lbv2.beta.com: Stonith2-1 Stonith2-2
>>>>>>>>>>>
>>>>>>>>>>>### Primitive Configuration ###
>>>>>>>>>>>primitive vip_208 ocf:heartbeat:IPaddr2 \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â ip="192.168.17.208" \
>>>>>>>>>>>Â Â Â Â Â Â Â nic="eth0" \
>>>>>>>>>>>Â Â Â Â Â Â Â cidr_netmask="24" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="90s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op monitor interval="5s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="100s" on-fail="fence"
>>>>>>>>>>>
>>>>>>>>>>>primitive varnishd lsb:varnish \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="90s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="100s" on-fail="fence"
>>>>>>>>>>>
>>>>>>>>>>>primitive ping ocf:pacemaker:ping \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â name="default_ping_set" \
>>>>>>>>>>>Â Â Â Â Â Â Â host_list="192.168.17.254" \
>>>>>>>>>>>Â Â Â Â Â Â Â multiplier="100" \
>>>>>>>>>>>Â Â Â Â Â Â Â dampen="1" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="90s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op monitor interval="10s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="100s" on-fail="fence"
>>>>>>>>>>>
>>>>>>>>>>>primitive Stonith1-1 stonith:external/stonith-helper \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_retries="1" \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_timeout="40s" \
>>>>>>>>>>>Â Â Â Â Â Â Â hostlist="lbv1.beta.com" \
>>>>>>>>>>>Â Â Â Â Â Â Â dead_check_target="192.168.17.132 10.0.17.132" \
>>>>>>>>>>>Â Â Â Â Â Â Â standby_check_command="/usr/local/sbin/crm_resource -r varnishd -W | grep -q `hostname`" \
>>>>>>>>>>>Â Â Â Â Â Â Â run_online_check="yes" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>
>>>>>>>>>>>primitive Stonith1-2 stonith:external/xen0 \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_timeout="60s" \
>>>>>>>>>>>Â Â Â Â Â Â Â hostlist="lbv1.beta.com:/etc/xen/lbv1.cfg" \
>>>>>>>>>>>Â Â Â Â Â Â Â dom0="xen0.beta.com" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op monitor interval="3600s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>
>>>>>>>>>>>primitive Stonith2-1 stonith:external/stonith-helper \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_retries="1" \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_timeout="40s" \
>>>>>>>>>>>Â Â Â Â Â Â Â hostlist="lbv2.beta.com" \
>>>>>>>>>>>Â Â Â Â Â Â Â dead_check_target="192.168.17.133 10.0.17.133" \
>>>>>>>>>>>Â Â Â Â Â Â Â standby_check_command="/usr/local/sbin/crm_resource -r varnishd -W | grep -q `hostname`" \
>>>>>>>>>>>Â Â Â Â Â Â Â run_online_check="yes" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>
>>>>>>>>>>>primitive Stonith2-2 stonith:external/xen0 \
>>>>>>>>>>>Â Â Â params \
>>>>>>>>>>>Â Â Â Â Â Â Â pcmk_reboot_timeout="60s" \
>>>>>>>>>>>Â Â Â Â Â Â Â hostlist="lbv2.beta.com:/etc/xen/lbv2.cfg" \
>>>>>>>>>>>Â Â Â Â Â Â Â dom0="xen0.beta.com" \
>>>>>>>>>>>Â Â Â op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op monitor interval="3600s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>Â Â Â op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>
>>>>>>>>>>>### Resource Location ###
>>>>>>>>>>>location HA_location-1 HAvarnish \
>>>>>>>>>>>Â Â Â rule 200: #uname eq lbv1.beta.com \
>>>>>>>>>>>Â Â Â rule 100: #uname eq lbv2.beta.com
>>>>>>>>>>>
>>>>>>>>>>>location HA_location-2 HAvarnish \
>>>>>>>>>>>Â Â Â rule -INFINITY: not_defined default_ping_set or default_ping_set lt 100
>>>>>>>>>>>
>>>>>>>>>>>location HA_location-3 grpStonith1 \
>>>>>>>>>>>Â Â Â rule -INFINITY: #uname eq lbv1.beta.com
>>>>>>>>>>>
>>>>>>>>>>>location HA_location-4 grpStonith2 \
>>>>>>>>>>>Â Â Â rule -INFINITY: #uname eq lbv2.beta.com
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>ã“れをæµã—ã“ã‚“ã ã¨ã“ã‚ã€æ˜¨æ—¥ã¨ã¯ãƒ¡ãƒƒã‚»ãƒ¼ã‚¸ãŒç•°ãªã‚Šã¾ã™ã€‚
>>>>>>>>>>>pingã®ãƒ¡ãƒƒã‚»ãƒ¼ã‚¸ã¯ãªããªã£ã¦ã„ã¾ã—ãŸã€‚
>>>>>>>>>>>
>>>>>>>>>>># crm_mon -rfA
>>>>>>>>>>>Last updated: Tue Mar 17 10:21:28 2015
>>>>>>>>>>>Last change: Tue Mar 17 10:21:09 2015
>>>>>>>>>>>Stack: heartbeat
>>>>>>>>>>>Current DC: lbv2.beta.com (82ffc36f-1ad8-8686-7db0-35686465c624) - parti
>>>>>>>>>>>tion with quorum
>>>>>>>>>>>Version: 1.1.12-561c4cf
>>>>>>>>>>>2 Nodes configured
>>>>>>>>>>>8 Resources configured
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>Online: [ lbv1.beta.com lbv2.beta.com ]
>>>>>>>>>>>
>>>>>>>>>>>Full list of resources:
>>>>>>>>>>>
>>>>>>>>>>>Â Resource Group: HAvarnish
>>>>>>>>>>>Â Â Â Â vip_208Â Â Â (ocf::heartbeat:IPaddr2):Â Â Â Â Â Â Started lbv1.beta.com
>>>>>>>>>>>    varnishd  (lsb:varnish): Started lbv1.beta.com
>>>>>>>>>>>Â Resource Group: grpStonith1
>>>>>>>>>>>Â Â Â Â Stonith1-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>>>>>>>>>>>Â Â Â Â Stonith1-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>>>>>>>>>>>Â Resource Group: grpStonith2
>>>>>>>>>>>Â Â Â Â Stonith2-1 (stonith:external/stonith-helper):Â Â Â Â Â Stopped
>>>>>>>>>>>Â Â Â Â Stonith2-2 (stonith:external/xen0):Â Â Â Â Â Â Â Stopped
>>>>>>>>>>>Â Clone Set: clone_ping [ping]
>>>>>>>>>>>Â Â Â Â Started: [ lbv1.beta.com lbv2.beta.com ]
>>>>>>>>>>>
>>>>>>>>>>>Node Attributes:
>>>>>>>>>>>* Node lbv1.beta.com:
>>>>>>>>>>>   + default_ping_set                 : 100
>>>>>>>>>>>* Node lbv2.beta.com:
>>>>>>>>>>>   + default_ping_set                 : 100
>>>>>>>>>>>
>>>>>>>>>>>Migration summary:
>>>>>>>>>>>* Node lbv2.beta.com:
>>>>>>>>>>>Â Â Stonith1-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>>>>>>>>>>>Â 10:21:17 2015'
>>>>>>>>>>>* Node lbv1.beta.com:
>>>>>>>>>>>Â Â Stonith2-1: migration-threshold=1 fail-count=1000000 last-failure='Tue Mar 17
>>>>>>>>>>>Â 10:21:17 2015'
>>>>>>>>>>>
>>>>>>>>>>>Failed actions:
>>>>>>>>>>>Â Â Â Stonith1-1_start_0 on lbv2.beta.com 'unknown error' (1): call=31, st
>>>>>>>>>>>atus=Error, last-rc-change='Tue Mar 17 10:21:15 2015', queued=0ms, exec=1082ms
>>>>>>>>>>>Â Â Â Stonith2-1_start_0 on lbv1.beta.com 'unknown error' (1): call=31, st
>>>>>>>>>>>atus=Error, last-rc-change='Tue Mar 17 10:21:16 2015', queued=0ms, exec=1079ms
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>/var/log/ha-debugã®ãƒã‚°ã§ã™ã€‚
>>>>>>>>>>>
>>>>>>>>>>>IPaddr2(vip_208)[7851]: 2015/03/17_10:21:22 INFO: Adding inet address 192.168.17.208/24 with broadcast address 192.168.17.255 to device eth0
>>>>>>>>>>>IPaddr2(vip_208)[7851]: 2015/03/17_10:21:22 INFO: Bringing device eth0 up
>>>>>>>>>>>IPaddr2(vip_208)[7851]: 2015/03/17_10:21:22 INFO: /usr/libexec/heartbeat/send_arp -i 200 -r 5 -p /var/run/resource-agents/send_arp-192.168.17.208 eth0 192.168.17.208 auto not_used not_used
>>>>>>>>>>>
>>>>>>>>>>>標準出力や標準エラー出力ã¯ã‚ã‚Šã¾ã›ã‚“ã§ã—ãŸã€‚
>>>>>>>>>>>
>>>>>>>>>>>stonith-helperãŒãŠã‹ã—ã„ã®ã§ã—ょã†ã‹ã€‚
>>>>>>>>>>>stonith-helperã¯ã‚·ã‚§ãƒ«ã‚¹ã‚¯ãƒªãƒ—トãªã®ã§ã‚¤ãƒ³ã‚¹ãƒˆãƒ¼ãƒ«ã¯ã‚ã¾ã‚Šæ°—ã«ã—ã¦ã„ãªã‹ã£ãŸã®ã§ã™ãŒã€‚
>>>>>>>>>>>stonith-helperã¯ã“ã“ã«é…ç½®ã•ã‚Œã¦ã„ã¾ã™ã€‚
>>>>>>>>>>>/usr/local/heartbeat/lib/stonith/plugins/external/stonith-helper
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>宜ã—ããŠé¡˜ã„ã—ã¾ã™ã€‚
>>>>>>>>>>>
>>>>>>>>>>>以上
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>2015-03-17 9:45 GMT+09:00 <renayama19661014@ybb.ne.jp>:
>>>>>>>>>>>
>>>>>>>>>>>ç¦ç”°ã•ã‚“
>>>>>>>>>>>>
>>>>>>>>>>>>ãŠã¯ã‚ˆã†ã”ã–ã„ã¾ã™ã€‚山内ã§ã™ã€‚
>>>>>>>>>>>>
>>>>>>>>>>>>念ã®ç‚ºã€æ‰‹å…ƒã«ã‚る複数ã®stonithを利用ã—ãŸå ´åˆã®ä¾‹ã‚’抜粋ã—ã¦ãŠé€ã‚Šã—ã¾ã™ã€‚
>>>>>>>>>>>>(実際ã«ã¯ã€æ”¹è¡Œã«æ°—を付ã‘ã¦ãã ã•ã„)
>>>>>>>>>>>>
>>>>>>>>>>>>以下ã®ä¾‹ã¯ã€PM1.1ç³»ã§ã®è¨å®šã§ã€
>>>>>>>>>>>>nodeaã¯ã€prmStonith1-1ã€Â prmStonith1-2ã®é †ã§stonithãŒå®Ÿè¡Œã•ã‚Œã¾ã™ã€‚
>>>>>>>>>>>>nodebã¯ã€prmStonith2-1ã€Â prmStonith2-2ã®é †ã§stonithãŒå®Ÿè¡Œã•ã‚Œã¾ã™ã€‚
>>>>>>>>>>>>
>>>>>>>>>>>>stonith自体ã¯ã€helperã¨sshã§ã™ã€‚
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>(snip)
>>>>>>>>>>>>### Group Configuration ###
>>>>>>>>>>>>group grpStonith1 \
>>>>>>>>>>>>prmStonith1-1 \
>>>>>>>>>>>>prmStonith1-2
>>>>>>>>>>>>
>>>>>>>>>>>>group grpStonith2 \
>>>>>>>>>>>>prmStonith2-1 \
>>>>>>>>>>>>prmStonith2-2
>>>>>>>>>>>>
>>>>>>>>>>>>### Fencing Topology ###
>>>>>>>>>>>>fencing_topology \
>>>>>>>>>>>>nodea: prmStonith1-1 prmStonith1-2 \
>>>>>>>>>>>>nodeb: prmStonith2-1 prmStonith2-2
>>>>>>>>>>>>(snp)
>>>>>>>>>>>>primitive prmStonith1-1 stonith:external/stonith-helper \
>>>>>>>>>>>>params \
>>>>>>>>>>>>
>>>>>>>>>>>>pcmk_reboot_retries="1" \
>>>>>>>>>>>>pcmk_reboot_timeout="40s" \
>>>>>>>>>>>>hostlist="nodea" \
>>>>>>>>>>>>dead_check_target="192.168.28.60 192.168.28.70" \
>>>>>>>>>>>>standby_check_command="/usr/sbin/crm_resource -r prmRES -W | grep -qi `hostname`" \
>>>>>>>>>>>>run_online_check="yes" \
>>>>>>>>>>>>op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>>
>>>>>>>>>>>>primitive prmStonith1-2 stonith:external/ssh \
>>>>>>>>>>>>params \
>>>>>>>>>>>>pcmk_reboot_timeout="60s" \
>>>>>>>>>>>>hostlist="nodea" \
>>>>>>>>>>>>op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op monitor interval="3600s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>>
>>>>>>>>>>>>primitive prmStonith2-1 stonith:external/stonith-helper \
>>>>>>>>>>>>params \
>>>>>>>>>>>>pcmk_reboot_retries="1" \
>>>>>>>>>>>>pcmk_reboot_timeout="40s" \
>>>>>>>>>>>>hostlist="nodeb" \
>>>>>>>>>>>>dead_check_target="192.168.28.61 192.168.28.71" \
>>>>>>>>>>>>standby_check_command="/usr/sbin/crm_resource -r prmRES -W | grep -qi `hostname`" \
>>>>>>>>>>>>run_online_check="yes" \
>>>>>>>>>>>>op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>>
>>>>>>>>>>>>primitive prmStonith2-2 stonith:external/ssh \
>>>>>>>>>>>>params \
>>>>>>>>>>>>pcmk_reboot_timeout="60s" \
>>>>>>>>>>>>hostlist="nodeb" \
>>>>>>>>>>>>op start interval="0s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op monitor interval="3600s" timeout="60s" on-fail="restart" \
>>>>>>>>>>>>op stop interval="0s" timeout="60s" on-fail="ignore"
>>>>>>>>>>>>(snip)
>>>>>>>>>>>>location rsc_location-grpStonith1-2 grpStonith1 \
>>>>>>>>>>>>rule -INFINITY: #uname eq nodea
>>>>>>>>>>>>location rsc_location-grpStonith2-3 grpStonith2 \
>>>>>>>>>>>>rule -INFINITY: #uname eq nodeb
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>以上ã§ã™ã€‚
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>--
>>>>>>>>>>>
>>>>>>>>>>>ELF Systems
>>>>>>>>>>>Masamichi Fukuda
>>>>>>>>>>>mail to: masamichi_fukuda@elf-systems.com
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>_______________________________________________
>>>>>>>>>>Linux-ha-japan mailing list
>>>>>>>>>>Linux-ha-japan@lists.sourceforge.jp
>>>>>>>>>>http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>--
>>>>>>>>>
>>>>>>>>>ELF Systems
>>>>>>>>>Masamichi Fukuda
>>>>>>>>>mail to: masamichi_fukuda@elf-systems.com
>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>>_______________________________________________
>>>>>>>>Linux-ha-japan mailing list
>>>>>>>>Linux-ha-japan@lists.sourceforge.jp
>>>>>>>>http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>--
>>>>>>>
>>>>>>>ELF Systems
>>>>>>>Masamichi Fukuda
>>>>>>>mail to: masamichi_fukuda@elf-systems.com
>>>>>>>
>>>>>>>
>>>>>>
>>>>>>_______________________________________________
>>>>>>Linux-ha-japan mailing list
>>>>>>Linux-ha-japan@lists.sourceforge.jp
>>>>>>http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>>>
>>>>>
>>>>>
>>>>>--
>>>>>
>>>>>ELF Systems
>>>>>Masamichi Fukuda
>>>>>mail to: masamichi_fukuda@elf-systems.com
>>>>>
>>>>>
>>>>
>>>>_______________________________________________
>>>>Linux-ha-japan mailing list
>>>>Linux-ha-japan@lists.sourceforge.jp
>>>>http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>>>
>>>
>>>--
>>>
>>>ELF Systems
>>>Masamichi Fukuda
>>>mail to: masamichi_fukuda@elf-systems.com
>>>
>>>
>>>
>>
>>_______________________________________________
>>Linux-ha-japan mailing list
>>Linux-ha-japan@lists.sourceforge.jp
>>http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan
>>
>
>
>--
>
>ELF Systems
>Masamichi Fukuda
>mail to: masamichi_fukuda@elf-systems.com
>
>
_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan@lists.sourceforge.jp
http://lists.sourceforge.jp/mailman/listinfo/linux-ha-japan