飯田ã•ã‚“
æ± ç”°ã§ã™ã€‚
ã”連絡ã‚ã‚ŠãŒã¨ã†ã”ã–ã„ã¾ã™ã€‚
VMware環境ã§ã¯ä¸‹è¨˜ã®ã‚ˆã†ãªå†ç¾æ€§ãŒã‚ã‚Šã¾ã—ãŸã€‚
ãã‚Œãžã‚Œ10回試行ã—ã€10回ã¨ã‚‚åŒä¸€ã®çµæžœã¨ãªã‚Šã¾ã—ãŸã€‚
ãªãŠã€ãƒªã‚½ãƒ¼ã‚¹ã¯Dummy1個ã«ã—ã¦å‹•ä½œã‚’確èªã—ã¦ã„ã¾ã™ã€‚
(1) vSphereClient ã‹ã‚‰ä»®æƒ³ãƒžã‚·ãƒ³ã‚’「リセットã€
- リソースã®ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒã¯æˆåŠŸ
- logconvã®å‡ºåŠ›ã§ã¯ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒå¤±æ•—
例)
Jun 13 11:44:15 acdbv-ha02 warning: Node acdbv-ha01 is lost
Jun 13 11:44:15 acdbv-ha02 info: Set DC node to acdbv-ha02.
Jun 13 11:44:16 acdbv-ha02 error: Start to fail-over.
Jun 13 11:44:16 acdbv-ha02 info: Resource dummy tries to start.
Jun 13 11:44:16 acdbv-ha02 info: Resource dummy started. (rc=0)
Jun 13 11:44:16 acdbv-ha02 error: fail-over failed.
ãƒã‚°ãƒ•ã‚¡ã‚¤ãƒ«ï¼š20160613-logconv/reset
(2) OSコマンド(reboot -nf)ã§ä»®æƒ³ãƒžã‚·ãƒ³ã‚’å†èµ·å‹•
- リソースã®ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒã¯æˆåŠŸ
- logconvã®å‡ºåŠ›ã§ã¯ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒå¤±æ•—
例)
Jun 13 13:02:31 acdbv-ha02 warning: Node acdbv-ha01 is lost
Jun 13 13:02:31 acdbv-ha02 info: Set DC node to acdbv-ha02.
Jun 13 13:02:32 acdbv-ha02 error: Start to fail-over.
Jun 13 13:02:32 acdbv-ha02 info: Resource dummy tries to start.
Jun 13 13:02:32 acdbv-ha02 info: Resource dummy started. (rc=0)
Jun 13 13:02:32 acdbv-ha02 error: fail-over failed.
ãƒã‚°ãƒ•ã‚¡ã‚¤ãƒ«ï¼š20160613-logconv/reboot
(3) initctlコマンドã§Pacemakerã‚’åœæ¢(initctl stop pacemaker.combined)
- リソースã®ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒã¯æˆåŠŸ
- logconvã«ã¯ã€ŒStart to fail-over.ã€ãŒå‡ºåŠ›ã•ã‚Œãªã„
→ コマンドオペレーションã«ã‚ˆã‚‹Pacemakerã®åœæ¢ãªã®ã§ã€ã“ã‚Œã¯ä»•æ§˜ã§ã™ã‹ï¼Ÿ
例)
Jun 13 13:25:53 acdbv-ha02 info: Resource dummy tries to start.
Jun 13 13:25:53 acdbv-ha02 info: Resource dummy started. (rc=0)
ãƒã‚°ãƒ•ã‚¡ã‚¤ãƒ«ï¼š20160613-logconv/initctl
(4) Dummyリソースã®ç›£è¦–æ•…éšœ(ステータスファイルã®å‰Šé™¤)
- リソースã®ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒã¯æˆåŠŸ
- DCノードã§ãƒªã‚½ãƒ¼ã‚¹æ•…éšœ
例)
Jun 13 13:41:04 acdbv-ha02 error: Start to fail-over.
Jun 13 13:41:04 acdbv-ha02 info: Resource dummy tries to stop.
Jun 13 13:41:04 acdbv-ha02 info: Resource dummy stopped. (rc=0)
Jun 13 13:41:04 acdbv-ha02 info: Resource dummy : Move acdbv-ha02 -> acdbv-ha01
Jun 13 13:41:04 acdbv-ha02 info: fail-over succeeded.
- éžDCノードã§ãƒªã‚½ãƒ¼ã‚¹æ•…éšœ
例)
Jun 13 13:42:04 acdbv-ha02 error: Resource dummy does not work. (rc=7)
Jun 13 13:42:04 acdbv-ha02 info: Resource dummy tries to stop.
Jun 13 13:42:04 acdbv-ha02 info: Resource dummy stopped. (rc=0)
Jun 13 13:42:04 acdbv-ha02 info: Resource dummy tries to start.
Jun 13 13:42:04 acdbv-ha02 info: Resource dummy started. (rc=0)
DCã«ä¾å˜ã›ãšã€Œerror: Start to fail-over.ã€ã€Œinfo: fail-over succeeded.ã€ã¨ã„ã†
出力ãŒå¾—られるã“ã¨ã‚’想定ã—ã¦ã„ã¾ã—ãŸãŒã€æœŸå¾…通りã«ãªã‚Šã¾ã›ã‚“ã§ã—ãŸã€‚
ãƒã‚°ãƒ•ã‚¡ã‚¤ãƒ«ï¼š20160613-logconv/monitor_ng
å‰å›žã®ãƒ¡ãƒ¼ãƒ«ã§æŒ‡æ‘˜ã—ã¦ã„ãŸã ã„ãŸã¨ãŠã‚Šã€logconvãŒå¤‰æ›ã«åˆ©ç”¨ã—ã¦ã„る下記メッセージãŒ
ha-logã«å‡ºåŠ›ã•ã‚Œã¦ã„ãªã„ã“ã¨ãŒæ ¹æœ¬åŽŸå› ã ã¨æ€ã„ã¾ã™ã€‚
notice: te_rsc_command: Initiating action <num>: start <resource name>_start_0 on <node name> (local)
ãŸã ã—
- te_rsc_command関数ã‹ã‚‰å½“該メッセージãŒå‡ºåŠ›ã•ã‚Œãªã„ç†ç”±ãŒä¸æ˜Ž(環境ä¾å˜ã‚„ãƒãƒ¼ã‚¸ãƒ§ãƒ³ã®çµ„ã¿åˆã‚ã›ï¼Ÿè¨å®šä¸è¶³ï¼Ÿ)
- te_rsc_command関数ã‹ã‚‰å½“該メッセージãŒå‡ºåŠ›ã•ã‚Œãªã„パターンãŒå¤šå²ã«ã‚ãŸã‚‹
ã¨ã„ã†æ¡ä»¶ãŒã‚ã‚‹ã“ã¨ã‹ã‚‰ã€ä»Šå›žæ§‹ç¯‰ã™ã‚‹ç’°å¢ƒ(ãŠã‚ˆã³é¡žä¼¼ã®ãƒãƒ¼ã‚¸ãƒ§ãƒ³ã‚’使用ã—ã¦ã„る環境)ã§ã¯
logconvã®å‡ºåŠ›çµæžœã‹ã‚‰ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒã®æˆå¦ã‚’判æ–ã›ãš
ha-logã®å‡ºåŠ›çµæžœã‚’システム監視(Hinemos, Zabbix, JP1ãªã©)ã«ç™»éŒ²ã—
イベント発生時ã®é€šçŸ¥(ç™ºå ±)ã¸ã¤ãªã’ã‚‹ã“ã¨ã¨ã—ã¾ã™ã€‚
# 今回ã¯Pacemaker 1.1.12ã‚’å°Žå…¥ã—ã¾ã™ã€‚
# ãŸã ã€Pacemaker 1.1.13/RHEL6/VMwareã®ç’°å¢ƒã§ã‚‚åŒæ§˜ã®å‹•ä½œã¯ç™ºç”Ÿã—ãã†ãªæ°—ãŒã—ã¾ã™ãŒã€‚。。
# 1.1.12ã¨1.1.13ã§ãƒã‚°å‡ºåŠ›å‘¨ã‚Šã§å¤§å¹…ãªå¤‰æ›´ã¯ãªã„ã§ã™ã‚ˆã。
以上よã‚ã—ããŠé¡˜ã„ã„ãŸã—ã¾ã™ã€‚
æ± ç”°æ·³å
差出人: 飯田 雄介
é€ä¿¡æ—¥æ™‚: 2016å¹´6月10æ—¥ 16:07
宛先: linux-ha-japan@lists.osdn.me
件å: Re: [Linux-ha-jp]DCノード故障時ã®logconvã®å‡ºåŠ›ã«ã¤ã„ã¦
æ± ç”°ã•ã‚“
ãŠä¸–話ã«ãªã‚Šã¾ã™ã€‚
飯田ã§ã™ã€‚
> 故障発生時ã®DCã¯1å·æ©Ÿã§ã™ã€‚
> DC故障を伴ã†ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒç™ºç”Ÿæ™‚ã«
> logconvã«ä¸Šè¨˜ã®ãƒ¡ãƒƒã‚»ãƒ¼ã‚¸ãŒå‡ºåŠ›ã•ã‚Œã‚‹ã®ã¯ä»•æ§˜ã§ã—ょã†ã‹ã€‚
フェイルオーãƒãƒ¼å¤±æ•—ã¨ãªã‚‹ã®ã¯æœŸå¾…ã•ã‚Œã‚‹å‹•ä½œã§ã¯ã‚ã‚Šã¾ã›ã‚“。
期待ã•ã‚Œã‚‹å‹•ãã¯ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒãƒ¼æˆåŠŸã¨ãªã‚‹ã“ã¨ã§ã™ã€‚
ç§ã®æ‰‹å…ƒã®ç’°å¢ƒã§ã‚‚é ‚ã„ãŸè¨å®šã‚’使ã£ã¦ä¼¼ãŸã‚ˆã†ãªæ§‹æˆã‚’å–ã‚Šå†ç¾ã—ã¦ã¿ã¾ã—ãŸãŒã€
下記ã®é€šã‚Šãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒãƒ¼æˆåŠŸã¨ãªã‚Šã¾ã—ãŸã€‚
Jun 10 05:42:20 cento7-logconv-2.novalocal info: Set DC node to cento7-logconv-1.novalocal.
Jun 10 05:42:47 cento7-logconv-2.novalocal warning: Node cento7-logconv-1.novalocal is lost
Jun 10 05:42:47 cento7-logconv-2.novalocal info: Unset DC node cento7-logconv-1.novalocal.
Jun 10 05:42:47 cento7-logconv-2.novalocal info: Set DC node to cento7-logconv-2.novalocal.
Jun 10 05:42:48 cento7-logconv-2.novalocal error: Start to fail-over.
Jun 10 05:42:48 cento7-logconv-2.novalocal info: Resource dummy01 tries to start.
Jun 10 05:42:48 cento7-logconv-2.novalocal info: Resource dummy01 started. (rc=0)
Jun 10 05:42:48 cento7-logconv-2.novalocal info: Resource dummy02 tries to start.
Jun 10 05:42:48 cento7-logconv-2.novalocal info: Resource dummy02 started. (rc=0)
Jun 10 05:42:48 cento7-logconv-2.novalocal info: Resource dummy03 tries to start.
Jun 10 05:42:48 cento7-logconv-2.novalocal info: Resource dummy03 started. (rc=0)
Jun 10 05:42:48 cento7-logconv-2.novalocal info: Resource dummy01 : Started on cento7-logconv-2novalocal
Jun 10 05:42:48 cento7-logconv-2.novalocal info: Resource dummy03 : Started on cento7-logconv-2novalocal
Jun 10 05:42:48 cento7-logconv-2.novalocal info: fail-over succeeded.
å†ç¾ç’°å¢ƒã¨é ‚ã„ãŸha-logを比較ã—ãŸã¨ã“ã‚ã€æ± ç”°ã•ã‚“ã®ç’°å¢ƒã§ã¯ä¸‹è¨˜ã®ã‚ˆã†ãªãƒã‚°ãŒå‡ºåŠ›ã•ã‚Œã¦ã„ãªã„よã†ã§ã™ã€‚
Jun 10 05:42:48 cento7-logconv-2 crmd[2249]: notice: te_rsc_command: Initiating action 4: start dummy01_start_0 on cento7-logconv-2.novalocal (local)
logconvã§ã¯ã“ã®ãƒã‚°ã‚’使ã£ã¦ãƒªã‚½ãƒ¼ã‚¹ã®ã‚¹ãƒ†ãƒ¼ã‚¿ã‚¹ã‚’管ç†ã—ã¦ã¾ã™ã®ã§ã€
ãƒã‚°ã®å‡ºåŠ›ãŒãªã„ã¨ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒãƒ¼ãŒæˆåŠŸã—ãŸã¨åˆ¤å®šã§ãã¾ã›ã‚“。
ãªãœã“ã®ãƒã‚°ãŒå‡ºåŠ›ã•ã‚Œãªã‹ã£ãŸã®ã‹ã¾ã§ã¯ã‚ã‹ã‚Šã¾ã›ã‚“ã§ã—ãŸã€‚
以上ã€ã”確èªã‚ˆã‚ã—ããŠé¡˜ã„ã„ãŸã—ã¾ã™ã€‚
> -----Original Message-----
> From: linux-ha-japan-bounces@lists.osdn.me
> [mailto:linux-ha-japan-bounces@lists.osdn.me] On Behalf Of
> tsukishima.ha@gmail.com
> Sent: Thursday, June 09, 2016 9:19 AM
> To: linux-ha-japan@lists.osdn.me
> Subject: [Linux-ha-jp] DCノード故障時ã®logconvã®å‡ºåŠ›ã«ã¤ã„ã¦
>
> ãŠä¸–話ã«ãªã£ã¦ãŠã‚Šã¾ã™ã€‚
>
> æ± ç”°ã§ã™ã€‚
>
>
>
> 下記ã®ç’°å¢ƒã§2ノードクラスタを構築ã—ã¦ã„ã¾ã™ã€‚
>
>
>
> # cat /etc/redhat-release
>
> Red Hat Enterprise Linux Server release 6.5 (Santiago)
>
>
>
> # rpm -qa | grep pacemaker-all
>
> pacemaker-all-1.1.12-1.1.el6.noarch
>
>
>
> # rpm -qa | grep pm_logconv-cs
>
> pm_logconv-cs-2.0-1.el6.noarch
>
>
>
>
>
> DummyリソースãŒ3ã¤è¨å®šã•ã‚ŒãŸgroupã‚’1å·æ©Ÿã§èµ·å‹•ã•ã›ãŸçŠ¶æ…‹ã§
>
> 1å·æ©Ÿã‚’åœæ¢(パワーオフ)ã™ã‚‹ã¨ã€æœŸå¾…通り2å·æ©Ÿã«ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒã—ã¾ã—ãŸãŒ
>
> 2å·æ©Ÿã®logconvã«ä¸‹è¨˜ã®ãƒ¡ãƒƒã‚»ãƒ¼ã‚¸ãŒå‡ºåŠ›ã•ã‚Œã¾ã—ãŸã€‚
>
> logconvã®ãƒ¡ãƒƒã‚»ãƒ¼ã‚¸ã ã‘を確èªã™ã‚‹ã¨ã€ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒã«å¤±æ•—ã—ã¦ã„るよã†ã«ã¿
> ãˆã¾ã™ã€‚
>
>
>
> Jun 8 19:25:58 acdbv-ha02 warning: Node acdbv-ha01 is lost
>
> Jun 8 19:25:58 acdbv-ha02 info: Set DC node to acdbv-ha02.
>
> Jun 8 19:26:00 acdbv-ha02 error: Start to fail-over.
>
> Jun 8 19:26:00 acdbv-ha02 info: Resource dummy01 tries to start.
>
> Jun 8 19:26:00 acdbv-ha02 info: Resource dummy01 started. (rc=0)
>
> Jun 8 19:26:00 acdbv-ha02 info: Resource dummy02 tries to start.
>
> Jun 8 19:26:00 acdbv-ha02 info: Resource dummy02 started. (rc=0)
>
> Jun 8 19:26:00 acdbv-ha02 info: Resource dummy03 tries to start.
>
> Jun 8 19:26:00 acdbv-ha02 info: Resource dummy03 started. (rc=0)
>
> Jun 8 19:26:00 acdbv-ha02 error: fail-over failed.
>
>
>
> 故障発生時ã®DCã¯1å·æ©Ÿã§ã™ã€‚
>
> DC故障を伴ã†ãƒ•ã‚§ã‚¤ãƒ«ã‚ªãƒ¼ãƒç™ºç”Ÿæ™‚ã«
>
> logconvã«ä¸Šè¨˜ã®ãƒ¡ãƒƒã‚»ãƒ¼ã‚¸ãŒå‡ºåŠ›ã•ã‚Œã‚‹ã®ã¯ä»•æ§˜ã§ã—ょã†ã‹ã€‚
>
> ãã‚Œã¨ã‚‚logconvã®è¨å®šãŒä¸è¶³ã—ã¦ã„ã‚‹ãŸã‚DCæ•…éšœã«å¯¾å¿œã§ãã¦ã„ãªã„ã®ã§ã—ょã†
> ã‹ã€‚
>
> 動作確èªã«ä½¿ç”¨ã—ãŸlogconvã®è¨å®šãŠã‚ˆã³ha-logを添付ã„ãŸã—ã¾ã™ã€‚
>
>
>
> 以上よã‚ã—ããŠé¡˜ã„ã„ãŸã—ã¾ã™ã€‚
>
>
>
> NTT先端技術
>
> æ± ç”°æ·³å
_______________________________________________
Linux-ha-japan mailing list
Linux-ha-japan@lists.osdn.me
http://lists.osdn.me/mailman/listinfo/linux-ha-japan