Pacemaker : Set Fence Device2024/07/23 |
Set Fence Device on Cluster. (see about Fencing on the site below) https://access.redhat.com/documentation/ja-jp/red_hat_enterprise_linux/8/html/configuring_and_managing_high_availability_clusters/s1-fencing-haao
It's possible to use many kinds of devices for fencing, APC or IPMI and so on. +--------------------+ | [ ISCSI Target ] | | dlp.srv.world | +---------+----------+ 10.0.0.30| | +----------------------+ | +----------------------+ | [ Cluster Node#1 ] |10.0.0.51 | 10.0.0.52| [ Cluster Node#2 ] | | node01.srv.world +----------+----------+ node02.srv.world | | | | | +----------------------+ +----------------------+ |
[1] |
Configure ISCSI Target and Create a storage for fence device, refer to here.
On this example, it created ISCSI storage as IQN [iqn.2024-04.world.srv:dlp.target01] with [1M] size. |
[2] | |
[3] | On all Cluster Nodes, Install SCSI Fence Agent. |
root@node01:~#
root@node01:~# apt -y install resource-agents-base watchdog mkdir /etc/watchdog.d root@node01:~# cp /usr/share/cluster/fence_scsi_check /etc/watchdog.d/ root@node01:~# systemctl stop watchdog.service root@node01:~# systemctl start watchdog.service |
[4] | Configure Fencing on a Node. [sda] of the example below is the storage from ISCSI target. |
# confirm disk ID root@node01:~# ll /dev/disk/by-id | grep sda | grep wwn lrwxrwxrwx 1 root root 9 Jul 23 00:08 wwn-0x6001405fd08aa7cd2fe4f8cad7b28412 -> ../../sda # set fencing # [scsi-shooter] : any name # [pcmk_host_list=***] : specify cluster nodes # [devices=***] : disk ID root@node01:~# pcs stonith create scsi-shooter fence_scsi pcmk_host_list="node01.srv.world node02.srv.world" devices=/dev/disk/by-id/wwn-0x6001405fd08aa7cd2fe4f8cad7b28412 meta provides=unfencing
# show config root@node01:~# pcs stonith config scsi-shooter Resource: scsi-shooter (class=stonith type=fence_scsi) Attributes: scsi-shooter-instance_attributes devices=/dev/disk/by-id/wwn-0x6001405fd08aa7cd2fe4f8cad7b28412 pcmk_host_list="node01.srv.world node02.srv.world" Meta Attributes: scsi-shooter-meta_attributes provides=unfencing Operations: monitor: scsi-shooter-monitor-interval-60s interval=60s # show status # OK if the status of fence device is [Started] root@node01:~# pcs status Cluster name: ha_cluster Cluster Summary: * Stack: corosync (Pacemaker is running) * Current DC: node01.srv.world (version 2.1.6-6fdc9deea29) - partition with quorum * Last updated: Tue Jul 23 00:13:10 2024 on node01.srv.world * Last change: Tue Jul 23 00:12:36 2024 by root via cibadmin on node01.srv.world * 2 nodes configured * 1 resource instance configured Node List: * Online: [ node01.srv.world node02.srv.world ] Full List of Resources: * scsi-shooter (stonith:fence_scsi): Started node01.srv.world Daemon Status: corosync: active/enabled pacemaker: active/enabled pcsd: active/enabled |
[5] | Try to test fencing. |
root@node02:~# pcs status Cluster name: ha_cluster Cluster Summary: * Stack: corosync (Pacemaker is running) * Current DC: node01.srv.world (version 2.1.6-6fdc9deea29) - partition with quorum * Last updated: Tue Jul 23 00:13:58 2024 on node02.srv.world * Last change: Tue Jul 23 00:12:36 2024 by root via cibadmin on node01.srv.world * 2 nodes configured * 1 resource instance configured Node List: * Online: [ node01.srv.world node02.srv.world ] Full List of Resources: * scsi-shooter (stonith:fence_scsi): Started node01.srv.world Daemon Status: corosync: active/enabled pacemaker: active/enabled pcsd: active/enabled # fencing root@node02:~# pcs stonith fence node01.srv.world Node: node01.srv.world fenced # target node turns to [OFFLINE] and it will be restarted root@node02:~# pcs status Cluster name: ha_cluster Cluster Summary: * Stack: corosync (Pacemaker is running) * Current DC: node02.srv.world (version 2.1.6-6fdc9deea29) - partition with quorum * Last updated: Tue Jul 23 00:14:35 2024 on node02.srv.world * Last change: Tue Jul 23 00:12:36 2024 by root via cibadmin on node01.srv.world * 2 nodes configured * 1 resource instance configured Node List: * Online: [ node02.srv.world ] * OFFLINE: [ node01.srv.world ] Full List of Resources: * scsi-shooter (stonith:fence_scsi): Started node02.srv.world Daemon Status: corosync: active/enabled pacemaker: active/enabled pcsd: active/enabled # after rebooting, if you manually start the node, do like follows root@node02:~# pcs cluster start node01.srv.world |
Sponsored Link |
|