Pacemaker : Set Fence Device2025/10/09 |
|
Set Fence Device on Cluster. (see about Fencing on the site below) https://access.redhat.com/documentation/ja-jp/red_hat_enterprise_linux/8/html/configuring_and_managing_high_availability_clusters/s1-fencing-haao
It's possible to use many kinds of devices for fencing, APC or IPMI and so on.
+--------------------+
| [ ISCSI Target ] |
| dlp.srv.world |
+---------+----------+
10.0.0.30|
|
+----------------------+ | +----------------------+
| [ Cluster Node#1 ] |10.0.0.51 | 10.0.0.52| [ Cluster Node#2 ] |
| node01.srv.world +----------+----------+ node02.srv.world |
| | | |
+----------------------+ +----------------------+
|
| [1] |
Configure ISCSI Target and Create a storage for fence device, refer to here. |
| [2] | |
| [3] | On all Cluster Nodes, Install SCSI Fence Agent. |
|
root@node01:~#
root@node01:~# apt -y install pacemaker-resource-agents watchdog mkdir /etc/watchdog.d root@node01:~# cp /usr/share/cluster/fence_scsi_check /etc/watchdog.d/ root@node01:~# systemctl stop watchdog.service root@node01:~# systemctl start watchdog.service |
| [4] | Configure Fencing on a Node. [sda] of the example below is the storage from ISCSI target. |
|
# confirm disk ID root@node01:~# ll /dev/disk/by-id | grep sda | grep wwn lrwxrwxrwx 1 root root 9 Oct 9 08:50 wwn-0x60014052e2280b723e447989b527dd74 -> ../../sda # set fencing # [scsi-shooter] : any name # [pcmk_host_list=***] : specify cluster nodes # [devices=***] : disk ID root@node01:~# pcs stonith create scsi-shooter fence_scsi pcmk_host_list="node01.srv.world node02.srv.world" devices=/dev/disk/by-id/wwn-0x60014052e2280b723e447989b527dd74 meta provides=unfencing
# show config root@node01:~# pcs stonith config scsi-shooter
Resource: scsi-shooter (class=stonith type=fence_scsi)
Attributes: scsi-shooter-instance_attributes
devices=/dev/disk/by-id/wwn-0x60014052e2280b723e447989b527dd74
pcmk_host_list="node01.srv.world node02.srv.world"
Meta Attributes: scsi-shooter-meta_attributes
provides=unfencing
Operations:
monitor: scsi-shooter-monitor-interval-60s
interval=60s
# show status # OK if the status of fence device is [Started] root@node01:~# pcs status Cluster name: ha_cluster Cluster Summary: * Stack: corosync (Pacemaker is running) * Current DC: node01.srv.world (version 3.0.0-3.0.0) - partition with quorum * Last updated: Thu Oct 9 08:55:57 2025 on node01.srv.world * Last change: Thu Oct 9 08:55:26 2025 by root via root on node01.srv.world * 2 nodes configured * 1 resource instance configured Node List: * Online: [ node01.srv.world node02.srv.world ] Full List of Resources: * scsi-shooter (stonith:fence_scsi): Started node01.srv.world Daemon Status: corosync: active/enabled pacemaker: active/enabled pcsd: active/enabled |
| [5] | Try to test fencing. |
|
root@node02:~# pcs status Cluster name: ha_cluster Cluster Summary: * Stack: corosync (Pacemaker is running) * Current DC: node01.srv.world (version 3.0.0-3.0.0) - partition with quorum * Last updated: Thu Oct 9 08:55:57 2025 on node01.srv.world * Last change: Thu Oct 9 08:55:26 2025 by root via root on node01.srv.world * 2 nodes configured * 1 resource instance configured Node List: * Online: [ node01.srv.world node02.srv.world ] Full List of Resources: * scsi-shooter (stonith:fence_scsi): Started node01.srv.world Daemon Status: corosync: active/enabled pacemaker: active/enabled pcsd: active/enabled # fencing root@node02:~# pcs stonith fence node01.srv.world Node: node01.srv.world fenced # target node turns to [OFFLINE] and it will be restarted root@node02:~# pcs status Cluster name: ha_cluster Cluster Summary: * Stack: corosync (Pacemaker is running) * Current DC: node02.srv.world (version 3.0.0-3.0.0) - partition with quorum * Last updated: Thu Oct 9 08:56:52 2025 on node02.srv.world * Last change: Thu Oct 9 08:55:26 2025 by root via root on node01.srv.world * 2 nodes configured * 1 resource instance configured Node List: * Online: [ node02.srv.world ] * OFFLINE: [ node01.srv.world ] Full List of Resources: * scsi-shooter (stonith:fence_scsi): Started node02.srv.world Daemon Status: corosync: active/enabled pacemaker: active/enabled pcsd: active/enabled # after rebooting, if you manually start the node, do like follows root@node02:~# pcs cluster start node01.srv.world |
| Sponsored Link |
|
|