CentOS Stream 9
Sponsored Link

Pacemaker : Set Fence Device2022/03/25

 
Set Fence Device on Cluster. (see about Fencing on the site below)
https://access.redhat.com/documentation/ja-jp/red_hat_enterprise_linux/8/html/configuring_and_managing_high_availability_clusters/s1-fencing-haao

It's possible to use many kinds of devices for fencing, APC or IPMI and so on.
On this example, Configure storage based fencing by using SCSI storage like follows.
Configure basic Cluster settings first, refer to here.

                        +--------------------+
                        | [  ISCSI Target  ] |
                        |    dlp.srv.world   |
                        +---------+----------+
                         10.0.0.30|
                                  |
+----------------------+          |          +----------------------+
| [  Cluster Node#1  ] |10.0.0.51 | 10.0.0.52| [  Cluster Node#2  ] |
|   node01.srv.world   +----------+----------+   node02.srv.world   |
|                      |                     |                      |
+----------------------+                     +----------------------+

[1]
Configure ISCSI Target and Create a storage for fence device, refer to here.
On this example, it created ISCSI storage as IQN [iqn.2021-06.world.srv:dlp.target01] with [1M] size.
[2]
[3] On all Cluster Nodes, Install SCSI Fence Agent.
[root@node01 ~]#
dnf --enablerepo=highavailability -y install fence-agents-scsi watchdog pcp-zeroconf
[root@node01 ~]#
cp /usr/share/cluster/fence_scsi_check /etc/watchdog.d/

[root@node01 ~]#
systemctl enable --now watchdog

[4] Configure Fencing on a Node.
[sda] of the example below is the storage from ISCSI target.
# confirm disk ID

[root@node01 ~]#
ll /dev/disk/by-id | grep sda | grep wwn

lrwxrwxrwx. 1 root root  9 Mar 25 09:12 wwn-0x60014054dab1945c29a455882f8da738 -> ../../sda

# set fencing
# [scsi-shooter] : any name
# [pcmk_host_list=***] : specify cluster nodes
# [devices=***] : disk ID

[root@node01 ~]#
pcs stonith create scsi-shooter fence_scsi pcmk_host_list="node01.srv.world node02.srv.world" devices=/dev/disk/by-id/wwn-0x60014054dab1945c29a455882f8da738 meta provides=unfencing
# show config

[root@node01 ~]#
pcs stonith config scsi-shooter

 Resource: scsi-shooter (class=stonith type=fence_scsi)
  Attributes: devices=/dev/disk/by-id/wwn-0x60014054dab1945c29a455882f8da738 pcmk_host_list="node01.srv.world node02.srv.world"
  Meta Attrs: provides=unfencing
  Operations: monitor interval=60s (scsi-shooter-monitor-interval-60s)

# show status
# OK if the status of fence device is [Started]

[root@node01 ~]#
pcs status

Cluster name: ha_cluster
Cluster Summary:
  * Stack: corosync
  * Current DC: node01.srv.world (version 2.1.2-4.el9-ada5c3b36e2) - partition with quorum
  * Last updated: Fri Mar 25 09:18:32 2022
  * Last change:  Fri Mar 25 09:18:11 2022 by root via cibadmin on node01.srv.world
  * 2 nodes configured
  * 1 resource instance configured

Node List:
  * Online: [ node01.srv.world node02.srv.world ]

Full List of Resources:
  * scsi-shooter        (stonith:fence_scsi):    Started node01.srv.world

Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/enabled
[5] Try to test fencing.
[root@node02 ~]#
pcs status

Cluster name: ha_cluster
Cluster Summary:
  * Stack: corosync
  * Current DC: node01.srv.world (version 2.1.2-4.el9-ada5c3b36e2) - partition with quorum
  * Last updated: Fri Mar 25 09:19:38 2022
  * Last change:  Fri Mar 25 09:18:11 2022 by root via cibadmin on node01.srv.world
  * 2 nodes configured
  * 1 resource instance configured

Node List:
  * Online: [ node01.srv.world node02.srv.world ]

Full List of Resources:
  * scsi-shooter        (stonith:fence_scsi):    Started node01.srv.world

Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/enabled

# fencing

[root@node02 ~]#
pcs stonith fence node01.srv.world

Node: node01.srv.world fenced
# target node turns to [OFFLINE] and it will be restarted
# after rebooting, status wull be back to [Online]

[root@node02 ~]#
pcs status

Cluster name: ha_cluster
Cluster Summary:
  * Stack: corosync
  * Current DC: node02.srv.world (version 2.1.2-4.el9-ada5c3b36e2) - partition with quorum
  * Last updated: Fri Mar 25 09:20:07 2022
  * Last change:  Fri Mar 25 09:18:11 2022 by root via cibadmin on node01.srv.world
  * 2 nodes configured
  * 1 resource instance configured

Node List:
  * Online: [ node02.srv.world ]
  * OFFLINE: [ node01.srv.world ]

Full List of Resources:
  * scsi-shooter        (stonith:fence_scsi):    Started node02.srv.world

Daemon Status:
  corosync: active/enabled
  pacemaker: active/enabled
  pcsd: active/enabled
Matched Content