CentOS Stream 9
Sponsored Link

OpenStack Epoxy : Compute ノードを追加する (GPU)2025/05/23

 

GPU を搭載した Compute ノードを追加して、仮想マシンインスタンスで GPU が利用できるように設定します。

当例では以下のような環境を例に、新たに GPU を搭載した [node02.srv.world] を Compute ノードとして追加します。

------------+--------------------------+--------------------------+------------
            |                          |                          |
        eth0|10.0.0.30             eth0|10.0.0.50             eth0|10.0.0.51
+-----------+-----------+  +-----------+-----------+  +-----------+-----------+
|   [ dlp.srv.world ]   |  | [ network.srv.world ] |  |  [ node01.srv.world ] |
|     (Control Node)    |  |     (Network Node)    |  |     (Compute Node)    |
|                       |  |                       |  |                       |
|  MariaDB    RabbitMQ  |  |      Open vSwitch     |  |        Libvirt        |
|  Memcached  Nginx     |  |     Neutron Server    |  |      Nova Compute     |
|  Keystone   httpd     |  |      OVN-Northd       |  |      Open vSwitch     |
|  Glance     Nova API  |  |         Nginx         |  |   OVN Metadata Agent  |
|                       |  |                       |  |     OVN-Controller    |
+-----------------------+  +-----------------------+  +-----------------------+

------------+------------
            |
        eth0|10.0.0.52
+-----------+-----------+
|  [ node02.srv.world ] |
|  (Compute Node (GPU)) |
|                       |
|        Libvirt        |
|      Nova Compute     |
|      Open vSwitch     |
|   OVN Metadata Agent  |
|     OVN-Controller    |
+-----------------------+

[1]

追加する ノードを こちらを参考にして Openstack クラスターに Compute ノードとして追加しておきます

[2]

追加する Compute ノードに こちらの [1] を参考にして GPU パススルーの設定を適用しておきます

[3] 追加した Nova-Compute に、GPU パススルー用の設定をします。
[root@node02 ~]#
lspci -nn | grep -i nvidia

81:00.0 VGA compatible controller [0300]: NVIDIA Corporation GA104 [GeForce RTX 3060] [10de:2487] (rev a1)
81:00.1 Audio device [0403]: NVIDIA Corporation GA104 High Definition Audio Controller [10de:228b] (rev a1)

[root@node02 ~]#
vi /etc/nova/nova.conf
# 最終行に追記
# パススルーしたいデバイスの [vendor_id], [product_id] を追記
[pci]
passthrough_whitelist = { "vendor_id": "10de", "product_id": "2487" }

[root@node02 ~]#
systemctl restart openstack-nova-compute
[4] Control ノードで Nova の設定を変更します。
[root@dlp ~(keystone)]#
vi /etc/nova/nova.conf
# 最終行に追記
# 対象の Compute ノードでパススルー設定したデバイスの [vendor_id], [product_id] を追記
# [name] は任意の名称
[pci]
alias: { "vendor_id":"10de", "product_id":"2487", "device_type":"type-PCI", "name":"RTX-3060" }

[filter_scheduler]
enabled_filters = PciPassthroughFilter

[root@dlp ~(keystone)]#
systemctl restart openstack-nova-api openstack-nova-conductor openstack-nova-scheduler

[root@dlp ~(keystone)]#
su -s /bin/bash nova -c "nova-manage cell_v2 discover_hosts"
# GPU 用の [flavor] 作成

[root@dlp ~(keystone)]#
openstack flavor create --id 6 --vcpus 4 --ram 8192 --disk 20 --property "pci_passthrough:alias"="RTX-3060:1" gpu1.small

+----------------------------+------------------------------------+
| Field                      | Value                              |
+----------------------------+------------------------------------+
| OS-FLV-DISABLED:disabled   | False                              |
| OS-FLV-EXT-DATA:ephemeral  | 0                                  |
| description                | None                               |
| disk                       | 20                                 |
| id                         | 6                                  |
| name                       | gpu1.small                         |
| os-flavor-access:is_public | True                               |
| properties                 | pci_passthrough:alias='RTX-3060:1' |
| ram                        | 8192                               |
| rxtx_factor                | 1.0                                |
| swap                       | 0                                  |
| vcpus                      | 4                                  |
+----------------------------+------------------------------------+

[root@dlp ~(keystone)]#
openstack flavor list

+----+------------+-------+------+-----------+-------+-----------+
| ID | Name       |   RAM | Disk | Ephemeral | VCPUs | Is Public |
+----+------------+-------+------+-----------+-------+-----------+
| 1  | m1.tiny    |  2048 |   10 |         0 |     1 | True      |
| 2  | m1.small   |  4096 |   10 |         0 |     2 | True      |
| 3  | m1.medium  |  8192 |   10 |         0 |     4 | True      |
| 4  | m1.large   | 16384 |   10 |         0 |     8 | True      |
| 5  | m2.large   | 16384 |   10 |        10 |     8 | True      |
| 6  | gpu1.small |  8192 |   20 |         0 |     4 | True      |
+----+------------+-------+------+-----------+-------+-----------+
[5] 任意の Openstack ユーザーで GPU インスタンスを作成して動作確認します。
[cent@dlp ~(keystone)]$
openstack network list

+--------------------------------------+---------+--------------------------------------+
| ID                                   | Name    | Subnets                              |
+--------------------------------------+---------+--------------------------------------+
| bd05d358-4e42-4bd5-93a3-4bf3cdb8382a | private | e98202e3-a16f-4262-94f1-3bfd5662253a |
| da68b9d2-83b3-4723-8ff1-43561841698c | public  | 79170337-cfa9-4df3-8dff-1202af9a8bba |
+--------------------------------------+---------+--------------------------------------+

[cent@dlp ~(keystone)]$
netID=$(openstack network list | grep private | awk '{ print $2 }')

[cent@dlp ~(keystone)]$
openstack server create --flavor gpu1.small --image CentOS-Stream9 --security-group secgroup01 --nic net-id=$netID --key-name mykey CentOS-9GPU
+-------------------------------------+----------------------------------------------------------+
| Field                               | Value                                                    |
+-------------------------------------+----------------------------------------------------------+
| OS-DCF:diskConfig                   | MANUAL                                                   |
| OS-EXT-AZ:availability_zone         | None                                                     |
| OS-EXT-SRV-ATTR:host                | None                                                     |
| OS-EXT-SRV-ATTR:hostname            | centos-9gpu                                              |
| OS-EXT-SRV-ATTR:hypervisor_hostname | None                                                     |
| OS-EXT-SRV-ATTR:instance_name       | None                                                     |
| OS-EXT-SRV-ATTR:kernel_id           | None                                                     |
| OS-EXT-SRV-ATTR:launch_index        | None                                                     |
| OS-EXT-SRV-ATTR:ramdisk_id          | None                                                     |
| OS-EXT-SRV-ATTR:reservation_id      | None                                                     |
| OS-EXT-SRV-ATTR:root_device_name    | None                                                     |
| OS-EXT-SRV-ATTR:user_data           | None                                                     |
| OS-EXT-STS:power_state              | N/A                                                      |
| OS-EXT-STS:task_state               | scheduling                                               |
| OS-EXT-STS:vm_state                 | building                                                 |
| OS-SRV-USG:launched_at              | None                                                     |
| OS-SRV-USG:terminated_at            | None                                                     |
| accessIPv4                          | None                                                     |
| accessIPv6                          | None                                                     |
| addresses                           | N/A                                                      |
| adminPass                           | wWEnMSn6vkha                                             |
| config_drive                        | None                                                     |
| created                             | 2025-05-23T02:15:52Z                                     |
| description                         | None                                                     |
| flavor                              | description=, disk='20', ephemeral='0', extra_specs..... |
| hostId                              | None                                                     |
| host_status                         | None                                                     |
| id                                  | 76d1d4da-337a-419c-a4df-a66a51b31854                     |
| image                               | CentOS-Stream9 (8bbf2495-d1b0-4c12-89d3-bf6ca71b2a2d)    |
| key_name                            | mykey                                                    |
| locked                              | None                                                     |
| locked_reason                       | None                                                     |
| name                                | CentOS-9GPU                                              |
| pinned_availability_zone            | None                                                     |
| progress                            | None                                                     |
| project_id                          | 6cd379304e2447da8514a66bb6cdfda5                         |
| properties                          | None                                                     |
| security_groups                     | name='84a1471f-8577-4556-8ebc-5ef39cce5fe0'              |
| server_groups                       | None                                                     |
| status                              | BUILD                                                    |
| tags                                |                                                          |
| trusted_image_certificates          | None                                                     |
| updated                             | 2025-05-23T02:15:52Z                                     |
| user_id                             | b6e98dc2822541dd8c4571ac4ed54778                         |
| volumes_attached                    |                                                          |
+-------------------------------------+----------------------------------------------------------+

[cent@dlp ~(keystone)]$
openstack server list

+--------------------------------------+-------------+---------+-------------------------------------+----------------+------------+
| ID                                   | Name        | Status  | Networks                            | Image          | Flavor     |
+--------------------------------------+-------------+---------+-------------------------------------+----------------+------------+
| 76d1d4da-337a-419c-a4df-a66a51b31854 | CentOS-9GPU | ACTIVE  | private=192.168.100.153             | CentOS-Stream9 | gpu1.small |
| 6ca15593-ee68-4d5b-9838-f24704448cfa | CentOS-9    | SHUTOFF | private=10.0.0.245, 192.168.100.189 | CentOS-Stream9 | m1.small   |
+--------------------------------------+-------------+---------+-------------------------------------+----------------+------------+

[cent@dlp ~(keystone)]$
openstack floating ip create public

+---------------------+--------------------------------------+
| Field               | Value                                |
+---------------------+--------------------------------------+
| created_at          | 2025-05-23T02:17:22Z                 |
| description         |                                      |
| dns_domain          |                                      |
| dns_name            |                                      |
| fixed_ip_address    | None                                 |
| floating_ip_address | 10.0.0.230                           |
| floating_network_id | da68b9d2-83b3-4723-8ff1-43561841698c |
| id                  | abd2d0b7-5ec0-4557-8d8f-d2ac285fc311 |
| name                | 10.0.0.230                           |
| port_details        | None                                 |
| port_id             | None                                 |
| project_id          | 6cd379304e2447da8514a66bb6cdfda5     |
| qos_policy_id       | None                                 |
| revision_number     | 0                                    |
| router_id           | None                                 |
| status              | DOWN                                 |
| subnet_id           | None                                 |
| tags                | []                                   |
| updated_at          | 2025-05-23T02:17:22Z                 |
+---------------------+--------------------------------------+

[cent@dlp ~(keystone)]$
openstack server add floating ip CentOS-9GPU 10.0.0.230

[cent@dlp ~(keystone)]$
ssh centos@10.0.0.230

The authenticity of host '10.0.0.239 (10.0.0.239)' can't be established.
ED25519 key fingerprint is SHA256:ATOWwdcS70oUn3PGRpAyDgOOWKNVfynDaycBr/9HO8o.
This key is not known by any other names
Are you sure you want to continue connecting (yes/no/[fingerprint])? yes
Warning: Permanently added '10.0.0.239' (ED25519) to the list of known hosts.
[centos@centos-9gpu ~]$
[centos@centos-9gpu ~]$
lspci | grep -i nvidia

05:00.0 VGA compatible controller: NVIDIA Corporation GA104 [GeForce RTX 3060] (rev a1)
関連コンテンツ