OpenStack Dalmatian : Compute ノードを追加する (GPU)2024/10/10 |
GPU を搭載した Compute ノードを追加して、仮想マシンインスタンスで GPU が利用できるように設定します。 当例では以下のような環境を例に、新たに GPU を搭載した [node02.srv.world] を Compute ノードとして追加します。 ------------+--------------------------+--------------------------+------------ | | | eth0|10.0.0.30 eth0|10.0.0.50 eth0|10.0.0.51 +-----------+-----------+ +-----------+-----------+ +-----------+-----------+ | [ dlp.srv.world ] | | [ network.srv.world ] | | [ node01.srv.world ] | | (Control Node) | | (Network Node) | | (Compute Node) | | | | | | | | MariaDB RabbitMQ | | Open vSwitch | | Libvirt | | Memcached Nginx | | Neutron Server | | Nova Compute | | Keystone httpd | | OVN-Northd | | Open vSwitch | | Glance Nova API | | Nginx | | OVN Metadata Agent | | | | | | OVN-Controller | +-----------------------+ +-----------------------+ +-----------------------+ ------------+------------ | eth0|10.0.0.52 +-----------+-----------+ | [ node02.srv.world ] | | (Compute Node (GPU)) | | | | Libvirt | | Nova Compute | | Open vSwitch | | OVN Metadata Agent | | OVN-Controller | +-----------------------+ |
[1] |
追加する Compute ノードに こちらの [1] を参考にして GPU パススルーの設定を適用しておきます。 |
[2] |
追加する ノードを こちらを参考にして Openstack クラスターに Compute ノードとして追加しておきます。 |
[3] | 追加した Nova-Compute に、GPU パススルー用の設定をします。 |
[root@node02 ~]# lspci -nn | grep -i nvidia 81:00.0 VGA compatible controller [0300]: NVIDIA Corporation GA104 [GeForce RTX 3060] [10de:2487] (rev a1) 81:00.1 Audio device [0403]: NVIDIA Corporation GA104 High Definition Audio Controller [10de:228b] (rev a1)
[root@node02 ~]#
vi /etc/nova/nova.conf
# 最終行に追記
# パススルーしたいデバイスの [vendor_id], [product_id] を追記
[pci]
passthrough_whitelist = { "vendor_id": "10de", "product_id": "2487" }
systemctl restart openstack-nova-compute
|
[4] | Control ノードで Nova の設定を変更します。 |
[root@dlp ~(keystone)]#
vi /etc/nova/nova.conf
# 最終行に追記
# 対象の Compute ノードでパススルー設定したデバイスの [vendor_id], [product_id] を追記
# [name] は任意の名称
[pci]
alias: { "vendor_id":"10de", "product_id":"2487", "device_type":"type-PCI", "name":"RTX-3060" }
[filter_scheduler]
enabled_filters = PciPassthroughFilter
[root@dlp ~(keystone)]#
systemctl restart openstack-nova-api openstack-nova-conductor openstack-nova-scheduler
# GPU 用の [flavor] 作成 [root@dlp ~(keystone)]# openstack flavor create --id 6 --vcpus 4 --ram 8192 --disk 20 --property "pci_passthrough:alias"="RTX-3060:1" gpu1.small +----------------------------+------------------------------------+ | Field | Value | +----------------------------+------------------------------------+ | OS-FLV-DISABLED:disabled | False | | OS-FLV-EXT-DATA:ephemeral | 0 | | description | None | | disk | 20 | | id | 6 | | name | gpu1.small | | os-flavor-access:is_public | True | | properties | pci_passthrough:alias='RTX-3060:1' | | ram | 8192 | | rxtx_factor | 1.0 | | swap | 0 | | vcpus | 4 | +----------------------------+------------------------------------+[root@dlp ~(keystone)]# openstack flavor list +----+------------+-------+------+-----------+-------+-----------+ | ID | Name | RAM | Disk | Ephemeral | VCPUs | Is Public | +----+------------+-------+------+-----------+-------+-----------+ | 1 | m1.tiny | 2048 | 10 | 0 | 1 | True | | 2 | m1.small | 4096 | 10 | 0 | 2 | True | | 3 | m1.medium | 8192 | 10 | 0 | 4 | True | | 4 | m1.large | 16384 | 10 | 0 | 8 | True | | 5 | m2.large | 16384 | 10 | 10 | 8 | True | | 6 | gpu1.small | 8192 | 20 | 0 | 4 | True | +----+------------+-------+------+-----------+-------+-----------+ |
[5] | 任意の Openstack ユーザーで GPU インスタンスを作成して動作確認します。 |
[cent@dlp ~(keystone)]$ openstack network list +--------------------------------------+---------+--------------------------------------+ | ID | Name | Subnets | +--------------------------------------+---------+--------------------------------------+ | 7302c07e-c543-4a85-a955-d6cc8a4e4011 | public | 8fbc394a-e821-47d8-8efc-d5f7c7db0ff8 | | aa200aa9-ddb2-4d89-aa37-ce3e3a939421 | private | dfb0949d-d775-4c17-9b07-b01ac7d71aac | +--------------------------------------+---------+--------------------------------------+[cent@dlp ~(keystone)]$ netID=$(openstack network list | grep private | awk '{ print $2 }')
[cent@dlp ~(keystone)]$
[cent@dlp ~(keystone)]$ openstack server create --flavor gpu1.small --image CentOS-Stream9 --security-group secgroup01 --nic net-id=$netID --key-name mykey CentOS-St9GPU
openstack server list +--------------------------------------+---------------+---------+-------------------------------------+----------------+------------+ | ID | Name | Status | Networks | Image | Flavor | +--------------------------------------+---------------+---------+-------------------------------------+----------------+------------+ | 23eae780-5600-4045-aebc-eaa5d2beb092 | CentOS-St9GPU | ACTIVE | private=192.168.100.171 | CentOS-Stream9 | gpu1.small | | 3d753408-6705-4cbf-acd8-2e09d975b973 | CentOS-St9 | SHUTOFF | private=10.0.0.206, 192.168.100.103 | CentOS-Stream9 | m1.small | +--------------------------------------+---------------+---------+-------------------------------------+----------------+------------+[cent@dlp ~(keystone)]$ openstack floating ip create public +---------------------+--------------------------------------+ | Field | Value | +---------------------+--------------------------------------+ | created_at | 2024-10-10T05:00:47Z | | description | | | dns_domain | | | dns_name | | | fixed_ip_address | None | | floating_ip_address | 10.0.0.239 | | floating_network_id | 7302c07e-c543-4a85-a955-d6cc8a4e4011 | | id | 16f419bb-936f-4faa-bedb-5b4829cd5b8a | | name | 10.0.0.239 | | port_details | None | | port_id | None | | project_id | 756e2b4937c44d3991028e294f92d89a | | qos_policy_id | None | | revision_number | 0 | | router_id | None | | status | DOWN | | subnet_id | None | | tags | [] | | updated_at | 2024-10-10T05:00:47Z | +---------------------+--------------------------------------+[cent@dlp ~(keystone)]$ openstack server add floating ip CentOS-St9GPU 10.0.0.239 [cent@dlp ~(keystone)]$ ssh centos@10.0.0.239
The authenticity of host '10.0.0.239 (10.0.0.239)' can't be established.
ED25519 key fingerprint is SHA256:ATOWwdcS70oUn3PGRpAyDgOOWKNVfynDaycBr/9HO8o.
This key is not known by any other names
Are you sure you want to continue connecting (yes/no/[fingerprint])? yes
Warning: Permanently added '10.0.0.239' (ED25519) to the list of known hosts.
[centos@centos-st9gpu ~]$
[centos@centos-st9gpu ~]$ lspci | grep -i nvidia 00:05.0 VGA compatible controller: NVIDIA Corporation GA104 [GeForce RTX 3060] (rev a1) |
Sponsored Link |
|