nova compute with allocated vgpu device failed to start after host reboot

Bug #1940641 reported by norman shen
16
This bug affects 3 people
Affects Status Importance Assigned to Milestone
OpenStack Compute (nova)
New
Undecided
Unassigned

Bug Description

Description
=====================

nova compute service failed to start after reboot, if there are vgpu
virtual machines beforehand.

Error log

2021-08-20 09:37:30.331 284159 DEBUG nova.virt.libvirt.volume.mount [None req-6ad4e06c-980e-4759-8b36-6c696e596dab - - - - -] Initialising _HostMountState generation 0 host_up /var/lib/openstack/lib/python3.8/site-packages/nova/virt/libvirt/volume/mount.py:131
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service [-] Error starting thread.: libvirt.libvirtError: Node device not found: no node device with matching name 'mdev_74527849_d08c_4243_b868_f84a1437c9b5'
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service Traceback (most recent call last):
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/oslo_service/service.py", line 807, in run_service
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service service.start()
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/nova/service.py", line 159, in start
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service self.manager.init_host()
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/nova/compute/manager.py", line 1414, in init_host
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service self.driver.init_host(host=self.host)
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/nova/virt/libvirt/driver.py", line 733, in init_host
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service self._recreate_assigned_mediated_devices()
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/nova/virt/libvirt/driver.py", line 862, in _recreate_assigned_mediated_devices
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service dev_info = self._get_mediated_device_information(dev_name)
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/nova/virt/libvirt/driver.py", line 7380, in _get_mediated_device_information
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service virtdev = self._host.device_lookup_by_name(devname)
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/nova/virt/libvirt/host.py", line 1153, in device_lookup_by_name
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service return self.get_connection().nodeDeviceLookupByName(name)
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/eventlet/tpool.py", line 190, in doit
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service result = proxy_call(self._autowrap, f, *args, **kwargs)
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/eventlet/tpool.py", line 148, in proxy_call
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service rv = execute(f, *args, **kwargs)
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/eventlet/tpool.py", line 129, in execute
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service six.reraise(c, e, tb)
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/six.py", line 703, in reraise
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service raise value
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/eventlet/tpool.py", line 83, in tworker
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service rv = meth(*args, **kwargs)
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service File "/var/lib/openstack/lib/python3.8/site-packages/libvirt.py", line 4614, in nodeDeviceLookupByName
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service if ret is None:raise libvirtError('virNodeDeviceLookupByName() failed', conn=self)
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service libvirt.libvirtError: Node device not found: no node device with matching name 'mdev_74527849_d08c_4243_b868_f84a1437c9b5'
2021-08-20 09:37:30.421 284159 ERROR oslo_service.service

Environment
============

nova: victoria
os ubuntu 20.04

Steps to Reproduce
===================

create vgpu virtual machines (mdev) and then reboot host.

Revision history for this message
norman shen (jshen28) wrote :
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.