Attaching virtual GPU devices to guests in nova
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Expired
|
Undecided
|
Unassigned |
Bug Description
This bug tracker is for errors with the documentation, use the following as a template and remove or add fields as you see fit. Convert [ ] into [x] to check boxes:
- [X] This is a doc addition request.
Hi, a problem came up when we are using nova(Queens) configured with the vGPU feature to create several instances. It seems multiple instances preempt the same vGPU resource, in our case, on the exact same instance which has acquired a vGPU already. Here is the error reported in the log:
"libvirt.
Apparently, nova is trying to allocate the vGPU resource that is already being used by another instance. Also, we ruled out a situation that there is not enough vGPU resources on the host. In our case, 25% of instances fell into error-creating state while we are only creating instances which only need 50% of all vGPU resources. From our perspective, the problem is with the nova-scheduler. Any idea how to work this out?
Thanks
Ruien Zhang
<email address hidden>
-------
Release: 21.1.0.dev214 on 2020-04-28 20:09:00
SHA: d19f1ac47b0a5fe
Source: https:/
URL: https:/
tags: | added: compute libvirt vgpu |
Hi, thanks for the bug report.
To better understand what's going on, we need additional details:
1. Your nova.conf file (specifically your [devices] section, as well as each specific device section)
2. Hardware details (which GPU model you're using)
3. nova-compute and nova-scheduler logs
4. The flavor(s) the instances were created with.
I've set this bug as incomplete for now, please set it back to NEW when you reply to make sure it gets looked at.
Thanks!