Charm doesn't initialise SRIOV gpu devices on nvidia-gpu versions >= 11.0 (was: charm does not create gpu virtual functions)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Nova Compute NVIDIA vGPU Plugin Charm |
Triaged
|
Medium
|
Unassigned |
Bug Description
Tested this on node with Nvidia Tesla A10 card with vGPU software: nvidia-
channel : yoga/stable
OS: jammy
After attaching vGPU driver to nova-compute-
Execute nvidia-smi on the node confirms driver is intalled successfully
However juju run-action --wait nova-compute-
ubuntu@
unit-nova-
UnitId: nova-compute-
id: "346"
results:
output: ""
status: completed
Inside the node, gpu card bus info is 25:00.0
ubuntu@
25:00.0 3D controller [0302]: NVIDIA Corporation GA102GL [A10] [10de:2236] (rev a1)
But no virtual functions are created
cd /sys/bus/
ls | grep virtfn
I need create virtual funciton manually
/usr/
after that I can see virtual functions
ls | grep virtfn
virtfn0
virtfn1
virtfn10
virtfn11
Re-run list-vpu-types
ubuntu@
unit-nova-
UnitId: nova-compute-
id: "348"
results:
output: |-
nvidia-588, 0000:25:02.3, NVIDIA A10-1B, num_heads=4, frl_config=45, framebuffer=1024M, max_resolution=
nvidia-589, 0000:25:02.3, NVIDIA A10-2B, num_heads=4, frl_config=45, framebuffer=2048M, max_resolution=
nvidia-590, 0000:25:02.3, NVIDIA A10-1Q, num_heads=4, frl_config=60, framebuffer=1024M, max_resolution=
nvidia-591, 0000:25:02.3, NVIDIA A10-2Q, num_heads=4, frl_config=60, framebuffer=2048M, max_resolution=
description: | updated |
summary: |
- charm does not create vgpu functions + charm does not create gpu virtual functions |
summary: |
- charm does not create gpu virtual functions + can not list vgpu types |
summary: |
- can not list vgpu types + charm does not create gpu virtual functions |
Changed in charm-nova-compute-nvidia-vgpu: | |
status: | Incomplete → New |
summary: |
- Charm doesn't initialise the driver fully on nvidia-gpu versions >= 11.0 - (was: charm does not create gpu virtual functions) + Charm doesn't initialise SRIOV gpu devices on nvidia-gpu versions >= + 11.0 (was: charm does not create gpu virtual functions) |
Hi Andy
Unfortunately, there's some more information we need please. Please could you have a read of https:/ /docs.openstack .org/charm- guide/latest/ community/ software- bug.html- obviously, not everything will be relevant, but it would be good to get log files from all the relevant software, etc.
Thanks.