Activity log for bug #2060899

Date Who What changed Old value New value Message
2024-04-11 03:04:28 Weichen Wu bug added bug
2024-04-11 03:04:50 Weichen Wu description [Summary] the test will check autonomous power state transition on the target disk [Steps to reproduce] 1. Install ubuntu 22.04 2. Install linux-nvidia and boot with nvidia kernel 3. Install checkbox 4. Run checkbox desktop certify test case $ checkbox-cli run com.canonical.certification::device com.canonical.certification::disk/apste_support_on_nvme0 [Expected result] test passed [Actual result] failed with error log below [Failure rate] 2/2 [Checkbox job `com.canonical.certification::disk/apste_support_on_nvme0` output] stderr ------ NVMe status: INVALID_FIELD: A reserved coded value or an unsupported value in a defined field(0x2) [Additional information] CID: 202307-31886 SKU: DGX Station A100 system-manufacturer: NVIDIA system-product-name: DGX Station A100 920-23487-2531-0R0 bios-version: L10.16 CPU: AMD EPYC 7742 64-Core Processor (128x) GPU: 01:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) 46:00.0 VGA compatible controller [0300]: ASPEED Technology, Inc. ASPEED Graphics Family [1a03:2000] (rev 41) 47:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) 81:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) c1:00.0 VGA compatible controller [0300]: NVIDIA Corporation TU117GLM [Quadro T1000 Mobile] [10de:1fb0] (rev a1) c2:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) nvidia-driver: 535.161.07 nvidia-vbios: 92.00.38.00.01 kernel-version: 5.15.0-1047-nvidia [Stage] Issue reported and logs collected right after it happened [Summary] the test will check autonomous power state transition on the target disk [Steps to reproduce] 1. Install ubuntu 22.04 2. Install linux-nvidia and boot with nvidia kernel 3. Install checkbox 4. Run checkbox desktop certify test case $ checkbox-cli run com.canonical.certification::device com.canonical.certification::disk/apste_support_on_nvme0 [Expected result] test passed [Actual result] failed with error log below [Failure rate] 2/2 [Checkbox job `com.canonical.certification::disk/apste_support_on_nvme0` output] stderr ------ NVMe status: INVALID_FIELD: A reserved coded value or an unsupported value in a defined field(0x2) [Additional information] CID: 202307-31886 SKU: DGX Station A100 system-manufacturer: NVIDIA system-product-name: DGX Station A100 920-23487-2531-0R0 bios-version: L10.16 CPU: AMD EPYC 7742 64-Core Processor (128x) GPU: 01:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) 46:00.0 VGA compatible controller [0300]: ASPEED Technology, Inc. ASPEED Graphics Family [1a03:2000] (rev 41) 47:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) 81:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) c1:00.0 VGA compatible controller [0300]: NVIDIA Corporation TU117GLM [Quadro T1000 Mobile] [10de:1fb0] (rev a1) c2:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) nvidia-driver: 535.161.07 nvidia-vbios: 92.00.38.00.01 kernel-version: 5.15.0-1047-nvidia [Stage] Issue reported and logs collected right after it happened
2024-04-11 03:07:01 Weichen Wu attachment added checkbox-session.tgz https://bugs.launchpad.net/bugs/2060899/+attachment/5763569/+files/checkbox-session.tgz
2024-04-11 03:07:03 Weichen Wu attachment added snap_list.log https://bugs.launchpad.net/bugs/2060899/+attachment/5763570/+files/snap_list.log
2024-04-11 03:07:18 Weichen Wu attachment added acpidump.log https://bugs.launchpad.net/bugs/2060899/+attachment/5763571/+files/acpidump.log
2024-04-11 04:18:21 Ubuntu Foundations Team Bug Bot tags bot-comment
2024-04-11 06:30:31 Weichen Wu description [Summary] the test will check autonomous power state transition on the target disk [Steps to reproduce] 1. Install ubuntu 22.04 2. Install linux-nvidia and boot with nvidia kernel 3. Install checkbox 4. Run checkbox desktop certify test case $ checkbox-cli run com.canonical.certification::device com.canonical.certification::disk/apste_support_on_nvme0 [Expected result] test passed [Actual result] failed with error log below [Failure rate] 2/2 [Checkbox job `com.canonical.certification::disk/apste_support_on_nvme0` output] stderr ------ NVMe status: INVALID_FIELD: A reserved coded value or an unsupported value in a defined field(0x2) [Additional information] CID: 202307-31886 SKU: DGX Station A100 system-manufacturer: NVIDIA system-product-name: DGX Station A100 920-23487-2531-0R0 bios-version: L10.16 CPU: AMD EPYC 7742 64-Core Processor (128x) GPU: 01:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) 46:00.0 VGA compatible controller [0300]: ASPEED Technology, Inc. ASPEED Graphics Family [1a03:2000] (rev 41) 47:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) 81:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) c1:00.0 VGA compatible controller [0300]: NVIDIA Corporation TU117GLM [Quadro T1000 Mobile] [10de:1fb0] (rev a1) c2:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) nvidia-driver: 535.161.07 nvidia-vbios: 92.00.38.00.01 kernel-version: 5.15.0-1047-nvidia [Stage] Issue reported and logs collected right after it happened [Summary] the test will check autonomous power state transition on the target disk affected test case: disk/apste_support_on_nvme0 [Steps to reproduce] 1. Install Ubuntu 22.04 desktop 2. Install linux-nvidia and boot with nnvidia-kernel 5.15.0-1047-nvidia 3. Follow the instructions to modify /etc/X11/xorg.conf to Section "Devie" Identifier "Device0" Driver "nvidia" VendorName "NVIDIA Corporation" BusID "PCI:193:0:0" EndSection 4. Run test command $ nvme get-feature -f 0x0c -H /dev/nvme0 | grep '(APSTE).*Enabled' && test -e /sys/class/nvme/nvme0/power/pm_qos_latency_tolerance_us [Expected result] test passed [Actual result] failed with error log below [Failure rate] 2/2 [Checkbox job `com.canonical.certification::disk/apste_support_on_nvme0` output] stderr ------ NVMe status: INVALID_FIELD: A reserved coded value or an unsupported value in a defined field(0x2) [Additional information] CID: 202307-31886 SKU: DGX Station A100 system-manufacturer: NVIDIA system-product-name: DGX Station A100 920-23487-2531-0R0 bios-version: L10.16 CPU: AMD EPYC 7742 64-Core Processor (128x) GPU: 01:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) 46:00.0 VGA compatible controller [0300]: ASPEED Technology, Inc. ASPEED Graphics Family [1a03:2000] (rev 41) 47:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) 81:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) c1:00.0 VGA compatible controller [0300]: NVIDIA Corporation TU117GLM [Quadro T1000 Mobile] [10de:1fb0] (rev a1) c2:00.0 3D controller [0302]: NVIDIA Corporation GA100 [A100 SXM4 80GB] [10de:20b2] (rev a1) nvidia-driver: 535.161.07 nvidia-vbios: 92.00.38.00.01 kernel-version: 5.15.0-1047-nvidia [Stage] Issue reported and logs collected right after it happened