Activity log for bug #1900668

Date Who What changed Old value New value Message
2020-10-20 08:17:49 Michał Ajduk bug added bug
2020-10-20 08:29:12 Michał Ajduk bug added subscriber Canonical Field High
2020-10-20 08:41:57 Alberto Donato bug task added grub (Ubuntu)
2020-10-21 07:57:29 Gábor Mészáros description # ENVIRONMENT MAAS version (SNAP): maas 2.8.2-8577-g.a3e674063 8980 2.8/stable canonical✓ - MAAS was cleanly installed. KVM POD setup works. MAAS status: bind9 RUNNING pid 9258, uptime 15:13:02 dhcpd RUNNING pid 26173, uptime 15:09:30 dhcpd6 STOPPED Not started http RUNNING pid 19526, uptime 15:10:49 ntp RUNNING pid 27147, uptime 14:02:18 proxy RUNNING pid 25909, uptime 15:09:33 rackd RUNNING pid 7219, uptime 15:13:20 regiond RUNNING pid 7221, uptime 15:13:20 syslog RUNNING pid 19634, uptime 15:10:48 Servers: HPE DL380 Gen10 configured to UEFI boot via PXE (PXE legacy mode), Secure boot disabled. All servers (18) experience the described problem. UEFI Boot menu contains 2 entries alowing one to select the PXE mode: - HPE Ethernet 1Gb 4-port 366FLR Adapter - NIC (HTTP(S) IPv4) - HPE Ethernet 1Gb 4-port 366FLR Adapter - NIC (PXE IPv4) # PROBLEM DESCRIPTION Similiar to https://bugs.launchpad.net/maas/+bug/1899840 PXE boot stalls after downloading grubx64.efi but before downloading grub.cfg: 2020-10-20 07:18:21 provisioningserver.rackdservices.tftp: [info] bootx64.efi requested by 10.216.240.69 2020-10-20 07:18:21 provisioningserver.rackdservices.tftp: [info] bootx64.efi requested by 10.216.240.69 2020-10-20 07:18:21 provisioningserver.rackdservices.tftp: [info] grubx64.efi requested by 10.216.240.69 Grub drops to the grub prompt. Within the grub prompt: - net_ls_addr shows correct IP address - net_ls_routes shows correct routing - net_bootps (that should initialize DHCP request from grub) fails with a message: failed to send packet We've also noticed that in a working scenario grub just after start up but before downloading grub conf sends arp request for MAAS IP: 13517 2020-10-19 13:53:38.864937 HewlettP_02:3d:e8 Broadcast ARP 60 Who has 10.216.240.1? Tell 10.216.240.51 and MAAS replies. When the boot stalls, one of the symptoms is that grub does not send the ARP request for MAAS IP. It also does not reply to MAAS ARP requests. It looks as if the EFI_NET stack was failing. # WORKAROUNDS 1) during the the PXE boot send ARP requests from MAAS to query the node IP. This seems to prevent the node from loosing connectivity. Tested 4 times on independent nodes. 2) Custom built grub: grub-mkimage -c grub.conf -o grubx64.efi -O x86_64-efi -p /grub normal configfile tftp memdisk boot diskfilter efifwsetup efi_gop efinet ls net normal part_gpt tar ext2 linuxefi http echo chain search search_fs_uuid search_label search_fs_file test tr true minicmd Grub version: 2.02-2ubuntu8.18 The grub PXE image built in the way described above works on all nodes (18) all the time (4 times tested). When I've included grub module linix.mod, I've managed to reproduce the described problem. It seems that the issue can be related to https://savannah.gnu.org/bugs/?func=detailitem&item_id=50715 # ENVIRONMENT MAAS version (SNAP):   maas 2.8.2-8577-g.a3e674063 8980 2.8/stable canonical✓ -   MAAS was cleanly installed. KVM POD setup works.   MAAS status:   bind9 RUNNING pid 9258, uptime 15:13:02   dhcpd RUNNING pid 26173, uptime 15:09:30   dhcpd6 STOPPED Not started   http RUNNING pid 19526, uptime 15:10:49   ntp RUNNING pid 27147, uptime 14:02:18   proxy RUNNING pid 25909, uptime 15:09:33   rackd RUNNING pid 7219, uptime 15:13:20   regiond RUNNING pid 7221, uptime 15:13:20   syslog RUNNING pid 19634, uptime 15:10:48 Servers: HPE DL380 Gen10 configured to UEFI boot via PXE (PXE legacy mode), Secure boot disabled. All servers (18) experience the described problem. UEFI Boot menu contains 2 entries alowing one to select the PXE mode: - HPE Ethernet 1Gb 4-port 366FLR Adapter - NIC (HTTP(S) IPv4) - HPE Ethernet 1Gb 4-port 366FLR Adapter - NIC (PXE IPv4) # PROBLEM DESCRIPTION Similiar to https://bugs.launchpad.net/maas/+bug/1899840 PXE boot stalls after downloading grubx64.efi but before downloading grub.cfg: 2020-10-20 07:18:21 provisioningserver.rackdservices.tftp: [info] bootx64.efi requested by 10.216.240.69 2020-10-20 07:18:21 provisioningserver.rackdservices.tftp: [info] bootx64.efi requested by 10.216.240.69 2020-10-20 07:18:21 provisioningserver.rackdservices.tftp: [info] grubx64.efi requested by 10.216.240.69 Grub drops to the grub prompt. Within the grub prompt: - net_ls_addr shows correct IP address - net_ls_routes shows correct routing - net_bootps (that should initialize DHCP request from grub) fails with a message: failed to send packet We've also noticed that in a working scenario grub just after start up but before downloading grub conf sends arp request for MAAS IP: 13517 2020-10-19 13:53:38.864937 HewlettP_02:3d:e8 Broadcast ARP 60 Who has 10.216.240.1? Tell 10.216.240.51 and MAAS replies. When the boot stalls, one of the symptoms is that grub does not send the ARP request for MAAS IP. It also does not reply to MAAS ARP requests. It looks as if the EFI_NET stack was failing. # WORKAROUNDS 1) during the the PXE boot send ARP requests from MAAS to query the node IP. This seems to prevent the node from loosing connectivity. Tested 4 times on independent nodes. 2) Custom built grub: grub-mkimage -c grub.conf -o grubx64.efi -O x86_64-efi -p /grub normal configfile tftp memdisk boot diskfilter efifwsetup efi_gop efinet ls net normal part_gpt tar ext2 linuxefi http echo chain search search_fs_uuid search_label search_fs_file test tr true minicmd Grub version: 2.02-2ubuntu8.18 The grub PXE image built in the way described above works on all nodes (18) all the time (4 times tested). When I've included grub module linux.mod, I've managed to reproduce the described problem. It seems that the issue can be related to https://savannah.gnu.org/bugs/?func=detailitem&item_id=50715
2020-10-21 12:51:13 Jouni Mettala affects grub (Ubuntu) grub2 (Ubuntu)
2020-11-12 13:57:01 Junien F bug added subscriber The Canonical Sysadmins
2020-11-13 06:39:03 Nobuto Murata bug added subscriber Nobuto Murata
2020-11-13 16:04:06 David Britton bug added subscriber Canonical Field Critical
2020-11-13 16:04:10 David Britton removed subscriber Canonical Field High
2020-11-13 16:20:18 David Britton maas: status New Invalid
2020-11-13 16:33:27 Nobuto Murata bug task added maas-images
2020-11-16 15:21:48 Matthieu Clemenceau tags rls-hh-incoming
2020-11-16 22:38:03 Stéphane Graber attachment added Screenshot from 2020-11-16 17-36-41.png https://bugs.launchpad.net/maas-images/+bug/1900668/+attachment/5434920/+files/Screenshot%20from%202020-11-16%2017-36-41.png
2020-11-16 22:38:20 Stéphane Graber attachment added Screenshot from 2020-11-16 17-36-52.png https://bugs.launchpad.net/maas-images/+bug/1900668/+attachment/5434921/+files/Screenshot%20from%202020-11-16%2017-36-52.png
2020-11-18 15:43:54 Julian Andres Klode grub2 (Ubuntu): status New Incomplete
2020-11-19 14:34:35 Rod Smith bug added subscriber Rod Smith
2020-11-19 16:15:31 Matthieu Clemenceau tags rls-hh-incoming fr-943 rls-hh-incoming
2020-11-19 16:20:24 Brian Murray tags fr-943 rls-hh-incoming fr-943
2020-11-19 17:33:20 Jeff Lane  tags fr-943 fr-943 hwcert-server
2020-11-27 16:15:51 Julian Andres Klode grub2 (Ubuntu): status Incomplete New
2020-12-07 14:41:46 Dimitri John Ledkov grub2 (Ubuntu): status New Incomplete
2020-12-07 14:41:48 Dimitri John Ledkov maas-images: status New Incomplete
2020-12-08 00:53:14 Nobuto Murata maas-images: status Incomplete New
2020-12-08 00:53:17 Nobuto Murata grub2 (Ubuntu): status Incomplete New
2020-12-08 16:19:10 Dimitri John Ledkov grub2 (Ubuntu): status New Confirmed
2020-12-08 19:21:15 Joelene M. Wheat bug watch added http://savannah.gnu.org/bugs/?50715
2020-12-09 13:57:53 Adam Collard bug task deleted maas
2020-12-10 00:24:00 Nobuto Murata tags fr-943 hwcert-server fr-943 hwcert-server ps5
2021-01-13 11:42:15 Dimitri John Ledkov grub2 (Ubuntu): status Confirmed Incomplete
2021-01-14 15:18:07 Dimitri John Ledkov maas-images: status New Incomplete
2021-02-10 11:32:41 Dimitri John Ledkov grub2 (Ubuntu): status Incomplete Fix Released
2021-02-10 11:32:43 Dimitri John Ledkov maas-images: status Incomplete Fix Released
2022-11-04 15:59:51 Mattias Andersson bug watch added http://savannah.gnu.org/bugs/?63245