Ephemeral environments using the wrong kernel

Bug #1656425 reported by Lee Trager on 2017-01-13
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
MAAS
Critical
Lee Trager
2.1
Critical
Lee Trager
Trunk
Critical
Lee Trager

Bug Description

Intermittently the ephemeral environment booted by MAAS uses the wrong kernel. The deployed system does use the right kernel. Clearing the boot-resources on the rack(rm -rf /var/lib/maas/boot-resources/*) normally fixes the problem.

Related branches

Lee Trager (ltrager) wrote :

This appears to be a problem with how the rack controller is creating the hard links in /var/lib/maas/boot-resources/current. The attached script stops the rack, clears the boot-resources, restarts the rack, once the rack finishes checks that the ga-16.04 kernel is hard linked to the correct file using the SHA256 found in metadata stream. 32/100 times the rack controller links the ga-16.04 kernel incorrectly. I've confirmed the region is correctly downloading and serving all files.

Lee Trager (ltrager) wrote :

Updated test script which validates all Xenial kernels using the latest hashes as of today.

Changed in maas:
status: Confirmed → Fix Committed
Changed in maas:
milestone: 2.1.3 → 2.1.4
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers