subiquity

subiquity crashes upon reusing failed to assemble raid member partition

Bug #1835091 reported by Dimitri John Ledkov on 2019-07-02

This bug affects 1 person

	Status	Importance	Assigned to
curtin	Fix Released	High	Unassigned
subiquity	Fix Released	Medium	Unassigned
probert (Ubuntu)	Fix Released	High	Unassigned

Bug Description

so following up from the previous bug #1835087 I removed the second drive, such that i only had:
- grub-partition
- /boot ext4 partition
- just half of a raid0 member as a partition

that raid0 member, got added to the failed-to-start md127 raid0. But otherwise failed to assemble into a functioning raid.

Upon reusing that partition for ext4 /, mke2fs failed, as vda3 is "in use" by mdadm.

Somehow, partial raid needs to be represented. Or we should try harder - remove device from raid, wipe raid signatures, then mke2fs.

Attaching screenshots.

Tags:

Related branches

~raharper/curtin:mwhudson-vmtest-reuse-half-a-raid

Merged into curtin:master

Server Team CI bot: Approve (continuous-integration) on 2019-09-06

Dan Watkins (community): Approve on 2019-08-02

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2019-07-02:

Screenshot_generic_2019-07-02_17:34:35.png Edit (18.2 KiB, image/png)

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2019-07-02:

Screenshot_generic_2019-07-02_17:33:28.png Edit (28.1 KiB, image/png)

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2019-07-02:

Screenshot_generic_2019-07-02_17:33:16.png Edit (10.7 KiB, image/png)

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2019-07-02:

installer.tar.gz Edit (38.0 KiB, application/x-tar)

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2019-07-02:

Screenshot_generic_2019-07-02_17:46:18.png Edit (10.7 KiB, image/png)

The partition marked unused, is actually in fact used by an active md127 that failed to start.

Revision history for this message

Ryan Harper (raharper) wrote on 2019-07-02:

Interesting. Let's look at the json and see if we can figure out a way to indicate we've a partial raid and either exclude partial raid members, or include raid in the config with partial members so that members of the raid are forced to be wiped.

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2019-07-03:

Also the "wipe & do full disk install" crashes too. As it tries to do exclusive open of /dev/vda and fails as it is part of raid.

I guess I can by-hand destroy raid, and unbreak the setup, but ew.

If a random person picks up an unused disk of the shelve, they should be able to install onto it, irrespective that it used to be a partial raid member or not.

Similarly with lvm2 / btrfs / zfs.

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2019-07-03:

probe-data.json indicates that it detected that vda3 is a raid-member, but the merged storage config does not mention vda3 at all =(

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2019-07-03:

So i did mdadm --manage --stop /dev/md127 to stop the raid and continue with reuse existing partitions....

.... however curtin was helpful enough to assemble md0 back again, even though we didn't ask for that to happen.

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2019-07-03:

#10

Screenshot_generic_2019-07-03_11:02:44.png Edit (28.1 KiB, image/png)

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2019-07-03:

#11

Screenshot_generic_2019-07-03_11:03:08.png Edit (18.0 KiB, image/png)

Revision history for this message

Ryan Harper (raharper) wrote on 2019-07-03:

#12

> If a random person picks up an unused disk of the shelve, they should be able to install onto it, > irrespective that it used to be a partial raid member or not.
>
> Similarly with lvm2 / btrfs / zfs.

We're not in disagreement. We're figuring out best communicate between curtin block-discover and subiquity.

The only answer to your request is to *wipe* the underlying partition or device;

> So i did mdadm --manage --stop /dev/md127 to stop the raid and continue with reuse existing partitions....
>
>.... however curtin was helpful enough to assemble md0 back again, even though we didn't ask for that to happen.

Curtin needs to "awaken" any possible block layer so that it can remove/wipe/clean the data so that when you boot up into the target you don't have a surprise md127 that starts recovering. In this case, subiquity doesn't yet know enough from the curtin discover data that it cannot use preserve on the partition that's a raid member.

Revision history for this message

Michael Hudson-Doyle (mwhudson) wrote on 2019-07-08:

#13

This is all a bit messy. I don't see a clean way to indicate in the model we currently use "this partition|disk is a raid member but we don't know where the rest of the raid is". That said, I'm inclined to think of this as a curtin bug too I'm afraid. From the journal.txt in the tarball xnox attached:

Jul 02 16:31:38 ubuntu-server curtin_log.1573[1727]: Current device storage tree:
Jul 02 16:31:38 ubuntu-server curtin_log.1573[1727]:
Jul 02 16:31:38 ubuntu-server curtin_log.1573[1727]: Shutdown Plan:
Jul 02 16:31:38 ubuntu-server curtin_log.1573[1727]:
Jul 02 16:31:38 ubuntu-server curtin_log.1573[1727]: finish: cmd-install/stage-partitioning/builtin/cmd-block-meta/clear-holders: SUCCESS:

This doesn't seem right? There should be some kind of shutdown plan for half-a-RAID?

Revision history for this message

Ryan Harper (raharper) wrote on 2019-07-08:

#14

The shutdown plan requires that the config for a device being cleared to not include the preserve: True config. The "raid" partition, vda3 is explicitly marked as preserved: true:

  - device: disk-vda
    size: 20947402752
    flag: linux
    preserve: true
    type: partition
    id: partition-vda3

Curtin would not be able to clear raid metadata from this partition without wipe: set and preserve not present.

That said, I do think that curtin can do a few things to resolve this:

1) include partial raids in the discovered config, and can either
a) add a field to indicate whether the array is healthy/partial/degrated; array_state maybe
b) defer to subiquity to use curtin.block.mdadm.md_check to determine if it wants to include or mark members of the array with wipe so they can be used in other configs.

2) Update how we run clear-holders; Currently we only pass in a list of block devices, of type disk which have 'wipe' set and do not have 'preserve' enabled. This fails the case here where we'd like to wipe vda3 but it has a holder.

Concretely; I'd curtin discover would return:

    - type: raid
        level: 0
        name: md127
        devices:
        - partition-vda3
        spare_devices: []
        array_state: failed

Now, to do that, probert will need to also include partial raids as well. Not sure; it's odd that pyudev didn't have a /dev/md127 entry in the context.

The alternative for probert is to run some mdadm commands on the devices which have the ID_FSTYPE set to raid. I'll add a probert task for this bug as well.

For clear-holders, curtin will also accept type: partition. I expect the final config to set wipe: superblock on vda3 since it's a raid member; and that clear-holders is called with devices=['/dev/vda3']

  - device: disk-vda
    size: 20947402752
    flag: linux
    wipe: superblock
    type: partition
    id: partition-vda3

From there, clear-holders would find /dev/md127 has a holder of type: raid and then the normal curtin shutdown plan would show us stopping md127, wipe each array member (/dev/vda3).

The shutdown plan requires that the config for a device being cleared to not include the preserve: True config.  The "raid" partition, vda3 is explicitly marked as preserved: true:

- device: disk-vda
    size: 20947402752
    flag: linux
    preserve: true
    type: partition
    id: partition-vda3

Curtin would not be able to clear raid metadata from this partition without wipe: set and preserve not present.

That said, I do think that curtin can do a few things to resolve this:

1) include partial raids in the discovered config, and can either
   a) add a field to indicate whether the array is healthy/partial/degrated; array_state maybe
   b) defer to subiquity to use curtin.block.mdadm.md_check to determine if it wants to include or mark members of the array with wipe so they can be used in other configs.

2) Update how we run clear-holders;  Currently we only pass in a list of block devices, of type disk which have 'wipe' set and do not have 'preserve' enabled.  This fails the case here where we'd like to wipe vda3 but it has a holder.

Concretely; I'd curtin discover would return:

-   type: raid                                                             
        level: 0                                                               
        name: md127                                                            
        devices:                                                               
        - partition-vda3                                                       
        spare_devices: []                                                      
        array_state: failed

Now, to do that, probert will need to also include partial raids as well.  Not sure;  it's odd that pyudev didn't have a /dev/md127 entry in the context.

The alternative for probert is to run some mdadm commands on the devices which have the ID_FSTYPE set to raid.  I'll add a probert task for this bug as well.

For clear-holders, curtin will also accept type: partition.  I expect the final config to set wipe: superblock on vda3 since it's a raid member; and that clear-holders is called with devices=['/dev/vda3']

- device: disk-vda
    size: 20947402752
    flag: linux
    wipe: superblock
    type: partition
    id: partition-vda3

From there, clear-holders would find /dev/md127 has a holder of type: raid and then the normal curtin shutdown plan would show us stopping md127, wipe each array member (/dev/vda3).

Changed in probert (Ubuntu):
importance:	Undecided → High
status:	New → Triaged
Changed in curtin:
importance:	Undecided → High
status:	New → Confirmed

Revision history for this message

Michael Hudson-Doyle (mwhudson) wrote on 2019-07-17:

#15

I fixed subiquity to wipe disks harder, which might well have fixed this. Will check today.

Revision history for this message

Michael Hudson-Doyle (mwhudson) wrote on 2019-07-17:

#16

So, no, neither case here is fixed in the latest subiquity :/ I'll try to make vmtest testcases for curtin.

Revision history for this message

Michael Hudson-Doyle (mwhudson) wrote on 2019-07-18:

#17

So https://code.launchpad.net/~mwhudson/curtin/+git/curtin/+merge/369918 now has matching test cases. Will attach failures from the tests (warning, contains sparse disk images) and from my testing in KVM.

Revision history for this message

Michael Hudson-Doyle (mwhudson) wrote on 2019-07-18:

#18

test failures (warning contains sparse disk images) Edit (94.3 KiB, application/x-tar)

Revision history for this message

Michael Hudson-Doyle (mwhudson) wrote on 2019-07-18:

#19

logs from testing in KVM Edit (337.7 KiB, application/x-tar)

Michael Hudson-Doyle (mwhudson) on 2019-07-22

Changed in subiquity:
status:	New → Triaged
importance:	Undecided → Medium
tags:	added: reuse

Francis Ginther (fginther) on 2019-07-31

tags:

added: id-5d40f920ea9865754db787bb

Revision history for this message

Server Team CI bot (server-team-bot) wrote on 2019-10-07:

#20

This bug is fixed with commit 7a22938d to curtin on branch master.
To view that commit see the following URL:
https://git.launchpad.net/curtin/commit/?id=7a22938d

Changed in curtin:
status:	Confirmed → Fix Committed

Revision history for this message

Ryan Harper (raharper) wrote on 2019-11-05: Fixed in curtin version 19.3.

#21

This bug is believed to be fixed in curtin in version 19.3. If this is still a problem for you, please make a comment and set the state back to New

Thank you.

Changed in curtin:
status:	Fix Committed → Fix Released

Michael Hudson-Doyle (mwhudson) on 2020-05-07

Changed in subiquity:
status:	Triaged → Fix Released
Changed in probert (Ubuntu):
status:	Triaged → Fix Released

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Bug attachments

Add attachment

Remote bug watches

Bug watches keep track of this bug in other bug trackers.