Storage node comes up as failed on disk swap
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Invalid
|
Low
|
Tingjie Chen |
Bug Description
Brief Description
-----------------
Storage node comes up as failed on disk swap
Severity
--------
Major
Steps to Reproduce
------------------
1. storage-0 had one OSD provisioned. That OSD was using a collocated journal of 1024 MiB
2. storage-0 was locked. A journal disk was provisioned. The OSD disk was updated to use the dedicated journal disk for storage as opposed to collocated journals with a size of 2048 MiB
3. storage-0 was unlocked. This was repeated for all storage nodes within the same peer group.
4. storage-0 was then locked and powered off. The OSD and journal disks were swapped
5. The node was then powered on and unlocked after it came online.
6. The storage node eventually went into Failed state. I'm seeing the following reported in puppet.log:
n[check]
0000:84:00.0 is the OSD
0000:05:00.0 is the journal
Speaking to a designer, it may be the case that partitions should be re-created by the system according to what is in the database.
Workaround is to lock the host, do a host reinstall and unlock
Expected Behavior
------------------
Storage node recovers
Actual Behavior
----------------
Storage node fails
Reproducibility
---------------
Tried once so far
System Configuration
-------
Storage system
Branch/Pull Time/Commit
-------
stx.10.2018 as of 2018-10-12_01-52-00
Timestamp/Logs
--------------
2018-10-12 17:56:57.855
Changed in starlingx: | |
assignee: | Ovidiu Poncea (ovidiu.poncea) → Bruce Jones (brucej) |
Changed in starlingx: | |
assignee: | Bruce Jones (brucej) → Cindy Xie (xxie1) |
Changed in starlingx: | |
assignee: | Cindy Xie (xxie1) → Zhuweiwei (vivian.zhu) |
Changed in starlingx: | |
assignee: | Zhuweiwei (vivian.zhu) → Changcheng Intel (liuc-intel) |
tags: |
added: stx.2019.05 removed: stx.2019.03 |
tags: |
added: stx.2.0 removed: stx.2019.05 |
tags: | added: stx.retestneeded |
Changed in starlingx: | |
assignee: | Changcheng Intel (liuc-intel) → Tingjie Chen (silverhandy) |
tags: | added: stx.storage |
tags: | added: stx.distro.other |
tags: | removed: stx.distro.other |
Changed in starlingx: | |
status: | Incomplete → Invalid |
tags: | removed: stx.retestneeded |
Targeting stx.2019.03 as this is a corner case during disk replacement