Storage nodes go for extra reboot after unlock (manifest apply failure)

Bug #1797150 reported by Maria Yousaf
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Fix Released
Medium
Daniel Badea

Bug Description

Brief Description
-----------------
Storage nodes go for extra reboot after unlock, after assigning OSD in newly created storage tier

Severity
--------
Major

Steps to Reproduce
------------------
1. Create a new storage tier
2. Lock a storage node
3. Assign an unused OSD on that storage node to the newly created storage tier
4. Unlock the storage node*
5. Repeat for all storage nodes

*On reboot, there is a failure displayed on the console: failed to apply puppet manifest. The storage node goes for a reboot again. Eventually, the node does come up.

Looking at the puppet manifest on storage-0, the following is seen:

2018-10-10T13:35:25.481 ^[[mNotice: 2018-10-10 13:35:25 +0000 /Stage[post]/Platform::Config::Storage::Post/File[/etc/platform/.initial_storage_config_complete]: Dependency Exec[ceph-osd-prepare-/dev/disk/by-path/pci-0000:85:00.0-nvme-1] has failures: true

Expected Behavior
------------------
A single reboot cycle should bring up the node and no manifest failures should be seen.

Actual Behavior
----------------
Manifest failures seen on unlock resulting in additional reboot.

Reproducibility
---------------
Extra reboot seen on multiple storage nodes in the system.

System Configuration
--------------------
Multi-node storage system

Branch/Pull Time/Commit
-----------------------
master as of 2018-10-09_01-52-01

Timestamp/Logs
--------------
 2018-10-10T13:35:25.481

Revision history for this message
Ghada Khalil (gkhalil) wrote :

stx.2019.03 - specific to storage tiers; system still comes up after extra reboot. Not required for stx.2018.10

Changed in starlingx:
assignee: nobody → Daniel Badea (daniel.badea)
importance: Undecided → Medium
status: New → Triaged
tags: added: stx. stx.config
tags: added: stx.2019.03
removed: stx.
Revision history for this message
Ghada Khalil (gkhalil) wrote :
Revision history for this message
Ghada Khalil (gkhalil) wrote :

Fix merged as of Nov 1/2018

Changed in starlingx:
status: Triaged → Fix Released
Ken Young (kenyis)
tags: added: stx.2019.05
removed: stx.2019.03
Ken Young (kenyis)
tags: added: stx.2.0
removed: stx.2019.05
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.