StarlingX

Storage nodes go for extra reboot after unlock (manifest apply failure)

Bug #1797150 reported by Maria Yousaf on 2018-10-10

This bug report is a duplicate of: Bug #1800889: Storage node fails to unlock when installing. Edit Remove

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	StarlingX	Fix Released	Medium	Daniel Badea

Bug Description

Brief Description
-----------------
Storage nodes go for extra reboot after unlock, after assigning OSD in newly created storage tier

Severity
--------
Major

Steps to Reproduce
------------------
1. Create a new storage tier
2. Lock a storage node
3. Assign an unused OSD on that storage node to the newly created storage tier
4. Unlock the storage node*
5. Repeat for all storage nodes

*On reboot, there is a failure displayed on the console: failed to apply puppet manifest. The storage node goes for a reboot again. Eventually, the node does come up.

Looking at the puppet manifest on storage-0, the following is seen:

2018-10-10T13:35:25.481 ^[[mNotice: 2018-10-10 13:35:25 +0000 /Stage[post]/Platform::Config::Storage::Post/File[/etc/platform/.initial_storage_config_complete]: Dependency Exec[ceph-osd-prepare-/dev/disk/by-path/pci-0000:85:00.0-nvme-1] has failures: true

Expected Behavior
------------------
A single reboot cycle should bring up the node and no manifest failures should be seen.

Actual Behavior
----------------
Manifest failures seen on unlock resulting in additional reboot.

Reproducibility
---------------
Extra reboot seen on multiple storage nodes in the system.

System Configuration
--------------------
Multi-node storage system

Branch/Pull Time/Commit
-----------------------
master as of 2018-10-09_01-52-01

Timestamp/Logs
--------------
2018-10-10T13:35:25.481

Tags:

Revision history for this message

Ghada Khalil (gkhalil) wrote on 2018-10-10:

stx.2019.03 - specific to storage tiers; system still comes up after extra reboot. Not required for stx.2018.10

Changed in starlingx:
assignee:	nobody → Daniel Badea (daniel.badea)
importance:	Undecided → Medium
status:	New → Triaged
tags:	added: stx. stx.config
tags:	added: stx.2019.03 removed: stx.

Revision history for this message

Ghada Khalil (gkhalil) wrote on 2018-11-01:

Duplicate of https://bugs.launchpad.net/starlingx/+bug/1800889

Revision history for this message

Ghada Khalil (gkhalil) wrote on 2018-11-02:

Fix merged as of Nov 1/2018

Changed in starlingx:
status:	Triaged → Fix Released

Ken Young (kenyis) on 2019-01-18

tags:

added: stx.2019.05
removed: stx.2019.03

Ken Young (kenyis) on 2019-04-05

tags:

added: stx.2.0
removed: stx.2019.05

Report a bug

This report contains Public information

Everyone can see this information.

Duplicate of bug #1800889 Remove

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.