Error on system restore attempt with swift enabled

Bug #1799271 reported by Jose Perez Carranza
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Won't Fix
Medium
Wei Zhou

Bug Description

Title
-----
Failing to apply system restore with error "Failed to execute controller manifest"

Brief Description
-----------------
When trying to restore the system is giving a failure on step 8 due "Failed to execute controller manifest". More information provided on the Timestamps and logs

Severity
--------
Provide the severity of the defect.
Major

Steps to Reproduce
------------------
1. Have a functional and working deployment installed
2. Generate backups files
   $ sudo config_controller --backup <backup_name>
3. Save the file generated
4. Reinstall the system
5. Attempt to restore the system
   $ sudo config_controller --restore-system <backup_name_system.tgz>

Expected Behavior
------------------
Restore of the system should be completed

Actual Behavior
----------------
Step 8 of 25 [############## ] [32%] Failed to execute controller manifest

Reproducibility
---------------
Reproducible 100%

System Configuration
--------------------
Duplex Bare-Metal Two node system

Branch/Pull Time/Commit
-----------------------
2018.10

Timestamp/Logs
--------------
Applying the manifest is launching Warnings about missing files but files are already present.. seems to be a race condition on the system. more information below:

controller-0:~$ sudo grep -E '^(.......)?Warning|^....-..-..T..:..:..([.]...)?(.......)?.Warning|^(.......)?Error|^....-..-..T..:..:..([.]...)?(.......)?.Error' /var/log/puppet/latest/puppet.log
2018-10-21T21:34:20.225 Warning: 2018-10-21 21:34:20 +0000 Unexpected line: Ring file /etc/swift/object.ring.gz not found, probably it hasn't been written yet
2018-10-21T21:34:20.904 Warning: 2018-10-21 21:34:20 +0000 Unexpected line: Ring file /etc/swift/container.ring.gz not found, probably it hasn't been written yet
2018-10-21T21:34:23.024 Warning: 2018-10-21 21:34:23 +0000 Unexpected line: Ring file /etc/swift/account.ring.gz not found, probably it hasn't been written yet

controller-0:~$ ll /etc/swift/ |grep gz
-rw-r--r-- 1 root root 204 Oct 21 21:34 account.ring.gz
-rw-r--r-- 1 root root 206 Oct 21 21:34 container.ring.gz
-rw-r--r-- 1 root root 202 Oct 21 21:34 object.ring.gz

Revision history for this message
Jose Perez Carranza (jgperezc) wrote :
Revision history for this message
Bruce Jones (brucej) wrote :

Only fails on duplex bare metal setup. Might be setup specific. Erich to investigate this more deeply.

Changed in starlingx:
assignee: nobody → Erich Cordoba (ericho)
Bruce Jones (brucej)
tags: added: stx.2018.10
tags: removed: stx.2018.10
Ghada Khalil (gkhalil)
tags: added: stx.config
Revision history for this message
Wei Zhou (wzhou007) wrote :

I guess swift was enabled at the time system backup was done. Just wondering if swift was enabled through system parameters, like this:

system service-parameter-modify swift config service_enabled=true
system service-parameter-modify swift config fs_size_mb=50 (optional)
system service-parameter-apply swift

Revision history for this message
Jose Perez Carranza (jgperezc) wrote :

I was executing some tests before the backup and Yes one of those test cases included a swift enabling with those commands.

Revision history for this message
Ghada Khalil (gkhalil) wrote :

stx.2019.03 - confirmed to be an issue, but only with swift enabled (which is not a common configuration)

summary: - error on system restore attempt
+ Error on system restore attempt with swift enabled
Changed in starlingx:
importance: Undecided → Medium
status: New → Triaged
assignee: Erich Cordoba (ericho) → Wei Zhou (wzhou007)
tags: added: stx.2019.03
Revision history for this message
Frank Miller (sensfan22) wrote :

Backup & restore with Swift is only supported when Swift is configured on CEPH. Backup & restore with swift configured on the controllers is not supported [1].

[1] https://storyboard.openstack.org/#!/story/2003518

Changed in starlingx:
status: Triaged → Won't Fix
assignee: Wei Zhou (wzhou007) → Jose Perez Carranza (jgperezc)
Ghada Khalil (gkhalil)
Changed in starlingx:
assignee: Jose Perez Carranza (jgperezc) → Wei Zhou (wzhou007)
Ken Young (kenyis)
tags: added: stx.2019.05
removed: stx.2019.03
Ken Young (kenyis)
tags: added: stx.2.0
removed: stx.2019.05
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.