Aborting the upgrade for controller-0 in a duplex env fails
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Low
|
Al Bailey |
Bug Description
Brief Description
-----------------
When performing a software upgrade of an AIO-DX system, if the upgrade is aborted after both controllers have been upgraded, the documentation indicates that it should be possible to abort the upgrade and downgrade the controllers.
It turns out that there is a file permission issue blocking the host-downgrade command for controller-0
Note: this is a very low priority issue, an abort would normally never happen, and if it did it would normally occur much earlier in the procedure. Also, very few people are doing upgrades.
A similar issue was encountered (and fixed) for another flag by this submission
https:/
Severity
--------
Minor
Steps to Reproduce
------------------
Perform a platform upgrade of an AIO-DX system, up to the point where both controllers are unlocked.
Then do the following:
- system upgrade-abort
- (swact from controller-0 to controller-1 to make controller-1 active)
- system host-lock controller-0
- (wait for controller-0 to lock)
- system host-downgrade controller-0
Expected Behavior
------------------
It should intitiate the downgrade
Actual Behavior
----------------
system host-downgrade controller-0
[Errno 13] Permission denied: '/etc/platform/
Reproducibility
---------------
Reproducable
System Configuration
-------
AIO-DX (upgrading from custom r4 to custom r5)
Branch/Pull Time/Commit
-------
Custom Load built May 25
Last Pass
---------
Unknown.
Timestamp/Logs
--------------
sysinv 2021-05-27 23:12:31.547 98746 ERROR wsme.api [-] Server-side error: "[Errno 13] Permission denied: '/etc/platform/
Traceback (most recent call last):
File "/usr/lib/
result = f(self, *args, **kwargs)
File "/usr/lib64/
open(
IOError: [Errno 13] Permission denied: '/etc/platform/
Test Activity
-------------
Developer Testing
Workaround
----------
sudo touch /etc/platform/
sudo chmod 666 /etc/platform/
Changed in starlingx: | |
importance: | Undecided → Low |
Changed in starlingx: | |
status: | In Progress → Fix Committed |
Changed in starlingx: | |
status: | Fix Committed → Fix Released |
tags: | added: stx.config |
With the workaround, the code fails further on. I will address that as well
system host-downgrade controller-0 python2. 7/site- packages/ sysinv/ openstack/ common/ rpc/amqp. py", line 437, in _process_data\n **args)\n' python2. 7/site- packages/ sysinv/ openstack/ common/ rpc/dispatcher. py", line 172, in dispatch\n result = getattr(proxyobj, method)(ctxt, **kwargs)\n' python2. 7/site- packages/ sysinv/ conductor/ manager. py", line 6486, in kill_ceph_ storage_ monitor\ n _("Unable to shut down ceph storage monitor."))\n'
Remote error: SysinvException Unable to shut down ceph storage monitor.
[u'Traceback (most recent call last):\n', u' File "/usr/lib64/
, u' File "/usr/lib64/
, u' File "/usr/lib64/
, u'SysinvException: Unable to shut down ceph storage monitor.\n'].