Identity sync issues after subcloud is first managed
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Andy |
Bug Description
Brief Description
-----------------
When a subcloud is first managed, it can sometimes take up to an hour for the identity and platform endpoints to go in-sync. This is intermittent.
Severity
--------
Major: subcloud can stay out of sync for up to an hour which will be confusing to the user
Steps to Reproduce
------------------
Configure a DC system
Install and add a subcloud
Manage the subcloud
Expected Behavior
------------------
All the endpoints for the subcloud should go in-sync within a minute or two.
Actual Behavior
----------------
Sometimes the identity and platform endpoints don't go in sync for up to an hour.
Reproducibility
---------------
Intermittent - this may have something to do with how soon the subcloud is managed after it goes online. I suspect that the sooner it is managed, the more likely this is to happen, but that is just a theory.
System Configuration
-------
Distributed Cloud
Branch/Pull Time/Commit
-------
Designer load built from starlingx master:
BUILD_DATE=
Last Pass
---------
I suspect this was broken by https:/
Timestamp/Logs
--------------
When the failure occurs, the dcorch initial identity sync is failing every minute with logs like this:
2020-07-16 13:15:06.295 102948 INFO dcorch.
2020-07-16 13:15:06.342 102948 INFO dcorch.
2020-07-16 13:15:06.342 102948 INFO dcorch.
2020-07-16 13:15:06.351 102948 INFO dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.770 102948 ERROR dcorch.
2020-07-16 13:15:07.772 102948 INFO dcorch.
Test Activity
-------------
Developer Testing
Workaround
----------
Wait for about an hour and the subcloud should go in sync.
tags: | added: stx.distcloud |
Changed in starlingx: | |
status: | New → Triaged |
importance: | Undecided → Medium |
tags: | added: stx.5.0 |
Changed in starlingx: | |
assignee: | nobody → Andy (andy.wrs) |
Fix proposed to branch: master /review. opendev. org/742974
Review: https:/