IPv6 Distributed Cloud: "loss of redundancy" alarm 400.002 found in SX subcloud

Bug #1846415 reported by Peng Peng
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Fix Released
Medium
Tao Liu

Bug Description

Brief Description
-----------------
Bring up SX subcloud. 400.002 alarm raised
400.002 | Service group distributed-cloud-services loss of redundancy; expected 1 standby member but no standby members available

Severity
--------
Major

Steps to Reproduce
------------------
as description

TC-name:

Expected Behavior
------------------
no such alarm

Actual Behavior
----------------

Reproducibility
---------------
Reproducible

System Configuration
--------------------
DC
IPv6

Lab-name: WCP_89

Branch/Pull Time/Commit
-----------------------
19.10 master as of 2019-09-22_20-00-00"

Last Pass
---------

Timestamp/Logs
--------------
[sysadmin@controller-0 ~(keystone_admin)]$ fm alarm-list
+----------+-------------------------------------------------------------------------------------------------------+---------------------------------------+----------+-------------------+
| Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+----------+-------------------------------------------------------------------------------------------------------+---------------------------------------+----------+-------------------+
| 400.002 | Service group distributed-cloud-services loss of redundancy; expected 1 standby member but no standby | service_domain=controller. | major | 2019-09-30T20:36: |
| | members available | service_group=distributed-cloud- | | 37.163105 |
| | | services | | |
| | | | | |
| 800.011 | Loss of replication in replication group group-0: peer host down | cluster= | major | 2019-09-26T15:24: |
| | | 3a0bb9c0-4622-4da4-9d54-ab3c27b20438. | | 31.603640 |
| | | peergroup=group-0 | | |
| | | | | |
+----------+-------------------------------------------------------------------------------------------------------+---------------------------------------+----------+-------------------+
[sysadmin@controller-0 ~(keystone_admin)]$

Test Activity
-------------
Regression Testing

Ghada Khalil (gkhalil)
Changed in starlingx:
assignee: nobody → Tao Liu (tliu88)
Ghada Khalil (gkhalil)
tags: added: stx.3.0 stx.distcloud
Tao Liu (tliu88)
Changed in starlingx:
status: New → In Progress
Revision history for this message
Ghada Khalil (gkhalil) wrote :

Marking as stx.3.0 - issue related to Distributed Cloud which is an stx.3.0 deliverable

Changed in starlingx:
status: In Progress → Triaged
importance: Undecided → Medium
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to stx-puppet (master)

Fix proposed to branch: master
Review: https://review.opendev.org/686471

Changed in starlingx:
status: Triaged → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to stx-puppet (master)

Reviewed: https://review.opendev.org/686471
Committed: https://git.openstack.org/cgit/starlingx/stx-puppet/commit/?id=ea9c64b22c2467852b2d9a44c9a63ae1069d1ec7
Submitter: Zuul
Branch: master

commit ea9c64b22c2467852b2d9a44c9a63ae1069d1ec7
Author: Tao Liu <email address hidden>
Date: Thu Oct 3 14:10:56 2019 -0400

    Fix "loss of redundancy" alarm in SX subcloud

    Service group distributed-cloud-services loss of redundancy alarm
    is raised in AIO-SX subclouds. This is due to the redundancy model
    expects 1 standby member but no standby members available.

    This update changes the distributed-cloud-service redundancy model
    to no standby member in AIO-DX subclouds

    Change-Id: I72eee6218b0b233f630f913c50b9554cd772e43c
    Closes-Bug:1846415
    Signed-off-by: Tao Liu <email address hidden>

Changed in starlingx:
status: In Progress → Fix Released
Yang Liu (yliu12)
tags: added: stx.retestneeded
Revision history for this message
Peng Peng (ppeng) wrote :

Verified on
[sysadmin@controller-0 ~(keystone_admin)]$ fm alarm-list
+----------+---------------------------+-----------------------------+----------+-------------------+
| Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+----------+---------------------------+-----------------------------+----------+-------------------+
| 750.002 | Application Apply Failure | k8s_application=hello-kitty | major | 2019-11-03T05:00: |
| | | | | 02.479201 |
| | | | | |
| 750.002 | Application Apply Failure | k8s_application=platform- | major | 2019-11-03T04:17: |
| | | integ-apps | | 10.292328 |
| | | | | |
+----------+---------------------------+-----------------------------+----------+-------------------+
[sysadmin@controller-0 ~(keystone_admin)]$
[sysadmin@controller-0 ~(keystone_admin)]$
[sysadmin@controller-0 ~(keystone_admin)]$ cat /etc/build.info
###
### Wind River Cloud Platform
### Release 19.10
###
### Wind River Systems, Inc.
###

SW_VERSION="19.10"
BUILD_TARGET="Host Installer"
BUILD_TYPE="Formal"
BUILD_ID="2019-11-02_08-39-54"
SRC_BUILD_ID="74"

JOB="TC_19.10_Build"
BUILD_BY="jenkins"
BUILD_NUMBER="74"
BUILD_HOST="yow-cgts4-lx.wrs.com"
BUILD_DATE="2019-11-02 08:41:48 -0400"

tags: removed: stx.retestneeded
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.