Alarm 800.010 Potential data loss. No available OSDs in storage replication group group-0
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Critical
|
Jia Hu |
Bug Description
Brief Description
-----------------
After provisioning Virtual STANDARD and Virtual STANDARD-EXTERNAL systems, the alarm 800.010 is listed and it is CRITICAL
Severity
--------
Steps to Reproduce
------------------
Provision Virtual STANDARD and Virtual STANDARD-EXTERNAL
Expected Behavior
------------------
No alarms
Actual Behavior
----------------
============= VIRTUAL STANDARD
[sysadmin@
+------
| Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+------
| 800.001 | Storage Alarm Condition: HEALTH_WARN [PGs are degraded/stuck or undersized]. Please check 'ceph | cluster= | warning | 2021-09-02T05:49: |
| | -s' for more details. | 2b7074f0-
| | | | | |
| 800.010 | Potential data loss. No available OSDs in storage replication group group-0: no OSDs | cluster= | critical | 2021-09-02T05:44: |
| | | 2b7074f0-
| | | peergroup=group-0 | | |
| | | | | |
+------
[sysadmin@
cluster:
id: 2b7074f0-
health: HEALTH_WARN
1 MDSs report slow metadata IOs
Reduced data availability: 192 pgs inactive
services:
mon: 3 daemons, quorum controller-
mgr: controller-
mds: kube-cephfs-1/1/1 up {0=controller-
osd: 0 osds: 0 up, 0 in
data:
pools: 3 pools, 192 pgs
objects: 0 objects, 0 B
usage: 0 B used, 0 B / 0 B avail
pgs: 100.000% pgs unknown
192 unknown
================= VIRTUAL STANDARD-EXTERNAL
[sysadmin@
+------
| Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+------
| 800.001 | Storage Alarm Condition: HEALTH_WARN [PGs are degraded/stuck or undersized]. Please check 'ceph | cluster= | warning | 2021-09-02T12:26: |
| | -s' for more details. | 720df95d-
| | | | | |
| 800.010 | Potential data loss. No available OSDs in storage replication group group-0: no OSDs | cluster= | critical | 2021-09-02T12:20: |
| | | 720df95d-
| | | peergroup=group-0 | | |
| | | | | |
| 200.001 | storage-1 was administratively locked to take it out-of-service. | host=storage-1 | warning | 2021-09-02T11:57: |
| | | | | 12.267591 |
| | | | | |
| 200.001 | storage-0 was administratively locked to take it out-of-service. | host=storage-0 | warning | 2021-09-02T11:57: |
| | | | | 06.516257 |
| | | | | |
| 250.001 | compute-1 Configuration is out-of-date. | host=compute-1 | major | 2021-09-02T11:56: |
| | | | | 47.769518 |
| | | | | |
| 250.001 | compute-0 Configuration is out-of-date. | host=compute-0 | major | 2021-09-02T11:56: |
| | | | | 47.065148 |
| | | | | |
| 200.001 | compute-1 was administratively locked to take it out-of-service. | host=compute-1 | warning | 2021-09-02T11:46: |
| | | | | 23.538292 |
| | | | | |
| 200.001 | compute-0 was administratively locked to take it out-of-service. | host=compute-0 | warning | 2021-09-02T11:46: |
| | | | | 17.311249 |
| | | | | |
+------
[sysadmin@
cluster:
id: 720df95d-
health: HEALTH_WARN
1 MDSs report slow metadata IOs
Reduced data availability: 192 pgs inactive
services:
mon: 2 daemons, quorum controller-
mgr: controller-
mds: kube-cephfs-1/1/1 up {0=controller-
osd: 0 osds: 0 up, 0 in
data:
pools: 3 pools, 192 pgs
objects: 0 objects, 0 B
usage: 0 B used, 0 B / 0 B avail
pgs: 100.000% pgs unknown
192 unknown
Reproducibility
---------------
Reproducible
System Configuration
-------
OS="centos"
SW_VERSION="21.12"
BUILD_TARGET="Host Installer"
BUILD_TYPE="Formal"
BUILD_ID=
JOB="STX_
<email address hidden>"
BUILD_NUMBER="600"
BUILD_HOST=
BUILD_DATE=
FLOCK_OS="centos"
FLOCK_JOB=
<email address hidden>"
FLOCK_BUILD_
FLOCK_BUILD_
FLOCK_BUILD_
DISTRO_OS="centos"
DISTRO_
<email address hidden>"
DISTRO_
DISTRO_
DISTRO_
COMPILER_
COMPILER_
<email address hidden>"
COMPILER_
COMPILER_
COMPILER_
tags: | added: stx.storage |
Changed in starlingx: | |
importance: | Undecided → Medium |
Changed in starlingx: | |
status: | Triaged → In Progress |
Screening: stx.6.0 / high - results in a sanity issue