Rook-ceph documentation - wrong values.yaml for rook-ceph-apps

Bug #1919162 reported by Nicolae Jascanu
12
This bug affects 2 people
Affects Status Importance Assigned to Milestone
StarlingX
Fix Released
Medium
M Camp

Bug Description

Brief Description
-----------------
The install documentation for Rook-Ceph in STX5.0 has a wrong value for devices name in values.yaml.
Instead of
cluster:
  storage:
    nodes:
    - name: controller-0
      devices:
      - name: /dev/disk/by-path/pci-0000:00:03.0-ata-2.0

it should be:
cluster:
  storage:
    nodes:
    - name: controller-0
      devices:
      - name: sdb

This affects all virtual and baremetal rook-ceph related documentation

Severity
--------
Critical

Steps to Reproduce
------------------
Install rook-ceph app using the documented values.yaml file

Expected Behavior
------------------
All pods should be running

Actual Behavior
----------------
It fail parsing the device name:
failed to parse device list ("/dev/disk/by-path/pci-0000:00:17.0-ata-1.0:1:::"): osds per device should be greater than 0 ("00")

Reproducibility
---------------
100%

System Configuration
--------------------
All configs
OS="centos"
SW_VERSION="20.12"
BUILD_TARGET="Host Installer"
BUILD_TYPE="Formal"
BUILD_ID="20210311T065832Z"

JOB="STX_build_layer_flock_master_master"
<email address hidden>"
BUILD_NUMBER="421"
BUILD_HOST="starlingx_mirror"
BUILD_DATE="2021-03-11 06:58:32 +0000"

FLOCK_OS="centos"
FLOCK_JOB="STX_build_layer_flock_master_master"
<email address hidden>"
FLOCK_BUILD_NUMBER="421"
FLOCK_BUILD_HOST="starlingx_mirror"
FLOCK_BUILD_DATE="2021-03-11 06:58:32 +0000"

DISTRO_OS="centos"
DISTRO_JOB="STX_build_layer_distro_master_master"
<email address hidden>"
DISTRO_BUILD_NUMBER="426"
DISTRO_BUILD_HOST="starlingx_mirror"
DISTRO_BUILD_DATE="2021-03-10 02:31:50 +0000"

COMPILER_OS="centos"
COMPILER_JOB="STX_build_layer_compiler_master_master"
<email address hidden>"
COMPILER_BUILD_NUMBER="485"
COMPILER_BUILD_HOST="starlingx_mirror"
COMPILER_BUILD_DATE="2021-03-11 02:30:00 +0000"

Test Activity
-------------
Install based on documentation

summary: - Rook-ceph documentation wrong values.yaml for rook-ceph-apps
+ Rook-ceph documentation - wrong values.yaml for rook-ceph-apps
M Camp (mcamp859)
Changed in starlingx:
assignee: nobody → M Camp (mcamp859)
importance: Undecided → Medium
status: New → Triaged
Revision history for this message
M Camp (mcamp859) wrote :
Changed in starlingx:
status: Triaged → In Progress
Revision history for this message
Giorgos K (bigbud41) wrote :

I had the same issue and did not notice ceph-rook-apps has failed and did again the procedure by doing

ls -ltr /dev/disk/by-path/
total 0
lrwxrwxrwx 1 root root 9 Mar 17 14:51 pci-0000:00:1f.2-ata-4.0 -> ../../sr0
lrwxrwxrwx 1 root root 9 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0 -> ../../sda
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part5 -> ../../sda5
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part2 -> ../../sda2
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part6 -> ../../sda6
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part3 -> ../../sda3
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part4 -> ../../sda4
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part1 -> ../../sda1
lrwxrwxrwx 1 root root 9 Mar 17 21:33 pci-0000:00:1f.2-ata-2.0 -> ../../sdb
lrwxrwxrwx 1 root root 9 Mar 17 21:33 pci-0000:00:1f.2-ata-3.0 -> ../../sdc

so in case of /sdb i replaced the values.yaml with the corresponding device aka

/dev/disk/by-path/pci-0000:00:1f.2-ata-2.0

Revision history for this message
Giorgos K (bigbud41) wrote :

had an issue with rook-ceph-apps and changed the values.yaml with the mapped value of sdb by doing

ls -ltr /dev/disk/by-path/
total 0
lrwxrwxrwx 1 root root 9 Mar 17 14:51 pci-0000:00:1f.2-ata-4.0 -> ../../sr0
lrwxrwxrwx 1 root root 9 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0 -> ../../sda
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part5 -> ../../sda5
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part2 -> ../../sda2
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part6 -> ../../sda6
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part3 -> ../../sda3
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part4 -> ../../sda4
lrwxrwxrwx 1 root root 10 Mar 17 21:33 pci-0000:00:1f.2-ata-1.0-part1 -> ../../sda1
lrwxrwxrwx 1 root root 9 Mar 17 21:33 pci-0000:00:1f.2-ata-2.0 -> ../../sdb
lrwxrwxrwx 1 root root 9 Mar 17 21:33 pci-0000:00:1f.2-ata-3.0 -> ../../sdc

and getting the pci-0000:00:1f.2-ata-2.0 value

Revision history for this message
Giorgos K (bigbud41) wrote :

nop failed again , i will try sdb

Revision history for this message
M Camp (mcamp859) wrote :

Review has merged, docs are updated, closing this LP.

Changed in starlingx:
status: In Progress → Fix Released
Revision history for this message
Giorgos K (bigbud41) wrote :

tried again with new vm but even with sdb it fails. stuck at 67% for a long time and the failed. probably openstack would failed as well cause there are no mount for ceph-rook

 rook-ceph-apps | 1.0-3 | rook-ceph-manifest | manifest.yaml | apply-failed | operation aborted, check logs for details

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.