[MultiOS][Yocto] collectd.service failed to start after unlock

Bug #1897667 reported by Jackie Huang
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Fix Committed
Low
Jackie Huang

Bug Description

Brief Description
-----------------
collectd.service failed to start after unlock and the system is in degraded state.

Severity
--------
Provide the severity of the defect.
<Major: System/Feature is usable but degraded>

Steps to Reproduce
------------------
1. build the image according to https://opendev.org/starlingx/meta-starlingx/src/branch/master/README.md
2. Install AIO simplex with the built out image
3. run ansiple playbootk and unclock
4. Check the system status after unlock

Expected Behavior
------------------
collectd.service should be started and system is in available state

Actual Behavior
----------------

[root@controller-0 ~(keystone_admin)]# system host-list
+----+--------------+-------------+----------------+-------------+--------------+
| id | hostname | personality | administrative | operational | availability |
+----+--------------+-------------+----------------+-------------+--------------+
| 1 | controller-0 | controller | unlocked | enabled | degraded |
+----+--------------+-------------+----------------+-------------+--------------+

controller-0:~$ systemctl status collectd
â collectd.service - Collectd statistics daemon and extension services
   Loaded: loaded (/etc/systemd/system/collectd.service; enabled; vendor preset: enabled)
   Active: failed (Result: signal) since Tue 2020-09-29 03:36:31 UTC; 37min ago
     Docs: man:collectd(1)
           man:collectd.conf(5)
  Process: 377267 ExecStart=/usr/sbin/collectd (code=killed, signal=KILL)
  Process: 377453 ExecStartPost=/bin/bash -c echo $MAINPID > /var/run/collectd.pid (code=exited, status=0/SUCCESS)
  Process: 377490 ExecStopPost=/bin/rm -f /var/run/collectd.pid (code=exited, status=0/SUCCESS)
 Main PID: 377267 (code=killed, signal=KILL)
      CPU: 999ms

Reproducibility
---------------
Reproducible

System Configuration
--------------------
One node system, All-in-one simplex

Branch/Pull Time/Commit
-----------------------
Branch: master
Time: Sep 25 2020
Commit: 595181564502bc04028132b3af68ff551b6cbe47

Last Pass
---------

Timestamp/Logs
--------------

Test Activity
-------------

Workaround
----------

Changed in starlingx:
assignee: nobody → Jackie Huang (jackie-huang)
status: New → Confirmed
Revision history for this message
Jackie Huang (jackie-huang) wrote :

Root cause: the path /bin/systemctll is incorrect
user.log:2020-09-29T04:21:47.000 controller-0 collectd[544571]: info [544571.45216] controller-0 pmond mon pmonHdlr.cpp (1246) respawn_process : Info : Service:collectd
user.log:2020-09-29T04:21:47.000 controller-0 collectd[544571]: warning collectd recovery failed with method '/etc/systemd/system/collectd.service': (/bin/systemctll restart ) (2:No such file or directory)

Changed in starlingx:
status: Confirmed → In Progress
description: updated
Revision history for this message
Jackie Huang (jackie-huang) wrote :
Revision history for this message
Jackie Huang (jackie-huang) wrote :
Changed in starlingx:
status: In Progress → Fix Committed
Revision history for this message
Ghada Khalil (gkhalil) wrote :

Low / doesn't gate the next release as this appears to be prep work for multi-os support

Changed in starlingx:
importance: Undecided → Low
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.