Write a diagnostics script
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StackLight |
Fix Released
|
Wishlist
|
Swann Croiset |
Bug Description
It would be useful to have a utility script that would report the status of the various LMA services as well as the logs, the system metrics and the hardware characteristics of the node. This script should be installed on every deployed node during the deployment of the LMA collector. The script should be generic so that it supports all StackLight roles.
In addition, another script should be written: it would be executed from the Fuel node, run the first script on every node and gather all the data on the Fuel node.
List of things to collect (more can be added of course):
- log files from the various StackLight services: /var/log/
- configuration directories for the various StackLight services.
- output of /proc/cpuinfo
- output of the following commands (*):
- uptime
- dmesg | tail -100
- vmstat 1 10
- mpstat -P ALL 1 10
- pidstat 1 10
- iostat -xz 1 10
- sar -n DEV 1 10
- sar -n TCP,ETCP 1 10
- lshw
- df -h
- pcs status
- status of the StackLight services
(*) List inspired by http://
All the data should be put into an archive that the Fuel node (or something else) can retrieve later on.
Note that there's already a blueprint for tracking this feature [1] but this feature request is specifically for LMA.
[1] https:/
summary: |
- Write a utility script reporting the status of the LMA services + Write a utility script for helping with diagnosis |
description: | updated |
Changed in lma-toolchain: | |
milestone: | none → 1.0.0 |
summary: |
- Write a utility script for helping with diagnosis + Write a diagnostics script |
Changed in lma-toolchain: | |
milestone: | 1.0.0 → 0.10.0 |
Changed in lma-toolchain: | |
status: | Fix Committed → Fix Released |
I am not sure it is relevant but a diagnostics tool was recently developed by a Mirantis employee: https:/ /github. com/adobdin/ timmy.