default IPMI retry timeout is too long
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ironic |
Fix Released
|
Low
|
John Trowbridge |
Bug Description
If a node is enrolled with an IPMI IP address that is unresponsive, this causes the sync power state periodic task to hang for about 10 minutes by default. While it is configurable, the default is much too high.
Here are logs with the default [ipmi] retry_interval of 60 seconds. Note the timestamps in this log - 12 minutes for the failure to be logged, with no periodic task activity during that time, and the node remains locked for the duration.
2014-10-20 11:02:14.544 DEBUG ironic.
2014-10-20 11:14:51.653 WARNING ironic.
Command: ipmitool -I lanplus -H 1.2.3.4 -L ADMINISTRATOR -U admin -R 12 -N 5 -f /tmp/tmpPuuGU4 power status
Exit code: 1
Stdout: ''
Stderr: 'Error: Unable to establish IPMI v2 / RMCP+ session\nError: Unable to establish IPMI v2 / RMCP+ session\nError: Unable to establish IPMI v2 / RMCP+ session\nUnable to get Chassis Power Status\n'.
2014-10-20 11:14:51.654 WARNING ironic.
2014-10-20 11:15:51.694 DEBUG ironic.
Here are logs with an [ipmi] retry_interval of 5 seconds. Again, note the timestamps - 30 seconds for the failure to be logged. I suggest changing the default value to 5 seconds.
2014-10-20 11:23:56.798 DEBUG ironic.
2014-10-20 11:24:26.954 WARNING ironic.
Command: ipmitool -I lanplus -H 1.2.3.4 -L ADMINISTRATOR -U admin -R 1 -N 5 -f /tmp/tmp2tg79p power status
Exit code: 1
Stdout: ''
Stderr: 'Error: Unable to establish IPMI v2 / RMCP+ session\nError: Unable to establish IPMI v2 / RMCP+ session\nError: Unable to establish IPMI v2 / RMCP+ session\nUnable to get Chassis Power Status\n'.
2014-10-20 11:24:26.958 WARNING ironic.
2014-10-20 11:25:26.983 DEBUG ironic.
For reference, version information:
$ ipmitool -V
ipmitool version 1.8.13
$ dpkg-query --list 'ipmi*' | grep ipmitool | awk '{print $3}'
1.8.13-1ubuntu0.
$ lsb_release -a | grep Desc
Description: Ubuntu 14.04.1 LTS
Ironic version is commit SHA 4589ba37077687b
Changed in ironic: | |
status: | New → Confirmed |
tags: | added: ipmi |
Changed in ironic: | |
importance: | Undecided → Low |
Changed in ironic: | |
milestone: | none → kilo-3 |
status: | Fix Committed → Fix Released |
Changed in ironic: | |
milestone: | kilo-3 → 2015.1.0 |
Fix proposed to branch: master /review. openstack. org/131296
Review: https:/