CentOS fails due to deployment timeout
Bug #1384332 reported by
Ryan Moe
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Committed
|
High
|
Vladimir Sharshov |
Bug Description
This happens on CentOS with Neutron VLAN.
The deployment times out because Astute keeps re-running puppet on the primary controller. There are no apparent errors in the puppet logs but Astute re-runs puppet because the lockfile created by daemonize doesn't get removed.
From the astute logs:
"Process not running but not empty lockfile is present. Trying to remove lockfile...ok."
After that Astute re-runs puppet and the same error occurs until finally the deployment times out.
Changed in fuel: | |
status: | New → Confirmed |
importance: | Undecided → High |
milestone: | none → 6.0 |
tags: | added: astute |
Changed in fuel: | |
status: | In Progress → Fix Committed |
To post a comment you must log in.
Ryan Moe (rmoe) is right.
For some reason daemonize do not remove lock file.
Sun Oct 19 18:08:16 +0000 2014 Puppet (debug): Finishing transaction 70038095814780 puppet/ state/state. yaml modules/ corosync/ lib/facter/ pacemaker_ hostname. rb
Sun Oct 19 18:08:16 +0000 2014 Puppet (debug): Storing state
Sun Oct 19 18:08:16 +0000 2014 Puppet (info): Creating state file /var/lib/
Sun Oct 19 18:08:16 +0000 2014 Puppet (debug): Stored state in 0.09 seconds
Sun Oct 19 18:08:16 +0000 2014 Puppet (notice): Finished catalog run in 3802.51 seconds
Sun Oct 19 18:08:19 +0000 2014 Puppet (info): Loading facts in /etc/puppet/
2014-10-19T18:08:18 debug: [411] 6b8d7ce1- 9af5-45f7- 89d5-7f4ef28ee2 38: MC agent 'puppetd', method 'last_run_summary', results: {:sender=>"3", :statuscode=>0, :statusmsg=>"OK", <...> , :runtime=>2, :enabled=>1, :err_msg=>"Process not running but not empty lockfile is present. Trying to remove lockfile...ok.", :version= >{"config" =>1413738274, "puppet"=>"3.4.2"}, :idling=>0}}
And again:
Sun Oct 19 18:33:08 +0000 2014 Puppet (debug): Finishing transaction 70257381676520 modules/ corosync/ lib/facter/ pacemaker_ hostname. rb
Sun Oct 19 18:33:08 +0000 2014 Puppet (debug): Storing state
Sun Oct 19 18:33:08 +0000 2014 Puppet (debug): Stored state in 0.36 seconds
Sun Oct 19 18:33:08 +0000 2014 Puppet (notice): Finished catalog run in 1461.06 seconds
Sun Oct 19 18:33:11 +0000 2014 Puppet (info): Loading facts in /etc/puppet/
2014-10-19T18:33:11 debug: [411] 6b8d7ce1- 9af5-45f7- 89d5-7f4ef28ee2 38: MC agent 'puppetd', method 'last_run_summary', results: {:sender=>"3", :statuscode=>0, :statusmsg=>"OK", <...>, :runtime=>3, :enabled=>1, :err_msg=>"Process not running but not empty lockfile is present. Trying to remove lockfile...ok.", :version= >{"config" =>1413742101, "puppet"=>"3.4.2"}, :idling=>0}}
In other hand puppet logs contain many errors. This is unexpected behavior and we should try to reproduce it. 19_18-46- 46"
release: "6.0"
build_number: "104"
build_id: "2014-10-