incorrect PID matching for neutron-*-agent in ocf scripts

Bug #1269334 reported by Sergey Vasilenko
20
This bug affects 3 people
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Fix Released
Critical
Sergey Vasilenko

Bug Description

Its possible in some situations for the pgrep to return process ID from the wrong process. After this Pacemaker kill one of the agents when it thinks it is no longer running.

We should use pgrep with -o for catch only oldest's process ID.

Changed in fuel:
assignee: nobody → Sergey Vasilenko (xenolog)
importance: Undecided → Medium
status: New → Triaged
tags: added: library neutron
Changed in fuel:
milestone: none → 4.1
Andrew Woodward (xarses)
summary: - incorrect PID katching for neutron-*-agent in ocf scripts
+ incorrect PID matching for neutron-*-agent in ocf scripts
tags: added: backport-4.0
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (master)

Fix proposed to branch: master
Review: https://review.openstack.org/67101

Changed in fuel:
status: Triaged → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (stable/4.0)

Fix proposed to branch: stable/4.0
Review: https://review.openstack.org/67118

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (stable/4.0)

Reviewed: https://review.openstack.org/67118
Committed: https://git.openstack.org/cgit/stackforge/fuel-library/commit/?id=fbfe4c087476092fb7a4b93a95166594f5e58c57
Submitter: Jenkins
Branch: stable/4.0

commit fbfe4c087476092fb7a4b93a95166594f5e58c57
Author: Sergey Vasilenko <email address hidden>
Date: Thu Jan 16 14:36:03 2014 +0400

    Fixed incorrectt PID matching for neutron-*-agents

    Change-Id: I15ea6cce54135f9d91080c736ba496c3002aa182
    Closes-bug: #1269334

Changed in fuel:
status: In Progress → Fix Committed
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (master)

Reviewed: https://review.openstack.org/67101
Committed: https://git.openstack.org/cgit/stackforge/fuel-library/commit/?id=c677020ed7412b6eac092e795b84fb7a13cb12e1
Submitter: Jenkins
Branch: master

commit c677020ed7412b6eac092e795b84fb7a13cb12e1
Author: Sergey Vasilenko <email address hidden>
Date: Thu Jan 16 14:36:03 2014 +0400

    Fixed incorrectt PID matching for neutron-*-agents

    Change-Id: I15ea6cce54135f9d91080c736ba496c3002aa182
    Closes-bug: #1269334

Andrew Woodward (xarses)
description: updated
description: updated
tags: added: customer-found
Changed in fuel:
importance: Medium → Critical
tags: added: release-notes
Changed in fuel:
status: Fix Committed → Fix Released
Revision history for this message
Pavel Vaylov (pvaylov) wrote :

Does this solution tested on live 4.0 environment ?
After I've made changes in 4 ocf files and restarted corosync I can start dhcp and l3 agents only by:

"crm_resource --resource p_neutron-l3-agent --force-start"

"crm status" does not show agents as "started"

I'm not sure if it related to patch or not.
Will continue debugging.

Revision history for this message
Pavel Vaylov (pvaylov) wrote :

I was able to start dhcp and l3 agents.

By some reason (due my experiments I think) "clone_p_neutron-metadata-agent" has been stopped.
But dhcp and l3 agents depends on "clone_p_neutron-metadata-agent".

[root@node-6 ~]# crm_resource --resource p_neutron-dhcp-agent -A
* p_neutron-dhcp-agent
    clone_p_neutron-metadata-agent (score=INFINITY, id=dhcp-with-metadata)
    clone_p_neutron-openvswitch-agent (score=INFINITY, id=dhcp-with-ovs)
    p_neutron-l3-agent (score=-100, id=dhcp-without-l3)
        clone_p_neutron-metadata-agent (id=l3-with-metadata - loop)
        clone_p_neutron-openvswitch-agent (id=l3-with-ovs - loop)

After I've started "clone_p_neutron-metadata-agent" dhcp and l3 agents started to.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.