Neutron DHCP agent failed to survive AMQP failover: UX after reboot of controller and compute instance can not get an address from DHCP

Bug #1484396 reported by Łukasz Oleś
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Won't Fix
High
Fuel Library (Deprecated)
Mirantis OpenStack
Invalid
Medium
Łukasz Oleś
7.0.x
Won't Fix
Medium
MOS Oslo
8.0.x
Won't Fix
Medium
Łukasz Oleś

Bug Description

Steps to reproduce:

- Deploy Fuel ISO using VirtualBox
- Cteate env using default settings with one Controller and one Compute
- login to Horizon and create one instance and assign it net04 network
- lunch an instance
- instance should get ip from 192.168.111.0 network
- reboot Controller and Compute

After reboot the instance is started but it can not get the addres. New instances also starts but do not get address from DHCP.

{
   "build_id":"2015-08-10_19-24-50",
   "build_number":"157",
   "release_versions":{
      "2015.1.0-7.0":{
         "VERSION":{
            "build_id":"2015-08-10_19-24-50",
            "build_number":"157",
            "api":"1.0",
            "fuel-library_sha":"c4b2b9a1b464d7f538635cbfc9304a714ededc41",
            "nailgun_sha":"9a6ac9b08733ded67bc65345a37787886ff4249d",
            "feature_groups":[
               "mirantis"
            ],
            "fuel-nailgun-agent_sha":"e01693992d7a0304d926b922b43f3b747c35964c",
            "openstack_version":"2015.1.0-7.0",
            "fuel-agent_sha":"57145b1d8804389304cd04322ba0fb3dc9d30327",
            "production":"docker",
            "python-fuelclient_sha":"ec5c02b3848fe1d15e5b50b323c3dda030f8e1ac",
            "astute_sha":"e1d3a435e5df5b40cbfb1a3acf80b4176d15a2dc",
            "fuel-ostf_sha":"c7f745431aa3c147f2491c865e029e0ffea91c47",
            "release":"7.0",
            "fuelmain_sha":"826387d1a05a40e4849433598442024bb1e3b7c7"
         }
      }
   },
   "auth_required":true,
   "api":"1.0",
   "fuel-library_sha":"c4b2b9a1b464d7f538635cbfc9304a714ededc41",
   "nailgun_sha":"9a6ac9b08733ded67bc65345a37787886ff4249d",
   "feature_groups":[
      "mirantis"
   ],
   "fuel-nailgun-agent_sha":"e01693992d7a0304d926b922b43f3b747c35964c",
   "openstack_version":"2015.1.0-7.0",
   "fuel-agent_sha":"57145b1d8804389304cd04322ba0fb3dc9d30327",
   "production":"docker",
   "python-fuelclient_sha":"ec5c02b3848fe1d15e5b50b323c3dda030f8e1ac",
   "astute_sha":"e1d3a435e5df5b40cbfb1a3acf80b4176d15a2dc",
   "fuel-ostf_sha":"c7f745431aa3c147f2491c865e029e0ffea91c47",
   "release":"7.0",
   "fuelmain_sha":"826387d1a05a40e4849433598442024bb1e3b7c7"
}

Revision history for this message
Łukasz Oleś (loles) wrote :
Changed in fuel:
status: New → Confirmed
Revision history for this message
Bogdan Dobrelya (bogdando) wrote :

The repro steps seem doesn't match the logs, there was a "time jump":
2015-08-12T23:22:57.773286+00:00 info: INFO: p_rabbitmq-server: get_monitor(): get_monitor function ready to return 8
2015-08-13T06:32:43.689554+00:00 info: INFO: p_rabbitmq-server: get_monitor(): CHECK LEVEL IS: 0

Please elaborate which exactly actions were taken?

Changed in fuel:
status: Confirmed → Incomplete
Revision history for this message
Bogdan Dobrelya (bogdando) wrote :

The RC is DHCP agent's Oslo messaging code failed to survive AMQP failover: http://pastebin.com/9i0FH6Rv

Revision history for this message
Bogdan Dobrelya (bogdando) wrote :
Changed in fuel:
status: Incomplete → Won't Fix
Changed in mos:
assignee: nobody → MOS Oslo (mos-oslo)
milestone: none → 7.0
importance: Undecided → High
status: New → Confirmed
summary: - After reboot of controller and compute instance can not get an address
- from DHCP
+ Neutron DHCP agent failed to survive AMQP failover: UX after reboot of
+ controller and compute instance can not get an address from DHCP
Revision history for this message
Davanum Srinivas (DIMS) (dims-v) wrote :

Please reply to Bogdan's query in comment #2

Changed in mos:
status: Confirmed → Incomplete
Revision history for this message
Łukasz Oleś (loles) wrote :

Time jump has nothing to do with this bug. I installed Openstack on one day. Hibernated my laptop and on second day I started the tests.

Changed in mos:
status: Incomplete → New
Revision history for this message
Davanum Srinivas (DIMS) (dims-v) wrote :

How many times has this happened? Is this recreate able?

Revision history for this message
Łukasz Oleś (loles) wrote :

For me it happens every time after I reboot controller and compute nodes.

Revision history for this message
Davanum Srinivas (DIMS) (dims-v) wrote :

@loles, thanks for the info. Since this is a virtualbox environment, i am not sure if this merits a "High" Importance. If you do get a chance, please do a fresh install from the latest ISO, try doing what you are doing immediately after the install is over (and NOT hibernate the laptop etc). if that works and the problems start happening after hibernate, that would give us some data. Also, we would not expect production installations to hibernate either :)

Revision history for this message
Dmitry Mescheryakov (dmitrymex) wrote :

Lukasz, please move the bug to new state in case it is reproducible without hibernation. I am afraid that OpenStack is too imperfect yet for us to support hibernation of the host OS :-)

Changed in mos:
status: New → Incomplete
Revision history for this message
Vitaly Sedelnik (vsedelnik) wrote :

This issue happens in VB env only after hibernation. Changing the importance to Medium, Won't Fix for 7.0, Confirmed for 8.0

Revision history for this message
Dmitry Mescheryakov (dmitrymex) wrote :

Somehow the issue became confirmed again. Moving it back to incomplete. Łukasz, please move the bug to new state in case it is reproducible without hibernation.

Revision history for this message
Maciej Relewicz (rlu) wrote :

After talk to Lukasz, I closed the bug.

Revision history for this message
Dina Belova (dbelova) wrote :

Moving to Invalid after more than a month in the incomplete state.

Changed in mos:
status: Incomplete → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.