after upgrade, vms nic is detached from br-int
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
kolla |
Fix Released
|
High
|
Jeffrey Zhang | ||
Mitaka |
Fix Released
|
Undecided
|
Unassigned | ||
Newton |
Fix Released
|
Undecided
|
Unassigned |
Bug Description
I was running the 2 week old code
tried upgrading the neutron containers to latest trunk code
kolla-ansible upgrade -t neutron
Neutron containers upgraded successfully to latest code
Issue:
I have a VM which is created before upgrade lost the connectivity and is no more rechable
restarted the VM seems VM is not able to get the IP address
from console log of VM
<snip>
WARN: /etc/rc3.
Initializing random number generator... done.
Starting acpid: OK
cirros-ds 'local' up at 1.19
no results found for mode=local. up 1.22. searched: nocloud configdrive ec2
Starting network...
udhcpc (v1.20.1) started
Sending discover...
Sending discover...
Sending discover...
Usage: /sbin/cirros-dhcpc <up|down>
No lease, failing
WARN: /etc/rc3.
cirros-ds 'net' up at 181.43
checking http://
failed 1/20: up 181.44. request failed
failed 2/20: up 183.52. request failed
failed 3/20: up 185.52. request failed
failed 4/20: up 187.53. request failed
failed 5/20: up 189.53. request failed
failed 6/20: up 191.54. request failed
failed 7/20: up 193.54. request failed
failed 8/20: up 195.55. request failed
failed 9/20: up 197.55. request failed
failed 10/20: up 199.55. request failed
failed 11/20: up 201.56. request failed
failed 12/20: up 203.57. request failed
failed 13/20: up 205.57. request failed
failed 14/20: up 207.58. request failed
failed 15/20: up 209.58. request failed
failed 16/20: up 211.59. request failed
failed 17/20: up 213.59. request failed
failed 18/20: up 215.60. request failed
failed 19/20: up 217.60. request failed
failed 20/20: up 219.61. request failed
failed to read iid from metadata. tried 20
no results found for mode=net. up 221.61. searched: nocloud configdrive ec2
failed to get instance-id of datasource
Starting dropbear sshd: OK
=== system information ===
Platform: OpenStack Foundation OpenStack Nova
Container: none
Arch: x86_64
CPU(s): 1 @ 2693.508 MHz
Cores/Sockets/
Virt-type:
RAM Size: 2003MB
Disks:
NAME MAJ:MIN SIZE LABEL MOUNTPOINT
vda 253:0 21474836480
vda1 253:1 21459755520 cirros-rootfs /
=== sshd host keys ===
-----BEGIN SSH HOST KEY KEYS-----
ssh-rsa AAAAB3NzaC1yc2E
ssh-dss AAAAB3NzaC1kc3M
-----END SSH HOST KEY KEYS-----
=== network info ===
if-info: lo,up,127.
if-info: eth0,up,
=== datasource: None None ===
=== cirros: current=0.3.4 uptime=221.71 ===
route: fscanf
=== pinging gateway failed, debugging connection ===
############ debug start ##############
### /etc/init.d/sshd start
Starting dropbear sshd: OK
route: fscanf
### ifconfig -a
eth0 Link encap:Ethernet HWaddr FA:16:3E:26:B7:C4
inet6 addr: fe80::f816:
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:8 errors:0 dropped:0 overruns:0 frame:0
TX packets:8 errors:0 dropped:0 overruns:0 carrier:0
RX bytes:648 (648.0 B) TX bytes:1124 (1.0 KiB)
lo Link encap:Local Loopback
inet addr:127.0.0.1 Mask:255.0.0.0
inet6 addr: ::1/128 Scope:Host
UP LOOPBACK RUNNING MTU:16436 Metric:1
RX packets:0 errors:0 dropped:0 overruns:0 frame:0
TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
</snip>
But the VMs created after the upgrade is reachable and able to get IP
summary: |
- after kolla-ansible upgrade of neutron containers OLD vms fails to gett - IP + after kolla-ansible upgrade of neutron containers OLD vms fails to get + IPAddress |
Changed in kolla: | |
status: | New → Triaged |
Changed in kolla: | |
status: | Triaged → Confirmed |
I reproduced this. after upgrade neutron. the vms is lost. but new created vm is pingable. physical OS reboot will save all vms.
After some debug, i found that the vms nic is detached from br-int during upgrade. If add the nic to br-int manually again. the vms are back.
still no idea why and how this happens. will continue debug on this.