Activity log for bug #2012104

Date Who What changed Old value New value Message
2023-03-17 18:49:20 Peter Sabaini bug added bug
2023-03-17 18:50:34 Peter Sabaini description For one of our compute machines I'm seeing two network agents that appear unhealthy: $ os network agent list | fgrep "register deleted" | compute1 | OVN Controller agent | ("Chassis" register deleted) | | XXX | UP | ovn-controller | | c085d57a-3a2b-4f97-8250-23d3f914b078 | OVN Metadata agent | ("Chassis" register deleted) | | XXX | UP | neutron-ovn-metadata-agent | The ("Chassis" register deleted) message appears to come from the fix for this: https://bugs.launchpad.net/neutron/+bug/1951149 Searching for that external id I can find this private chassis and it's chassis indeed seems empty: $ sudo ovn-sbctl find chassis-private | grep -A 5 e621e0fb-83d3-4a18-82b3-c842996548ed' _uuid : e621e0fb-83d3-4a18-82b3-c842996548ed chassis : [] external_ids : {"neutron:liveness_check_at"="2022-06-17T08:43:33.393639+00:00", "neutron:metadata_liveness_check_at"="2022-06-17T02:27:21.309718+00:00", "neutron:ovn-metadata-id"="c085d57a-3a2b-4f97-8250-23d3f914b078", "ne utron:ovn-metadata-sb-cfg"="150397"} name : compute1 nb_cfg : 150397 nb_cfg_timestamp : 1657729945956 But there's also: $ sudo ovn-sbctl find chassis hostname=compute1.stack _uuid : 164cb56b-1a3c-4401-bc52-6fa5e58d8f2a encaps : [c442312a-9dfa-4ffe-9db7-afe5f9055962] external_ids : {datapath-type=system, iface-types="bareudp,erspan,geneve,gre,gtpu,internal,ip6erspan,ip6gre,lisp,patch,stt,system,tap,vxlan", is-interconn="false", "neutron:ovn-metadata-sb-cfg"="250161", ovn-bridge-mappings="", ovn-chassis-mac-mappings="", ovn-cms-options="", ovn-enable-lflow-cache="true", ovn-limit-lflow-cache="", ovn-memlimit-lflow-cache-kb="", ovn-monitor-all="false", ovn-trim-limit-lflow-cache="", ovn-trim-timeout-ms="", ovn-trim-wmark-perc-lflow-cache="", port-up-notif="true"} hostname : compute1.stack name : compute1.stack nb_cfg : 0 other_config : {datapath-type=system, iface-types="bareudp,erspan,geneve,gre,gtpu,internal,ip6erspan,ip6gre,lisp,patch,stt,system,tap,vxlan", is-interconn="false", ovn-bridge-mappings="", ovn-chassis-mac-mappings="", ovn-cms-options="", ovn-enable-lflow-cache="true", ovn-limit-lflow-cache="", ovn-memlimit-lflow-cache-kb="", ovn-monitor-all="false", ovn-trim-limit-lflow-cache="", ovn-trim-timeout-ms="", ovn-trim-wmark-perc-lflow-cache="", port-up-notif="true"} transport_zones : [] vtep_logical_switches: [] $ sudo ovn-sbctl find chassis-private chassis=164cb56b-1a3c-4401-bc52-6fa5e58d8f2a _uuid : cbec617d-19dc-481c-ba99-b4132244773c chassis : 164cb56b-1a3c-4401-bc52-6fa5e58d8f2a external_ids : {"neutron:ovn-metadata-id"="3328a0c7-081b-58a9-9e91-baf5c8c259cd", "neutron:ovn-metadata-sb-cfg"="312321"} name : compute1.stack nb_cfg : 312321 nb_cfg_timestamp : 1679042105359 Which seems to be a correct entry -- should neutron not pick up this entry rather than the one with "chassis : []"? Software versions: ii neutron-server 2:20.2.0-0ubuntu1~cloud0 all Neutron is a virtual network service for Openstack - server ii ovn-central 22.03.0-0ubuntu1~cloud0 amd64 OVN central components Distributor ID: Ubuntu Description: Ubuntu 20.04.4 LTS Release: 20.04 Codename: focal Please let me know if I can provide more diagnostics. For one of our compute machines I'm seeing two network agents that appear unhealthy: ``` $ os network agent list | fgrep "register deleted" | compute1 | OVN Controller agent | ("Chassis" register deleted) | | XXX | UP | ovn-controller | | c085d57a-3a2b-4f97-8250-23d3f914b078 | OVN Metadata agent | ("Chassis" register deleted) | | XXX | UP | neutron-ovn-metadata-agent | ``` The ("Chassis" register deleted) message appears to come from the fix for this: https://bugs.launchpad.net/neutron/+bug/1951149 Searching for that external id I can find this private chassis and it's chassis indeed seems empty: ``` $ sudo ovn-sbctl find chassis-private | grep -A 5 e621e0fb-83d3-4a18-82b3-c842996548ed' _uuid : e621e0fb-83d3-4a18-82b3-c842996548ed chassis : [] external_ids : {"neutron:liveness_check_at"="2022-06-17T08:43:33.393639+00:00", "neutron:metadata_liveness_check_at"="2022-06-17T02:27:21.309718+00:00", "neutron:ovn-metadata-id"="c085d57a-3a2b-4f97-8250-23d3f914b078", "ne utron:ovn-metadata-sb-cfg"="150397"} name : compute1 nb_cfg : 150397 nb_cfg_timestamp : 1657729945956 ``` But there's also: ``` $ sudo ovn-sbctl find chassis hostname=compute1.stack _uuid : 164cb56b-1a3c-4401-bc52-6fa5e58d8f2a encaps : [c442312a-9dfa-4ffe-9db7-afe5f9055962] external_ids : {datapath-type=system, iface-types="bareudp,erspan,geneve,gre,gtpu,internal,ip6erspan,ip6gre,lisp,patch,stt,system,tap,vxlan", is-interconn="false", "neutron:ovn-metadata-sb-cfg"="250161", ovn-bridge-mappings="", ovn-chassis-mac-mappings="", ovn-cms-options="", ovn-enable-lflow-cache="true", ovn-limit-lflow-cache="", ovn-memlimit-lflow-cache-kb="", ovn-monitor-all="false", ovn-trim-limit-lflow-cache="", ovn-trim-timeout-ms="", ovn-trim-wmark-perc-lflow-cache="", port-up-notif="true"} hostname : compute1.stack name : compute1.stack nb_cfg : 0 other_config : {datapath-type=system, iface-types="bareudp,erspan,geneve,gre,gtpu,internal,ip6erspan,ip6gre,lisp,patch,stt,system,tap,vxlan", is-interconn="false", ovn-bridge-mappings="", ovn-chassis-mac-mappings="", ovn-cms-options="", ovn-enable-lflow-cache="true", ovn-limit-lflow-cache="", ovn-memlimit-lflow-cache-kb="", ovn-monitor-all="false", ovn-trim-limit-lflow-cache="", ovn-trim-timeout-ms="", ovn-trim-wmark-perc-lflow-cache="", port-up-notif="true"} transport_zones : [] vtep_logical_switches: [] $ sudo ovn-sbctl find chassis-private chassis=164cb56b-1a3c-4401-bc52-6fa5e58d8f2a _uuid : cbec617d-19dc-481c-ba99-b4132244773c chassis : 164cb56b-1a3c-4401-bc52-6fa5e58d8f2a external_ids : {"neutron:ovn-metadata-id"="3328a0c7-081b-58a9-9e91-baf5c8c259cd", "neutron:ovn-metadata-sb-cfg"="312321"} name : compute1.stack nb_cfg : 312321 nb_cfg_timestamp : 1679042105359 ``` Which seems to be a correct entry -- should neutron not pick up this entry rather than the one with "chassis : []"? Software versions: ii neutron-server 2:20.2.0-0ubuntu1~cloud0 all Neutron is a virtual network service for Openstack - server ii ovn-central 22.03.0-0ubuntu1~cloud0 amd64 OVN central components Distributor ID: Ubuntu Description: Ubuntu 20.04.4 LTS Release: 20.04 Codename: focal Please let me know if I can provide more diagnostics.
2023-03-17 21:37:07 Brian Haley tags ovn
2023-03-17 21:42:51 Brian Haley bug added subscriber Brian Haley
2023-03-17 21:43:10 Brian Haley neutron: status New Incomplete
2023-03-20 09:37:06 Peter Sabaini neutron: status Incomplete New
2023-03-20 14:18:24 Brian Haley neutron: status New Confirmed
2023-03-20 14:18:33 Brian Haley neutron: importance Undecided Medium
2023-03-21 14:27:29 Rodolfo Alonso neutron: assignee Rodolfo Alonso (rodolfo-alonso-hernandez)
2023-03-29 10:56:52 Rodolfo Alonso bug added subscriber Rodolfo Alonso
2023-04-21 10:20:38 OpenStack Infra neutron: status Confirmed In Progress
2023-04-27 14:21:58 OpenStack Infra neutron: status In Progress Fix Released