Activity log for bug #1914807

Date Who What changed Old value New value Message
2021-02-05 18:29:33 Dan Streetman bug added bug
2021-02-05 18:30:01 Dan Streetman bug added subscriber Victor Tapia
2021-02-05 18:30:12 Dan Streetman tags seg sts
2021-02-05 18:31:19 Dan Streetman description Seen both with maas 2.8 as well as maas 2.9; after running for a while, deployments stop working, and the rackd log has many messages like: 2021-02-05 18:13:56 provisioningserver.rpc.clusterservice: [critical] Failed to contact region. (While requesting RPC info at http://10.230.56.2:5240/MAAS). Traceback (most recent call last): File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 460, in callback self._startRunCallbacks(result) File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 568, in _startRunCallbacks self._runCallbacks() File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks current.result = callback(current.result, *args, **kw) File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1475, in gotResult _inlineCallbacks(r, g, status) --- <exception caught here> --- File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1367, in _doUpdate eventloops, maas_url = yield self._get_rpc_info(urls) File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1631, in _get_rpc_info raise config_exc File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1602, in _get_rpc_info eventloops, maas_url = yield self._parallel_fetch_rpc_info(urls) File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks current.result = callback(current.result, *args, **kw) File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1576, in handle_responses errors[0].raiseException() File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/python/failure.py", line 467, in raiseException raise self.value.with_traceback(self.tb) File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1537, in _serial_fetch_rpc_info raise last_exc File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1529, in _serial_fetch_rpc_info response = yield self._fetch_rpc_info(url, orig_url) File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1416, in _inlineCallbacks result = result.throwExceptionIntoGenerator(g) File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/python/failure.py", line 491, in throwExceptionIntoGenerator return g.throw(self.type, self.value, self.tb) File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1631, in _get_rpc_info raise config_exc File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1602, in _get_rpc_info eventloops, maas_url = yield self._parallel_fetch_rpc_info(urls) File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks current.result = callback(current.result, *args, **kw) File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1576, in handle_responses errors[0].raiseException() File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/python/failure.py", line 467, in raiseException raise self.value.with_traceback(self.tb) File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1416, in _inlineCallbacks result = result.throwExceptionIntoGenerator(g) File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/python/failure.py", line 491, in throwExceptionIntoGenerator return g.throw(self.type, self.value, self.tb) File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1537, in _serial_fetch_rpc_info raise last_exc File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1529, in _serial_fetch_rpc_info response = yield self._fetch_rpc_info(url, orig_url) twisted.internet.error.ConnectingCancelledError: HostnameAddress(hostname=b'10.230.56.2', port=5240) The region controller appears to be working fine and there are no errors in the regiond log. To get maas working again, the system must be rebooted, or the maas snap service must be restarted. However, the problem being occurring again after some number of hours or days. Seen both with maas 2.8 as well as maas 2.9; after running for a while, deployments stop working, and the rackd log has many messages like: 2021-02-05 18:13:56 provisioningserver.rpc.clusterservice: [critical] Failed to contact region. (While requesting RPC info at http://10.230.56.2:5240/MAAS).         Traceback (most recent call last):           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 460, in callback             self._startRunCallbacks(result)           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 568, in _startRunCallbacks             self._runCallbacks()           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks             current.result = callback(current.result, *args, **kw)           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1475, in gotResult             _inlineCallbacks(r, g, status)         --- <exception caught here> ---           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1367, in _doUpdate             eventloops, maas_url = yield self._get_rpc_info(urls)           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1631, in _get_rpc_info             raise config_exc           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1602, in _get_rpc_info             eventloops, maas_url = yield self._parallel_fetch_rpc_info(urls)           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks             current.result = callback(current.result, *args, **kw)           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1576, in handle_responses             errors[0].raiseException()           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/python/failure.py", line 467, in raiseException             raise self.value.with_traceback(self.tb)           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1537, in _serial_fetch_rpc_info             raise last_exc           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1529, in _serial_fetch_rpc_info             response = yield self._fetch_rpc_info(url, orig_url)           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1416, in _inlineCallbacks             result = result.throwExceptionIntoGenerator(g)           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/python/failure.py", line 491, in throwExceptionIntoGenerator             return g.throw(self.type, self.value, self.tb)           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1631, in _get_rpc_info             raise config_exc           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1602, in _get_rpc_info             eventloops, maas_url = yield self._parallel_fetch_rpc_info(urls)           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 654, in _runCallbacks             current.result = callback(current.result, *args, **kw)           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1576, in handle_responses             errors[0].raiseException()           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/python/failure.py", line 467, in raiseException             raise self.value.with_traceback(self.tb)           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/internet/defer.py", line 1416, in _inlineCallbacks             result = result.throwExceptionIntoGenerator(g)           File "/snap/maas/11322/usr/lib/python3/dist-packages/twisted/python/failure.py", line 491, in throwExceptionIntoGenerator             return g.throw(self.type, self.value, self.tb)           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1537, in _serial_fetch_rpc_info             raise last_exc           File "/snap/maas/11322/lib/python3.8/site-packages/provisioningserver/rpc/clusterservice.py", line 1529, in _serial_fetch_rpc_info             response = yield self._fetch_rpc_info(url, orig_url)         twisted.internet.error.ConnectingCancelledError: HostnameAddress(hostname=b'10.230.56.2', port=5240) The region controller appears to be working fine and there are no errors in the regiond log. This deployment uses a single region and single rack, which are both located on a single VM. To get maas working again, the system must be rebooted, or the maas snap service must be restarted. However, the problem being occurring again after some number of hours or days.
2021-02-18 21:22:21 Dominique Poulain bug added subscriber Dominique Poulain
2021-02-19 01:38:38 Launchpad Janitor maas (Ubuntu): status New Confirmed
2021-03-03 17:36:50 Adam Collard bug task added maas
2021-03-04 11:04:34 Björn Tillenius maas: status New Incomplete
2021-03-04 14:32:00 Dan Streetman attachment added rackd.log https://bugs.launchpad.net/ubuntu/+source/maas/+bug/1914807/+attachment/5472846/+files/rackd.log
2021-03-04 14:32:25 Dan Streetman attachment removed rackd.log https://bugs.launchpad.net/ubuntu/+source/maas/+bug/1914807/+attachment/5472846/+files/rackd.log
2021-03-04 14:33:33 Dan Streetman attachment added logs.tar.bz2 https://bugs.launchpad.net/ubuntu/+source/maas/+bug/1914807/+attachment/5472847/+files/logs.tar.bz2
2021-03-04 14:36:27 Dan Streetman maas: status Incomplete New
2021-07-07 22:33:36 Bill Wear maas: status New Triaged
2021-08-16 12:19:25 Björn Tillenius maas: status Triaged Incomplete
2021-11-11 10:09:41 Alberto Donato maas: status Incomplete New
2021-11-11 10:09:45 Alberto Donato maas: status New Incomplete
2023-01-30 20:21:44 Joel Davidow bug added subscriber Joel Davidow
2023-03-09 18:59:45 Mauricio Faria de Oliveira maas (Ubuntu): status Confirmed Incomplete
2023-03-10 01:40:08 Dan Streetman removed subscriber Dan Streetman
2023-03-12 17:26:36 Mauricio Faria de Oliveira maas: status Incomplete Invalid
2023-03-12 17:26:40 Mauricio Faria de Oliveira maas (Ubuntu): status Incomplete Invalid