Designate

Bug #1896783
Activity log

Activity log for bug #1896783

Date	Who	What changed	Old value	New value	Message
2020-09-23 14:30:39	Michael Chapman	bug			added bug
2020-09-23 14:32:30	Michael Chapman	designate: assignee		Michael Chapman (michaeltchapman)
2020-09-24 03:50:20	Michael Chapman	summary	Bind backend rndc commands have no timeout	Bind backend rndc commands aren't limited
2020-09-24 03:55:06	Michael Chapman	description	In a local test environment I misconfigured the pools.yaml to point at bind servers that don't exist, then ran tempest. The result was about 1500 rndc processes that don't appear to ever disappear, causing significant load on the system. rndc doesn't have a timeout option that I can see in the man page, so it probably makes sense to add one via oslo processutils. The environment is deployed using tripleo, so the processes are all in the designate-worker container: [root@controller-1 ~]# podman stats designate_worker ID NAME CPU % MEM USAGE / LIMIT MEM % NET IO BLOCK IO PIDS 18103f368b4e designate_worker 1.73% 19.13GB / 33.56GB 57.02% -- / -- 4.882GB / 8.536MB 2076	In a local test environment I misconfigured the pools.yaml to point at bind servers that don't exist, then ran tempest. The result was about 1500 rndc processes that don't appear to ever disappear, causing significant load on the system. The environment is deployed using tripleo, so the processes are all in the designate-worker container: [root@controller-1 ~]# podman stats designate_worker ID NAME CPU % MEM USAGE / LIMIT MEM % NET IO BLOCK IO PIDS 18103f368b4e designate_worker 1.73% 19.13GB / 33.56GB 57.02% -- / -- 4.882GB / 8.536MB 2076 Steps to reproduce: 1. Deploy devstack with the bind backend. 2. Edit /etc/designate/pools.yaml so rndc_host doesn't point at a bind server 3. Update the pools: designate-manage pool update --file /etc/designate/pools.yaml 4. Run tempest: tox -e all-plugin -- designate This is despite designate worker only having 2 configured workers: /etc/designate.conf: [service:worker] workers = 2 It might make sense to maintain a task queue for the rndc commands so that only a limited number can be active at any given time. rndc doesn't have a timeout option that I can see in the man page, so it might makes sense to add one via oslo processutils. I think the built in timeout is about 30 seconds.
2021-03-19 19:47:03	Michael Johnson	designate: status	New	Fix Committed
2021-03-19 19:47:13	Michael Johnson	designate: importance	Undecided	High