Activity log for bug #1772601

Date Who What changed Old value New value Message
2018-05-22 08:14:02 Stuart Bishop bug added bug
2018-05-22 08:15:40 William Grant bug added subscriber William Grant
2018-05-22 08:29:42 Stuart Bishop description Upgrading a model to 2.3.7 caused an unexpected outage when PostgreSQL was shutdown. The charm thankfully recovered when the first hook was run post-upgrade, so the outage was limited to 60 seconds. It seems that upgradin jujud also sends kill signals to all processes spawned by its subprocesses, such as the PostgreSQL daemon started by running 'pg_ctlcluster 9.5 main start' from a hook. We think everything linked by the cgroup gets affected, so even though pg_ctlcluster switches user id and group it is killed: # cat /proc/4776/cgroup # The root PostgreSQL pid 11:cpu,cpuacct:/system.slice/jujud-unit-postgresql-1.service 10:pids:/system.slice/jujud-unit-postgresql-1.service 9:devices:/system.slice/jujud-unit-postgresql-1.service 8:perf_event:/ 7:memory:/system.slice/jujud-unit-postgresql-1.service 6:freezer:/ 5:net_cls,net_prio:/ 4:hugetlb:/ 3:blkio:/system.slice/jujud-unit-postgresql-1.service 2:cpuset:/ 1:name=systemd:/system.slice/jujud-unit-postgresql-1.service To reproduce, deploy a cs:postgresql unit with an older version of juju, and upgrade to 2.3.7. /var/log/postgresql-9.5-main.log will show the shutdown signal being received, and starting up again when the next hook is run. systemd services seem unaffected, so pgbouncer (started by systemctl start pgbouncer) did not get signaled. Upgrading a model to 2.3.7 caused an unexpected outage when PostgreSQL was shutdown. The charm thankfully recovered when the first hook was run post-upgrade, so the outage was limited to 60 seconds. It seems that upgradeing jujud also sends kill signals to all processes spawned by its subprocesses, such as the PostgreSQL daemon started by running 'pg_ctlcluster 9.5 main start' from a hook. We think everything linked by the cgroup gets affected, so even though pg_ctlcluster switches user id and group it is killed: # cat /proc/4776/cgroup # The root PostgreSQL pid 11:cpu,cpuacct:/system.slice/jujud-unit-postgresql-1.service 10:pids:/system.slice/jujud-unit-postgresql-1.service 9:devices:/system.slice/jujud-unit-postgresql-1.service 8:perf_event:/ 7:memory:/system.slice/jujud-unit-postgresql-1.service 6:freezer:/ 5:net_cls,net_prio:/ 4:hugetlb:/ 3:blkio:/system.slice/jujud-unit-postgresql-1.service 2:cpuset:/ 1:name=systemd:/system.slice/jujud-unit-postgresql-1.service To reproduce, deploy a cs:postgresql unit with an older version of juju, and upgrade to 2.3.7. /var/log/postgresql-9.5-main.log will show the shutdown signal being received, and starting up again when the next hook is run. systemd services seem unaffected, so pgbouncer (started by systemctl start pgbouncer) did not get signaled.
2018-05-22 09:05:00 Nobuto Murata bug added subscriber Nobuto Murata
2018-05-22 14:53:39 Stuart Bishop tags canonical-is
2018-05-22 14:53:49 Stuart Bishop bug added subscriber The Canonical Sysadmins
2018-06-06 17:48:16 John A Meinel juju: importance Undecided High
2018-06-06 17:48:16 John A Meinel juju: status New Incomplete
2018-06-11 09:47:59 Stuart Bishop juju: status Incomplete New
2018-06-12 12:38:16 Richard Harding juju: status New Triaged
2018-06-12 12:45:37 Richard Harding bug added subscriber Richard Harding
2022-11-03 16:41:44 Canonical Juju QA Bot juju: importance High Low
2022-11-03 16:41:45 Canonical Juju QA Bot tags canonical-is canonical-is expirebugs-bot