snmpd stop on host stops snmpd on LXD containers

Bug #1720109 reported by Arno van Huyssteen
14
This bug affects 1 person
Affects Status Importance Assigned to Milestone
net-snmp (Ubuntu)
Fix Released
Medium
Arno van Huyssteen
Xenial
Fix Released
Medium
Unassigned

Bug Description

[Impact]

 * Stopping snmpd in Xenial can stop container processes of the same
   binary

 * Fix by matching to the PIDFILE that is already tracked by snmpd

[Test Case]

 * Set up a host (e.g. a VM being Xenial)
 * Set up at least one (or more) containers in it
 * Install and start snmpd in the Host
 * Install and start snmpd in the containers
 * you'll now see like:
    root@ubuntu-vm:/var/log# ps fxaw|grep snmp
    29382 ? S 0:00 \_ /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp
    29308 ? S 0:00 /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp
  # One of these is from the container
 * now on the host stop your service, chances are you stop your guests instead
   root@ubuntu-vm:/var/log# service snmpd stop
   root@ubuntu-vm:/var/log# ps fxaw|grep snmp
   29426 pts/0 S+ 0:00 \_ grep --color=auto snmp
   # 29382 was from a lxd container in this case

[Regression Potential]

 * In case the pidfile stored by the snmpd daemon would be incorrect then
   the stop/restart actions would not match the process and fail to
   stop/restart. We didn't see that in our tests, but that is the
   regression I could think of.

[Other Info]

 * n/a

----

If you have ubuntu 16.04 containers running snmpd and you stop the snmpd on the LXD host system, it will also shut down all the snmpd instances on the containers (but will not restart them if you restart them on the LXD host) .. in fact, you even need to go back into the container, stop the snmpd before you can start it again.

See log below, viepovzat17 is the LXD host, viezmaaat10 is the ubuntu container:

  driver: lxc
  driver_version: 2.0.8
  kernel: Linux
  kernel_architecture: x86_64
  kernel_version: 4.4.0-87-generic
  server: lxd
  server_pid: 4426
  server_version: "2.16"
  storage: zfs
  storage_version: 0.6.5.6-0ubuntu16

root@viezmaaat10:~# /etc/init.d/snmpd start
[ ok ] Starting snmpd (via systemctl): snmpd.service.
root@viezmaaat10:~# ps -eaf | grep snmp
snmp 1271 1 0 10:50 ? 00:00:00 /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp -I -smux mteTrigger mteTriggerConf -p /run/snmpd.pid
root 1291 757 0 10:50 ? 00:00:00 grep --color=auto snmp
root@viezmaaat10:~#
root@viezmaaat10:~# ps -eaf | grep snmp
snmp 1271 1 0 10:50 ? 00:00:00 /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp -I -smux mteTrigger mteTriggerConf -p /run/snmpd.pid
root 1293 757 0 10:50 ? 00:00:00 grep --color=auto snmp
root@viezmaaat10:~# exit

root@viepovzat17:~# ps -eaf | grep snmp
snmp 22757 1 0 10:45 ? 00:00:00 /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp -I -smux mteTrigger mteTriggerConf -p /run/snmpd.pid
100116 24118 5222 0 10:50 ? 00:00:00 /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp -I -smux mteTrigger mteTriggerConf -p /run/snmpd.pid
root 24153 21842 0 10:50 pts/1 00:00:00 grep --color=auto snmp
root@viepovzat17:~# /etc/init.d/snmpd stop
[ ok ] Stopping snmpd (via systemctl): snmpd.service.
root@viepovzat17:~# ps -eaf | grep snmp
root 24286 21842 0 10:50 pts/1 00:00:00 grep --color=auto snmp
root@viepovzat17:~#

description: updated
tags: added: cpe-onsite
Revision history for this message
Stéphane Graber (stgraber) wrote :

This is a bug in the snmpd init script, moving bug over there.

affects: lxd (Ubuntu) → net-snmp (Ubuntu)
Ante Karamatić (ivoks)
Changed in net-snmp (Ubuntu):
assignee: nobody → Ante Karamatić (ivoks)
Revision history for this message
Arno van Huyssteen (avanhuys) wrote :

Has anyone had a chance to look into this bug in net-snmp yet?

Ante Karamatić (ivoks)
Changed in net-snmp (Ubuntu):
assignee: Ante Karamatić (ivoks) → Arno van Huyssteen (avanhuys)
importance: Undecided → Medium
status: New → Confirmed
Revision history for this message
Andreas Hasenack (ahasenack) wrote :

It's just using an executable match (--exec), instead of a pid:
  stop)
    log_daemon_msg "Stopping SNMP services:"
    start-stop-daemon --quiet --stop --oknodo --retry 3 --exec /usr/sbin/snmpd && [ ! -f $SNMP_PID ] || rm $SNMP_PID

That's the --exec bit.

Changed in net-snmp (Ubuntu):
status: Confirmed → Triaged
summary: - snmpd stop on host stops snmpd on LXD containers, but host-restart of
- snmpd doesnt restart lxd process
+ snmpd stop on host stops snmpd on LXD containers
tags: added: server-next
Revision history for this message
Christian Ehrhardt  (paelzer) wrote :

Note: since 5.7.3+dfsg-1.4 a native systemd file is used which uses type=simple and MAINPID tracking.
So >= Artful this is already fixed.

Setting the status to fixed, but adding a Xenial task as there the issue still exists.

Changed in net-snmp (Ubuntu Xenial):
status: New → Triaged
importance: Undecided → Medium
Changed in net-snmp (Ubuntu):
status: Triaged → Fix Released
Revision history for this message
Christian Ehrhardt  (paelzer) wrote :

The old Xenial init file does actually already PID tracking via "-p $SNMP_PID" of snmpd itself.
That makes sure that the real daemon pid is in there and not other spawned tasks.

It has some cleanup to remove said file after stop (even if the daemon itself did not).
But in general it seems we can use that file as-is to use --pidfile

Interestingly, the reload action already does --pidfile "$SNMP_PID" --exec /usr/sbin/snmpd as I'd suggest.

A quick test modifying the script on a system worked, I'll propose the change for an SRU.

Revision history for this message
Christian Ehrhardt  (paelzer) wrote :
Revision history for this message
Christian Ehrhardt  (paelzer) wrote :

- Ack on the MP (and some more tests)
- Updated the SRU template
- pushed the git ubuntu tag

Thereby uploaded the proposed change for the SRU team to evaluate.

description: updated
Revision history for this message
Chris J Arges (arges) wrote : Please test proposed package

Hello Arno, or anyone else affected,

Accepted net-snmp into xenial-proposed. The package will build now and be available at https://launchpad.net/ubuntu/+source/net-snmp/5.7.3+dfsg-1ubuntu4.1 in a few hours, and then in the -proposed repository.

Please help us by testing this new package. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation on how to enable and use -proposed.Your feedback will aid us getting this update out to other Ubuntu users.

If this package fixes the bug for you, please add a comment to this bug, mentioning the version of the package you tested and change the tag from verification-needed-xenial to verification-done-xenial. If it does not fix the bug for you, please add a comment stating that, and change the tag to verification-failed-xenial. In either case, without details of your testing we will not be able to proceed.

Further information regarding the verification process can be found at https://wiki.ubuntu.com/QATeam/PerformingSRUVerification . Thank you in advance!

Changed in net-snmp (Ubuntu Xenial):
status: Triaged → Fix Committed
tags: added: verification-needed verification-needed-xenial
Revision history for this message
Christian Ehrhardt  (paelzer) wrote :
Download full text (3.3 KiB)

Xenial as-is
You can see on the +100000 UID the container is the first line:
$ ps axlf | grep snmpd | grep -v grep
5 100112 4553 2723 20 0 55228 5952 - S ? 0:00 \_ /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp -I -smux mteTrigger mteTriggerConf -p /run/snmpd.pid
5 112 3398 1 20 0 63340 7672 - S ? 0:00 /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp -I -smux mteTrigger mteTriggerConf -p /run/snmpd.pid

$ sudo systemctl restart snmpd
$ ps axlf | grep snmpd | grep -v grep
5 112 4656 1 20 0 63344 7820 - S ? 0:00 /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp -I -smux mteTrigger mteTriggerConf -p /run/snmpd.pid

Containers view:
Feb 21 16:06:35 x1 systemd[1]: Started LSB: SNMP agents.
Feb 21 16:06:35 x1 snmpd[1208]: NET-SNMP version 5.7.3
Feb 21 16:08:03 x1 snmpd[1208]: Received TERM or STOP signal... shutting down...

Upgrade to snmpd from proposed:
$ sudo apt install snmpd
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following additional packages will be installed:
  libsnmp30
Suggested packages:
  snmptrapd
The following packages will be upgraded:
  libsnmp30 snmpd
2 upgraded, 0 newly installed, 0 to remove and 39 not upgraded.
Need to get 868 kB of archives.
After this operation, 0 B of additional disk space will be used.
Do you want to continue? [Y/n] Y
Get:1 http://archive.ubuntu.com/ubuntu xenial-proposed/main amd64 snmpd amd64 5.7.3+dfsg-1ubuntu4.1 [57,2 kB]
Get:2 http://archive.ubuntu.com/ubuntu xenial-proposed/main amd64 libsnmp30 amd64 5.7.3+dfsg-1ubuntu4.1 [811 kB]
Fetched 868 kB in 0s (2.779 kB/s)
Preconfiguring packages ...
(Reading database ... 54148 files and directories currently installed.)
Preparing to unpack .../snmpd_5.7.3+dfsg-1ubuntu4.1_amd64.deb ...
Unpacking snmpd (5.7.3+dfsg-1ubuntu4.1) over (5.7.3+dfsg-1ubuntu4) ...
Preparing to unpack .../libsnmp30_5.7.3+dfsg-1ubuntu4.1_amd64.deb ...
Unpacking libsnmp30:amd64 (5.7.3+dfsg-1ubuntu4.1) over (5.7.3+dfsg-1ubuntu4) ...
Processing triggers for systemd (229-4ubuntu21.1) ...
Processing triggers for ureadahead (0.100.0-19) ...
Processing triggers for man-db (2.7.5-1) ...
Processing triggers for libc-bin (2.23-0ubuntu10) ...
Setting up libsnmp30:amd64 (5.7.3+dfsg-1ubuntu4.1) ...
Setting up snmpd (5.7.3+dfsg-1ubuntu4.1) ...

$ ps axlf | grep snmpd | grep -v grep
5 100112 5601 2723 20 0 55228 5668 - S ? 0:00 \_ /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp -I -smux mteTrigger mteTriggerConf -p /run/snmpd.pid
5 112 5573 1 20 0 63344 7684 - S ? 0:00 /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp -I -smux mteTrigger mteTriggerConf -p /run/snmpd.pid

Host:
$ sudo systemctl restart snmpd
Container still:
Active: active (running) since Wed 2018-02-21 16:10:39 UTC; 1min 3s ago

$ ps axlf | grep snmpd | grep -v grep
5 100112 5601 2723 20 0 55228 5668 - S ? 0:00 \_ /usr/sbin/snmpd -Lsd -Lf /dev/null -u snmp -g snmp -I -smux mteTrigger mteTriggerConf -p /run/snmpd.pid
5 112 5660 1 20 0 63340 7656 - S ? 0:00 /usr/sbi...

Read more...

tags: added: verification-done verification-done-xenial
removed: verification-needed verification-needed-xenial
Revision history for this message
Launchpad Janitor (janitor) wrote :

This bug was fixed in the package net-snmp - 5.7.3+dfsg-1ubuntu4.1

---------------
net-snmp (5.7.3+dfsg-1ubuntu4.1) xenial; urgency=medium

  * d/snmpd.init: also match start-stop-daemon against pidfile to avoid
    killing extra snmpd processes for example in container (LP: #1720109)

 -- Christian Ehrhardt <email address hidden> Tue, 20 Feb 2018 14:53:51 +0100

Changed in net-snmp (Ubuntu Xenial):
status: Fix Committed → Fix Released
Revision history for this message
Brian Murray (brian-murray) wrote : Update Released

The verification of the Stable Release Update for net-snmp has completed successfully and the package has now been released to -updates. Subsequently, the Ubuntu Stable Release Updates Team is being unsubscribed and will not receive messages about this bug report. In the event that you encounter a regression using the package from -updates please report a new bug using ubuntu-bug and tag the bug report regression-update so we can easily find any regressions.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.