cyborg agent failed to start privsep daemon
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
kolla-ansible |
Fix Released
|
High
|
Radosław Piliszek | ||
Train |
Fix Released
|
High
|
Unassigned | ||
Ussuri |
Fix Released
|
High
|
Radosław Piliszek |
Bug Description
What happened:
The cyborg agent failed to scan PCI devices by using 'lspci', because the privesp daemon failed to start.
The log is bellow:
2020-03-25 17:06:00.555 6 DEBUG oslo_service.
2020-03-25 17:06:00.556 6 DEBUG oslo_concurrenc
2020-03-25 17:06:00.557 6 INFO oslo.privsep.daemon [-] Running privsep helper: ['sudo', 'privsep-helper', '--config-file', '/etc/cyborg/
2020-03-25 17:06:01.247 6 WARNING oslo.privsep.daemon [-] privsep log: [Errno 1] Operation not permitted2020-03-25 17:06:01.285 6 INFO oslo.privsep.daemon [-] Spawned new privsep daemon via rootwrap
2020-03-25 17:06:01.285 6 DEBUG oslo.privsep.daemon [-] Accepted privsep connection to /tmp/tmpLXVeAf/
2020-03-25 17:06:01.243 2631 INFO oslo.privsep.daemon [-] privsep process running with uid/gid: 0/0
2020-03-25 17:06:01.246 2631 ERROR oslo.privsep.daemon [-] [Errno 1] Operation not permitted
Traceback (most recent call last):
File "/var/lib/
Daemon(channel, context).run()
File "/var/lib/
self.
File "/var/lib/
capabilitie
File "/var/lib/
raise OSError(errno, os.strerror(errno))
OSError: [Errno 1] Operation not permitted
2020-03-25 17:06:01.296 6 DEBUG oslo_privsep.comm [-] EOF on privsep read channel _reader_main /var/lib/
2020-03-25 17:06:01.297 6 ERROR oslo.privsep.daemon [-] Error while sending initial PING to privsep: Premature eof waiting for privileged process: IOError: Premature eof waiting for privileged process
2020-03-25 17:06:01.297 6 ERROR oslo.privsep.daemon Traceback (most recent call last):
2020-03-25 17:06:01.297 6 ERROR oslo.privsep.daemon File "/var/lib/
2020-03-25 17:06:01.297 6 ERROR oslo.privsep.daemon reply = self.send_
2020-03-25 17:06:01.297 6 ERROR oslo.privsep.daemon File "/var/lib/
2020-03-25 17:06:01.297 6 ERROR oslo.privsep.daemon reply = future.result()
2020-03-25 17:06:01.297 6 ERROR oslo.privsep.daemon File "/var/lib/
2020-03-25 17:06:01.297 6 ERROR oslo.privsep.daemon raise self.error
2020-03-25 17:06:01.297 6 ERROR oslo.privsep.daemon IOError: Premature eof waiting for privileged process
2020-03-25 17:06:01.297 6 ERROR oslo.privsep.daemon
2020-03-25 17:06:01.297 6 CRITICAL oslo.privsep.daemon [-] Privsep daemon failed to start: IOError: Premature eof waiting for privileged process
2020-03-25 17:06:01.298 6 DEBUG oslo_concurrenc
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
2020-03-25 17:06:01.298 6 ERROR oslo_service.
What you expected to happen:
The cyborg agent can scan the PCI devices successfully.
How to reproduct it:
1. Config the cyborg agent's driver:
[agent]
enabled_drivers = nvidia_gpu_driver
2. Restart cyborg agent container.
**Environment**:
OS: CentOS7.7.1908
Kernel: 3.10.0-
Docker version: 18.09.9
Kolla-Ansible version: 9.0.1
Docker image install type: source
Docker image distribution: CentOS
Are you using official images from Docker Hub or self built: Docker Hub
Changed in kolla-ansible: | |
importance: | Undecided → High |
summary: |
- cyborg agent failed to start privesp daemon + cyborg agent failed to start privsep daemon |
tags: | added: cyborg cyborg-agent privsep |
Changed in kolla-ansible: | |
assignee: | ya.wang (ya.wang) → Radosław Piliszek (yoctozepto) |
Fix proposed to branch: master /review. opendev. org/721139
Review: https:/