contrail-vrouter-nodemgr fails with Exited status after deleting tor-agent
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Juniper Openstack | Status tracked in Trunk | |||||
R2.20 |
Fix Committed
|
Medium
|
Nikhil Bansal | |||
R2.21.x |
Fix Committed
|
Medium
|
Nikhil Bansal | |||
R2.22.x |
Fix Committed
|
Medium
|
Nikhil Bansal | |||
R3.0 |
Fix Committed
|
Medium
|
Nikhil Bansal | |||
Trunk |
Fix Committed
|
Medium
|
Nikhil Bansal |
Bug Description
Contrail-
Customer tested adding/deleting tor-agents with running add_tor_
After the deletion of tor-agent, they found contrail-
on a TSN was failed with "EXITED" status.
-------
root@openc-14:~# contrail-status
== Contrail vRouter ==
supervisor-vrouter: active
contrail-
~snip~
contrail-
contrail-
contrail-
contrail-
contrail-
contrail-
contrail-
contrail-
-------
Following traceback was logged on contrail-
-------
process:
wokeup and found a line
process:
Sending UVE:NodeStatusU
File "/usr/lib/
result = self._run(
File "/usr/lib/
self.
File "/usr/lib/
process_
File "/usr/lib/
process_
File "/usr/lib/
return VrouterProcessS
File "/usr/lib/
(self.group, self.name) = self.get_
File "/usr/lib/
for line in open(filename)))
IOError: [Errno 2] No such file or directory: '/etc/contrail/
<Greenlet at 0x7f3a335e0730: <bound method VrouterEventMan
-------
customer think the cause of this issue is lack of test condition or error handling process on vrouter_
It periodically opens .ini file of tor-agents for checking the process status. However, by timing the file can be already deleted by the fabric task and itraises IO error.
Restart supervisor-vrouter service is necessary to recover the node status.Issue time stamp Mar 25 11:30:00
Restart only contrail-
Logs are uploaded in the attachments
Steps to Reproduce
1. Execute following script on build server.
-------
#!/bin/bash
cd /opt/contrail/utils
while [ 1 ]; do
fab add_tor_
fab delete_
done
-------
2. Then watch contrail-
If problem happens, the same traceback message appears on it.
In customer test environment, it usually happens within 3 hours.
Changed in juniperopenstack: | |
assignee: | nobody → Raj Reddy (rajreddy) |
tags: | added: analytics |
Changed in juniperopenstack: | |
importance: | Undecided → Medium |
assignee: | Raj Reddy (rajreddy) → Nikhil Bansal (nikhilb-u) |
information type: | Proprietary → Public |
Review in progress for https:/ /review. opencontrail. org/19577
Submitter: Nikhil Bansal (<email address hidden>)