python-tripleoclient exits with status code 0 when overcloud deploy fails

Bug #1674982 reported by Sagi (Sergey) Shnaidman
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
High
Adriano Petrich

Bug Description

Python-tripleoclient exits with code 0 when overcloud deploy fails.
It's better to use error status code in these cases.

Changed in tripleo:
status: New → Triaged
importance: Undecided → Medium
milestone: none → pike-2
Revision history for this message
Alex Schultz (alex-schultz) wrote :

Raising to high because this makes it very hard for automation to work right

Changed in tripleo:
importance: Medium → High
Revision history for this message
Ben Nemec (bnemec) wrote :

This is definitely not always the case, for example: http://logs.openstack.org/60/448460/2/check-tripleo/gate-tripleo-ci-centos-7-ovb-nonha/c877061/console.html#_2017-03-22_18_54_15_747619

Do you have a link to a job where it failed and returned 0?

Revision history for this message
Alex Schultz (alex-schultz) wrote :

I think this spawned from Bug 1674955

Revision history for this message
Sagi (Sergey) Shnaidman (sshnaidm) wrote :
Changed in tripleo:
assignee: nobody → Adriano Petrich (apetrich)
Revision history for this message
Adriano Petrich (apetrich) wrote :

It looks like related or duplicate of https://bugs.launchpad.net/tripleo/+bug/1675709

Revision history for this message
Michele Baldessari (michele) wrote :

Adriano I am not 100% sure they are duplicate, although they *might* be somehow related.

I mean in this bug here we see that the deployment failed http://logs.openstack.org/79/445479/3/check/gate-tripleo-ci-centos-7-nonha-multinode-oooq/2da08a1/logs/undercloud/home/jenkins/overcloud_deploy.log.txt.gz#_2017-03-22_09_38_06 but oooq got a return code == 0.

bug 1675709 actually shows that the deployment succeeded and it seems the output of the deploy command is confirming that with all the steps turning to COMPLETE, but we have an issue in the generation of the overcloudrc.

It might turn out that both problems have the same root cause, but until then I would think it is best to keep them separate.

Revision history for this message
Adriano Petrich (apetrich) wrote :

The source for this problem is still eluding us.

Some experiments show that some RuntimeErrors might be silently catching somewhere and not propagating but we are not able to reproduce it consistently. We are moving our exceptions that are subclassed as RuntimeErrors into pure Exceptions. Although we are not 100% sure it will solve the problem.

Changed in tripleo:
milestone: pike-2 → pike-3
Changed in tripleo:
status: Triaged → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.