baremetal bond-with-vlans in pike is failing

Bug #1722596 reported by wes hayutin
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Invalid
High
wes hayutin
Tags: quickstart
Revision history for this message
Alex Schultz (alex-schultz) wrote :

Looks infrastructure related

fatal: [host15.beaker.tripleo.lab.eng.rdu2.redhat.com]: FAILED! => {
    "changed": true,
    "cmd": [
        "curl",
        "-sfL",
        "http://rhos-release.virt.bos.redhat.com/ci-images/centos-org-image-cache/pike/rdo_trunk//undercloud.qcow2.md5"
    ],
    "delta": "0:00:00.053607",
    "end": "2017-10-10 11:55:59.143923",
    "failed": true,
    "invocation": {
        "module_args": {
            "_raw_params": "curl -sfL http://rhos-release.virt.bos.redhat.com/ci-images/centos-org-image-cache/pike/rdo_trunk//undercloud.qcow2.md5",
            "_uses_shell": false,
            "chdir": null,
            "creates": null,
            "executable": null,
            "removes": null,
            "warn": true
        },
        "module_name": "command"
    },
    "rc": 22,
    "start": "2017-10-10 11:55:59.090316",
    "stderr": "",
    "stdout": "",
    "stdout_lines": [],
    "warnings": [
        "Consider using get_url or uri module rather than running curl"
    ]
}

Revision history for this message
Emilien Macchi (emilienm) wrote :

Please improve the bug report, change the title and add some description.

Your servant triage bot

Revision history for this message
wes hayutin (weshayutin) wrote :

full baremetal deployments have been failing, we have no passes on bond-with-vlans since August 15 2017. This bug is to track getting this environment back into a stable state.

Revision history for this message
wes hayutin (weshayutin) wrote :

Current issue is that the "hash" variable is empty.

15:53:14 ++ get_delorean_hash_from_url
15:53:14 ++ uri=
15:53:14 ++ echo
15:53:14 ++ awk -F / '{ print $7 }'
15:53:14 + hash=
15:53:14 + cached_image=http://internal-server/ci-images/centos-org-image-cache/pike/rdo_trunk//undercloud.qcow2

wes hayutin (weshayutin)
Changed in tripleo:
assignee: wes hayutin (weshayutin) → Matthew Young (myoung)
Revision history for this message
Matt Young (halcyondude) wrote :

I suspect what's happening here is that the job is being launched such that current_build isn't being set.

There's a few ways it can be set, and a way to manually launch the job such that it's not set (bad). Just running the individual job will not set the parameter, which will cause the error being observed above.

There is an existing item [1] in the CI team's backlog that IMHO we should just address to make it clear when this is occurring.

<note>
The *only* reason that the jobs are set up using these property files (vs. other mechanism to pass parameters) is that at the time these pipelines were created, "being like RDO upstream" was a critical concern. RDO Phase 1 is still structured in a similar way, with property files being specified in the wrapping multijobs. This is less of an acute issue as most of those are set up to pull from a symlink, so whenever the jobs are simply run they always pull the current symlink - at the cost of not being reproducible once the symlinks have been updated.
</note>

Ways to launch the existing jobs such that current_build is defined:

1. launch wrapping multijob, which launch these jobs with a property file, containing "current_build=$urlHash", which is injected as a jenkins 'parameter' (an environment variable)

2. Using "Retry" on a failed build, which leverages the naginator plugin to re-use the same parameters which include "current_build" - and in the case of sbtest jobs contain additional parameters.

3. Using the "Rebuild" option (again on a specific numbered build job), which will present a UI page pre-populated with the previous parameters and their values, which could be changed or kept the same

4. using curl in the form "http://server/job/myjob/buildWithParameters?PARAMETER=Value"

[1] https://trello.com/c/rtFYw1dW/28-rdophase2ospphase0-make-jobs-fail-early-and-clearly-if-currentbuild-is-not-defined

Revision history for this message
Matt Young (halcyondude) wrote :
Changed in tripleo:
assignee: Matthew Young (myoung) → nobody
assignee: nobody → wes hayutin (weshayutin)
Revision history for this message
wes hayutin (weshayutin) wrote :

bah.. thanks Matt!

Changed in tripleo:
milestone: queens-1 → queens-2
Changed in tripleo:
milestone: queens-2 → queens-3
Changed in tripleo:
milestone: queens-3 → queens-rc1
Changed in tripleo:
milestone: queens-rc1 → rocky-1
Changed in tripleo:
milestone: rocky-1 → rocky-2
Matt Young (halcyondude)
Changed in tripleo:
status: Triaged → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.