check-grenade-dsvm marked as FAILED - n-api/g-api Logs have errors

Bug #1259907 reported by Davanum Srinivas (DIMS)
24
This bug affects 4 people
Affects Status Importance Assigned to Milestone
Glance
Invalid
Undecided
Unassigned
OpenStack Compute (nova)
Invalid
Undecided
Unassigned
OpenStack Core Infrastructure
Invalid
Undecided
Unassigned
tempest
Fix Released
High
Sean Dague

Bug Description

Example:
http://logs.openstack.org/81/61281/1/check/check-grenade-dsvm/f42b658/console.html

2013-12-11 00:14:33.892 | Log File: g-api
2013-12-11 00:14:33.893 | 2013-12-11 00:04:32.459 9398 ERROR glance.api.v1.upload_utils [a954dd77-c926-4ef5-916c-0589e852bb1b 4c3bf2863784478e8fc3dec275a7bdef 4af50376a7f44390b0d5790b0f3aa1f1] Received HTTP error while uploading image 88236e20-ced9-4868-b9bb-570d97edc446
2013-12-11 00:14:33.893 |
2013-12-11 00:14:33.893 | 2013-12-11 00:04:32.472 9398 ERROR glance.api.v1.upload_utils [a954dd77-c926-4ef5-916c-0589e852bb1b 4c3bf2863784478e8fc3dec275a7bdef 4af50376a7f44390b0d5790b0f3aa1f1] Unable to kill image 88236e20-ced9-4868-b9bb-570d97edc446:
2013-12-11 00:14:33.893 |
2013-12-11 00:14:34.044 | Log File: n-api
2013-12-11 00:14:34.044 | 2013-12-11 00:04:32.475 ERROR nova.image.s3 [req-64000d25-93ec-43d8-817b-f62ec9a17a16 demo demo] Failed to upload testbucket/bundle.img.manifest.xml to /tmp/tmpsS2EHo
2013-12-11 00:14:34.045 |
2013-12-11 00:14:35.542 | Logs have errors
2013-12-11 00:14:35.542 | FAILED

Tags: gate-failure
Revision history for this message
Abhishek Chanda (abhishek-i) wrote :

This might actually be a glance issue. Marked as critical since it is blocking gates.

Changed in nova:
status: New → Triaged
importance: Undecided → Critical
Revision history for this message
Joe Gordon (jogo) wrote :

If you think it might be a glance issue and not a nova issue, then this bug isn't triaged for nova and shouldn't be marked as critical yet.

Changed in nova:
status: Triaged → New
importance: Critical → Undecided
Revision history for this message
Matt Riedemann (mriedem) wrote :

These errors are also in the glance API logs:

2013-12-11 16:28:25.864 10055 ERROR glance.api.v1.upload_utils [ba4c79c1-ea44-4f93-9e77-6ad3e77a97c4 c528309692694ed69e2a65f623c5b220 657967d64d7a44689a3093e6838a6fa4] Received HTTP error while uploading image 4681af21-56d3-4abf-8d54-f08de29dbc5f
2013-12-11 16:28:25.881 10055 ERROR glance.api.v1.upload_utils [ba4c79c1-ea44-4f93-9e77-6ad3e77a97c4 c528309692694ed69e2a65f623c5b220 657967d64d7a44689a3093e6838a6fa4] Unable to kill image 4681af21-56d3-4abf-8d54-f08de29dbc5f:

Revision history for this message
Matt Riedemann (mriedem) wrote :
summary: - check-grenade-dsvm marked as FAILED - "Failed to upload
- testbucket/bundle.img.manifest.xml" in n-api
+ check-grenade-dsvm marked as FAILED - n-api/g-api Logs have errors
Revision history for this message
Jeremy Stanley (fungi) wrote :

Also:

Image cache contained image file for image '303d6d16-e2ee-4b71-8900-61820e087714', however the registry did not contain metadata for that image!

http://logs.openstack.org/19/61419/1/check/check-tempest-dsvm-full/c9cc87f/logs/screen-g-api.txt.gz?level=ERROR

Revision history for this message
Mark Washenberger (markwash) wrote :

What appears to be happening is that the image is being deleted before the PUT of data has finished. But I can't find any recent glance behavior changes related to that. Is it possible that the PUT is just taking longer than before? Is it possible that what is driving the tests for this has changed slightly?

Revision history for this message
Mark Washenberger (markwash) wrote :

Related to https://bugs.launchpad.net/glance/+bug/1236868 but that bug has been active for a very long time so again there might be other factors in place causing this bug to surface in our tests.

Revision history for this message
Mark Washenberger (markwash) wrote :

Actually that bug above appears like it may be the result for admin users. For regular users, it seems like this current behavior has been in place at least since grizzly. So one other thing that might have changed here was the presence or absence of the admin role on the test account.

Changed in glance:
status: New → Incomplete
Revision history for this message
Mark Washenberger (markwash) wrote :

Marking as incomplete just because I need a little more info here about why this just started to break in order to figure out what should be fixed. AFAICT this is *normal* glance behavior, so maybe we just need to clean up the log levels?

Revision history for this message
Davanum Srinivas (DIMS) (dims-v) wrote :

@markwash - this is a gate bug, all the logs are in available as well as the code for the test cases...What else is needed? Failure rates are hovering between 30-45%, I don't think we should be marking this as "Incomplete" IMHO.

James E. Blair (corvus)
tags: added: gate-failure
Revision history for this message
Matt Riedemann (mriedem) wrote :
Revision history for this message
Matt Riedemann (mriedem) wrote :

Hmm, "however the registry did not contain metadata for that image" for g-api logs is in the whitelist here:

http://git.openstack.org/cgit/openstack/tempest/tree/etc/whitelist.yaml#n50

So that shouldn't be causing failures.

Changed in nova:
status: New → Triaged
Revision history for this message
Matt Riedemann (mriedem) wrote :

I don't know that I'd consider this 'triaged' necessarily, that implies that we know what the root cause is.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Related fix proposed to elastic-recheck (master)

Related fix proposed to branch: master
Review: https://review.openstack.org/61826

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Related fix merged to elastic-recheck (master)

Reviewed: https://review.openstack.org/61826
Committed: https://git.openstack.org/cgit/openstack-infra/elastic-recheck/commit/?id=7790fabe2e99cc2d41cfdb147a18f10c4dd9aa4f
Submitter: Jenkins
Branch: master

commit 7790fabe2e99cc2d41cfdb147a18f10c4dd9aa4f
Author: Matt Riedemann <email address hidden>
Date: Thu Dec 12 09:14:05 2013 -0800

    Add e-r query for bug 1259907

    There are errors in the screen-g-api.txt log but they are whitelisted
    errors already so we can't query on them since they would result in
    false negatives. See the bug report for more details.

    Related-Bug: #1259907

    Change-Id: I8553358e7316d466cbb88e1a3809dc17b5baf53f

Revision history for this message
Sean Dague (sdague) wrote :

Part of the problem here is that the log fail detector in tempest is actually breaking the grenade runs when it probably shouldn't. Grenade was not a primary use case for that script.

no longer affects: grenade
Changed in tempest:
status: New → Confirmed
importance: Undecided → High
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tempest (master)

Reviewed: https://review.openstack.org/62107
Committed: https://git.openstack.org/cgit/openstack/tempest/commit/?id=1159e52a2c9c51976bc3be5ad504c88fb94c2fe1
Submitter: Jenkins
Branch: master

commit 1159e52a2c9c51976bc3be5ad504c88fb94c2fe1
Author: Sean Dague <email address hidden>
Date: Fri Dec 13 18:46:21 2013 -0500

    don't fail on dirty logs with grenade

    because grenade is upgrading from old to new we might actually
    expect the logs to be dirtier than in upstream tempest. The grenade
    logs weren't scrubbed in the same ways during the development here
    as the tempest regular runs.

    Change-Id: Id1bcc2cc85e73a414d382756a65ea1d80dc10b00
    Closes-Bug: #1259907

Changed in tempest:
status: Confirmed → Fix Released
Sean Dague (sdague)
Changed in tempest:
assignee: nobody → Sean Dague (sdague)
Revision history for this message
Jeremy Stanley (fungi) wrote :

Seems to have been fixed in grenade.

Changed in openstack-ci:
status: New → Invalid
Revision history for this message
Joe Gordon (jogo) wrote :

this bug is fairly old, if it comes up again we should open a new bug.

Changed in nova:
status: Triaged → Invalid
Revision history for this message
Erno Kuvaja (jokke) wrote :

I do agree with Joe. Marking as Invalid for glance as well.

Changed in glance:
status: Incomplete → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.