Checksum error\n Volume group for uuid not found; libdevmapper exiting with 2 device(s) still suspended

Bug #1359428 reported by Nathan Kinder
16
This bug affects 3 people
Affects Status Importance Assigned to Milestone
Cinder
Invalid
High
Unassigned

Bug Description

I'm encountering a number of failed volume related tempest tests in the gate-tempest-dsvm-full job. The errors all look similar to this:

2014-08-20 07:09:32.347 | setUpClass (tempest.api.volume.admin.test_snapshots_actions.SnapshotsActionsTestXML)
2014-08-20 07:09:32.347 | ------------------------------------------------------------------------------------
2014-08-20 07:09:32.347 |
2014-08-20 07:09:32.347 | Captured traceback:
2014-08-20 07:09:32.347 | ~~~~~~~~~~~~~~~~~~~
2014-08-20 07:09:32.347 | Traceback (most recent call last):
2014-08-20 07:09:32.347 | File "tempest/test.py", line 76, in decorator
2014-08-20 07:09:32.347 | f(cls)
2014-08-20 07:09:32.348 | File "tempest/api/volume/admin/test_snapshots_actions.py", line 46, in setUpClass
2014-08-20 07:09:32.348 | 'available')
2014-08-20 07:09:32.348 | File "tempest/services/volume/xml/snapshots_client.py", line 138, in wait_for_snapshot_status
2014-08-20 07:09:32.348 | raise exceptions.TimeoutException(message)
2014-08-20 07:09:32.348 | TimeoutException: Request timed out
2014-08-20 07:09:32.348 | Details: Request timed out
2014-08-20 07:09:32.348 | Details: Time Limit Exceeded! (196s)while waiting for available, but we got creating.

The full log is available here:

  http://logs.openstack.org/08/104408/5/gate/gate-tempest-dsvm-full/f3081d0/console.html

There are some cinder errors seem in the c-vol log:

2014-08-20 06:24:51.343 26531 ERROR cinder.brick.local_dev.lvm [req-374faa66-0448-4db0-912a-da1682aedb18 d0db1e0cb50a44f8b0ce9797b8289862 32bee14e421845b1bf68ce61cf092a4c - - -] Error creating snapshot
2014-08-20 06:24:51.349 26531 ERROR cinder.brick.local_dev.lvm [req-374faa66-0448-4db0-912a-da1682aedb18 d0db1e0cb50a44f8b0ce9797b8289862 32bee14e421845b1bf68ce61cf092a4c - - -] Cmd :sudo cinder-rootwrap /etc/cinder/rootwrap.conf lvcreate --name _snapshot-da519713-3038-4f8a-ac3a-5446d63711fd --snapshot stack-volumes-lvmdriver-1/volume-ebe39b05-7768-4a7f-872f-134e888eabd8 -L 1.00g
2014-08-20 06:24:51.349 26531 ERROR cinder.brick.local_dev.lvm [req-374faa66-0448-4db0-912a-da1682aedb18 d0db1e0cb50a44f8b0ce9797b8289862 32bee14e421845b1bf68ce61cf092a4c - - -] StdOut :
2014-08-20 06:24:51.350 26531 ERROR cinder.brick.local_dev.lvm [req-374faa66-0448-4db0-912a-da1682aedb18 d0db1e0cb50a44f8b0ce9797b8289862 32bee14e421845b1bf68ce61cf092a4c - - -] StdErr : /dev/loop1: Checksum error
  Volume group for uuid not found: eBM7ZaAMcl3tfLoPn79OvlT02cAd30LxD6bozjscwcMF6RX3dGkLiayOO52BkABV
  Problem reactivating origin volume-ebe39b05-7768-4a7f-872f-134e888eabd8
  libdevmapper exiting with 2 device(s) still suspended.

See the full c-vol log for more details:

    http://logs.openstack.org/08/104408/5/gate/gate-tempest-dsvm-full/f3081d0/logs/screen-c-vol.txt.gz?level=ERROR

Revision history for this message
John Griffith (john-griffith) wrote :

this is odd, seems to be happening exactly 5x a day every day since the 17'th. Query in Kibana: "libdevmapper exiting with 2 device(s) still suspended"

Changed in cinder:
status: New → Confirmed
Revision history for this message
John Griffith (john-griffith) wrote :

Not sure, but I think this change in Tempest:
https://github.com/openstack/tempest/commit/d9df38c867d6447dc41ad3fa7bc8b3d732e751b0

May have introduced an issue where we don't reliably get the current status prior to the delete operation.

Revision history for this message
Matt Riedemann (mriedem) wrote :
Revision history for this message
Matt Riedemann (mriedem) wrote :
Changed in cinder:
importance: Undecided → High
summary: - tempest volume tests fail with timeouts
+ Checksum error\n Volume group for uuid not found
summary: - Checksum error\n Volume group for uuid not found
+ Checksum error\n Volume group for uuid not found; libdevmapper exiting
+ with 2 device(s) still suspended
Revision history for this message
Jordan Pittier (jordan-pittier) wrote :

...84 failures in the past 7 days....

Revision history for this message
Jordan Pittier (jordan-pittier) wrote :
Revision history for this message
Ivan Kolodyazhny (e0ne) wrote :

This issue also affects gate-rally-dsvm-cinder job

Revision history for this message
Sean McGinnis (sean-mcginnis) wrote :

Appears to have since been fixed indirectly.

Changed in cinder:
status: Confirmed → Invalid
Revision history for this message
Mike Perez (thingee) wrote :

Appears to still be happening.

Changed in cinder:
status: Invalid → Confirmed
Revision history for this message
Matt Riedemann (mriedem) wrote :

This is gone again.

Changed in cinder:
status: Confirmed → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.