Bug #1681839 “libvirt: blockcommit fails - disk not ready for pi...” : Bugs : libvirt package : Ubuntu

Revision history for this message

Patrick Best (bestpa) wrote on 2017-04-11:

#1

bugfix mentioned : https://bugzilla.redhat.com/show_bug.cgi?id=1197592

Joshua Powers (powersj) on 2017-04-12

Changed in libvirt (Ubuntu):
status:	New → Incomplete
status:	Incomplete → New

Revision history for this message

Joshua Powers (powersj) wrote on 2017-04-12:

#2

Hi and thanks for reporting this bug! I am going to see if someone else from the team can also take a look at this to see how big of a change this would require.

Also, sorry for marking this as incomplete and then new again as I was on the wrong tab.

Revision history for this message

Patrick Best (bestpa) wrote on 2017-04-17: Re: [Bug 1681839] Re: libvirt - disk not ready for pivot yet

#3

many thanks!

On Wed, Apr 12, 2017 at 2:38 PM, Joshua Powers <email address hidden>
wrote:

> Hi and thanks for reporting this bug! I am going to see if someone else
> from the team can also take a look at this to see how big of a change
> this would require.
>
> Also, sorry for marking this as incomplete and then new again as I was
> on the wrong tab.
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1681839
>
> Title:
> libvirt - disk not ready for pivot yet
>
> Status in libvirt package in Ubuntu:
> New
>
> Bug description:
> root@thewind:/home/bestpa/scripts# virsh blockcommit mail vda --active
> --verbose --pivot
> Block commit: [100 %]error: failed to pivot job for disk vda
> error: block copy still active: disk 'vda' not ready for pivot yet
>
> found related bugfix at redhat... can i get 1.3.2 pushed into ubuntu
> 16.04 release?
>
> bestpa@thewind:~$ cat /etc/os-release
> NAME="Ubuntu"
> VERSION="16.04.2 LTS (Xenial Xerus)"
> ID=ubuntu
> ID_LIKE=debian
> PRETTY_NAME="Ubuntu 16.04.2 LTS"
>
> bestpa@thewind:~$ libvirtd --version
> libvirtd (libvirt) 1.3.1
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/ubuntu/+source/libvirt/+bug/
> 1681839/+subscriptions
>

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-04-18: Re: libvirt - disk not ready for pivot yet

#4

Info to repro I tried:
# create a simple system via uvtool-libvirt
$ uvt-kvm create [...]
$ virsh dumpxml <guest> > t1.xml
$ virsh undefine <guest>

# need to be transient for the blockcopy test
$ virsh create t1.xml

# Now we have a transient domain, and can copy them around:

$ virsh domblklist xenial-zfspool-libvirtTarget Source
------------------------------------------------
vda /var/lib/uvtool/libvirt/images/xenial-zfspool-libvirt.qcow
vdb /var/lib/uvtool/libvirt/images/xenial-zfspool-libvirt-ds-clone.qcow

# Since the referred bug reported that being racy I tried in a loop:
$ for idx in $(seq 1 20); do virsh blockcopy xenial-zfspool-libvirt vdb /var/lib/uvtool/libvirt/images/xenial-zfspool-libvirt-ds-clone${idx}.qcow --pivot --verbose --wait; done

It worked fine in 20/20 cases for me - I also checked on the bigger vda image but it worked as well.
That might only be due to less load, smaller file or whatever else defines the race window.

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-04-18:

#5

You reported your issue on commit rather than copy as in the RH bug.
So looking into that more specific.

$ virsh snapshot-create-as --domain testguest snap1 --diskspec vda,file=/var/lib/uvtool/libvirt/images/vda-snap1.qcow2 --disk-only --atomic --no-metadata
# touch a file in guest
$ virsh snapshot-create-as --domain testguest snap2 --diskspec vda,file=/var/lib/uvtool/libvirt/images/vda-snap2.qcow2 --disk-only --atomic --no-metadata
# touch a file in guest

This gave me a two stage snapshot list
$ sudo qemu-img info --backing-chain /var/lib/uvtool/libvirt/images/vda-snap2.qcow2
image: /var/lib/uvtool/libvirt/images/vda-snap2.qcow2
[...]
backing file: /var/lib/uvtool/libvirt/images/vda-snap1.qcow2
[...]
image: /var/lib/uvtool/libvirt/images/vda-snap1.qcow2
[...]
backing file: /var/lib/uvtool/libvirt/images/testguest-clone5.qcow
backing file format: qcow2
[...]
image: /var/lib/uvtool/libvirt/images/testguest-clone5.qcow

Committing those onto the base worked as well:

virsh blockcommit testguest vda --active --verbose --pivot
Block commit: [100 %]
Successfully pivoted

In "virsh domblklist testguest" this moved me back from:
vda /var/lib/uvtool/libvirt/images/vda-snap1.qcow2
to
vda /var/lib/uvtool/libvirt/images/testguest-clone5.qcow

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-04-18:

#6

On the changes:
First set of patches is in 1.2.18, so we have that already
faa14391 virsh: Refactor block job waiting in cmdBlockCopy
74084035 virsh: Refactor block job waiting in cmdBlockCommit
2e782763 virsh: Refactor block job waiting in cmdBlockPull
eae59247 qemu: Update state of block job to READY only if it actually is ready

Second set that got upstream into 1.3.2 that would need to be backports
86c4df83 virsh: improve waiting for block job readiness
8fa216bb virsh: ensure SIGINT action is reset on all errors
15dee2ef virsh: be consistent with style of loop exit
704dfd6b virsh: avoid unnecessary progress updates

This set seems almost backportable at first look, but I didn't check all the dependencies that are not so obvious.

On the original request, we won't just move to 1.3.2 in Xenial as that would be against the SRU policy [1] to protect the stability for many/all other use cases.
Instead we could either work on this together to backport, test and verify it for Xenial.
Or you could use the Ubuntu Cloud Archive [2] which is like a special backport pocket that provides you the latest cloud/virtualization related packages. With that you could get the Libvirt/qemu stack of Yakkety or Zesty and should be good as well.

If you pick the former one needs to find the time to create the backports.
I don't know if I immediately get to this given that it is a rather less used use-case and also only triggering on a race as it seems to me. If you want to work with me on preparing that I'll try to help as good as I can - and there also is the USBSD [3] that might help if you are unsure.

[1]: https://wiki.ubuntu.com/StableReleaseUpdates
[2]: https://wiki.ubuntu.com/OpenStack/CloudArchive
[3]: https://naccblog.wordpress.com/2017/03/24/usbsd-1-goals-inaugural-ubuntu-server-bug-squashing-day/

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-04-18:

#7

For now I'll mark it as incomplete waiting for any further info you can provide.
To better triage and confirm your case, I'd like to understand if you:
- can reliably trigger this (if you have steps to do so please report them as well)
- was a one-time failure
- happens every now and then in your environment

Also if you happen to have identified any steps on the creation of the images (size, format, ...) that will affect the chance to reproduce please let us know.

Changed in libvirt (Ubuntu):
status:	New → Incomplete
importance:	Undecided → Medium

Revision history for this message

Patrick Best (bestpa) wrote on 2017-04-20:

#8

Well i'll be. It seems to work no problem upon subsequent tries.
Here's my methodology and run-through.

virsh # list
Id Name State
----------------------------------------------------

17 mail running

virsh #
virsh #
virsh #
virsh # domblklist mail
Target Source
------------------------------------------------
vda /images2/mail.img
hdb /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

virsh #
virsh #
virsh #
virsh #
virsh # list
Id Name State
----------------------------------------------------

17 mail running

virsh # domblklist mail
Target Source
------------------------------------------------
vda /images2/mail.img
hdb /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

virsh # snapshot-list mail
Name Creation Time State
------------------------------------------------------------

virsh # snapshot-create-as --domain mail mail-snap1 --disk-only --atomic
Domain snapshot mail-snap1 created
virsh # snapshot-list mail
Name Creation Time State
------------------------------------------------------------
mail-snap1 2017-04-20 16:03:06 -0400 disk-snapshot

virsh # domblklist mail
Target Source
------------------------------------------------
vda /images2/mail.mail-snap1
hdb /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

virsh # blockcommit mail vda --active --verbose --pivot
Block commit: [100 %]
Successfully pivoted
virsh # domblklist mail
Target Source
------------------------------------------------
vda /images2/mail.img
hdb /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

virsh # snapshot-list mail
Name Creation Time State
------------------------------------------------------------
mail-snap1 2017-04-20 16:03:06 -0400 disk-snapshot

virsh # snapshot-delete mail mail-snap1 --metadata
Domain snapshot mail-snap1 deleted

virsh # snapshot-list mail
Name Creation Time State
------------------------------------------------------------

virsh # domblklist mail
Target Source
------------------------------------------------
vda /images2/mail.img
hdb /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

so as you see, i had no problems doing this. I wish I could delete snap files , but for some reason we can only use --meta , and must delete them at the filesystem level.

Well i'll be.  It seems to work no problem upon subsequent tries.  
Here's my methodology and run-through.

virsh # list
 Id    Name                           State
----------------------------------------------------

17    mail                           running

virsh # 
virsh # 
virsh # 
virsh # domblklist mail
Target     Source
------------------------------------------------
vda        /images2/mail.img
hdb        /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

virsh # 
virsh # 
virsh # 
virsh # 
virsh # list 
 Id    Name                           State
----------------------------------------------------

17    mail                           running

virsh # domblklist mail
Target     Source
------------------------------------------------
vda        /images2/mail.img
hdb        /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

virsh # snapshot-list mail
 Name                 Creation Time             State
------------------------------------------------------------

virsh # snapshot-create-as --domain mail mail-snap1 --disk-only --atomic 
Domain snapshot mail-snap1 created
virsh # snapshot-list mail
 Name                 Creation Time             State
------------------------------------------------------------
 mail-snap1           2017-04-20 16:03:06 -0400 disk-snapshot

virsh # domblklist mail
Target     Source
------------------------------------------------
vda        /images2/mail.mail-snap1
hdb        /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

virsh # blockcommit mail vda --active --verbose --pivot 
Block commit: [100 %]
Successfully pivoted
virsh # domblklist mail
Target     Source
------------------------------------------------
vda        /images2/mail.img
hdb        /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

virsh # snapshot-list mail
 Name                 Creation Time             State
------------------------------------------------------------
 mail-snap1           2017-04-20 16:03:06 -0400 disk-snapshot

virsh # snapshot-delete mail mail-snap1 --metadata 
Domain snapshot mail-snap1 deleted

virsh # snapshot-list mail
 Name                 Creation Time             State
------------------------------------------------------------

virsh # domblklist mail
Target     Source
------------------------------------------------
vda        /images2/mail.img
hdb        /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

so as you see, i had no problems doing this.  I wish I could delete snap files , but for some reason we can only use --meta , and must delete them at the filesystem level.

Revision history for this message

Patrick Best (bestpa) wrote on 2017-04-20:

#9

at this point, i'll consider it a one-time occurence, but i'm thinking it may have happened other times, due to a jammed up backup script i see once in a while. I don't wish to pursue it further until it's too infuriating, then i'll just run a hostOS with a fresher version available in a different long-term distro.

Thanks for looking.

P

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-04-21:

#10

Thanks for reporting back,
there are a few races with block jobs that we look into atm which might as well affect this.
Unfortunately - as it always is with races - they are hard to trigger/confirm and I had hoped you might have found a case to reliably trigger.

If you happen to find any coincidence like the jammed backup that helps to somewhat reliably recreate please reach out.

Otherwise as I said anything of Yakkety and newer already have the fixes. And while this was not the purpose it was meant for, I've seen people choose [1] sometimes just to run the base LTS with a newer virt stack.

[1]: https://wiki.ubuntu.com/OpenStack/CloudArchive

Revision history for this message

Patrick Best (bestpa) wrote on 2017-04-22:

#11

Happened again. Same VM, too. My backup script made it through 3 of these, one hda, and one vda as well. Then this:

-------------BEGIN backup for VM called mail
Sat Apr 22 00:07:51 EDT 2017
current snapshots mail - should be empty
Name Creation Time State
------------------------------------------------------------

initial blklist and snapshot list mail
Target Source
------------------------------------------------
vda /images2/mail.img
hdb /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

Name Creation Time State
------------------------------------------------------------

block type is vda
image location is /images2/mail.img
creating snapshot for mail
Domain snapshot mail-snap1 created
current snapshots for mail
Name Creation Time State
------------------------------------------------------------
mail-snap1 2017-04-22 00:07:52 -0400 disk-snapshot

performing FIRST TIME SPARSE rsync for mail
mail.img

sent 77.70G bytes received 35 bytes 75.11M bytes/sec
total size is 77.68G speedup is 1.00
I am done with the rsync.
current blklist mail
Target Source
------------------------------------------------
vda /images2/mail.mail-snap1
hdb /home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso

blockcommit mail
Block commit: [100 %]error: failed to pivot job for disk vda
error: block copy still active: disk 'vda' not ready for pivot yet

Revision history for this message

Patrick Best (bestpa) wrote on 2017-04-22:

#12

what's the proper way to keep running the base LTS with a newer virt stack? Do i need to point to a particular repo? Don't even know where to start with this...

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-04-24:

#13

Hi Patrick,
sorry to see you run into it again - but for now I consider it a great chance to find something that allows to reproduce and catch the issue.

Qou said:
"My backup script made it through 3 of these, one hda, and one vda as well. Then this"
It seems you backup script does:
1. check old snapshots
2. create a snapshot
3. copies off the now stable base image
4. blockcommits image and snapshot together
=> Is your comment saying that of these cycles 3 works (like one a day or such) but then on the fourth you triggered the bug again?

To improve the chance of recreation might I ask you a bunch of questions around your disk/system setup:
1. could you share your guest xml
2. could you share your backup script
3. could you elaborate on your base filesystem setup on the Host
4. is your system overall under a lot of CPU consumption - if so what kind of load?
5. is your system overall under a lot of Disk I/O - if so what kind of load?
6. is your guest that is failing under a lot of CPU consumption - if so what kind of load?
7. is your guest that is failing under a lot of Disk I/O - if so what kind of load?
8. Any changes coming to your mind that explain why this happens recently - is there any new HW/Software/Scripts or a changed workload in place now?

On your question about using Ubuntu Cloud Archive - as I mentioned people sometimes "mis-use" it for just a newer virtualization stack, but one has to keep in mind that this is not the original purpose of it.
If you want to give it a try to check if newer releases in there give you the stability you need for your use-case go to [1]. It explains the basics, as a TL;DR it is a special ppa [2]. Therfore the "use" is via adding that ppa, and then an apt update/upgrade will pull in the newer software packages. Given that you seem to be on a prod system you might want to test that ahead almost as you'd do with a major OS upgrade.

[1]: https://wiki.ubuntu.com/OpenStack/CloudArchive#The_Ubuntu_Cloud_Archive
[2]: https://help.launchpad.net/Packaging/PPA

Hi Patrick,
sorry to see you run into it again - but for now I consider it a great chance to find something that allows to reproduce and catch the issue.

Qou said:
"My backup script made it through 3 of these, one hda, and one vda as well. Then this"
It seems you backup script does:
1. check old snapshots
2. create a snapshot
3. copies off the now stable base image
4. blockcommits image and snapshot together
=> Is your comment saying that of these cycles 3 works (like one a day or such) but then on the fourth you triggered the bug again?

To improve the chance of recreation might I ask you a bunch of questions around your disk/system setup:
1. could you share your guest xml
2. could you share your backup script
3. could you elaborate on your base filesystem setup on the Host
4. is your system overall under a lot of CPU consumption - if so what kind of load?
5. is your system overall under a lot of Disk I/O - if so what kind of load?
6. is your guest that is failing under a lot of CPU consumption - if so what kind of load?
7. is your guest that is failing under a lot of Disk I/O - if so what kind of load?
8. Any changes coming to your mind that explain why this happens recently - is there any new HW/Software/Scripts or a changed workload in place now?

On your question about using Ubuntu Cloud Archive - as I mentioned people sometimes "mis-use" it for just a newer virtualization stack, but one has to keep in mind that this is not the original purpose of it.
If you want to give it a try to check if newer releases in there give you the stability you need for your use-case go to [1]. It explains the basics, as a TL;DR it is a special ppa [2]. Therfore the "use" is via adding that ppa, and then an apt update/upgrade will pull in the newer software packages. Given that you seem to be on a prod system you might want to test that ahead almost as you'd do with a major OS upgrade.

[1]: https://wiki.ubuntu.com/OpenStack/CloudArchive#The_Ubuntu_Cloud_Archive
[2]: https://help.launchpad.net/Packaging/PPA

Revision history for this message

Patrick Best (bestpa) wrote on 2017-04-24:

#14

Download full text (16.1 KiB)

Happy to share what i can.

I should have mentioned that the backup script goes through all my VM's, and my ambiguous comment mentions that it went through 3 of the VM's before stalling on this, the fourth. System is low utilisation for RAM CPU and disk. Proliant G5 dual chip quad core (no HT) using P400 RAID CARD (transparent to system) with /images being on a RAID-1'd SATA spindle, and /images2 being on a RAID-1'd SATA SSD. While there are some i/o wait indicators in my hypervisor (very low though), there's no steal time recorded on any of my VM's.
The ten or so VM's are low utilisation, administrative (my mail server, a zabbix server, a landscape server, etc). System is LTS with no tweaks, up to date on a regular basis.

/backup has been an NFS mount point and an external USB drive, witnessed failure condition on both.
the failing VM is on my SSD raid drive at /images2

Smart Array P400 in Slot 0 (Embedded)
   Bus Interface: PCI
   Slot: 0
   Serial Number: PA2240J9SU5360
   Cache Serial Number: PA2270D9SU21FK
   Controller Status: OK
   Hardware Revision: B
   Firmware Version: 1.18
   Rebuild Priority: Low
   Surface Scan Delay: 15 secs
   Surface Scan Mode: Idle
   Parallel Surface Scan Supported: No
   Elevator Sort: Enabled
   Post Prompt Timeout: 0 secs
   Cache Board Present: True
   Cache Status: OK
   Cache Ratio: 100% Read / 0% Write
   Drive Write Cache: Disabled
   Total Cache Size: 512 MB
   Total Cache Memory Available: 464 MB
   No-Battery Write Cache: Disabled
   Battery/Capacitor Count: 0
   SATA NCQ Supported: False
   Number of Ports: 2 Internal only
   Driver Name: cciss
   Driver Version: 3.6.26
   PCI Address (Domain:Bus:Device.Function): 0000:06:00.0
   Host Serial Number: 2UX70501S6
   Sanitize Erase Supported: False

   Array: A
      Interface Type: SATA
      Unused Space: 0 MB (0.0%)
      Used Space: 931.5 GB (100.0%)
      Status: OK
      Array Type: Data

   Array: B
      Interface Type: SATA
      Unused Space: 0 MB (0.0%)
      Used Space: 447.1 GB (100.0%)
      Status: OK
      Array Type: Data

      logicaldrive 1 (465.7 GB, RAID 1, OK)
      logicaldrive 2 (223.5 GB, RAID 1, OK)
      physicaldrive 2I:1:1 (port 2I:box 1:bay 1, SATA, 500 GB, OK)
      physicaldrive 2I:1:2 (port 2I:box 1:bay 2, SATA, 500 GB, OK)
      physicaldrive 2I:1:3 (port 2I:box 1:bay 3, SATA, 240.0 GB, OK)
      physicaldrive 2I:1:4 (port 2I:box 1:bay 4, SATA, 250 GB, OK)

root@thewind:~#

root@thewind:~# top
top - 09:10:18 up 38 days, 14:09, 1 user, load average: 4.81, 4.53, 4.60
Tasks: 280 total, 1 running, 279 sleeping, 0 stopped, 0 zombie
%Cpu(s): 10.4 us, 13.1 sy, 0.0 ni, 75.5 id, 1.1 wa, 0.0 hi, 0.0 si, 0.0 st
KiB Mem : 64943112 total, 394580 free, 25359596 used, 39188936 buff/cache
KiB Swap: 66056188 total, 64023212 free, 2032976 used. 37839620 avail Mem

root@thewind:~# cat /proc/cpuinfo | grep Xe
model name : Intel(R) Xeon(R) CPU X5450 @ 3.00GHz
model name : Intel(R) Xeon(R) CPU X5450 @ 3.00GHz
model name : Intel(R) Xeon(R) CPU X5450 @ 3.00GHz
model name : Intel(R) Xeon(R) CPU X5450 @ 3.00GHz
model name : Intel(R) Xe...

Happy to share what i can.

I should have mentioned that the backup script goes through all my VM's, and my ambiguous comment mentions that it went through 3 of the VM's before stalling on this, the fourth.  System is low utilisation for RAM CPU and disk.  Proliant G5 dual chip quad core (no HT) using P400 RAID CARD (transparent to system) with /images being on a RAID-1'd SATA spindle, and /images2 being on a RAID-1'd SATA SSD.  While there are some i/o wait indicators in my hypervisor (very low though), there's no steal time recorded on any of my VM's. 
 The ten or so VM's are low utilisation, administrative (my mail server, a zabbix server, a landscape server, etc).  System is LTS with no tweaks, up to date on a regular basis.

/backup has been an NFS mount point and an external USB drive, witnessed failure condition on both.
the failing VM is on my SSD raid drive at /images2

Smart Array P400 in Slot 0 (Embedded)
   Bus Interface: PCI
   Slot: 0
   Serial Number: PA2240J9SU5360
   Cache Serial Number: PA2270D9SU21FK
   Controller Status: OK
   Hardware Revision: B
   Firmware Version: 1.18
   Rebuild Priority: Low
   Surface Scan Delay: 15 secs
   Surface Scan Mode: Idle
   Parallel Surface Scan Supported: No
   Elevator Sort: Enabled
   Post Prompt Timeout: 0 secs
   Cache Board Present: True
   Cache Status: OK
   Cache Ratio: 100% Read / 0% Write
   Drive Write Cache: Disabled
   Total Cache Size: 512 MB
   Total Cache Memory Available: 464 MB
   No-Battery Write Cache: Disabled
   Battery/Capacitor Count: 0
   SATA NCQ Supported: False
   Number of Ports: 2 Internal only
   Driver Name: cciss
   Driver Version: 3.6.26
   PCI Address (Domain:Bus:Device.Function): 0000:06:00.0
   Host Serial Number: 2UX70501S6
   Sanitize Erase Supported: False

Array: A
      Interface Type: SATA
      Unused Space: 0  MB (0.0%)
      Used Space: 931.5 GB (100.0%)
      Status: OK
      Array Type: Data

Array: B
      Interface Type: SATA
      Unused Space: 0  MB (0.0%)
      Used Space: 447.1 GB (100.0%)
      Status: OK
      Array Type: Data

logicaldrive 1 (465.7 GB, RAID 1, OK)
      logicaldrive 2 (223.5 GB, RAID 1, OK)
      physicaldrive 2I:1:1 (port 2I:box 1:bay 1, SATA, 500 GB, OK)
      physicaldrive 2I:1:2 (port 2I:box 1:bay 2, SATA, 500 GB, OK)
      physicaldrive 2I:1:3 (port 2I:box 1:bay 3, SATA, 240.0 GB, OK)
      physicaldrive 2I:1:4 (port 2I:box 1:bay 4, SATA, 250 GB, OK)

root@thewind:~#

root@thewind:~# top
top - 09:10:18 up 38 days, 14:09,  1 user,  load average: 4.81, 4.53, 4.60
Tasks: 280 total,   1 running, 279 sleeping,   0 stopped,   0 zombie
%Cpu(s): 10.4 us, 13.1 sy,  0.0 ni, 75.5 id,  1.1 wa,  0.0 hi,  0.0 si,  0.0 st
KiB Mem : 64943112 total,   394580 free, 25359596 used, 39188936 buff/cache
KiB Swap: 66056188 total, 64023212 free,  2032976 used. 37839620 avail Mem

root@thewind:~# cat /proc/cpuinfo | grep Xe
model name      : Intel(R) Xeon(R) CPU           X5450  @ 3.00GHz
model name      : Intel(R) Xeon(R) CPU           X5450  @ 3.00GHz
model name      : Intel(R) Xeon(R) CPU           X5450  @ 3.00GHz
model name      : Intel(R) Xeon(R) CPU           X5450  @ 3.00GHz
model name      : Intel(R) Xeon(R) CPU           X5450  @ 3.00GHz
model name      : Intel(R) Xeon(R) CPU           X5450  @ 3.00GHz
model name      : Intel(R) Xeon(R) CPU           X5450  @ 3.00GHz
model name      : Intel(R) Xeon(R) CPU           X5450  @ 3.00GHz

root@thewind:~# cat /home/bestpa/scripts/backup_thewind.candidate.sh 
#!/bin/bash
set -e

# gleaned good stuff from https://www.gonzalomarcote.com/2014/kvm-live-backups-with-qcow2/

MOUNTPOINT=/backup

date
echo "Beginning backup of virtual machines"

iptables --list
free -m
virsh list --all

set +e

echo "mounting drive"
# use for external USB drives 
mount $MOUNTPOINT

set -e

# Check if mounted 
echo "checking to see if $MOUNTPOINT  is mounted"
if ! mountpoint -q $MOUNTPOINT/
then
  { echo "$MOUNTPOINT not mounted!"; exit; }
else
{

echo "filesystem $MOUNTPOINT mounted"
       logger "thewind_backup :  $MOUNTPOINT confirmed mounted"

df -h 
        du -sh $MOUNTPOINT

# copy the following
                #
                # /home/bestpa/ only if new file
                # /images
                # /etc/libvirt

logger "Starting bestpa backups"
        echo "Starting bestpa backups"
        # rsync cause only copy if more recent timestamp.
                # Can't do -rav because NFS mount doesn't let you change ownership
                # exludes (o)wner (g)roup from (a)rchive
                # rsync -rav /home/bestpa $MOUNTPOINT
        rsync -rlptDv /home/bestpa $MOUNTPOINT

logger "Starting libvirt xml backups"
        echo "Starting libvirt xml backups"
        #Only copy if more recent timestamp.
                # Can't do -rav because NFS mount doesn't let you change ownership
                # rsync -rav /etc/libvirt $MOUNTPOINT
        rsync -rlptDv /etc/libvirt $MOUNTPOINT

logger "Starting virsh backups"
        echo "Starting virsh backups"

# splunk is a volumegroup logicalvolume, so not here.
        # palo also needs attention
        # canary is a fucked up one too - it's got multiple disks, one VG and one QCOW2
        for i in cacti landscape mail realworldnumbers tenantinvoice pfsense ; do

echo "-------------BEGIN backup for VM called $i"
                date

#sanity check - we don't want to see a current snapshot in domblklist
                string=`virsh domblklist $i | sed -n '3,3p'`
                        if [[ $string == *"snap"* ]]; then
                          echo "Can't Proceed - The current image is a snapshot."
                          exit;
                        fi
                #sanity check - we don't want to see a current snapshot in snapshot-list
                echo "current snapshots $i - should be empty"
                virsh snapshot-list $i
                string=`virsh snapshot-list $i | sed -n '3,3p'`
                        if [[ $string == *"snap"* ]]; then
                          echo "Can't Proceed - There is already a snapshot listed."
                          exit;
                        fi

echo "initial blklist and snapshot list $i"
                virsh domblklist $i
                virsh snapshot-list $i

BLK_TYPE=`virsh domblklist $i | sed -n '3,3p' | awk {'print $1'}`
                IMG_LOC=`virsh domblklist $i | sed -n '3,3p' | awk {'print $2'}`
                        echo "block type is $BLK_TYPE"
                        echo "image location is $IMG_LOC"

echo creating snapshot for $i
                virsh snapshot-create-as --domain $i $i-snap1 --disk-only --atomic 
                sleep 10

echo "current snapshots for $i"
                virsh snapshot-list $i

## if this is the first time we are copying the file, we copy it sparse
                ## otherwise, we only copy changed blocks with --inplace

##
                ## First lets get the proper directory  
                IMAGE_DIR=`virsh domblklist mail | sed -n '3,3p' | awk {'print $2'} | awk -F '/' {'print $2'}`
                if [ -e $MOUNTPOINT/$IMAGE_DIR/$i.img ]; then 
                        echo "  performng INPLACE rsync for $i"
                        rsync -vh --inplace $IMG_LOC $MOUNTPOINT/$IMAGE_DIR/$i.img
                else
                        echo "  performing FIRST TIME SPARSE rsync for $i"
                        rsync -vh --sparse $IMG_LOC $MOUNTPOINT/$IMAGE_DIR/$i.img
                fi
                echo "I am done with the rsync."

echo "current blklist $i"
                virsh domblklist $i

echo "blockcommit $i"
                virsh blockcommit $i $BLK_TYPE --active --verbose --pivot

echo "current blklist $i"
                virsh domblklist $i
                echo "current snapshots $i"
                virsh snapshot-list $i

## NO DELETES YET, TESTING MODE, check every morning and kill any -snap

#echo "deleting snapshot"
                #virsh snapshot-delete $i $i-snap1 --metadata
                #echo "current snapshots"
                #virsh snapshot-list $i

#echo "deleting old snapshot file"
                #rm -v /images/$i-snap1

echo "DONE backup $i"
                echo 
        done

virsh list --all

logger "Finished virsh backups"
        echo "Finished virsh backups"

du -sh $MOUNTPOINT 
        df -h

umount $MOUNTPOINT

logger "unmounting $MOUNTPOINT drive"
        echo "unmounting $MOUNTPOINT drive"

date

}
fi

root@thewind:~#  cat /etc/os-release 
NAME="Ubuntu"
VERSION="16.04.2 LTS (Xenial Xerus)"
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME="Ubuntu 16.04.2 LTS"
VERSION_ID="16.04"
HOME_URL="http://www.ubuntu.com/"
SUPPORT_URL="http://help.ubuntu.com/"
BUG_REPORT_URL="http://bugs.launchpad.net/ubuntu/"
VERSION_CODENAME=xenial
UBUNTU_CODENAME=xenial
root@thewind:~#

root@thewind:~# 
root@thewind:~# 
root@thewind:~# virsh list mail
error: unexpected data 'mail'
root@thewind:~# cat /etc/libvirt/qemu/mail.xml

<domain type='kvm'>
  <name>mail</name>
  <uuid>e58f22b6-a3b7-4763-af47-b1cc36fb8df3</uuid>
  <memory unit='KiB'>4194304</memory>
  <currentMemory unit='KiB'>4194304</currentMemory>
  <vcpu placement='static'>2</vcpu>
  <resource>
    <partition>/machine</partition>
  </resource>
  <os>
    <type arch='x86_64' machine='pc-i440fx-wily'>hvm</type>
    <boot dev='hd'/>
    <boot dev='cdrom'/>
  </os>
  <features>
    <acpi/>
    <apic/>
  </features>
  <cpu mode='host-model'>
    <model fallback='allow'>Penryn</model>
    <vendor>Intel</vendor>
    <feature policy='require' name='osxsave'/>
    <feature policy='require' name='xsave'/>
    <feature policy='require' name='dca'/>
    <feature policy='require' name='pdcm'/>
    <feature policy='require' name='xtpr'/>
    <feature policy='require' name='tm2'/>
    <feature policy='require' name='est'/>
    <feature policy='require' name='vmx'/>
    <feature policy='require' name='ds_cpl'/>
    <feature policy='require' name='monitor'/>
    <feature policy='require' name='dtes64'/>
    <feature policy='require' name='pbe'/>
    <feature policy='require' name='tm'/>
    <feature policy='require' name='ht'/>
    <feature policy='require' name='ss'/>
    <feature policy='require' name='acpi'/>
    <feature policy='require' name='ds'/>
    <feature policy='require' name='vme'/>
  </cpu>
  <clock offset='utc'>
    <timer name='rtc' tickpolicy='catchup'/>
    <timer name='pit' tickpolicy='delay'/>
    <timer name='hpet' present='no'/>
  </clock>
  <on_poweroff>destroy</on_poweroff>
  <on_reboot>restart</on_reboot>
  <on_crash>restart</on_crash>
  <pm>
    <suspend-to-mem enabled='no'/>
    <suspend-to-disk enabled='no'/>
  </pm>
  <devices>
    <emulator>/usr/bin/kvm-spice</emulator>
    <disk type='file' device='disk'>
      <driver name='qemu' type='qcow2'/>
      <source file='/images2/mail.mail-snap1'/>
      <target dev='vda' bus='virtio'/>
      <address type='pci' domain='0x0000' bus='0x00' slot='0x04' function='0x0'/>
    </disk>
    <disk type='file' device='cdrom'>
      <driver name='qemu' type='raw'/>
      <source file='/home/bestpa/iso/ubuntu-14.04.5-server-amd64.iso'/>
      <target dev='hdb' bus='ide'/>
      <readonly/>
      <address type='drive' controller='0' bus='0' target='0' unit='0'/>
    </disk>
    <controller type='usb' index='0' model='ich9-ehci1'>
      <address type='pci' domain='0x0000' bus='0x00' slot='0x05' function='0x7'/>
    </controller>
    <controller type='usb' index='0' model='ich9-uhci1'>
      <master startport='0'/>
      <address type='pci' domain='0x0000' bus='0x00' slot='0x05' function='0x0' multifunction='on'/>
    </controller>
    <controller type='usb' index='0' model='ich9-uhci2'>
      <master startport='2'/>
      <address type='pci' domain='0x0000' bus='0x00' slot='0x05' function='0x1'/>
    </controller>
    <controller type='usb' index='0' model='ich9-uhci3'>
      <master startport='4'/>
      <address type='pci' domain='0x0000' bus='0x00' slot='0x05' function='0x2'/>
    </controller>
    <controller type='ide' index='0'>
      <address type='pci' domain='0x0000' bus='0x00' slot='0x01' function='0x1'/>
    </controller>
    <controller type='pci' index='0' model='pci-root'/>
    <interface type='network'>
      <mac address='52:54:00:4f:fe:5c'/>
      <source network='private'/>
      <model type='rtl8139'/>
      <address type='pci' domain='0x0000' bus='0x00' slot='0x03' function='0x0'/>
    </interface>
    <serial type='pty'>
      <target port='0'/>
    </serial>
    <console type='pty'>
      <target type='serial' port='0'/>
    </console>
    <input type='mouse' bus='ps2'/>
    <input type='keyboard' bus='ps2'/>
    <graphics type='vnc' port='-1' autoport='yes' listen='10.0.1.12'>
      <listen type='address' address='10.0.1.12'/>
    </graphics>
    <video>
      <model type='cirrus' vram='16384' heads='1'/>
      <address type='pci' domain='0x0000' bus='0x00' slot='0x02' function='0x0'/>
    </video>
    <memballoon model='virtio'>
      <address type='pci' domain='0x0000' bus='0x00' slot='0x06' function='0x0'/>
    </memballoon>
  </devices>
  <seclabel type='dynamic' model='apparmor' relabel='yes'/>
</domain>
root@thewind:~#

root@thewind:~# iostat 5
Linux 4.4.0-66-generic (thewind)        04/24/2017      _x86_64_        (8 CPU)

avg-cpu:  %user   %nice %system %iowait  %steal   %idle
          10.35    0.00   13.11    1.08    0.00   75.45

Device:            tps    kB_read/s    kB_wrtn/s    kB_read    kB_wrtn
cciss/c0d0        5.80        52.99       169.64  176686197  565655689
cciss/c0d1       23.76        87.58       440.35  292034952 1468321408
sda               0.89         3.07        15.72   10246856   52418340
dm-0             18.46         2.56        77.79    8547523  259397184
dm-1              0.54         0.40         1.77    1320142    5887228
dm-2             24.06        87.58       440.35  292030859 1468321408
dm-3              0.81        46.98        22.49  156654007   74979028
dm-4              3.17         2.93        67.48    9759267  224998111
dm-5              0.00         0.01         0.00      44468        220
dm-6              0.00         0.01         0.00      43767          0

avg-cpu:  %user   %nice %system %iowait  %steal   %idle
          34.91    0.00   35.21    0.53    0.00   29.35

Device:            tps    kB_read/s    kB_wrtn/s    kB_read    kB_wrtn
cciss/c0d0        4.80         2.40        96.80         12        484
cciss/c0d1       12.80         0.00       301.60          0       1508
sda               0.00         0.00         0.00          0          0
dm-0             11.00         0.00        44.00          0        220
dm-1              0.00         0.00         0.00          0          0
dm-2             13.20         0.00       301.60          0       1508
dm-3              0.40         0.00         8.00          0         40
dm-4              1.80         0.00        44.80          0        224
dm-5              0.00         0.00         0.00          0          0
dm-6              0.00         0.00         0.00          0          0

avg-cpu:  %user   %nice %system %iowait  %steal   %idle
          35.06    0.00   17.93    2.04    0.00   44.97

Device:            tps    kB_read/s    kB_wrtn/s    kB_read    kB_wrtn
cciss/c0d0       18.80         0.00       624.80          0       3124
cciss/c0d1       35.40         0.00      1389.60          0       6948
sda               0.00         0.00         0.00          0          0
dm-0            113.40         0.00       460.00          0       2300
dm-1              0.00         0.00         0.00          0          0
dm-2             37.00         0.00      1402.40          0       7012
dm-3              0.00         0.00         0.00          0          0
dm-4             13.20         0.00       164.80          0        824
dm-5              0.00         0.00         0.00          0          0
dm-6              0.00         0.00         0.00          0          0

^C
root@thewind:~#

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-04-25:

#15

Thanks Patrick,
I unfortunately haven't found anything in there which I'd have lacked in my try to recreate :-/

Btw: I think this should be $i and not mail, although if guests are all the same it doesn't matter
"IMAGE_DIR=`virsh domblklist mail" -> "IMAGE_DIR=`virsh domblklist $i"

What is the ammount of Data you sync each time - maybe this is far bigger in your than in my case.
Do you happen to know how much that is? I mean we only talk about the I/O that sums up while you do backup, how long is that - a few minutes, should not be too much right?

If you end up trying the newer libvirt let me know if that solves your issue at least.
There would also be a way in between - I have a ppa where we tried to backport some of the blockjob changes here:
https://launchpad.net/~ci-train-ppa-service/+archive/ubuntu/2619

Chances are that this might affect you, yet OTOH since this is your production system I'd not use it as it is experimental. That said while I fail to reproduce, do you have a test env where this triggers as well and that you could use to try such experimental libvirt packages?

Revision history for this message

kritek (kritek) wrote on 2017-10-08:

#16

I can reproduce this reliably on Server 16.04.3 LTS.

virsh version
Compiled against library: libvirt 1.3.1
Using library: libvirt 1.3.1
Using API: QEMU 1.3.1
Running hypervisor: QEMU 2.5.0

I have 4 VMs, the one that consistently fails is write heavy, its a carbon/graphite server.
Steps to reproduce:

virsh snapshot-create-as --domain centos7-graphite centos7-graphite-SNAP1 --diskspec vda,file=/var/lib/libvirt/images/centos7-graphite.img-SNAP1 --disk-only --atomic

sleep 300 (approximate time of rsync of base img to destination)

virsh blockcommit centos7-graphite vda --active --pivot --shallow --verbose
This is where it fails:

Block commit: [100 %]error: failed to pivot job for disk vda
error: block copy still active: disk 'vda' not ready for pivot yet

I end up having to shut down the vm, delete snapshot metadata, delete the disk attachment (the SNAP disk), and re-attach the original disk, then boot the vm again to restore the vm.

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-10-19:

#17

I increased the write load in my reproducer, but still can't trigger it here :-/

Did you have any chance to try the Cloud Archive versions mentioned in c#13 or (I know based on an older version, but you could force it in ) the ppa from c#15 ?

From the comments it seems this is your production environment, any chance you can set up an equivalent test environment to test the packages I mentioned without causing too much trouble for your main load?

Revision history for this message

Dominik Psenner (dpsenner) wrote on 2017-11-05:

#18

We see this same issue in one of our production systems. The live backup scripts fail every few days and it is necessary to manually run a blockjob abort and a subsequent blockcommit usually passes. The backup scripts can be found here: https://github.com/dpsenner/libvirt-administration-tools

On <email address hidden> a dev suggested to upgrade libvirt to a newer version of libvirt. He indicated that virsh 1.3.1 is ancient and artful is actually already at 3.6. It would seem that they have addressed several issues and fixed several race conditions as indicated in earlier comments. Unfortunately there's no way to upgrade the production systems os to a newer ubuntu release just for gigs. It would however help if a newer version of libvirt was backported to 16.04 lts. Are there any dependency issues that prevent the backport of a newer libvirt?

Revision history for this message

Dominik Psenner (dpsenner) wrote on 2017-11-05:

#19

I thought that a few links to the mailing list archives could help so here they are:

https://www.redhat.com/archives/libvirt-users/2017-August/msg00020.html
https://www.redhat.com/archives/libvirt-users/2017-October/msg00033.html

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-11-06:

#20

Hi Dominik,
thanks for the links.

Yes 1.3.1 is ancient in a way of "as old as the 16.04 Ubuntu release" plus fix-backports as they are identifiable and qualify for the SRU process [1].

We have three Options here, but atm not all are feasible:

1. Backport the fix to Xenial
I beg your pardon, but for this particular case so far I unable to recreate to debug further on my side or to identify the fix it would need for the SRU - I provided a test ppa in c#15 to help with that, but I understand that it can be unwanted to shove that into production systems to test it.

2. Update the packages in Xenial to newer versions
The complexity of the virtualization stack (and all the potential regressions by just upgrading the versions for everyone out there) can be high, so just bumping all those in Xenial to the level e.g. we have in artful do not qualify for an SRU update.

3. Use a backport
Since the virtualization stack is one of those places where the "newer vs stable" issue comes up often, and even more so because newer openstack releases should go along a newer virt-stack there is a way out just as you assume. Via the Ubuntu Cloud Archive [2] you can get access to a backport of the most recent versions into the latest LTS. While it wasn't created for that I'd think this is the "backport" to 16.04 LTS you might be looking for.

[1]: https://wiki.ubuntu.com/StableReleaseUpdates
[2]: https://wiki.ubuntu.com/OpenStack/CloudArchive

Revision history for this message

Dominik Psenner (dpsenner) wrote on 2017-11-07:

#21

Hi Christian,

thanks for your insights. The Ubuntu Cloud Archive is completely new to me. Am I right in the assumption that adding the ocata cloud-archive repository with 'sudo add-apt-repository cloud-archive:ocata' and a subsequent 'apt update && apt upgrade' would effectively upgrade libvirt to 2.5.0, respectively 3.5.0 if I would add cloud-archive:pike? What implications does this have when upgrading the production system to a newer LTS in roughly two years? Will a dist-upgrade even work out fine without actually bashing the production system or would you advice to plan the reinstallation of the hosting machine from scratch?

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-11-07:

#22

pike would be 3.6 not 3.5 - other than that yes.

In general upgrading from 16.04+UCA-Pike -> 18.04 shouldn't be very different to 17.10->18.04.
It is supposed to work.

As you know it is generally a good advise to go with test systems, backups, phased upgrades, ... as there always could be something - but in general yeah, there is noting blocking your usual upgrade path.

If you want test how that would be when 18.04 is out - on a test system take Trusty + UCA-Mitaka (which is on the level of 16.04) and then upgrade to Ubuntu 16.04.

Revision history for this message

Dominik Psenner (dpsenner) wrote on 2017-11-07:

#23

Thanks for pointing out that pike would be 3.6. To me it is still hard to track which version what UCA release includes because those resources are actually quite hard to find.

Given the proposed solution [3], do you consider the usage of the Ubuntu Cloud Archive package repository to be the recommended way of having a more recent libvirt package version on a ubuntu lts? Or would you rather recommend to do a dist-upgrade to the next stable ubuntu release? As of today that would be 17.10. Of course that would require us to schedule major updates more frequently.

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2017-11-07:

#24

<personal_opinion>
On "static" systems I'm usually a slow upgrader on others systems I use daily cloud images right away.
So if you have a complex (custom/manual) setup I'd likely go with LTS+UCA.
It means less major changes to your system than doing a release upgrade every 6 months but would keep your virt stack up to date.
</personal_opinion>

Note: I'm not sure if for official support (read Ubuntu-advantage) there are special constraints around that.

Revision history for this message

falstaff (falstaff) wrote on 2018-04-26:

#25

Observed the same issue on Ubuntu 16.04.4 with a Dell R440 and a RAID 5 consisting of 3 10k SAS disks. Using 16.04+UCA-Pike resolved the issue just fine.

Revision history for this message

Patrick Best (bestpa) wrote on 2018-04-26: Re: [Bug 1681839] Re: libvirt - disk not ready for pivot yet

#26

I’ve given up on qcow and on ubuntu for my hypervisor needs. See ya!

On Thu, Apr 26, 2018 at 3:00 PM falstaff <email address hidden> wrote:

> Observed the same issue on Ubuntu 16.04.4 with a Dell R440 and a RAID 5
> consisting of 3 10k SAS disks. Using 16.04+UCA-Pike resolved the issue
> just fine.
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1681839
>
> Title:
> libvirt - disk not ready for pivot yet
>
> Status in libvirt package in Ubuntu:
> Incomplete
>
> Bug description:
> root@thewind:/home/bestpa/scripts# virsh blockcommit mail vda --active
> --verbose --pivot
> Block commit: [100 %]error: failed to pivot job for disk vda
> error: block copy still active: disk 'vda' not ready for pivot yet
>
> found related bugfix at redhat... can i get 1.3.2 pushed into ubuntu
> 16.04 release?
>
> bestpa@thewind:~$ cat /etc/os-release
> NAME="Ubuntu"
> VERSION="16.04.2 LTS (Xenial Xerus)"
> ID=ubuntu
> ID_LIKE=debian
> PRETTY_NAME="Ubuntu 16.04.2 LTS"
>
> bestpa@thewind:~$ libvirtd --version
> libvirtd (libvirt) 1.3.1
>
> To manage notifications about this bug go to:
>
> https://bugs.launchpad.net/ubuntu/+source/libvirt/+bug/1681839/+subscriptions
>

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2018-04-27: Re: libvirt - disk not ready for pivot yet

#27

Thanks Falstaff, yeah we knew the fixes are in latter releases - they just were hard to backport keeping the general regression risk low (for all other users).
UCA as you used it is a valid way to get fixes ahead of time into last LTS, Thanks for verifying this again falstaff.

@bestpa - sad to hear :-/ but see ya another day on another case.

Changed in libvirt (Ubuntu Bionic):
status:	Incomplete → Fix Released
Changed in libvirt (Ubuntu Artful):
status:	New → Fix Released
Changed in libvirt (Ubuntu Xenial):
status:	New → Won't Fix

Matthew Ruffell (mruffell) on 2019-10-31

summary:	- libvirt - disk not ready for pivot yet + libvirt: blockcommit fails - disk not ready for pivot yet
description:	updated
tags:	added: sts
Changed in libvirt (Ubuntu Xenial):
status:	Won't Fix → In Progress
importance:	Undecided → Medium
assignee:	nobody → Matthew Ruffell (mruffell)

Revision history for this message

Matthew Ruffell (mruffell) wrote on 2019-11-01:

#28

libvirt debdiff for xenial Edit (12.6 KiB, text/plain)

Attached is the debdiff for xenial to fix this issue.

I was not sure if the patches in debian/patches should be placed in the debian/patches/ubuntu directory or not, so I left them outside. Feel free to move them if necessary.

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2019-11-07:

#29

Thanks++
Now that I had (new) steps to recreate I could work on those.
I wondered if an LVM is really strictly needed - which would also easen the initialization.
So I simplified it to.

$ apt install uvtool-libvirt
$ uvt-simplestreams-libvirt sync --source http://cloud-images.ubuntu.com/daily arch=amd64 label=daily release=xenial
$ uvt-kvm create xsnaptest arch=amd64 release=xenial label=daily
# depending on your apparmor config you might want to add something like this TEMPORARY to /etc/apparmor.d/abstractions/libvirt-qemu '/var/lib/uvtool/libvirt/images/* rwk,'
$ virsh snapshot-create-as --domain xsnaptest --diskspec vda,file=/var/lib/libvirt/images/xsnaptest-snapshot.qcow2,snapshot=external --disk-only --atomic

I started a loop which on one side dirtied the snapshot and on the other pivoted it.
# make dirty:
$ while /bin/true; do uvt-kvm ssh --insecure xsnaptest "dd if=/dev/urandom of=file.txt count=4096 bs=1M"; done
# snapshot, wait and pivot blockcommit
$ while virsh blockcommit xsnaptest vda --active --verbose --pivot --wait; do rm /var/lib/libvirt/images/xsnaptest-snapshot.qcow2; sleep 2s; virsh snapshot-create-as --domain xsnaptest --diskspec vda,file=/var/lib/libvirt/images/xsnaptest-snapshot.qcow2,snapshot=external --disk-only --atomic; sleep $(( RANDOM % 30 ))s; ll -h /var/lib/libvirt/images/xsnaptest-snapshot.qcow2; done

The snapshots to commit were about 200M to 6.9G, but none triggered the issue (about 40 tries in the loop).
So maybe really it only happens (much more likely) when the original backing to write back is an LVM.
Glad you found that for your test to become a reliable reproducer.

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2019-11-07:

#30

For the sake of seeing it trigger at least once I redeployed a mchine with Xenial to create LVMs there on a free /dev/sdb disk as your example needs it.
# create guest
$ uvt-simplestreams-libvirt --verbose sync --source http://cloud-images.ubuntu.com/daily arch=amd64 label=daily release=xenial
$ uvt-kvm create xsnaptest arch=amd64 release=xenial label=daily

# create Volume
$ sudo pvcreate /dev/sdb
$ sudo vgcreate LVMpool_vg /dev/sdb
$ cat > lvmpool.xml <<EOF
<pool type="logical">
<name>LVMpool_vg</name>
<source>
<device path="/dev/sdb"/>
</source>
<target>
<path>/dev/LVMpool_vg</path>
</target>
</pool>
EOF
$ virsh pool-define lvmpool.xml
$ sudo vgcreate LVMpool_vg /dev/sdb
$ virsh pool-start LVMpool_vg
$ virsh vol-create-as LVMpool_vg lvvol1 15G

# Use volume in the guest
$ cat > lvmdisk.xml <<EOF
<disk type='block' device='disk'>
  <driver name='qemu' type='raw'/>
  <source dev='/dev/LVMpool_vg/lvvol1'/>
  <target dev='vdc' bus='virtio'/>
</disk>
EOF
$ virsh attach-device xsnaptest lvmdisk.xml

# Prep initial snapshot
virsh snapshot-create-as --domain xsnaptest --diskspec vdc,file=/var/lib/libvirt/images/xsnaptest-snapshot.qcow2,snapshot=external --disk-only --atomic

# Check snapshot being backed by lvmdisk
$ sudo qemu-img info /var/lib/libvirt/images/xsnaptest-snapshot.qcow2
image: /var/lib/libvirt/images/xsnaptest-snapshot.qcow2
file format: qcow2
virtual size: 15G (16106127360 bytes)
disk size: 196K
cluster_size: 65536
backing file: /dev/LVMpool_vg/lvvol1
backing file format: raw
Format specific information:
    compat: 1.1
    lazy refcounts: false
    refcount bits: 16
    corrupt: false

# dump I/O onto that device from inside the guest
$ while /bin/true; do uvt-kvm ssh --insecure xsnaptest "sudo dd if=/dev/urandom of=/dev/vdc count=8192 bs=1M"; done

# Iterate on it while the disk/snapshot keeps getting dirty
$ while virsh blockcommit xsnaptest vdc --active --verbose --pivot --wait; do sudo rm /var/lib/libvirt/images/xsnaptest-snapshot.qcow2; sleep 2s; virsh snapshot-create-as --domain xsnaptest --diskspec vdc,file=/var/lib/libvirt/images/xsnaptest-snapshot.qcow2,snapshot=external --disk-only --atomic; sleep $(( RANDOM % 30 + 20 ))s; sudo ls -laFh /var/lib/libvirt/images/xsnaptest-snapshot.qcow2; done

Finally I saw it in action
Block commit: [100 %]error: failed to pivot job for disk vdc
error: block copy still active: disk 'vdc' not ready for pivot yet

I retried and this was reproducible.

I upgraded to the PPA (more about that later) and ran my loop.
It reached 100% and then got slow (I/O while doing the pivot).
I needed to either wait quite a while or slow down the ongoing I/O a bit.

I had the loop running a 10 times and with the fix it never failed again (sized between 519M and 7.1G).