qemu2.0.0 should be upgraded to 2.4 on Jenkins Product CI

Bug #1484080 reported by Dennis Dmitriev
34
This bug affects 8 people
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Fix Released
High
Mateusz Matuszkowiak

Bug Description

There are several types of issues faced using qemu2.0.0:

1) Fuel admin node freezes after suspend / resume sequence (even without taking a snapshot) eating 100% CPU: [1], [2]
Reproduced in ~70% system tests when snapshot is taken after deploy of cluster.
We use workaround for this issue in system tests: destroy and start Fuel admin node if it inaccessible after revert. Duration of each job that perform system tests can be increased more than on an hour because of this issue.

2) Docker containers on Fuel master node can be completely broken after destroy Fuel admin node that freezes by the reason from the issue №1 : [3]
No workaround here because there is an unrecoverable data loss. Tests are failed.

3) Sometimes libvirt unable to take a snapshot because a volume can be busy for unknown reason at the moment of taking the snapshot: [4]
No workaround here. Tests are failed.

To investigate the issue, was performed the following actions:
  - Stop docker containers before taking a snapshot. Issue №1 is reproduced.
  - Reduce IO on Fuel admin node by stopping docker containers and rsyslogd. Issue №1 is reproduced.
  - Snapshot slave nodes first in order to get Fuel admin node snapshoted when there is no other activity on the host. Issue №1 is reproduced.
  - Force closing all SSH sessions on Fuel admin node to reduce the amount of ESTABLISHED connections before taking a snapshot. Issue №1 is reproduced.
  - All the above methods at the same time. No success.

Issue №1 can be not reproduced or reproducing constantly on the same host, when no other activity on the host was performed.

--------------

As the possible solution, we can test qemu v2.3 on Jenkins, which doesn't show issues with snapshoting or suspending on local hosts, and upgrade to qemu2.3 in case if the tests did not detect new problems.

[1] https://bugs.launchpad.net/fuel/+bug/1418204
[2] https://bugs.launchpad.net/fuel/+bug/1450508
[3] https://bugs.launchpad.net/fuel/+bug/1457802
[4] https://bugs.launchpad.net/fuel/+bug/1415079

description: updated
information type: Public → Private
information type: Private → Public
Andrey Nikitin (heos)
Changed in fuel:
status: New → Triaged
Revision history for this message
Alexander Evseev (aevseev) wrote :

BTW 2.4.0 just released...

Revision history for this message
Igor Shishkin (teran) wrote :

QEMU 2.3 is not present in Ubuntu repository.
Reassigning to build team.

Changed in fuel:
assignee: Fuel DevOps (fuel-devops) → Fuel build team (fuel-build)
Revision history for this message
Dmitry Borodaenko (angdraug) wrote :

You may be able to take qemu 1:2.3+dfsg-6a from Debian/sid as is:
https://packages.debian.org/sid/qemu

Revision history for this message
Aleksandra Fedorova (bookwar) wrote :

This issue becomes urgent. We have several failures on CI due to peak CPU usage during vm's bootstrap process.

 This load is not seen on zabbix graph, but can reach about 12 according to atop data.

Changed in fuel:
importance: Medium → High
Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix proposed to fuel-infra/jeepyb-config (master)

Fix proposed to branch: master
Change author: Sergey Otpuschennikov <email address hidden>
Review: https://review.fuel-infra.org/11169

Changed in fuel:
status: Triaged → In Progress
Changed in fuel:
assignee: Fuel build team (fuel-build) → Sergey Otpuschennikov (sotpuschennikov)
Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix merged to fuel-infra/jeepyb-config (master)

Reviewed: https://review.fuel-infra.org/11169
Submitter: Andrey Nikitin <email address hidden>
Branch: master

Commit: 7878dc95c82bb0086b52a52c635b76a8f33160c7
Author: Sergey Otpuschennikov <email address hidden>
Date: Sat Sep 5 11:12:33 2015

Added new repository

- fuel-infra-packages/libseccomp

Change-Id: Ic3f64fe90fe8ba479456cbda628887b10cb4ac98
Partial-Bug: #1484080

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix proposed to fuel-infra-packages/libseccomp (master)

Fix proposed to branch: master
Change author: Sergey Otpuschennikov <email address hidden>
Review: https://review.fuel-infra.org/11248

Revision history for this message
Sergey Otpuschennikov (sotpuschennikov) wrote : Re: qemu2.0.0 should be upgraded to 2.3 on Jenkins Product CI

New versions of qemu successfully built in temporary repositories
qemu-2.3 http://obs-1.mirantis.com:82/qa-ubuntu-10882/
qemu-2.4 http://obs-1.mirantis.com:82/qa-ubuntu-11241/

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix proposed to fuel-infra/jeepyb-config (master)

Fix proposed to branch: master
Change author: Sergey Otpuschennikov <email address hidden>
Review: https://review.fuel-infra.org/11274

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix merged to fuel-infra/jeepyb-config (master)

Reviewed: https://review.fuel-infra.org/11274
Submitter: Igor Shishkin <email address hidden>
Branch: master

Commit: 9ae0e11a8ad13680270f411f8bb26f2a70aae3fa
Author: Sergey Otpuschennikov <email address hidden>
Date: Tue Sep 8 13:52:47 2015

Added new project

- fuel-infra-packages/libiscsi
- fuel-infra-packages/openbios

We need upgrade version libiscsi, openbios-ppc and openbios-sparc for QEMU 2.4

Change-Id: Ib1d61dd1ba0146c15ea5d47ededaee9aeae71b47
Partial-Bug: #1484080

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix proposed to fuel-infra-packages/openbios (master)

Fix proposed to branch: master
Change author: Sergey Otpuschennikov <email address hidden>
Review: https://review.fuel-infra.org/11303

Revision history for this message
Dennis Dmitriev (ddmitriev) wrote : Re: qemu2.0.0 should be upgraded to 2.3 on Jenkins Product CI

I've tested qemu2.4 on Ubuntu 14.04 LTS, using these repos:

qemu2.4: http://obs-1.mirantis.com:82/qa-ubuntu-11241/ubuntu/
openbios_1.1+svn1334: http://obs-1.mirantis.com:82/qa-ubuntu-11303/ubuntu/
,and the package http://launchpadlibrarian.net/199538382/seabios_1.7.5-1ubuntu1_all.deb

Several system tests were passed on qemu2.4 without any issues.

Revision history for this message
Dennis Dmitriev (ddmitriev) wrote :

Please provide any CI server (or 2-3 servers if it is possible), from old or new CI, where in console output of system tests appeared such message:

"From first time admin isn't reverted: [Errno 113] No route to host"

Could be found, for example, almost in any run of https://product-ci.infra.mirantis.net/view/7.0_swarm/job/7.0.system_test.ubuntu.thread_3/

Revision history for this message
Dennis Dmitriev (ddmitriev) wrote :

Upd. to #13 - please upgrade qemu to 2.4 on the provided server(s) so we could test it during acceptance.

Changed in fuel:
assignee: Sergey Otpuschennikov (sotpuschennikov) → Dennis Dmitriev (ddmitriev)
Revision history for this message
Dennis Dmitriev (ddmitriev) wrote :

Servers for testing are requested in https://bugs.launchpad.net/fuel/+bug/1494278

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix proposed to fuel-infra/jeepyb-config (master)

Fix proposed to branch: master
Change author: Sergey Otpuschennikov <email address hidden>
Review: https://review.fuel-infra.org/11435

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix merged to fuel-infra/jeepyb-config (master)

Reviewed: https://review.fuel-infra.org/11435
Submitter: Igor Shishkin <email address hidden>
Branch: master

Commit: ca3cf047b56377fc5afcc34bcb876349f66b5b12
Author: Sergey Otpuschennikov <email address hidden>
Date: Fri Sep 11 10:25:59 2015

Added new project

-fuel-infra-packages/seabios

We need update version seabios for upgrade qemu to 2.4
Partial-Bug: #1484080

Change-Id: I8558792d34836729016dda59c9e64071237287ae

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix proposed to fuel-infra-packages/seabios (master)

Fix proposed to branch: master
Change author: Sergey Otpuschennikov <email address hidden>
Review: https://review.fuel-infra.org/11518

summary: - qemu2.0.0 should be upgraded to 2.3 on Jenkins Product CI
+ qemu2.0.0 should be upgraded to 2.4 on Jenkins Product CI
Revision history for this message
Dennis Dmitriev (ddmitriev) wrote :

qemu2.4 was testing during two weeks (see https://bugs.launchpad.net/fuel/+bug/1494278).

Any of the issues described in this bug description wasn't appeared.

qemu2.4 was working quite stable for snapshoting/reverting; for running a HA environment for days without unexpected issues; for deploying multiple environments in the same time.

I propose to update qemu to the version 2.4 on all Jenkins slaves that run tests for Fuel 8.0.

Changed in fuel:
assignee: Dennis Dmitriev (ddmitriev) → Fuel DevOps (fuel-devops)
status: In Progress → Confirmed
Revision history for this message
Igor Shishkin (teran) wrote :

Assigning back to Build team because of -2 from CI in qemu review.
So Built team work is not done yet.

Changed in fuel:
assignee: Fuel DevOps (fuel-devops) → Fuel build team (fuel-build)
Changed in fuel:
status: Confirmed → In Progress
assignee: Fuel build team (fuel-build) → Sergey Otpuschennikov (sotpuschennikov)
Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix merged to fuel-infra-packages/openbios (master)

Reviewed: https://review.fuel-infra.org/11303
Submitter: Igor Shishkin <email address hidden>
Branch: master

Commit: ee0dfd289aeed397993e2b6f0de1b46104e62825
Author: Sergey Otpuschennikov <email address hidden>
Date: Tue Sep 8 15:20:33 2015

Upgrade packages openbios-ppc and openbios-sparc for qemu

Change-Id: I72897bd531f224e61ff00d958c2b715280cb33a5
Partial-Bug: #1484080

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix merged to fuel-infra-packages/seabios (master)

Reviewed: https://review.fuel-infra.org/11518
Submitter: Igor Shishkin <email address hidden>
Branch: master

Commit: 3a35bac6a7007d0c65cd507cdc6fbe660619b17a
Author: Sergey Otpuschennikov <email address hidden>
Date: Mon Sep 14 07:17:08 2015

Added seabios 1.7.5 for upgrade qemu to 2.4

Change-Id: Icd85cf43dbe81355a78f271ee5959e0f418d40aa
Partial-Bug: #1484080

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix merged to fuel-infra-packages/libseccomp (master)

Reviewed: https://review.fuel-infra.org/11248
Submitter: Igor Shishkin <email address hidden>
Branch: master

Commit: 02bfad12bb3c64cb462c9f7a034f99419213e278
Author: Sergey Otpuschennikov <email address hidden>
Date: Mon Oct 12 16:33:18 2015

Upgrade version to 2.1.1 for qemu 2.4

Change-Id: I29011fd0b7bf1463a40249ce6f4f2462cd19cdb0
Partial-Bug: #1484080

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Fix merged to fuel-infra-packages/qemu (master)

Reviewed: https://review.fuel-infra.org/11241
Submitter: Igor Shishkin <email address hidden>
Branch: master

Commit: cc8384be6405e5aba69d46dc411e29179ccb9057
Author: Sergey Otpuschennikov <email address hidden>
Date: Mon Oct 12 15:14:30 2015

Update version to 2.4

Change-Id: I74107272e0707369f21bdb4d9968dcf68616013e
Closes-Bug: #1484080

Changed in fuel:
status: In Progress → Fix Committed
Changed in fuel:
assignee: Sergey Otpuschennikov (sotpuschennikov) → Fuel DevOps (fuel-devops)
status: Fix Committed → Confirmed
tags: added: swarm-blocker
Revision history for this message
Yuriy Shamray (ishamrai) wrote :
Dmitry Pyzhov (dpyzhov)
tags: added: devops
Revision history for this message
Andrey Nikitin (heos) wrote :

We've faced the problem is described in the bug: https://bugs.launchpad.net/fuel/+bug/1506582. So, we should be sure that it's not affected our tests to start update the slaves on new version of qemu.

Revision history for this message
Andrey Nikitin (heos) wrote :

That's why current bug is hold on time of investigation.

Dmitry Pyzhov (dpyzhov)
tags: added: area-devops
Revision history for this message
Dmitry Klenov (dklenov) wrote :

Andrey, can you please tell the latest status? This bug is still marked as a swarm blocker.

Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Related fix proposed to fuel-infra/puppet-manifests (master)

Related fix proposed to branch: master
Change author: Igor Shishkin <email address hidden>
Review: https://review.fuel-infra.org/13683

Changed in fuel:
status: Confirmed → In Progress
Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Related fix merged to fuel-infra/puppet-manifests (master)

Reviewed: https://review.fuel-infra.org/13683
Submitter: Igor Shishkin <email address hidden>
Branch: master

Commit: 1e2a8f7b1bc7bb5efd3afa61a350ad2da06db8ab
Author: Igor Shishkin <email address hidden>
Date: Thu Nov 5 15:34:07 2015

Adding stub for qemu=2.4 missed modprobe file

Change-Id: I05d4c3cc79b98c7d72f51cae3dd1d47c5b8acb03
Closes-Bug: #1506582
Closes-Bug: #1513394
Related-Bug: #1484080
Signed-off-by: Igor Shishkin <email address hidden>

Igor Shishkin (teran)
Changed in fuel:
status: In Progress → Triaged
Changed in fuel:
assignee: Fuel DevOps (fuel-devops) → Mateusz Matuszkowiak (mmatuszkowiak)
Changed in fuel:
status: Triaged → In Progress
Revision history for this message
Mateusz Matuszkowiak (mmatuszkowiak) wrote :

Three servers left from these connected to product-ci (3 remains of 35 in total).

srv39-bud.infra.mirantis.net
srv74-bud.infra.mirantis.net
srv75-bud.infra.mirantis.net

^^ these cannot be now upgraded because still jobs are running on them.

Revision history for this message
Mateusz Matuszkowiak (mmatuszkowiak) wrote :

Hi.

All of the remaining servers are done. This means all product-ci slaves are running now qemu2.4 with nested=1.

Regards,
Mateusz

Changed in fuel:
status: In Progress → Fix Committed
status: Fix Committed → Fix Released
Revision history for this message
Fuel Devops McRobotson (fuel-devops-robot) wrote : Change abandoned on fuel-infra-packages/qemu (master)

Change abandoned by Sergey Otpuschennikov <email address hidden> on branch: master
Review: https://review.fuel-infra.org/10882

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.