Nova Tempest test test_server_actions_resize_server_revert failed

Bug #1405658 reported by Alexander Gubanov
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Mirantis OpenStack
Fix Released
Medium
Eugeniya Kudryashova
6.0.x
Won't Fix
Medium
Eugeniya Kudryashova
6.1.x
Fix Released
Medium
Eugeniya Kudryashova

Bug Description

Deployment:

ubuntu / net neutron vlan / 3 controller / 2 compute / ceph

[root@nailgun mos-scale]# fuel --f
api: '1.0'
astute_sha: 16b252d93be6aaa73030b8100cf8c5ca6a970a91
auth_required: true
build_id: 2014-12-18_01-32-01
build_number: '56'
feature_groups:
- mirantis
fuellib_sha: 73332192a257ea02c40a39885c502ad1ebdf3eda
fuelmain_sha: 45caacadb878abfbd9d60e134d72229698b469c9
nailgun_sha: 5f91157daa6798ff522ca9f6d34e7e135f150a90
ostf_sha: a9afb68710d809570460c29d6c3293219d3624d4
production: docker
release: '6.0'
release_versions:
  2014.2-6.0:
    VERSION:
      api: '1.0'
      astute_sha: 16b252d93be6aaa73030b8100cf8c5ca6a970a91
      build_id: 2014-12-18_01-32-01
      build_number: '56'
      feature_groups:
      - mirantis
      fuellib_sha: 73332192a257ea02c40a39885c502ad1ebdf3eda
      fuelmain_sha: 45caacadb878abfbd9d60e134d72229698b469c9
      nailgun_sha: 5f91157daa6798ff522ca9f6d34e7e135f150a90
      ostf_sha: a9afb68710d809570460c29d6c3293219d3624d4
      production: docker
      release: '6.0'

Steps to reproduce:

tempest tempest.api.compute.servers.test_server_actions.ServerActionsTestJSON.test_resize_server_revert

Result:

Test fail with the exception

Traceback (most recent call last):
_StringException: pythonlogging:'': {{{
2014-12-25 15:31:55,062 21996 ERROR [tempest.api.compute.base] Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
Details: {u'message': u'internal error: early end of file from monitor: possible problem:\nqemu-system-x86_64: -drive file=rbd:compute/a93e6533-b624-4e79-98e0-40c726f1e02e_disk:id=compute:key=AQARGJxUYNGzEBAA/jHo/kj9BXU1N3+HF8UQ3w==:auth_supported=cephx\\;none:mon_host=192.168.0.3', u'code': 500, u'created': u'2014-12-25T15:31:52Z'}
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base Traceback (most recent call last):
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base File "/opt/stack/tempest/tempest/api/compute/base.py", line 136, in server_check_teardown
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base 'ACTIVE')
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base File "/opt/stack/tempest/tempest/services/compute/json/servers_client.py", line 183, in wait_for_server_status
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base ready_wait=ready_wait)
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base File "/opt/stack/tempest/tempest/common/waiters.py", line 77, in wait_for_server_status
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base server_id=server_id)
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base BuildErrorException: Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base Details: {u'message': u'internal error: early end of file from monitor: possible problem:\nqemu-system-x86_64: -drive file=rbd:compute/a93e6533-b624-4e79-98e0-40c726f1e02e_disk:id=compute:key=AQARGJxUYNGzEBAA/jHo/kj9BXU1N3+HF8UQ3w==:auth_supported=cephx\\;none:mon_host=192.168.0.3', u'code': 500, u'created': u'2014-12-25T15:31:52Z'}
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base
}}}

Traceback (most recent call last):
  File "/opt/stack/tempest/tempest/api/compute/servers/test_server_actions.py", line 240, in test_resize_server_revert
    self.client.wait_for_server_status(self.server_id, 'ACTIVE')
  File "/opt/stack/tempest/tempest/services/compute/json/servers_client.py", line 183, in wait_for_server_status
    ready_wait=ready_wait)
  File "/opt/stack/tempest/tempest/common/waiters.py", line 77, in wait_for_server_status
    server_id=server_id)
BuildErrorException: Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
Details: {u'message': u'internal error: early end of file from monitor: possible problem:\nqemu-system-x86_64: -drive file=rbd:compute/a93e6533-b624-4e79-98e0-40c726f1e02e_disk:id=compute:key=AQARGJxUYNGzEBAA/jHo/kj9BXU1N3+HF8UQ3w==:auth_supported=cephx\\;none:mon_host=192.168.0.3', u'code': 500, u'created': u'2014-12-25T15:31:52Z'}
Traceback (most recent call last):
_StringException: Empty attachments:
  stderr
  stdout

pythonlogging:'': {{{
2014-12-25 15:31:55,062 21996 ERROR [tempest.api.compute.base] Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
Details: {u'message': u'internal error: early end of file from monitor: possible problem:\nqemu-system-x86_64: -drive file=rbd:compute/a93e6533-b624-4e79-98e0-40c726f1e02e_disk:id=compute:key=AQARGJxUYNGzEBAA/jHo/kj9BXU1N3+HF8UQ3w==:auth_supported=cephx\\;none:mon_host=192.168.0.3', u'code': 500, u'created': u'2014-12-25T15:31:52Z'}
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base Traceback (most recent call last):
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base File "/opt/stack/tempest/tempest/api/compute/base.py", line 136, in server_check_teardown
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base 'ACTIVE')
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base File "/opt/stack/tempest/tempest/services/compute/json/servers_client.py", line 183, in wait_for_server_status
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base ready_wait=ready_wait)
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base File "/opt/stack/tempest/tempest/common/waiters.py", line 77, in wait_for_server_status
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base server_id=server_id)
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base BuildErrorException: Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base Details: {u'message': u'internal error: early end of file from monitor: possible problem:\nqemu-system-x86_64: -drive file=rbd:compute/a93e6533-b624-4e79-98e0-40c726f1e02e_disk:id=compute:key=AQARGJxUYNGzEBAA/jHo/kj9BXU1N3+HF8UQ3w==:auth_supported=cephx\\;none:mon_host=192.168.0.3', u'code': 500, u'created': u'2014-12-25T15:31:52Z'}
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base
}}}

traceback-1: {{{
Traceback (most recent call last):
  File "/opt/stack/tempest/tempest/api/compute/servers/test_server_actions.py", line 51, in tearDown
    self.server_check_teardown()
  File "/opt/stack/tempest/tempest/api/compute/base.py", line 140, in server_check_teardown
    cls.servers_client.wait_for_server_termination(cls.server_id)
  File "/opt/stack/tempest/tempest/services/compute/json/servers_client.py", line 196, in wait_for_server_termination
    raise exceptions.BuildErrorException(server_id=server_id)
BuildErrorException: Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
}}}

Traceback (most recent call last):
  File "/opt/stack/tempest/tempest/api/compute/servers/test_server_actions.py", line 240, in test_resize_server_revert
    self.client.wait_for_server_status(self.server_id, 'ACTIVE')
  File "/opt/stack/tempest/tempest/services/compute/json/servers_client.py", line 183, in wait_for_server_status
    ready_wait=ready_wait)
  File "/opt/stack/tempest/tempest/common/waiters.py", line 77, in wait_for_server_status
    server_id=server_id)
BuildErrorException: Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
Details: {u'message': u'internal error: early end of file from monitor: possible problem:\nqemu-system-x86_64: -drive file=rbd:compute/a93e6533-b624-4e79-98e0-40c726f1e02e_disk:id=compute:key=AQARGJxUYNGzEBAA/jHo/kj9BXU1N3+HF8UQ3w==:auth_supported=cephx\\;none:mon_host=192.168.0.3', u'code': 500, u'created': u'2014-12-25T15:31:52Z'}

Traceback (most recent call last):
_StringException: Empty attachments:
  stderr
  stdout

pythonlogging:'': {{{
2014-12-25 15:31:55,062 21996 ERROR [tempest.api.compute.base] Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
Details: {u'message': u'internal error: early end of file from monitor: possible problem:\nqemu-system-x86_64: -drive file=rbd:compute/a93e6533-b624-4e79-98e0-40c726f1e02e_disk:id=compute:key=AQARGJxUYNGzEBAA/jHo/kj9BXU1N3+HF8UQ3w==:auth_supported=cephx\\;none:mon_host=192.168.0.3', u'code': 500, u'created': u'2014-12-25T15:31:52Z'}
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base Traceback (most recent call last):
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base File "/opt/stack/tempest/tempest/api/compute/base.py", line 136, in server_check_teardown
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base 'ACTIVE')
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base File "/opt/stack/tempest/tempest/services/compute/json/servers_client.py", line 183, in wait_for_server_status
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base ready_wait=ready_wait)
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base File "/opt/stack/tempest/tempest/common/waiters.py", line 77, in wait_for_server_status
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base server_id=server_id)
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base BuildErrorException: Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base Details: {u'message': u'internal error: early end of file from monitor: possible problem:\nqemu-system-x86_64: -drive file=rbd:compute/a93e6533-b624-4e79-98e0-40c726f1e02e_disk:id=compute:key=AQARGJxUYNGzEBAA/jHo/kj9BXU1N3+HF8UQ3w==:auth_supported=cephx\\;none:mon_host=192.168.0.3', u'code': 500, u'created': u'2014-12-25T15:31:52Z'}
2014-12-25 15:31:55.062 21996 TRACE tempest.api.compute.base
}}}

traceback-1: {{{
Traceback (most recent call last):
  File "/opt/stack/tempest/tempest/api/compute/servers/test_server_actions.py", line 51, in tearDown
    self.server_check_teardown()
  File "/opt/stack/tempest/tempest/api/compute/base.py", line 140, in server_check_teardown
    cls.servers_client.wait_for_server_termination(cls.server_id)
  File "/opt/stack/tempest/tempest/services/compute/json/servers_client.py", line 196, in wait_for_server_termination
    raise exceptions.BuildErrorException(server_id=server_id)
BuildErrorException: Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
}}}

Traceback (most recent call last):
  File "/opt/stack/tempest/tempest/api/compute/servers/test_server_actions.py", line 240, in test_resize_server_revert
    self.client.wait_for_server_status(self.server_id, 'ACTIVE')
  File "/opt/stack/tempest/tempest/services/compute/json/servers_client.py", line 183, in wait_for_server_status
    ready_wait=ready_wait)
  File "/opt/stack/tempest/tempest/common/waiters.py", line 77, in wait_for_server_status
    server_id=server_id)
BuildErrorException: Server a93e6533-b624-4e79-98e0-40c726f1e02e failed to build and is in ERROR status
Details: {u'message': u'internal error: early end of file from monitor: possible problem:\nqemu-system-x86_64: -drive file=rbd:compute/a93e6533-b624-4e79-98e0-40c726f1e02e_disk:id=compute:key=AQARGJxUYNGzEBAA/jHo/kj9BXU1N3+HF8UQ3w==:auth_supported=cephx\\;none:mon_host=192.168.0.3', u'code': 500, u'created': u'2014-12-25T15:31:52Z'}

I can provide diagnostic snaphot ( 84 Mb )

Tags: nova tempest
tags: added: tempest
Revision history for this message
Timur Nurlygayanov (tnurlygayanov) wrote :
Changed in mos:
importance: Undecided → Medium
assignee: nobody → MOS Nova (mos-nova)
Revision history for this message
Roman Podoliaka (rpodolyaka) wrote :

The problem is that during the revert of the resize, instance disks are destroyed here - https://github.com/openstack/nova/blob/a7c105aadb97fb411a5e46646a42927df4fbe5a9/nova/compute/manager.py#L3551-L3552

When Ceph is used as a backend for ephemeral disks, the corresponding object will be removed from the pool on destroy() and apparently a new qemu process will fail to start due to a missing disk image.

Revision history for this message
Eugeniya Kudryashova (ekudryashova) wrote :

This bug is already fixed in upstream, fix is merged in kilo: https://review.openstack.org/#/c/139693/

But fix is broken, so we can't merge backport to juno until master would be fixed. Fix for master now is on review here: https://review.openstack.org/#/c/154761/2

So we are blocked to merge backport to juno, and should wait for this stuff to be merged.

Revision history for this message
Yaroslav Lobankov (ylobankov) wrote :
Download full text (22.3 KiB)

VERSION:
  feature_groups:
    - mirantis
  production: "docker"
  release: "6.0.1"
  api: "1.0"
  build_number: "97"
  build_id: "2015-02-18_20-49-44"
  astute_sha: "f7cda2171b0b677dfaeb59693d980a2d3ee4c3e0"
  fuellib_sha: "faf0a06933fb816a8baf55f208144ea286cc5856"
  ostf_sha: "3b57985d4d2155510894a1f6d03b478b201f7780"
  nailgun_sha: "4eda56b68851e5bf26feedee8ecbdf338a45cc72"
  fuelmain_sha: "c799e3a6d88289e58db764a6be7910aab7da3149"

I have NOT reproduced this issue on
ENVIRONMENT:
HA mode, CentOS, Neutron with GRE segmentation, Cinder LVM, 1 controller, 1 (compute + cinder)

I also have NOT reproduced this issue on
ENVIRONMENT:
HA mode, CentOS, Neutron with GRE segmentation, Ceph RBD for images, volumes, Ceph RadosGW for objects, 1 controller, 1 (compute + ceph)

But I have reproduced this issue on
ENVIRONMENT:
HA mode, CentOS, Neutron with GRE segmentation, Ceph RBD for images, volumes, EPHEMERAL VOLUMES, Ceph RadosGW for objects, 1 controller, 1 (compute + ceph)

It looks like the problem is related to using Ceph RBD for ephemeral volumes. In compute.log I have seen the following error:

2015-02-19 17:02:09.573 20535 ERROR nova.virt.libvirt.driver [req-987aa887-9e48-4985-97e3-f9ff851255ea None] Error launching a defined domain with XML: <domain type='kvm'>
  <name>instance-00000003</name>
  <uuid>7ee97c46-69e4-40ca-a41d-876a6faa68bc</uuid>
  <metadata>
    <nova:instance xmlns:nova="http://openstack.org/xmlns/libvirt/nova/1.0">
      <nova:package version="2014.2-fuel6.0.1.mira20"/>
      <nova:name>ServerActionsTestJSON-instance-1146312509</nova:name>
      <nova:creationTime>2015-02-19 17:02:08</nova:creationTime>
      <nova:flavor name="m1.nano">
        <nova:memory>64</nova:memory>
        <nova:disk>0</nova:disk>
        <nova:swap>0</nova:swap>
        <nova:ephemeral>0</nova:ephemeral>
        <nova:vcpus>1</nova:vcpus>
      </nova:flavor>
      <nova:owner>
        <nova:user uuid="8d808e3a0cef4e27afdeb73f37116872">ServerActionsTestJSON-2098800342</nova:user>
        <nova:project uuid="7c79a9d0484744dcbf98b174d91c59b3">ServerActionsTestJSON-1650317639</nova:project>
      </nova:owner>
      <nova:root type="image" uuid="ee54c1d3-5a84-4131-b45f-352c37850a5e"/>
    </nova:instance>
  </metadata>
  <memory unit='KiB'>65536</memory>
  <currentMemory unit='KiB'>65536</currentMemory>
  <vcpu placement='static' cpuset='0-7'>1</vcpu>
    <sysinfo type='smbios'>
      <system>
        <entry name='manufacturer'>Red Hat Inc.</entry>
        <entry name='product'>OpenStack Nova</entry>
        <entry name='version'>2014.2-fuel6.0.1.mira20</entry>
        <entry name='serial'>bd073957-5927-4733-a6c8-9ac8d07c9c04</entry>
        <entry name='uuid'>7ee97c46-69e4-40ca-a41d-876a6faa68bc</entry>
      </system>
    </sysinfo>
  <os>
    <type arch='x86_64' machine='pc-i440fx-2.0'>hvm</type>
    <boot dev='hd'/>
    <smbios mode='sysinfo'/>
  </os>
  <features>
    <acpi/>
    <apic/>
  </features>
  <cpu mode='host-model'>
    <model fallback='allow'/>
    <topology sockets='1' cores='1' threads='1'/>
  </cpu>
  <clock offset='utc'>
    <timer name='pit' tickpolicy='delay'/>
    <timer name='rtc' tickpolicy='catchup'/>
    <timer name='hpet...

Revision history for this message
Yaroslav Lobankov (ylobankov) wrote :
Revision history for this message
Roman Podoliaka (rpodolyaka) wrote :

Not enough priority to be backported to 6.0.1

Revision history for this message
Yaroslav Lobankov (ylobankov) wrote :

Verified on
VERSION:
  feature_groups:
    - mirantis
  production: "docker"
  release: "6.1"
  openstack_version: "2014.2.2-6.1"
  api: "1.0"
  build_number: "338"
  build_id: "2015-04-22_22-54-31"
  nailgun_sha: "4972120d5a5d41efe9be2d10b81329adf3258a78"
  python-fuelclient_sha: "2b311b3b82a1e2df1dc3484a0f37e282273cd988"
  astute_sha: "5be3506e6a4f8b7f0a858db2e162dabcc07347df"
  fuel-library_sha: "9b26cb98e7ceb905fe85b744eee2f808f7e4737b"
  fuel-ostf_sha: "dcda5a869d1780efc265b6f6b7b8c14c8324a68b"
  fuelmain_sha: "2f8be90209e709810674f73abb02bb54a700ce5e"

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.