Bug #1741409 “stress smoke test hang with dev test on AWS Xenial...” : Bugs : linux-aws package : Ubuntu

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2018-01-05:

#1

Dependencies.txt Edit (2.1 KiB, text/plain; charset="utf-8")
JournalErrors.txt Edit (2.8 KiB, text/plain; charset="utf-8")
ProcCpuinfoMinimal.txt Edit (835 bytes, text/plain; charset="utf-8")

description:

updated

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2018-01-05:

#2

Manually tested with older kernel (4.4.0-1043-aws), this issue still can be reproduced.

The node will get rebooted when bumping into this test.

Colin Ian King (colin-king) on 2018-01-11

Changed in linux-aws (Ubuntu):
importance:	Undecided → High
status:	New → In Progress

Revision history for this message

Colin Ian King (colin-king) wrote on 2018-01-11:

#3

This is locking up on opening a specific device. It is not a race condition as I originally suspected, but a lockup on a simple read open of a device on just AWS.

Revision history for this message

Colin Ian King (colin-king) wrote on 2018-01-11:

#4

4.4.0-73 has the same issue, so it's not an aws specific kernel issue per se.

Revision history for this message

Colin Ian King (colin-king) wrote on 2018-01-11:

#5

issue occurs with v4.15-rc7 upstream kernel too

Revision history for this message

Colin Ian King (colin-king) wrote on 2018-01-11:

#6

..and way back to v4.0

Revision history for this message

Colin Ian King (colin-king) wrote on 2018-01-12:

#7

Do you mind re-running the test to see if we get passed this stress test now?

Revision history for this message

Po-Hsu Lin (cypressyew) wrote on 2018-01-12:

#8

Tested with 4.4.0-109 lowlatency kernel, this dev test can pass now.

I will leave this bug open as discussed on the IRC.

Revision history for this message

Colin Ian King (colin-king) wrote on 2018-02-20:

#9

I can reproduce this with 4.16-rc2, I've debugged this down to:

drivers/char/hpet.c, hpet_timer_set_irq():

        if (irq < HPET_MAX_IRQ) {
                spin_lock_irq(&hpet_lock);
                v = readl(&timer->hpet_config);
                v |= irq << Tn_INT_ROUTE_CNF_SHIFT;
                writel(v, &timer->hpet_config);

.. the writel to hpet_config causes the reboot.

How to reproduce this issue:

git clone git://kernel.ubuntu.com/cking/stress-ng
cd stress-ng
git revert 0124b250ec205ea3cd6d9d68fb96c03ac294d12f
make
sudo ./stress-ng --dev 1

.. wait a while and it will eventually get around to the /dev/hpet and opening this causes the hang.

The minimal reproducer is:

#include <sys/types.h>
#include <sys/stat.h>
#include <fcntl.h>
#include <unistd.h>
#include <stdlib.h>

int main(void)
{
int fd;

fd = open("/dev/hpet", O_RDONLY | O_NONBLOCK);
if (fd > 0)
close(fd);

exit(0);
}

run this as root and it will cause the reboot.

Revision history for this message

Colin Ian King (colin-king) wrote on 2018-02-20:

#10

Download full text (31.6 KiB)

demsg of guest:

[ 0.000000] Linux version 4.16.0-rc2+ (cking@gloin) (gcc version 7.3.0 (Ubuntu 7.3.0-3ubuntu1)) #7 SMP Tue Feb 20 14:27:20 UTC 2018
[ 0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-4.16.0-rc2+ root=UUID=b6adc449-5e3d-4331-ba6b-6e99a75fa48e ro console=tty1 console=ttyS0 nvme.io_timeout=4294967295
[ 0.000000] KERNEL supported cpus:
[ 0.000000] Intel GenuineIntel
[ 0.000000] AMD AuthenticAMD
[ 0.000000] Centaur CentaurHauls
[ 0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers'
[ 0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers'
[ 0.000000] x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers'
[ 0.000000] x86/fpu: xstate_offset[2]: 576, xstate_sizes[2]: 256
[ 0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832 bytes, using 'standard' format.
[ 0.000000] e820: BIOS-provided physical RAM map:
[ 0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009dfff] usable
[ 0.000000] BIOS-e820: [mem 0x000000000009e000-0x000000000009ffff] reserved
[ 0.000000] BIOS-e820: [mem 0x00000000000e0000-0x00000000000fffff] reserved
[ 0.000000] BIOS-e820: [mem 0x0000000000100000-0x000000003fffffff] usable
[ 0.000000] BIOS-e820: [mem 0x00000000fc000000-0x00000000ffffffff] reserved
[ 0.000000] NX (Execute Disable) protection: active
[ 0.000000] random: fast init done
[ 0.000000] SMBIOS 2.7 present.
[ 0.000000] DMI: Xen HVM domU, BIOS 4.2.amazon 08/24/2006
[ 0.000000] Hypervisor detected: Xen HVM
[ 0.000000] Xen version 4.2.
[ 0.000000] Xen Platform PCI: I/O protocol version 1
[ 0.000000] Netfront and the Xen platform PCI driver have been compiled for this kernel: unplug emulated NICs.
[ 0.000000] Blkfront and the Xen platform PCI driver have been compiled for this kernel: unplug emulated disks.
               You might have to change the root device
               from /dev/hd[a-d] to /dev/xvd[a-d]
               in your root= kernel command line option
[ 0.000000] HVMOP_pagetable_dying not supported
[ 0.000000] e820: update [mem 0x00000000-0x00000fff] usable ==> reserved
[ 0.000000] e820: remove [mem 0x000a0000-0x000fffff] usable
[ 0.000000] e820: last_pfn = 0x40000 max_arch_pfn = 0x400000000
[ 0.000000] MTRR default type: write-back
[ 0.000000] MTRR fixed ranges enabled:
[ 0.000000] 00000-9FFFF write-back
[ 0.000000] A0000-BFFFF write-combining
[ 0.000000] C0000-FFFFF write-back
[ 0.000000] MTRR variable ranges enabled:
[ 0.000000] 0 base 0000F0000000 mask 3FFFF8000000 uncachable
[ 0.000000] 1 base 0000F8000000 mask 3FFFFC000000 uncachable
[ 0.000000] 2 disabled
[ 0.000000] 3 disabled
[ 0.000000] 4 disabled
[ 0.000000] 5 disabled
[ 0.000000] 6 disabled
[ 0.000000] 7 disabled
[ 0.000000] x86/PAT: Configuration [0-7]: WB WC UC- UC WB WP UC- WT
[ 0.000000] found SMP MP-table at [mem 0x000fbc20-0x000fbc2f] mapped at [ (ptrval)]
[ 0.000000] Scanning 1 areas for low memory corruption
[ 0.000000] Base memory trampoline at [ (ptrval)] 98000 size 24576
[ 0.000000] BRK [0x0ff42000, 0x0ff42fff] PGTAB...

demsg of guest:

[    0.000000] Linux version 4.16.0-rc2+ (cking@gloin) (gcc version 7.3.0 (Ubuntu 7.3.0-3ubuntu1)) #7 SMP Tue Feb 20 14:27:20 UTC 2018
[    0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-4.16.0-rc2+ root=UUID=b6adc449-5e3d-4331-ba6b-6e99a75fa48e ro console=tty1 console=ttyS0 nvme.io_timeout=4294967295
[    0.000000] KERNEL supported cpus:
[    0.000000]   Intel GenuineIntel
[    0.000000]   AMD AuthenticAMD
[    0.000000]   Centaur CentaurHauls
[    0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers'
[    0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers'
[    0.000000] x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers'
[    0.000000] x86/fpu: xstate_offset[2]:  576, xstate_sizes[2]:  256
[    0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832 bytes, using 'standard' format.
[    0.000000] e820: BIOS-provided physical RAM map:
[    0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009dfff] usable
[    0.000000] BIOS-e820: [mem 0x000000000009e000-0x000000000009ffff] reserved
[    0.000000] BIOS-e820: [mem 0x00000000000e0000-0x00000000000fffff] reserved
[    0.000000] BIOS-e820: [mem 0x0000000000100000-0x000000003fffffff] usable
[    0.000000] BIOS-e820: [mem 0x00000000fc000000-0x00000000ffffffff] reserved
[    0.000000] NX (Execute Disable) protection: active
[    0.000000] random: fast init done
[    0.000000] SMBIOS 2.7 present.
[    0.000000] DMI: Xen HVM domU, BIOS 4.2.amazon 08/24/2006
[    0.000000] Hypervisor detected: Xen HVM
[    0.000000] Xen version 4.2.
[    0.000000] Xen Platform PCI: I/O protocol version 1
[    0.000000] Netfront and the Xen platform PCI driver have been compiled for this kernel: unplug emulated NICs.
[    0.000000] Blkfront and the Xen platform PCI driver have been compiled for this kernel: unplug emulated disks.
               You might have to change the root device
               from /dev/hd[a-d] to /dev/xvd[a-d]
               in your root= kernel command line option
[    0.000000] HVMOP_pagetable_dying not supported
[    0.000000] e820: update [mem 0x00000000-0x00000fff] usable ==> reserved
[    0.000000] e820: remove [mem 0x000a0000-0x000fffff] usable
[    0.000000] e820: last_pfn = 0x40000 max_arch_pfn = 0x400000000
[    0.000000] MTRR default type: write-back
[    0.000000] MTRR fixed ranges enabled:
[    0.000000]   00000-9FFFF write-back
[    0.000000]   A0000-BFFFF write-combining
[    0.000000]   C0000-FFFFF write-back
[    0.000000] MTRR variable ranges enabled:
[    0.000000]   0 base 0000F0000000 mask 3FFFF8000000 uncachable
[    0.000000]   1 base 0000F8000000 mask 3FFFFC000000 uncachable
[    0.000000]   2 disabled
[    0.000000]   3 disabled
[    0.000000]   4 disabled
[    0.000000]   5 disabled
[    0.000000]   6 disabled
[    0.000000]   7 disabled
[    0.000000] x86/PAT: Configuration [0-7]: WB  WC  UC- UC  WB  WP  UC- WT  
[    0.000000] found SMP MP-table at [mem 0x000fbc20-0x000fbc2f] mapped at [        (ptrval)]
[    0.000000] Scanning 1 areas for low memory corruption
[    0.000000] Base memory trampoline at [        (ptrval)] 98000 size 24576
[    0.000000] BRK [0x0ff42000, 0x0ff42fff] PGTABLE
[    0.000000] BRK [0x0ff43000, 0x0ff43fff] PGTABLE
[    0.000000] BRK [0x0ff44000, 0x0ff44fff] PGTABLE
[    0.000000] BRK [0x0ff45000, 0x0ff45fff] PGTABLE
[    0.000000] RAMDISK: [mem 0x346e2000-0x36368fff]
[    0.000000] ACPI: Early table checksum verification disabled
[    0.000000] ACPI: RSDP 0x00000000000EA020 000024 (v02 Xen   )
[    0.000000] ACPI: XSDT 0x00000000FC00DDC0 000054 (v01 Xen    HVM      00000000 HVML 00000000)
[    0.000000] ACPI: FACP 0x00000000FC00DA80 0000F4 (v04 Xen    HVM      00000000 HVML 00000000)
[    0.000000] ACPI: DSDT 0x00000000FC001CE0 00BD19 (v02 Xen    HVM      00000000 INTL 20090123)
[    0.000000] ACPI: FACS 0x00000000FC001CA0 000040
[    0.000000] ACPI: FACS 0x00000000FC001CA0 000040
[    0.000000] ACPI: APIC 0x00000000FC00DB80 0000D8 (v02 Xen    HVM      00000000 HVML 00000000)
[    0.000000] ACPI: HPET 0x00000000FC00DCD0 000038 (v01 Xen    HVM      00000000 HVML 00000000)
[    0.000000] ACPI: WAET 0x00000000FC00DD10 000028 (v01 Xen    HVM      00000000 HVML 00000000)
[    0.000000] ACPI: SSDT 0x00000000FC00DD40 000031 (v02 Xen    HVM      00000000 INTL 20090123)
[    0.000000] ACPI: SSDT 0x00000000FC00DD80 000031 (v02 Xen    HVM      00000000 INTL 20090123)
[    0.000000] ACPI: Local APIC address 0xfee00000
[    0.000000] No NUMA configuration found
[    0.000000] Faking a node at [mem 0x0000000000000000-0x000000003fffffff]
[    0.000000] NODE_DATA(0) allocated [mem 0x3ffd5000-0x3fffffff]
[    0.000000] tsc: Fast TSC calibration using PIT
[    0.000000] Zone ranges:
[    0.000000]   DMA      [mem 0x0000000000001000-0x0000000000ffffff]
[    0.000000]   DMA32    [mem 0x0000000001000000-0x000000003fffffff]
[    0.000000]   Normal   empty
[    0.000000]   Device   empty
[    0.000000] Movable zone start for each node
[    0.000000] Early memory node ranges
[    0.000000]   node   0: [mem 0x0000000000001000-0x000000000009dfff]
[    0.000000]   node   0: [mem 0x0000000000100000-0x000000003fffffff]
[    0.000000] Initmem setup node 0 [mem 0x0000000000001000-0x000000003fffffff]
[    0.000000] On node 0 totalpages: 262045
[    0.000000]   DMA zone: 64 pages used for memmap
[    0.000000]   DMA zone: 21 pages reserved
[    0.000000]   DMA zone: 3997 pages, LIFO batch:0
[    0.000000]   DMA32 zone: 4032 pages used for memmap
[    0.000000]   DMA32 zone: 258048 pages, LIFO batch:31
[    0.000000] Reserved but unavailable: 99 pages
[    0.000000] ACPI: PM-Timer IO Port: 0xb008
[    0.000000] ACPI: Local APIC address 0xfee00000
[    0.000000] IOAPIC[0]: apic_id 1, version 17, address 0xfec00000, GSI 0-47
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 5 global_irq 5 low level)
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 10 global_irq 10 low level)
[    0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 11 global_irq 11 low level)
[    0.000000] ACPI: IRQ0 used by override.
[    0.000000] ACPI: IRQ5 used by override.
[    0.000000] ACPI: IRQ9 used by override.
[    0.000000] ACPI: IRQ10 used by override.
[    0.000000] ACPI: IRQ11 used by override.
[    0.000000] Using ACPI (MADT) for SMP configuration information
[    0.000000] ACPI: HPET id: 0x8086a201 base: 0xfed00000
[    0.000000] smpboot: Allowing 15 CPUs, 14 hotplug CPUs
[    0.000000] PM: Registered nosave memory: [mem 0x00000000-0x00000fff]
[    0.000000] PM: Registered nosave memory: [mem 0x0009e000-0x0009ffff]
[    0.000000] PM: Registered nosave memory: [mem 0x000a0000-0x000dffff]
[    0.000000] PM: Registered nosave memory: [mem 0x000e0000-0x000fffff]
[    0.000000] e820: [mem 0x40000000-0xfbffffff] available for PCI devices
[    0.000000] Booting paravirtualized kernel on Xen HVM
[    0.000000] clocksource: refined-jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 7645519600211568 ns
[    0.000000] setup_percpu: NR_CPUS:8192 nr_cpumask_bits:15 nr_cpu_ids:15 nr_node_ids:1
[    0.000000] percpu: Embedded 46 pages/cpu @        (ptrval) s151552 r8192 d28672 u262144
[    0.000000] pcpu-alloc: s151552 r8192 d28672 u262144 alloc=1*2097152
[    0.000000] pcpu-alloc: [0] 00 01 02 03 04 05 06 07 [0] 08 09 10 11 12 13 14 -- 
[    0.000000] xen: PV spinlocks enabled
[    0.000000] PV qspinlock hash table entries: 256 (order: 0, 4096 bytes)
[    0.000000] Built 1 zonelists, mobility grouping on.  Total pages: 257928
[    0.000000] Policy zone: DMA32
[    0.000000] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-4.16.0-rc2+ root=UUID=b6adc449-5e3d-4331-ba6b-6e99a75fa48e ro console=tty1 console=ttyS0 nvme.io_timeout=4294967295
[    0.000000] Calgary: detecting Calgary via BIOS EBDA area
[    0.000000] Calgary: Unable to locate Rio Grande table in EBDA - bailing!
[    0.000000] Memory: 971544K/1048180K available (12300K kernel code, 2480K rwdata, 4208K rodata, 2408K init, 2416K bss, 76636K reserved, 0K cma-reserved)
[    0.000000] SLUB: HWalign=64, Order=0-3, MinObjects=0, CPUs=15, Nodes=1
[    0.000000] Kernel/User page tables isolation: enabled
[    0.000000] ftrace: allocating 39258 entries in 154 pages
[    0.004000] Hierarchical RCU implementation.
[    0.004000] 	RCU restricting CPUs from NR_CPUS=8192 to nr_cpu_ids=15.
[    0.004000] 	Tasks RCU enabled.
[    0.004000] RCU: Adjusting geometry for rcu_fanout_leaf=16, nr_cpu_ids=15
[    0.004000] NR_IRQS: 524544, nr_irqs: 952, preallocated irqs: 16
[    0.004000] xen:events: Using 2-level ABI
[    0.004000] xen:events: Xen HVM callback vector for event delivery is enabled
[    0.004000] Console: colour VGA+ 80x25
[    0.004000] console [tty1] enabled
[    0.004000] Cannot get hvm parameter CONSOLE_EVTCHN (18): -22!
[    0.004000] console [ttyS0] enabled
[    0.004000] ACPI: Core revision 20180105
[    0.004000] ACPI: 3 ACPI AML tables successfully acquired and loaded
[    0.004000] clocksource: hpet: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 30580167144 ns
[    0.004000] hpet clockevent registered
[    0.004012] APIC: Switch to symmetric I/O mode setup
[    0.007467] x2apic: IRQ remapping doesn't support X2APIC mode
[    0.008003] Switched APIC routing to physical flat.
[    0.012000] ..TIMER: vector=0x30 apic1=0 pin1=2 apic2=0 pin2=0
[    0.040000] tsc: Fast TSC calibration using PIT
[    0.052002] tsc: Detected 2400.118 MHz processor
[    0.054548] tsc: Detected 2400.078 MHz TSC
[    0.054549] clocksource: tsc-early: mask: 0xffffffffffffffff max_cycles: 0x22988164a9a, max_idle_ns: 440795290212 ns
[    0.060005] Calibrating delay loop (skipped), value calculated using timer frequency.. 4800.15 BogoMIPS (lpj=9600312)
[    0.068002] pid_max: default: 32768 minimum: 301
[    0.070697] Security Framework initialized
[    0.072004] Yama: becoming mindful.
[    0.076028] AppArmor: AppArmor initialized
[    0.078913] Dentry cache hash table entries: 131072 (order: 8, 1048576 bytes)
[    0.084089] Inode-cache hash table entries: 65536 (order: 7, 524288 bytes)
[    0.088022] Mount-cache hash table entries: 2048 (order: 2, 16384 bytes)
[    0.096008] Mountpoint-cache hash table entries: 2048 (order: 2, 16384 bytes)
[    0.100259] CPU: Physical Processor ID: 0
[    0.104017] mce: CPU supports 2 MCE banks
[    0.108026] Last level iTLB entries: 4KB 1024, 2MB 1024, 4MB 1024
[    0.112004] Last level dTLB entries: 4KB 1024, 2MB 1024, 4MB 1024, 1GB 4
[    0.120004] Spectre V2 : Mitigation: Full generic retpoline
[    0.133814] clocksource: xen: mask: 0xffffffffffffffff max_cycles: 0x1cd42e4dffb, max_idle_ns: 881590591483 ns
[    0.140015] Xen: using vcpuop timer interface
[    0.140021] installing Xen timer for CPU 0
[    0.144071] smpboot: CPU0: Intel(R) Xeon(R) CPU E5-2676 v3 @ 2.40GHz (family: 0x6, model: 0x3f, stepping: 0x2)
[    0.148044] cpu 0 spinlock event irq 53
[    0.151719] Performance Events: unsupported p6 CPU model 63 no PMU driver, software events only.
[    0.152055] Hierarchical SRCU implementation.
[    0.156497] NMI watchdog: Perf event create on CPU 0 failed with -2
[    0.160006] NMI watchdog: Perf NMI watchdog permanently disabled
[    0.164151] smp: Bringing up secondary CPUs ...
[    0.168006] smp: Brought up 1 node, 1 CPU
[    0.171679] smpboot: Max logical packages: 15
[    0.172007] smpboot: Total of 1 processors activated (4800.15 BogoMIPS)
[    0.176241] devtmpfs: initialized
[    0.179596] x86/mm: Memory block size: 128MB
[    0.180209] evm: security.selinux
[    0.183446] evm: security.SMACK64
[    0.184006] evm: security.SMACK64EXEC
[    0.187508] evm: security.SMACK64TRANSMUTE
[    0.188007] evm: security.SMACK64MMAP
[    0.191780] evm: security.apparmor
[    0.192006] evm: security.ima
[    0.194990] evm: security.capability
[    0.196176] clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 7645041785100000 ns
[    0.200028] futex hash table entries: 4096 (order: 6, 262144 bytes)
[    0.204139] pinctrl core: initialized pinctrl subsystem
[    0.208133] RTC time: 15:01:12, date: 02/20/18
[    0.212163] NET: Registered protocol family 16
[    0.216114] audit: initializing netlink subsys (disabled)
[    0.220154] audit: type=2000 audit(1519138872.764:1): state=initialized audit_enabled=0 res=1
[    0.224115] cpuidle: using governor ladder
[    0.228005] cpuidle: using governor menu
[    0.231800] ACPI: bus type PCI registered
[    0.232005] acpiphp: ACPI Hot Plug PCI Controller Driver version: 0.5
[    0.236428] PCI: Using configuration type 1 for base access
[    0.241172] HugeTLB registered 2.00 MiB page size, pre-allocated 0 pages
[    0.244270] ACPI: Added _OSI(Module Device)
[    0.248007] ACPI: Added _OSI(Processor Device)
[    0.251102] ACPI: Added _OSI(3.0 _SCP Extensions)
[    0.252007] ACPI: Added _OSI(Processor Aggregator Device)
[    0.256000] xen: --> pirq=16 -> irq=9 (gsi=9)
[    0.260862] ACPI: Interpreter enabled
[    0.263528] ACPI: (supports S0 S3 S4 S5)
[    0.264005] ACPI: Using IOAPIC for interrupt routing
[    0.267285] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug
[    0.268494] ACPI: Enabled 2 GPEs in block 00 to 0F
[    0.328686] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-ff])
[    0.332013] acpi PNP0A03:00: _OSC: OS supports [ASPM ClockPM Segments MSI]
[    0.336011] acpi PNP0A03:00: _OSC failed (AE_NOT_FOUND); disabling ASPM
[    0.340030] acpi PNP0A03:00: fail to add MMCONFIG information, can't access extended PCI configuration space under this bridge.
[    0.345393] acpiphp: Slot [0] registered
[    0.348932] acpiphp: Slot [3] registered
[    0.352013] acpiphp: Slot [4] registered
[    0.355129] acpiphp: Slot [5] registered
[    0.356307] acpiphp: Slot [6] registered
[    0.359394] acpiphp: Slot [7] registered
[    0.360297] acpiphp: Slot [8] registered
[    0.363258] acpiphp: Slot [9] registered
[    0.364313] acpiphp: Slot [10] registered
[    0.367391] acpiphp: Slot [11] registered
[    0.368328] acpiphp: Slot [12] registered
[    0.371499] acpiphp: Slot [13] registered
[    0.372285] acpiphp: Slot [14] registered
[    0.375654] acpiphp: Slot [15] registered
[    0.376347] acpiphp: Slot [16] registered
[    0.379677] acpiphp: Slot [17] registered
[    0.380307] acpiphp: Slot [18] registered
[    0.383553] acpiphp: Slot [19] registered
[    0.384329] acpiphp: Slot [20] registered
[    0.387460] acpiphp: Slot [21] registered
[    0.388350] acpiphp: Slot [22] registered
[    0.391531] acpiphp: Slot [23] registered
[    0.392293] acpiphp: Slot [24] registered
[    0.395501] acpiphp: Slot [25] registered
[    0.396292] acpiphp: Slot [26] registered
[    0.399855] acpiphp: Slot [27] registered
[    0.400290] acpiphp: Slot [28] registered
[    0.403301] acpiphp: Slot [29] registered
[    0.404310] acpiphp: Slot [30] registered
[    0.407454] acpiphp: Slot [31] registered
[    0.408315] PCI host bridge to bus 0000:00
[    0.411370] pci_bus 0000:00: root bus resource [io  0x0000-0x0cf7 window]
[    0.412007] pci_bus 0000:00: root bus resource [io  0x0d00-0xffff window]
[    0.416005] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window]
[    0.420006] pci_bus 0000:00: root bus resource [mem 0xf0000000-0xfbffffff window]
[    0.424007] pci_bus 0000:00: root bus resource [bus 00-ff]
[    0.428095] pci 0000:00:00.0: [8086:1237] type 00 class 0x060000
[    0.430549] pci 0000:00:01.0: [8086:7000] type 00 class 0x060100
[    0.433373] pci 0000:00:01.1: [8086:7010] type 00 class 0x010180
[    0.434717] pci 0000:00:01.1: reg 0x20: [io  0xc100-0xc10f]
[    0.435236] pci 0000:00:01.1: legacy IDE quirk: reg 0x10: [io  0x01f0-0x01f7]
[    0.436008] pci 0000:00:01.1: legacy IDE quirk: reg 0x14: [io  0x03f6]
[    0.440008] pci 0000:00:01.1: legacy IDE quirk: reg 0x18: [io  0x0170-0x0177]
[    0.444004] pci 0000:00:01.1: legacy IDE quirk: reg 0x1c: [io  0x0376]
[    0.448856] pci 0000:00:01.3: [8086:7113] type 00 class 0x068000
[    0.448890] * Found PM-Timer Bug on the chipset. Due to workarounds for a bug,
               * this clock source is slow. Consider trying other clock sources
[    0.453397] pci 0000:00:01.3: quirk: [io  0xb000-0xb03f] claimed by PIIX4 ACPI
[    0.457183] pci 0000:00:02.0: [1013:00b8] type 00 class 0x030000
[    0.457620] pci 0000:00:02.0: reg 0x10: [mem 0xf0000000-0xf1ffffff pref]
[    0.457833] pci 0000:00:02.0: reg 0x14: [mem 0xf3000000-0xf3000fff]
[    0.459895] pci 0000:00:03.0: [5853:0001] type 00 class 0xff8000
[    0.460284] pci 0000:00:03.0: reg 0x10: [io  0xc000-0xc0ff]
[    0.460514] pci 0000:00:03.0: reg 0x14: [mem 0xf2000000-0xf2ffffff pref]
[    0.463692] ACPI: PCI Interrupt Link [LNKA] (IRQs *5 10 11)
[    0.464259] ACPI: PCI Interrupt Link [LNKB] (IRQs 5 *10 11)
[    0.468242] ACPI: PCI Interrupt Link [LNKC] (IRQs 5 10 *11)
[    0.472259] ACPI: PCI Interrupt Link [LNKD] (IRQs *5 10 11)
[    0.496269] xen:balloon: Initialising balloon driver
[    0.504172] pci 0000:00:02.0: vgaarb: setting as boot VGA device
[    0.508000] pci 0000:00:02.0: vgaarb: VGA device added: decodes=io+mem,owns=io+mem,locks=none
[    0.508011] pci 0000:00:02.0: vgaarb: bridge control possible
[    0.511660] vgaarb: loaded
[    0.512235] SCSI subsystem initialized
[    0.515091] libata version 3.00 loaded.
[    0.515110] ACPI: bus type USB registered
[    0.516032] usbcore: registered new interface driver usbfs
[    0.520026] usbcore: registered new interface driver hub
[    0.523824] usbcore: registered new device driver usb
[    0.524049] pps_core: LinuxPPS API ver. 1 registered
[    0.527674] pps_core: Software ver. 5.3.6 - Copyright 2005-2007 Rodolfo Giometti <giometti@linux.it>
[    0.528026] PTP clock support registered
[    0.531365] EDAC MC: Ver: 3.0.0
[    0.532512] PCI: Using ACPI for IRQ routing
[    0.535850] PCI: pci_cache_line_size set to 64 bytes
[    0.536194] e820: reserve RAM buffer [mem 0x0009e000-0x0009ffff]
[    0.536313] NetLabel: Initializing
[    0.539020] NetLabel:  domain hash size = 128
[    0.540004] NetLabel:  protocols = UNLABELED CIPSOv4 CALIPSO
[    0.543820] NetLabel:  unlabeled traffic allowed by default
[    0.544146] HPET: 3 timers in total, 0 timers will be used for per-cpu timer
[    0.548018] hpet0: at MMIO 0xfed00000, IRQs 2, 8, 0
[    0.551377] hpet0: 3 comparators, 64-bit 62.500000 MHz counter
[    0.555077] clocksource: Switched to clocksource xen
[    0.566007] VFS: Disk quotas dquot_6.6.0
[    0.568915] VFS: Dquot-cache hash table entries: 512 (order 0, 4096 bytes)
[    0.573381] AppArmor: AppArmor Filesystem Enabled
[    0.576617] pnp: PnP ACPI init
[    0.579017] system 00:00: [mem 0x00000000-0x0009ffff] could not be reserved
[    0.583386] system 00:00: Plug and Play ACPI device, IDs PNP0c02 (active)
[    0.583491] system 00:01: [io  0x08a0-0x08a3] has been reserved
[    0.587324] system 00:01: [io  0x0cc0-0x0ccf] has been reserved
[    0.591304] system 00:01: [io  0x04d0-0x04d1] has been reserved
[    0.595223] system 00:01: Plug and Play ACPI device, IDs PNP0c02 (active)
[    0.595260] xen: --> pirq=17 -> irq=8 (gsi=8)
[    0.595287] pnp 00:02: Plug and Play ACPI device, IDs PNP0b00 (active)
[    0.595318] xen: --> pirq=18 -> irq=12 (gsi=12)
[    0.595333] pnp 00:03: Plug and Play ACPI device, IDs PNP0f13 (active)
[    0.595358] xen: --> pirq=19 -> irq=1 (gsi=1)
[    0.595369] pnp 00:04: Plug and Play ACPI device, IDs PNP0303 PNP030b (active)
[    0.595391] xen: --> pirq=20 -> irq=6 (gsi=6)
[    0.595393] pnp 00:05: [dma 2]
[    0.595407] pnp 00:05: Plug and Play ACPI device, IDs PNP0700 (active)
[    0.595438] xen: --> pirq=21 -> irq=4 (gsi=4)
[    0.595449] pnp 00:06: Plug and Play ACPI device, IDs PNP0501 (active)
[    0.595503] system 00:07: [io  0x10c0-0x1141] has been reserved
[    0.599263] system 00:07: [io  0xb044-0xb047] has been reserved
[    0.603050] system 00:07: Plug and Play ACPI device, IDs PNP0c02 (active)
[    0.622677] pnp: PnP ACPI: found 8 devices
[    0.631106] clocksource: acpi_pm: mask: 0xffffff max_cycles: 0xffffff, max_idle_ns: 2085701024 ns
[    0.637242] pci_bus 0000:00: resource 4 [io  0x0000-0x0cf7 window]
[    0.637243] pci_bus 0000:00: resource 5 [io  0x0d00-0xffff window]
[    0.637245] pci_bus 0000:00: resource 6 [mem 0x000a0000-0x000bffff window]
[    0.637246] pci_bus 0000:00: resource 7 [mem 0xf0000000-0xfbffffff window]
[    0.637481] NET: Registered protocol family 2
[    0.641037] tcp_listen_portaddr_hash hash table entries: 512 (order: 1, 8192 bytes)
[    0.646319] TCP established hash table entries: 8192 (order: 4, 65536 bytes)
[    0.650915] TCP bind hash table entries: 8192 (order: 5, 131072 bytes)
[    0.655231] TCP: Hash tables configured (established 8192 bind 8192)
[    0.659377] UDP hash table entries: 512 (order: 2, 16384 bytes)
[    0.663472] UDP-Lite hash table entries: 512 (order: 2, 16384 bytes)
[    0.667681] NET: Registered protocol family 1
[    0.670718] pci 0000:00:00.0: Limiting direct PCI/PCI transfers
[    0.674468] pci 0000:00:01.0: PIIX3: Enabling Passive Release
[    0.678226] pci 0000:00:01.0: Activating ISA DMA hang workarounds
[    0.682245] pci 0000:00:02.0: Video device with shadowed ROM at [mem 0x000c0000-0x000dffff]
[    0.687889] PCI: CLS 0 bytes, default 64
[    0.687948] Unpacking initramfs...
[    1.129013] Freeing initrd memory: 29212K
[    1.132773] Scanning for low memory corruption every 60 seconds
[    1.138934] Initialise system trusted keyrings
[    1.143090] Key type blacklist registered
[    1.147084] workingset: timestamp_bits=36 max_order=18 bucket_order=0
[    1.153875] zbud: loaded
[    1.157128] squashfs: version 4.0 (2009/01/31) Phillip Lougher
[    1.170620] fuse init (API version 7.26)
[    1.176595] Key type asymmetric registered
[    1.180701] Asymmetric key parser 'x509' registered
[    1.184837] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 244)
[    1.191108] io scheduler noop registered
[    1.194656] io scheduler deadline registered
[    1.198541] io scheduler cfq registered (default)
[    1.202713] intel_idle: Please enable MWAIT in BIOS SETUP
[    1.202837] input: Power Button as /devices/LNXSYSTM:00/LNXPWRBN:00/input/input0
[    1.209175] ACPI: Power Button [PWRF]
[    1.212597] input: Sleep Button as /devices/LNXSYSTM:00/LNXSLPBN:00/input/input1
[    1.218676] ACPI: Sleep Button [SLPF]
[    1.222491] xen: --> pirq=22 -> irq=28 (gsi=28)
[    1.222600] xen:grant_table: Grant tables using version 1 layout
[    1.227602] Grant table initialized
[    1.230930] Cannot get hvm parameter CONSOLE_EVTCHN (18): -22!
[    1.235617] Serial: 8250/16550 driver, 32 ports, IRQ sharing enabled
[    1.272045] 00:06: ttyS0 at I/O 0x3f8 (irq = 4, base_baud = 115200) is a 16550A
[    1.280210] Linux agpgart interface v0.103
[    1.285677] loop: module loaded
[    1.288740] Invalid max_queues (4), will use default max: 1.
[    1.312812] ata_piix 0000:00:01.1: version 2.13
[    1.314385] scsi host0: ata_piix
[    1.319073] scsi host1: ata_piix
[    1.323019] ata1: PATA max MWDMA2 cmd 0x1f0 ctl 0x3f6 bmdma 0xc100 irq 14
[    1.329961] ata2: PATA max MWDMA2 cmd 0x170 ctl 0x376 bmdma 0xc108 irq 15
[    1.338778] libphy: Fixed MDIO Bus: probed
[    1.343211] tun: Universal TUN/TAP device driver, 1.6
[    1.348399] PPP generic driver version 2.4.2
[    1.354082] xen_netfront: Initialising Xen virtual ethernet driver
[    1.361140] ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
[    1.367521] ehci-pci: EHCI PCI platform driver
[    1.372703] ehci-platform: EHCI generic platform driver
[    1.378017] ohci_hcd: USB 1.1 'Open' Host Controller (OHCI) Driver
[    1.384093] ohci-pci: OHCI PCI platform driver
[    1.388644] ohci-platform: OHCI generic platform driver
[    1.394046] uhci_hcd: USB Universal Host Controller Interface driver
[    1.400413] i8042: PNP: PS/2 Controller [PNP0303:PS2K,PNP0f13:PS2M] at 0x60,0x64 irq 1,12
[    1.411200] serio: i8042 KBD port at 0x60,0x64 irq 1
[    1.416379] serio: i8042 AUX port at 0x60,0x64 irq 12
[    1.421573] mousedev: PS/2 mouse device common for all mice
[    1.428337] input: AT Translated Set 2 keyboard as /devices/platform/i8042/serio0/input/input2
[    1.436437] rtc_cmos 00:02: rtc core: registered rtc_cmos as rtc0
[    1.442819] rtc_cmos 00:02: alarms up to one day, 114 bytes nvram, hpet irqs
[    1.449797] i2c /dev entries driver
[    1.454000] device-mapper: uevent: version 1.0.3
[    1.459186] device-mapper: ioctl: 4.37.0-ioctl (2017-09-20) initialised: dm-devel@redhat.com
[    1.468335] ledtrig-cpu: registered to indicate activity on CPUs
[    1.474906] NET: Registered protocol family 10
[    1.481247] blkfront: xvda: barrier or flush: disabled; persistent grants: disabled; indirect descriptors: enabled;
[    1.492675] Segment Routing with IPv6
[    1.497921] NET: Registered protocol family 17
[    1.502308] Key type dns_resolver registered
[    1.506869] intel_rdt: Intel RDT L3 allocation detected
[    1.512381] RAS: Correctable Errors collector initialized.
[    1.517495] sched_clock: Marking stable (1517442981, 0)->(7712735054, -6195292073)
[    1.523963]  xvda: xvda1
[    1.527051] registered taskstats version 1
[    1.531128] Loading compiled-in X.509 certificates
[    1.538315] Loaded X.509 cert 'Build time autogenerated kernel key: abff7770be19caaa295930d745e8934b20fa4874'
[    1.547955] zswap: loaded using pool lzo/zbud
[    1.554869] Key type big_key registered
[    1.559005] Key type trusted registered
[    1.564154] Key type encrypted registered
[    1.568299] AppArmor: AppArmor sha1 policy hashing enabled
[    1.573399] ima: No TPM chip found, activating TPM-bypass! (rc=-19)
[    1.579208] evm: HMAC attrs: 0x1
[    1.583113]   Magic number: 6:31:33
[    1.586946] rtc_cmos 00:02: setting system clock to 2018-02-20 15:01:14 UTC (1519138874)
[    1.594417] BIOS EDD facility v0.16 2004-Jun-25, 0 devices found
[    1.600225] EDD information not available.
[    1.607683] Freeing unused kernel memory: 2408K
[    1.612341] Write protecting the kernel read-only data: 20480k
[    1.618243] Freeing unused kernel memory: 2008K
[    1.626516] Freeing unused kernel memory: 1936K
[    1.636430] x86/mm: Checked W+X mappings: passed, no W+X pages found.
[    1.642344] x86/mm: Checking user space page tables
[    1.652947] x86/mm: Checked W+X mappings: passed, no W+X pages found.
[    1.790490] FDC 0 is a S82078B
[    1.830347] cryptd: max_cpu_qlen set to 1000
[    1.872314] AVX2 version of gcm_enc/dec engaged.
[    1.877094] AES CTR mode by8 optimization enabled
[    1.893891] [TTM] Zone  kernel: Available graphics memory: 503554 kiB
[    1.900087] [TTM] Initializing pool allocator
[    1.904648] [TTM] Initializing DMA pool allocator
[    1.908663] [drm] fb mappable at 0xF0000000
[    1.911441] [drm] vram aper at 0xF0000000
[    1.914298] [drm] size 33554432
[    1.916829] [drm] fb depth is 24
[    1.919414] [drm]    pitch is 3072
[    1.922166] fbcon: cirrusdrmfb (fb0) is primary device
[    1.929684] Console: switching to colour frame buffer device 128x48
[    1.939227] cirrus 0000:00:02.0: fb0: cirrusdrmfb frame buffer device
[    1.942663] [drm] Initialized cirrus 1.0.0 20110418 for 0000:00:02.0 on minor 0
[    2.104065] raid6: sse2x1   gen()  9397 MB/s
[    2.172061] raid6: sse2x1   xor()  6685 MB/s
[    2.240062] raid6: sse2x2   gen() 11216 MB/s
[    2.308064] raid6: sse2x2   xor()  7297 MB/s
[    2.376028] raid6: sse2x4   gen() 13267 MB/s
[    2.444013] raid6: sse2x4   xor()  8341 MB/s
[    2.512061] raid6: avx2x1   gen() 18541 MB/s
[    2.580062] raid6: avx2x1   xor() 12585 MB/s
[    2.648064] raid6: avx2x2   gen() 20939 MB/s
[    2.716064] raid6: avx2x2   xor() 13050 MB/s
[    2.784059] raid6: avx2x4   gen() 24310 MB/s
[    2.852059] raid6: avx2x4   xor() 15081 MB/s
[    2.853967] raid6: using algorithm avx2x4 gen() 24310 MB/s
[    2.856348] raid6: .... xor() 15081 MB/s, rmw enabled
[    2.858541] raid6: using avx2x2 recovery algorithm
[    2.862843] tsc: Refined TSC clocksource calibration: 2400.032 MHz
[    2.865487] clocksource: tsc: mask: 0xffffffffffffffff max_cycles: 0x229855db145, max_idle_ns: 440795257019 ns
[    2.872203] xor: automatically using best checksumming function   avx       
[    2.877255] async_tx: api initialized (async)
[    2.937718] Btrfs loaded, crc32c=crc32c-intel
[    2.973662] EXT4-fs (xvda1): INFO: recovery required on readonly filesystem
[    2.978435] EXT4-fs (xvda1): write access will be enabled during recovery
[    3.035381] EXT4-fs (xvda1): recovery complete
[    3.041714] EXT4-fs (xvda1): mounted filesystem with ordered data mode. Opts: (null)
[    3.511494] input: ImExPS/2 Generic Explorer Mouse as /devices/platform/i8042/serio1/input/input4
[    3.837527] systemd[1]: systemd 229 running in system mode. (+PAM +AUDIT +SELINUX +IMA +APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ -LZ4 +SECCOMP +BLKID +ELFUTILS +KMOD -IDN)
[    3.848809] systemd[1]: Detected virtualization xen.
[    3.852537] systemd[1]: Detected architecture x86-64.
[    3.861039] systemd[1]: Set hostname to <ip-172-31-46-101>.
[    4.020222] systemd[1]: Listening on Device-mapper event daemon FIFOs.
[    4.031179] systemd[1]: Listening on Journal Audit Socket.
[    4.039398] systemd[1]: Reached target User and Group Name Lookups.
[    4.056012] systemd[1]: Listening on udev Kernel Socket.
[    4.065542] systemd[1]: Listening on Journal Socket.
[    4.073499] systemd[1]: Set up automount Arbitrary Executable File Formats File System Automount Point.
[    4.175630] Loading iSCSI transport class v2.0-870.
[    4.224199] iscsi: registered transport (tcp)
[    4.252932] EXT4-fs (xvda1): re-mounted. Opts: discard
[    4.333135] iscsi: registered transport (iser)
[    4.523400] systemd-journald[406]: Received request to flush runtime journal from PID 1
[    4.975843] audit: type=1400 audit(1519138877.884:2): apparmor="STATUS" operation="profile_load" profile="unconfined" name="lxc-container-default" pid=603 comm="apparmor_parser"
[    4.975846] audit: type=1400 audit(1519138877.884:3): apparmor="STATUS" operation="profile_load" profile="unconfined" name="lxc-container-default-cgns" pid=603 comm="apparmor_parser"
[    4.975848] audit: type=1400 audit(1519138877.884:4): apparmor="STATUS" operation="profile_load" profile="unconfined" name="lxc-container-default-with-mounting" pid=603 comm="apparmor_parser"
[    4.975849] audit: type=1400 audit(1519138877.884:5): apparmor="STATUS" operation="profile_load" profile="unconfined" name="lxc-container-default-with-nesting" pid=603 comm="apparmor_parser"
[    4.988736] audit: type=1400 audit(1519138877.900:6): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/sbin/dhclient" pid=605 comm="apparmor_parser"
[    4.988738] audit: type=1400 audit(1519138877.900:7): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/lib/NetworkManager/nm-dhcp-client.action" pid=605 comm="apparmor_parser"
[    4.988740] audit: type=1400 audit(1519138877.900:8): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/lib/NetworkManager/nm-dhcp-helper" pid=605 comm="apparmor_parser"
[    4.988742] audit: type=1400 audit(1519138877.900:9): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/lib/connman/scripts/dhclient-script" pid=605 comm="apparmor_parser"
[    4.990634] audit: type=1400 audit(1519138877.900:10): apparmor="STATUS" operation="profile_load" profile="unconfined" name="/usr/bin/lxc-start" pid=606 comm="apparmor_parser"
[    5.335995] piix4_smbus 0000:00:01.3: SMBus base address uninitialized - upgrade BIOS or use force_addr=0xaddr
[    5.410390] RAPL PMU: API unit is 2^-32 Joules, 3 fixed counters, 655360 ms ovfl timer
[    5.410391] RAPL PMU: hw unit of domain pp0-core 2^-14 Joules
[    5.410392] RAPL PMU: hw unit of domain package 2^-14 Joules
[    5.410393] RAPL PMU: hw unit of domain dram 2^-16 Joules
[    5.435476] EDAC sbridge: Seeking for: PCI ID 8086:2fa0
[    5.435479] EDAC sbridge:  Ver: 1.1.2 
[    5.452269] intel_rapl: Found RAPL domain package
[    5.452279] intel_rapl: Found RAPL domain dram
[    5.452280] intel_rapl: DRAM domain energy unit 15300pj
[    7.408992] new mount options do not match the existing superblock, will be ignored
[   32.243684] random: crng init done

Revision history for this message

Stefan Bader (smb) wrote on 2018-02-20:

#11

I was able to observe the crash on a Ubuntu Xenial Xen host which produced the following text on the host console:

(XEN) domain_crash called from hpet.c:387
(XEN) Domain 2 (vcpu#1) crashed on cpu#4:
(XEN) ----[ Xen-4.6.5 x86_64 debug=n Not tainted ]----
(XEN) CPU: 4
(XEN) RIP: 0010:[<ffffffff81532dc1>]
(XEN) RFLAGS: 0000000000010002 CONTEXT: hvm guest (d2v1)
(XEN) rax: 0000000000000032 rbx: ffff880034e41a00 rcx: 0000000000000000
(XEN) rdx: 0000000000000001 rsi: 0000000000000032 rdi: ffffffff821fdfb0
(XEN) rbp: ffff8800e90afc10 rsp: ffff8800e90afbd8 r8: 0000000000000003
(XEN) r9: 0000000000000000 r10: 000000000000000a r11: 0000000000000000
(XEN) r12: ffff880107a4a8f8 r13: ffffffff821fdfb0 r14: ffffc90000002140
(XEN) r15: ffffffff81a7c600 cr0: 0000000080050033 cr4: 0000000000360670
(XEN) cr3: 000000003492c000 cr2: 00007fcc1156a030
(XEN) ds: 0000 es: 0000 fs: 0000 gs: 0000 ss: 0018 cs: 0010

Will investigate further (whether this persists in newer xen versions)

Revision history for this message

Stefan Bader (smb) wrote on 2018-02-21:

#12

I was booting the same Xenial based HVM guest on the same host (but this time running Bionic / Xen 4.9). This combination does not crash the domain when opening HPET. Though the check and code that would do it is still there. I also found a bug report against xenserver which I believe is based on the same Xen version as we have in Xenial (4.6.5): https://bugs.xenserver.org/browse/XSO-809?focusedCommentId=16484&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel

This and the code say that the crash is done because HPET is set to use an unsupported interrupt method (edge/level). Since the Linux guest is the same in both cases, and also the test + crash code, either the hypervisor or maybe the seabios seem to use a different default.

Revision history for this message

Stefan Bader (smb) wrote on 2018-02-21:

#13

Darn, ok I take everything back. Somehow the compiled reproducer was mangled in such a way it did maybe no longer do what it was intended to do. Anyhow, with freshly generated reproducers, even Xen 4.9 has the crash. :(

Colin Ian King (colin-king) on 2018-02-24

Changed in linux-aws (Ubuntu):
assignee:	Colin Ian King (colin-king) → Stefan Bader (smb)

Revision history for this message

Stefan Bader (smb) wrote on 2018-02-26:

#14

Right now I do not think there is much choice to fix this (other than not touch /dev/hpet on AWS). The linux kernel deliberately wants to set a level triggered interrupt. The xen hypervisor has no support for that (there might be some addition done but certainly not in any released version of Xen). And as "error handling" forcefully crashes the domain.

Revision history for this message

Sean Feole (sfeole) wrote on 2018-09-12:

#15

Been sorting through many of the ubuntu-kernel-tests bugs.

This is one of the few that actually is being worked.

Stefan, any update on this? Should this be/ Has it been fixed? I can revisit once i finish cleaning up the list

Changed in ubuntu-kernel-tests:
status:	New → In Progress
assignee:	nobody → Sean Feole (sfeole)
importance:	Undecided → Medium

Revision history for this message

Stefan Bader (smb) wrote on 2018-09-12:

#16

It might be fixed if AWS runs a Xen hypervisor which has the following patch included (this is from the development tree of upstream Xen, so will be part of Xen-4.12).

commit be07023be115c94b7fbb51d2ef6f421ddd680de8
Author: Roger Pau Monné <email address hidden>
Date: Tue Jul 24 15:54:18 2018 +0200

x86/vhpet: add support for level triggered interrupts

One can never say for sure what AWS runs, so whether its fixed or not can only be found out by trial and error.

Revision history for this message

Sean Feole (sfeole) wrote on 2018-11-26:

#17

We updated our test instances to run on the latest hardware made available in AWS, I have not seen this reoccur in the xenial testing.

closing bug.

Changed in ubuntu-kernel-tests:
status:	In Progress → Invalid
Changed in linux-aws (Ubuntu):
status:	In Progress → Invalid
Changed in ubuntu-kernel-tests:
assignee:	Sean Feole (sfeole) → nobody
Changed in linux-aws (Ubuntu):
assignee:	Stefan Bader (smb) → nobody

Affects		Status	Importance	Assigned to	Milestone
	ubuntu-kernel-tests	Invalid	Medium	Unassigned
	linux-aws (Ubuntu)	Invalid	High	Unassigned

Ubuntu
linux-aws package

stress smoke test hang with dev test on AWS Xenial kernel

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntulinux-aws package

stress smoke test hang with dev test on AWS Xenial kernel

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
linux-aws package