Activity log for bug #1817918

Date Who What changed Old value New value Message
2019-02-27 15:09:04 Guilherme G. Piccoli bug added bug
2019-02-27 15:09:53 Guilherme G. Piccoli nominated for series Ubuntu Xenial
2019-02-27 15:09:53 Guilherme G. Piccoli bug task added linux (Ubuntu Xenial)
2019-02-27 15:09:53 Guilherme G. Piccoli nominated for series Ubuntu Bionic
2019-02-27 15:09:53 Guilherme G. Piccoli bug task added linux (Ubuntu Bionic)
2019-02-27 15:10:00 Guilherme G. Piccoli linux (Ubuntu Bionic): status New Fix Released
2019-02-27 15:10:09 Guilherme G. Piccoli linux (Ubuntu Bionic): importance Undecided Low
2019-02-27 15:10:11 Guilherme G. Piccoli linux (Ubuntu Bionic): assignee Guilherme G. Piccoli (gpiccoli)
2019-02-27 15:10:15 Guilherme G. Piccoli linux (Ubuntu Xenial): status New Confirmed
2019-02-27 15:10:18 Guilherme G. Piccoli linux (Ubuntu Xenial): importance Undecided High
2019-02-27 15:10:20 Guilherme G. Piccoli linux (Ubuntu Xenial): assignee Guilherme G. Piccoli (gpiccoli)
2019-02-27 19:10:32 Guilherme G. Piccoli description There is a report of hard lockup induced by a long delay in lapic expiration timer. We'll provide SRU request here for merging the fixes in 4.4 kernel. [Impact] * There is a long-time report of an issue with the TSC delay present in wait_lapic_expire() - basically the guest could have an expiration timer configured in a way it induces host to wait a long time (with preemption disabled), so there's a potential scenario for host lockups. * The stack trace we have access (from an user report of this issue) is (summarized) below: NMI watchdog: Watchdog detected hard LOCKUP on cpu 16 [...] CPU: 16 PID: 3024910 Comm: CPU 0/KVM Not tainted 4.4.0-139-generic #165-Ubuntu RIP: 0010:[<addr>] [<addr>] delay_tsc+0x20/0x60 [...] __delay+0x15/0x20 wait_lapic_expire+0xc3/0x150 [kvm] vcpu_enter_guest+0x743/0x11d0 [kvm] kvm_arch_vcpu_ioctl_run+0xe6/0x410 [kvm] kvm_vcpu_ioctl+0x33d/0x620 [kvm] do_vfs_ioctl+0x2af/0x4b0 ? __do_page_fault+0x1c1/0x410 ? fire_user_return_notifiers+0x3e/0x50 SyS_ioctl+0x79/0x90 entry_SYSCALL_64_fastpath+0x22/0xc1 This matches the reported problem in the KVM mailing-list: https://marc.info/?l=kvm&m=146374488028339 * A fix was proposed in the above thread, but discarded in favor of the following approach: https://marc.info/?l=kvm&m=146647260109315 The patch was merged in Linus tree, hence we hereby request the SRU: b606f189c7d5 ("KVM: LAPIC: cap __delay at lapic_timer_advance_ns"). There's one additional patch needed, which is just the header adjustment for exporting a necessary function. * The patch is missing only in 4.4 kernel series; Bionic (4.15) and the other newer releases have the patch already. [Test Case] * Unfortunately this is a hard to reproduce issue; we have reports of this lockup from an user, hence the SRU request here. Also, the patch was introduced originally in kernel 4.7, approx. 2.5 years ago. So, we are confident that community is running this code long enough without errors reported. Also, checked in the Linus tree and no fixes for this code were introduced since kernel 4.7. [Regression Potential] * The code modification requested here affects the amount of delay in a specific timer; the patch introduces a maximum time for delay, preventing unbounded delays in host. The regression potential is considered low, and given the nature of the modification, latency issues in guests are likely to be the most problematic regression potential we have.
2019-02-27 22:35:41 Khaled El Mously linux (Ubuntu Xenial): status Confirmed Fix Committed
2019-03-15 20:04:28 Brad Figg tags sts sts verification-needed-xenial
2019-03-18 15:52:10 Guilherme G. Piccoli tags sts verification-needed-xenial sts verification-done-xenial
2019-03-28 07:34:03 Wouter van Bommel bug added subscriber Wouter van Bommel
2019-04-02 10:26:27 Launchpad Janitor linux (Ubuntu Xenial): status Fix Committed Fix Released
2019-04-02 10:26:27 Launchpad Janitor cve linked 2018-9517
2019-04-02 10:26:27 Launchpad Janitor cve linked 2019-3459
2019-04-02 10:26:27 Launchpad Janitor cve linked 2019-3460
2019-04-02 10:26:27 Launchpad Janitor cve linked 2019-6974
2019-04-02 10:26:27 Launchpad Janitor cve linked 2019-7221
2019-04-02 10:26:27 Launchpad Janitor cve linked 2019-7222
2019-04-02 10:26:27 Launchpad Janitor cve linked 2019-9213
2020-05-10 13:16:47 Guilherme G. Piccoli linux (Ubuntu): status Confirmed Fix Released