[linux-azure] panic in ext4_resize_fs() found during storage testing

Bug #1835322 reported by Joseph Salisbury
14
This bug affects 2 people
Affects Status Importance Assigned to Milestone
linux-azure (Ubuntu)
Fix Released
Undecided
Unassigned
Xenial
Fix Released
Undecided
Unassigned
Cosmic
Invalid
Undecided
Unassigned
Disco
Invalid
Undecided
Unassigned

Bug Description

A panic was observed during file system testing. The trace is the following:

[ 8783.243586] kernel BUG at /build/linux-azure-3iFJ9j/linux-azure-4.18.0/fs/ext4/resize.c:266!
[ 8783.252751] invalid opcode: 0000 [#1] SMP PTI
[ 8783.256735] CPU: 7 PID: 39476 Comm: resize2fs Not tainted 4.18.0-1023-azure #24~18.04.1-Ubuntu
[ 8783.256735] Hardware name: Microsoft Corporation Virtual Machine/Virtual Machine, BIOS 090007 06/02/2017
[ 8783.256735] RIP: 0010:ext4_resize_fs+0x73b/0xf10
[ 8783.256735] Code: 50 ff ff ff 41 8b 75 10 4d 8b 65 00 85 f6 0f 94 c0 4d 85 e4 0f 94 c1 09 c8 83 bd 5c ff ff ff 01 7e 48 84 c0 0f 84 43 06 00 00 <0f> 0b 48 c7 c2 68 a7 8d 8f 48 c7 c6 00 fb 88 8f 4c 89 f7 e8 0d f8
[ 8783.256735] RSP: 0018:ffff984e8dce7cb0 EFLAGS: 00010202
[ 8783.256735] RAX: 0000000000205c01 RBX: 000000000000001f RCX: 0000000000000000
[ 8783.256735] RDX: ffff8b1dbe1367d0 RSI: 0000000000000000 RDI: 0000000000000000
[ 8783.256735] RBP: ffff984e8dce7d88 R08: ffff984e8dce7d4c R09: ffff984e8dce7d54
[ 8783.256735] R10: 0000000000000120 R11: 0000000000000001 R12: ffff8b1dbe136800
[ 8783.256735] R13: ffff8b1d74aefe80 R14: ffff8b1dbdeb9000 R15: 0000000000000000
[ 8783.256735] FS: 00007f213fed30c0(0000) GS:ffff8b1ded7c0000(0000) knlGS:0000000000000000
[ 8783.256735] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 8783.256735] CR2: 0000556aa08ae9b8 CR3: 0000001b8e324005 CR4: 00000000003606e0
[ 8783.256735] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 8783.256735] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 8783.256735] Call Trace:
[ 8783.256735] ? security_capable+0x3c/0x60
[ 8783.256735] ext4_ioctl+0xf91/0x14d0
[ 8783.256735] ? audit_filter_rules.constprop.14+0x325/0xf90
[ 8783.256735] ? audit_filter_rules.constprop.14+0x24b/0xf90
[ 8783.256735] do_vfs_ioctl+0xa8/0x630
[ 8783.256735] ksys_ioctl+0x75/0x80
[ 8783.256735] __x64_sys_ioctl+0x1a/0x20
[ 8783.256735] do_syscall_64+0x6a/0x1a0
[ 8783.256735] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 8783.256735] RIP: 0033:0x7f213f3825d7
[ 8783.256735] Code: b3 66 90 48 8b 05 b1 48 2d 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 81 48 2d 00 f7 d8 64 89 01 48
[ 8783.256735] RSP: 002b:00007ffe8effd688 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[ 8783.256735] RAX: ffffffffffffffda RBX: 0000556aa08aa980 RCX: 00007f213f3825d7
[ 8783.256735] RDX: 00007ffe8effd7d0 RSI: 0000000040086610 RDI: 0000000000000004
[ 8783.256735] RBP: 0000000000000004 R08: 0000000000000000 R09: 0000000000000000
[ 8783.256735] R10: 0000000000000000 R11: 0000000000000246 R12: 0000556aa08ac980
[ 8783.256735] R13: 00007ffe8effd7d0 R14: 0000556aa08a92d0 R15: 0000000000000000

This issue is resolved by the following upstream commit:
f96c3ac8dfc2 ("ext4: fix crash during online resizing")

Commit f96c3ac8dfc2 is in mainline as of v5.1-rc1. This commit was requested in the upstream stable kernels. However, the Ubuntu kernels are EOL upstream. Please include this commit in the 16.04 and 18.04 linux-azure kernels.

Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-azure (Ubuntu):
status: New → Confirmed
Marcelo Cerri (mhcerri)
Changed in linux-azure (Ubuntu Xenial):
status: New → Fix Committed
Changed in linux-azure (Ubuntu Cosmic):
status: New → Fix Committed
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote :

This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-bionic' to 'verification-done-bionic'. If the problem still exists, change the tag 'verification-needed-bionic' to 'verification-failed-bionic'.

If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags: added: verification-needed-bionic
tags: added: verification-done-bionic
removed: verification-needed-bionic
Revision history for this message
Launchpad Janitor (janitor) wrote :
Download full text (11.4 KiB)

This bug was fixed in the package linux-azure - 4.15.0-1051.56

---------------
linux-azure (4.15.0-1051.56) xenial; urgency=medium

  * linux-azure: 4.15.0-1051.56 -proposed tracker (LP: #1834950)

  * [linux-azure] panic in ext4_resize_fs() found during storage testing
    (LP: #1835322)
    - ext4: fix crash during online resizing

  [ Ubuntu: 4.15.0-55.60 ]

  * linux: 4.15.0-55.60 -proposed tracker (LP: #1834954)
  * Request backport of ceph commits into bionic (LP: #1834235)
    - ceph: use atomic_t for ceph_inode_info::i_shared_gen
    - ceph: define argument structure for handle_cap_grant
    - ceph: flush pending works before shutdown super
    - ceph: send cap releases more aggressively
    - ceph: single workqueue for inode related works
    - ceph: avoid dereferencing invalid pointer during cached readdir
    - ceph: quota: add initial infrastructure to support cephfs quotas
    - ceph: quota: support for ceph.quota.max_files
    - ceph: quota: don't allow cross-quota renames
    - ceph: fix root quota realm check
    - ceph: quota: support for ceph.quota.max_bytes
    - ceph: quota: update MDS when max_bytes is approaching
    - ceph: quota: add counter for snaprealms with quota
    - ceph: avoid iput_final() while holding mutex or in dispatch thread
  * QCA9377 isn't being recognized sometimes (LP: #1757218)
    - SAUCE: USB: Disable USB2 LPM at shutdown
  * hns: fix ICMP6 neighbor solicitation messages discard problem (LP: #1833140)
    - net: hns: fix ICMP6 neighbor solicitation messages discard problem
    - net: hns: fix unsigned comparison to less than zero
  * Fix occasional boot time crash in hns driver (LP: #1833138)
    - net: hns: Fix probabilistic memory overwrite when HNS driver initialized
  * use-after-free in hns_nic_net_xmit_hw (LP: #1833136)
    - net: hns: fix KASAN: use-after-free in hns_nic_net_xmit_hw()
  * hns: attempt to restart autoneg when disabled should report error
    (LP: #1833147)
    - net: hns: Restart autoneg need return failed when autoneg off
  * systemd 237-3ubuntu10.14 ADT test failure on Bionic ppc64el (test-seccomp)
    (LP: #1821625)
    - powerpc: sys_pkey_alloc() and sys_pkey_free() system calls
    - powerpc: sys_pkey_mprotect() system call
  * [UBUNTU] pkey: Indicate old mkvp only if old and curr. mkvp are different
    (LP: #1832625)
    - pkey: Indicate old mkvp only if old and current mkvp are different
  * [UBUNTU] kernel: Fix gcm-aes-s390 wrong scatter-gather list processing
    (LP: #1832623)
    - s390/crypto: fix gcm-aes-s390 selftest failures
  * System crashes on hot adding a core with drmgr command (4.15.0-48-generic)
    (LP: #1833716)
    - powerpc/numa: improve control of topology updates
    - powerpc/numa: document topology_updates_enabled, disable by default
  * Kernel modules generated incorrectly when system is localized to a non-
    English language (LP: #1828084)
    - scripts: override locale from environment when running recordmcount.pl
  * [UBUNTU] kernel: Fix wrong dispatching for control domain CPRBs
    (LP: #1832624)
    - s390/zcrypt: Fix wrong dispatching for control domain CPRBs
  * CVE-2019-11815
    - net: rds: force to destroy connection if t_sock ...

Changed in linux-azure (Ubuntu Xenial):
status: Fix Committed → Fix Released
Revision history for this message
Launchpad Janitor (janitor) wrote :
Download full text (7.5 KiB)

This bug was fixed in the package linux-azure - 4.18.0-1025.27~18.04.1

---------------
linux-azure (4.18.0-1025.27~18.04.1) bionic; urgency=medium

  * linux-azure: 4.18.0-1025.27~18.04.1 -proposed tracker (LP: #1835436)

  [ Ubuntu: 4.18.0-1025.27 ]

  * linux-azure: 4.18.0-1025.27 -proposed tracker (LP: #1835438)
  * [linux-azure] panic in ext4_resize_fs() found during storage testing
    (LP: #1835322)
    - ext4: fix crash during online resizing
  * linux: 4.18.0-26.27 -proposed tracker (LP: #1834904)
  * QCA9377 isn't being recognized sometimes (LP: #1757218)
    - SAUCE: USB: Disable USB2 LPM at shutdown
  * hns: fix ICMP6 neighbor solicitation messages discard problem (LP: #1833140)
    - net: hns: fix ICMP6 neighbor solicitation messages discard problem
    - net: hns: fix unsigned comparison to less than zero
  * Fix occasional boot time crash in hns driver (LP: #1833138)
    - net: hns: Fix probabilistic memory overwrite when HNS driver initialized
  * use-after-free in hns_nic_net_xmit_hw (LP: #1833136)
    - net: hns: fix KASAN: use-after-free in hns_nic_net_xmit_hw()
  * hns: attempt to restart autoneg when disabled should report error
    (LP: #1833147)
    - net: hns: Restart autoneg need return failed when autoneg off
  * [UBUNTU] pkey: Indicate old mkvp only if old and curr. mkvp are different
    (LP: #1832625)
    - pkey: Indicate old mkvp only if old and current mkvp are different
  * [UBUNTU] kernel: Fix gcm-aes-s390 wrong scatter-gather list processing
    (LP: #1832623)
    - s390/crypto: fix gcm-aes-s390 selftest failures
  * rtlwifi: aggressive memory leak (LP: #1831751)
    - rtlwifi: Fix leak of skb when processing C2H_BT_INFO
  * Kernel modules generated incorrectly when system is localized to a non-
    English language (LP: #1828084)
    - scripts: override locale from environment when running recordmcount.pl
  * CVE-2019-11815
    - net: rds: force to destroy connection if t_sock is NULL in
      rds_tcp_kill_sock().
  * Sound device not detected after resume from hibernate (LP: #1826868)
    - drm/i915: Force 2*96 MHz cdclk on glk/cnl when audio power is enabled
    - drm/i915: Save the old CDCLK atomic state
    - drm/i915: Remove redundant store of logical CDCLK state
    - drm/i915: Skip modeset for cdclk changes if possible
  * Handle overflow in proc_get_long of sysctl (LP: #1833935)
    - sysctl: handle overflow in proc_get_long
  * Dell XPS 13 (9370) defaults to s2idle sleep/suspend instead of deep, NVMe
    drains lots of power under s2idle (LP: #1808957)
    - Revert "UBUNTU: SAUCE: pci/nvme: prevent WDC PC SN720 NVMe from entering D3
      and being disabled"
    - Revert "UBUNTU: SAUCE: nvme: add quirk to not call disable function when
      suspending"
    - Revert "UBUNTU: SAUCE: pci: prevent Intel NVMe SSDPEKKF from entering D3"
    - Revert "SAUCE: nvme: add quirk to not call disable function when suspending"
    - Revert "SAUCE: pci: prevent sk hynix nvme from entering D3"
    - PCI: PM: Avoid possible suspend-to-idle issue
    - PCI: PM: Skip devices in D0 for suspend-to-idle
    - nvme-pci: Sync queues on reset
    - nvme: Export get and set features
    - nvme-pci: Use host managed power s...

Read more...

Changed in linux-azure (Ubuntu):
status: Confirmed → Fix Released
Terry Rudd (terrykrudd)
Changed in linux-azure (Ubuntu Cosmic):
status: Fix Committed → Invalid
Marcelo Cerri (mhcerri)
Changed in linux-azure (Ubuntu Disco):
status: New → Incomplete
status: Incomplete → Invalid
Revision history for this message
Marcelo Cerri (mhcerri) wrote :

Disco got the fix via the v5.0.4 upstream stable update (LP: #1821607).

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.