ubuntu_nbd_smoke test failed on f1.micro in google cloud with 5.8 / 5.11 (kernel NULL pointer dereference)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
ubuntu-kernel-tests |
Won't Fix
|
Undecided
|
Unassigned |
Bug Description
Issue found on 5.8.0-51.57-generic with instance f1-micro only among all of our google instances.
Test failed with:
Running '/home/
creating backing nbd image /tmp/nbd_image.img
------
Image path: /tmp/nbd_image.img
Mount point: /mnt/nbd-test-7613
Date: Tue Apr 20 15:43:54 UTC 2021
Host: g-l-generic-
Kernel: 5.8.0-51-generic #57-Ubuntu SMP Wed Apr 14 16:02:45 UTC 2021
Machine: g-l-generic-
CPUs online: 1
CPUs total: 1
Page size: 4096
Pages avail: 1336
Pages total: 146583
Free space:
Filesystem Size Used Avail Use% Mounted on
tmpfs 58M 996K 57M 2% /run
/dev/sda1 9.6G 3.4G 6.2G 35% /
tmpfs 287M 0 287M 0% /dev/shm
tmpfs 5.0M 0 5.0M 0% /run/lock
tmpfs 4.0M 0 4.0M 0% /sys/fs/cgroup
/dev/sda15 105M 7.9M 97M 8% /boot/efi
tmpfs 58M 4.0K 58M 1% /run/user/1007
------
NBD device /dev/nbd0 created
found nbd export
NBD exports found:
test
starting client with NBD device /dev/nbd0
Negotiation: ..size = 128MB
Error: Failed to setup device, check dmesg
nbd-client failed to start
unmounting /mnt/nbd-test-7613
Exiting.
umount: /mnt/nbd-test-7613: no mount point specified.
stopping client
/home/
Found kernel warning, IO error and/or call trace
echo
[ 155.925988] creating backing nbd image /tmp/nbd_image.img
[ 159.733368] NBD device /dev/nbd0 created
[ 161.978549] found nbd export
[ 163.003727] starting client with NBD device /dev/nbd0
[ 163.464838] nbd: nbd0 already in use
[ 163.473599] nbd-client failed to start
[ 163.473665] unmounting /mnt/nbd-test-7613
[ 164.712115] stopping client
[ 164.715177] block nbd0: NBD_DISCONNECT
[ 164.716596] BUG: kernel NULL pointer dereference, address: 0000000000000020
[ 164.723829] #PF: supervisor write access in kernel mode
[ 164.729171] #PF: error_code(0x0002) - not-present page
[ 164.734418] PGD 0 P4D 0
[ 164.737065] Oops: 0002 [#1] SMP PTI
[ 164.740665] CPU: 0 PID: 7782 Comm: nbd-client Not tainted 5.8.0-51-generic #57-Ubuntu
[ 164.748604] Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
[ 164.957574] RIP: 0010:mutex_
[ 164.961800] Code: c3 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 41 54 49 89 fc e8 9d e4 ff ff 31 c0 65 48 8b 14 25 c0 7b 01 00 <3e> 49 0f b1 14 24 75 06 4c 8b 65 f8 c9 c3 4c 89 e7 e8 ac ff ff ff
[ 164.980863] RSP: 0000:ffffbce700
[ 164.986196] RAX: 0000000000000000 RBX: ffffffffb41fdd00 RCX: 0000000000000000
[ 164.994549] RDX: ffff9f7700f6c680 RSI: ffffffffb39a8103 RDI: 0000000000000020
[ 165.002012] RBP: ffffbce7008779a0 R08: 0000000000000000 R09: ffff9f7794c64000
[ 165.208121] R10: ffffffffb4069a00 R11: 0000000000000005 R12: 0000000000000020
[ 165.215568] R13: ffffbce7008779c0 R14: 0000000000000068 R15: ffffffffc078ae88
[ 165.222840] FS: 00007f02e2ac0f8
[ 165.231062] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 165.237027] CR2: 0000000000000020 CR3: 0000000014c64004 CR4: 00000000003606f0
[ 165.244562] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 165.251829] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 165.457633] Call Trace:
[ 165.460272] flush_workqueue
[ 165.464233] nbd_disconnect_
[ 165.469256] nbd_genl_
[ 165.474180] genl_family_
[ 165.479051] genl_rcv_
[ 165.482740] ? genl_family_
[ 165.487415] netlink_
[ 165.491362] genl_rcv+0x29/0x40
[ 165.494643] netlink_
[ 165.498832] netlink_
[ 165.502868] sock_sendmsg+
[ 165.506483] ____sys_
[ 165.709230] ? sendmsg_
[ 165.713616] ___sys_
[ 165.717391] ? lru_cache_
[ 165.723160] ? do_anonymous_
[ 165.727553] ? handle_
[ 165.731931] ? __handle_
[ 165.736501] __sys_sendmsg+
[ 165.740185] __x64_sys_
[ 165.744240] do_syscall_
[ 165.747957] entry_SYSCALL_
[ 165.753138] RIP: 0033:0x7f02e3012777
[ 165.756839] Code: 0c 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 2e 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 89 54 24 1c 48 89 74 24 10
[ 165.974269] RSP: 002b:00007fff23
[ 165.981963] RAX: ffffffffffffffda RBX: 00005571d0cb7300 RCX: 00007f02e3012777
[ 165.989331] RDX: 0000000000000000 RSI: 00007fff23050040 RDI: 0000000000000008
[ 165.996573] RBP: 00005571d0cb7420 R08: 0000000000000014 R09: 00005571d0cb8690
[ 166.003826] R10: 00007f02e30dc210 R11: 0000000000000246 R12: 00005571d0cb7210
[ 166.209742] R13: 00007fff23050040 R14: 0000000000000001 R15: 00007fff230503b0
[ 166.217009] Modules linked in: nbd nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua intel_rapl_msr intel_rapl_common sb_edac rapl input_leds serio_raw pvpanic efi_pstore mac_hid sch_fq_codel drm virtio_rng ip_tables x_tables autofs4 btrfs blake2b_generic raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd glue_helper psmouse virtio_net net_failover failover virtio_scsi i2c_piix4
[ 166.465182] CR2: 0000000000000020
[ 166.468656] ---[ end trace 469eaeb4bef09dfe ]---
[ 166.497719] RIP: 0010:mutex_
[ 166.502034] Code: c3 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 41 54 49 89 fc e8 9d e4 ff ff 31 c0 65 48 8b 14 25 c0 7b 01 00 <3e> 49 0f b1 14 24 75 06 4c 8b 65 f8 c9 c3 4c 89 e7 e8 ac ff ff ff
[ 166.720537] RSP: 0000:ffffbce700
[ 166.725902] RAX: 0000000000000000 RBX: ffffffffb41fdd00 RCX: 0000000000000000
[ 166.733155] RDX: ffff9f7700f6c680 RSI: ffffffffb39a8103 RDI: 0000000000000020
[ 166.740531] RBP: ffffbce7008779a0 R08: 0000000000000000 R09: ffff9f7794c64000
[ 166.747889] R10: ffffffffb4069a00 R11: 0000000000000005 R12: 0000000000000020
[ 166.755142] R13: ffffbce7008779c0 R14: 0000000000000068 R15: ffffffffc078ae88
[ 166.961921] FS: 00007f02e2ac0f8
[ 166.970133] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 166.976030] CR2: 0000000000000020 CR3: 0000000014c64004 CR4: 00000000003606f0
[ 166.983291] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 166.990539] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 168.985943] Found kernel warning, IO error and/or call trace
[ 169.001889] echo
killing server
This issue does not exist on this instance with 5.8.0-49, however klebers found out that in this cycle it got only ~512MB ram. But it got ~3G ram in the last cycle, maybe this is why it is failing like this in this cycle.