Bug #1977919 “Docker container creation causes kernel oops on li...” : Bugs : linux-gcp-5.13 package : Ubuntu

Revision history for this message

Steven Davidovitz (steven.davidovitz-ddl) wrote on 2022-06-08:

#1

install-and-run-docker.sh Edit (1.0 KiB, text/x-sh)

Revision history for this message

Launchpad Janitor (janitor) wrote on 2022-06-08:

#2

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-aws (Ubuntu):
status:	New → Confirmed

Revision history for this message

Roger Sikorski (rogersik) wrote on 2022-06-08 (last edit on 2022-06-08):

#3

Can confirm it. A restore from last week 03.06.2022 on one node fixed it.

Another node i reinstalled Ubuntu 20.04 and i had still the same issue. Here i fixed it with a reinstall of Ubuntu 22.04

Revision history for this message

Peter (pagelypete) wrote on 2022-06-08:

#4

Also can confirm - very easy to reproduce.

Revision history for this message

Alex Thomson (lxgaming) wrote on 2022-06-08:

#5

I'm also having this issue but on Oracle Cloud (linux-oracle v5.13.0-1033.39)

Revision history for this message

Johannes Postler (johannespostler) wrote on 2022-06-08:

#6

Google Compute Engine seems to be affected as well for Ubuntu 20.04. Using kernel 5.13.0-1030-gcp #36~20.04.1-Ubuntu

Revision history for this message

Alastair McClelland (alastairmcc) wrote on 2022-06-08:

#7

Also seeing this on AWS Ubuntu 20.04 after an update to linux-image-aws/focal-updates 5.13.0.1028.31~20.04.22

Revision history for this message

Marvin Beckers (embik) wrote on 2022-06-08:

#8

Perhaps this is obvious, but same thing happens when using containerd directly, without docker as intermediate.

Revision history for this message

Nigel (nigel-sim) wrote on 2022-06-08:

#9

Trace Edit (8.2 KiB, text/plain)

I believe I've got the same issue on Azure 5.13.0-1028-azure.

Revision history for this message

Samuel Gregorovič (samgre1881) wrote on 2022-06-08 (last edit on 2022-06-08):

#10

Confirmed on AWS AMI ubuntu/images/hvm-ssd/ubuntu-focal-20.04-amd64-server-20211129. We fixed it by reverting to kernel GNU/Linux 5.13.0-1025-aws x86_64, forcing GRUB to load it instead of a corrupted one.

P.S.: We faced loop rebooting and unkillable docker process. After the kernel downgrade, everything seems ok.

Revision history for this message

Roger Sikorski (rogersik) wrote on 2022-06-08:

#11

I think it has something to do with docker network / volumes. Because with the container watchtower which doen'st use any open network ports or volumes don't make the system crashing.

Revision history for this message

Alastair McClelland (alastairmcc) wrote on 2022-06-08:

#12

`docker run -it ubuntu bash` is enough to cause it to crash.

Revision history for this message

Szymon Lubieniecki (antares81) wrote on 2022-06-08:

#13

Can't even build the image:
kernel:[ 221.374595] Kernel panic - not syncing: Fatal exception in interrupt

Revision history for this message

Kempsu (kneitola) wrote on 2022-06-08 (last edit on 2022-06-08):

#14

Can confirm, one of my AWS EC2 instance running Ubuntu 20.04 is dying during reboot after installing the update. Also running docker on this instance.

Revision history for this message

John Chittum (jchittum) wrote on 2022-06-08:

#15

We are actively working on the issue. This also affects more than the `linux-aws` kernel, as we've been able to reproduce on 5.13 versions of:

linux-oracle
linux-azure
linux-gcp
linux-aws

This appears to be confined to the latest 5.13 kernel update. We will provide more updates shortly on all kernels affected and changes

Revision history for this message

Kevin Keijzer (kkeijzer) wrote on 2022-06-08 (last edit on 2022-06-08):

#16

This broke a lot of our AWS t2 servers running Docker, which I all had to restore by adding the root volume to a different instance and then changing /boot/grub/grub.cfg in order to boot 5.13.0-1025-aws again.

So another "I can confirm this" from me.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2022-06-08:

#17

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-gcp (Ubuntu):
status:	New → Confirmed

Connor Brewster (cbrewster) on 2022-06-08

affects:

linux-meta-gcp-5.13 (Ubuntu) → linux-gcp (Ubuntu)

Revision history for this message

Gerard Kok (g-kok) wrote on 2022-06-08:

#18

This happened to two of our instances in AWS. In the hope that this is helpful to anyone: in an attempt to avoid having to mount the root volumes on another instances, we disabled docker and containerd in the small timeframe between SSH becoming accessible and the kernel panic, by running something like this from a laptop:

while true; do
ssh <instance> "sudo systemctl disable docker.service; sudo systemctl disable containerd.service"
done

This allowed us to revert to the previous kernel without having to mount the root volume on a different instance.

Revision history for this message

James Benkart (benkartjkb) wrote on 2022-06-08:

#19

I have similar lernel panics launching docker-ce instances on the google cloud platform after recent ubuntu update, 20.04 LTS. 22.04 is unaffected.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2022-06-08:

#20

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-gcp (Ubuntu):
status:	New → Confirmed

Revision history for this message

Francis Ginther (fginther) wrote on 2022-06-08 (last edit on 2022-06-08):

#21

Work on this issue continues. We have identified the following impacted kernels and versions:

focal linux-aws-5.13 5.13.0-1028.31~20.04.1
focal linux-azure-5.13 5.13.0-1028.33~20.04.1
focal linux-gcp-5.13 5.13.0-1030.36~20.04.1
focal linux-oracle-5.13 5.13.0-1033.39~20.04.1

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2022-06-08:

#22

Please try this test kernel at https://kernel.ubuntu.com/~rtg/focal-docker-crash-lp1977919/5.13.0-1029.32~lp1977919.1/

wget https://kernel.ubuntu.com/~rtg/focal-docker-crash-lp1977919/5.13.0-1029.32~lp1977919.1/amd64/linux-image-unsigned-5.13.0-1029-aws_5.13.0-1029.32~lp1977919.1_amd64.deb
wget https://kernel.ubuntu.com/~rtg/focal-docker-crash-lp1977919/5.13.0-1029.32~lp1977919.1/amd64/linux-modules-5.13.0-1029-aws_5.13.0-1029.32~lp1977919.1_amd64.deb
sudo dpkg -i *.deb

Revision history for this message

Fabio Augusto Miranda Martins (fabio.martins) wrote on 2022-06-08:

#23

Just tested this 5.13.0-1029.32~lp1977919.1 kernel and confirmed that it fixes the issue (doesn't crash when running the same docker container that would crash in the -1028 kernel)

Revision history for this message

Launchpad Janitor (janitor) wrote on 2022-06-08:

#24

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-aws-5.13 (Ubuntu Focal):
status:	New → Confirmed

Tim Gardner (timg-tpi) on 2022-06-08

affects:

linux-aws (Ubuntu) → linux-aws-5.13 (Ubuntu)

Launchpad Janitor (janitor) on 2022-06-08

Changed in linux-gcp-5.13 (Ubuntu Focal):
status:	New → Confirmed

Tim Gardner (timg-tpi) on 2022-06-08

affects:	linux-gcp (Ubuntu) → linux-gcp-5.13 (Ubuntu)
Changed in linux-azure-5.13 (Ubuntu Focal):
assignee:	nobody → Tim Gardner (timg-tpi)
importance:	Undecided → High
status:	New → In Progress
Changed in linux-aws-5.13 (Ubuntu Focal):
assignee:	nobody → Tim Gardner (timg-tpi)
importance:	Undecided → High
status:	Confirmed → In Progress
Changed in linux-aws-5.13 (Ubuntu):
status:	Confirmed → New

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2022-06-08:

#26

The fix commit is impish/linux 6a6dd081d512c812a937503d5949e4479340accb ("UBUNTU: SAUCE: overlayfs: prevent dereferencing struct file in ovl_vm_prfile_set()")

Revision history for this message

Launchpad Janitor (janitor) wrote on 2022-06-08:

#27

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-aws-5.13 (Ubuntu):
status:	New → Confirmed
Changed in linux-azure-5.13 (Ubuntu):
status:	New → Confirmed
Changed in linux-oracle-5.13 (Ubuntu Focal):
status:	New → Confirmed
Changed in linux-oracle-5.13 (Ubuntu):
status:	New → Confirmed

Revision history for this message

Dave Chiluk (chiluk) wrote on 2022-06-08 (last edit on 2022-06-08):

#31

What are the chances we can remove the the affected kernels from the archives so more people don't get bit by this.

Dave Chiluk (chiluk) on 2022-06-08

tags:

added: indeed

Revision history for this message

Jake Edwards (jake-edwards-fenwick) wrote on 2022-06-09 (last edit on 2022-06-09):

#32

Download full text (9.1 KiB)

I believe I'm getting a similar issue on Azure with a linux & Docker (linux-azure) after updates last night.
Trying to bring up the docker network interface.

Adding stack trace for those looking for Azure-related kernel panic.

[ 37.662249] kernel BUG at include/linux/fs.h:3103!
[ 37.665024] invalid opcode: 0000 [#1] SMP PTI
[ 37.667710] CPU: 1 PID: 3383 Comm: id Not tainted 5.13.0-1028-azure #33~20.04.1-Ubuntu
[ 37.668464] device vethd7a96c6 entered promiscuous mode
[ 37.672439] Hardware name: Microsoft Corporation Virtual Machine/Virtual Machine, BIOS 090008 12/07/2018
[ 37.672441] RIP: 0010:__fput+0x247/0x250
[ 37.672446] Code: 00 48 85 ff 0f 84 8b fe ff ff f6 c7 40 0f 85 82 fe ff ff e8 ab 38 00 00 e9 78 fe ff ff 4c 89 f7 e8 2e 87 02 00 e9 b5 fe ff ff <0f> 0b 0f 1f 80 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31 db 48
[ 37.672448] RSP: 0018:ffffaba0c3cdbde8 EFLAGS: 00010246
[ 37.672450] RAX: 0000000000000000 RBX: 00000000000a801d RCX: ffff99ce838a8000
[ 37.672451] RDX: ffff99ce8acf6b40 RSI: 0000000000000001 RDI: 0000000000000000
[ 37.672452] RBP: ffffaba0c3cdbe10 R08: 00000000000000a9 R09: ffff99ce8cf29d58
[ 37.672453] R10: ffffaba0c3cdbde8 R11: ffff99cea3891b10 R12: ffff99cea3891b00
[ 37.672454] R13: ffff99ce8cf29d58 R14: ffff99ce8acf6b60 R15: ffff99ce8ce95600
[ 37.672455] FS: 0000000000000000(0000) GS:ffff99cff7d00000(0000) knlGS:0000000000000000
[ 37.672456] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 37.672458] CR2: 00005597f9210f2e CR3: 0000000230c10002 CR4: 00000000003706e0
[ 37.672459] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 37.672460] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 37.693338] br: port 11(vethd7a96c6) entered blocking state
[ 37.693823] Call Trace:
[ 37.693825] <TASK>
[ 37.693827] ____fput+0xe/0x10
[ 37.696897] br: port 11(vethd7a96c6) entered forwarding state
[ 37.700943] task_work_run+0x6a/0xa0
[ 37.700947] do_exit+0x371/0xad0
[ 37.700950] do_group_exit+0x43/0xb0
[ 37.700952] __x64_sys_exit_group+0x18/0x20
[ 37.700954] do_syscall_64+0x61/0xb0
[ 37.760928] ? irqentry_exit+0x19/0x30
[ 37.763226] ? exc_page_fault+0x83/0x160
[ 37.765461] ? asm_exc_page_fault+0x8/0x30
[ 37.767666] entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 37.770647] RIP: 0033:0x7f4bf2ee3f0b
[ 37.772594] Code: Unable to access opcode bytes at RIP 0x7f4bf2ee3ee1.
[ 37.776165] RSP: 002b:00007ffc10383c68 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
[ 37.780576] RAX: ffffffffffffffda RBX: 0000000000000004 RCX: 00007f4bf2ee3f0b
[ 37.784941] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 37.788989] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[ 37.792803] R10: 0000000000000001 R11: 0000000000000246 R12: 0000000000000000
[ 37.796666] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 37.800943] </TASK>
[ 37.802280] Modules linked in: veth xt_nat xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo ip6table_nat ip6table_filter ip6_tables xt_addrtype iptable_filter iptable_nat nf_nat br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm...

I believe I'm getting a similar issue on Azure with a linux & Docker (linux-azure) after updates last night.
Trying to bring up the docker network interface.

Adding stack trace for those looking for Azure-related kernel panic.

[   37.662249] kernel BUG at include/linux/fs.h:3103!
[   37.665024] invalid opcode: 0000 [#1] SMP PTI
[   37.667710] CPU: 1 PID: 3383 Comm: id Not tainted 5.13.0-1028-azure #33~20.04.1-Ubuntu
[   37.668464] device vethd7a96c6 entered promiscuous mode
[   37.672439] Hardware name: Microsoft Corporation Virtual Machine/Virtual Machine, BIOS 090008  12/07/2018
[   37.672441] RIP: 0010:__fput+0x247/0x250
[   37.672446] Code: 00 48 85 ff 0f 84 8b fe ff ff f6 c7 40 0f 85 82 fe ff ff e8 ab 38 00 00 e9 78 fe ff ff 4c 89 f7 e8 2e 87 02 00 e9 b5 fe ff ff <0f> 0b 0f 1f 80 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31 db 48
[   37.672448] RSP: 0018:ffffaba0c3cdbde8 EFLAGS: 00010246
[   37.672450] RAX: 0000000000000000 RBX: 00000000000a801d RCX: ffff99ce838a8000
[   37.672451] RDX: ffff99ce8acf6b40 RSI: 0000000000000001 RDI: 0000000000000000
[   37.672452] RBP: ffffaba0c3cdbe10 R08: 00000000000000a9 R09: ffff99ce8cf29d58
[   37.672453] R10: ffffaba0c3cdbde8 R11: ffff99cea3891b10 R12: ffff99cea3891b00
[   37.672454] R13: ffff99ce8cf29d58 R14: ffff99ce8acf6b60 R15: ffff99ce8ce95600
[   37.672455] FS:  0000000000000000(0000) GS:ffff99cff7d00000(0000) knlGS:0000000000000000
[   37.672456] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   37.672458] CR2: 00005597f9210f2e CR3: 0000000230c10002 CR4: 00000000003706e0
[   37.672459] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   37.672460] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   37.693338] br: port 11(vethd7a96c6) entered blocking state
[   37.693823] Call Trace:
[   37.693825]  <TASK>
[   37.693827]  ____fput+0xe/0x10
[   37.696897] br: port 11(vethd7a96c6) entered forwarding state
[   37.700943]  task_work_run+0x6a/0xa0
[   37.700947]  do_exit+0x371/0xad0
[   37.700950]  do_group_exit+0x43/0xb0
[   37.700952]  __x64_sys_exit_group+0x18/0x20
[   37.700954]  do_syscall_64+0x61/0xb0
[   37.760928]  ? irqentry_exit+0x19/0x30
[   37.763226]  ? exc_page_fault+0x83/0x160
[   37.765461]  ? asm_exc_page_fault+0x8/0x30
[   37.767666]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[   37.770647] RIP: 0033:0x7f4bf2ee3f0b
[   37.772594] Code: Unable to access opcode bytes at RIP 0x7f4bf2ee3ee1.
[   37.776165] RSP: 002b:00007ffc10383c68 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
[   37.780576] RAX: ffffffffffffffda RBX: 0000000000000004 RCX: 00007f4bf2ee3f0b
[   37.784941] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[   37.788989] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[   37.792803] R10: 0000000000000001 R11: 0000000000000246 R12: 0000000000000000
[   37.796666] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[   37.800943]  </TASK>
[   37.802280] Modules linked in: veth xt_nat xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo ip6table_nat ip6table_filter ip6_tables xt_addrtype iptable_filter iptable_nat nf_nat br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xt_owner iptable_security xt_tcpudp bpfilter hv_balloon serio_raw joydev sch_fq_codel ipmi_devintf ipmi_msghandler drm msr i2c_core ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear crct10dif_pclmul crc32_pclmul ghash_clmulni_intel hid_generic aesni_intel crypto_simd hid_hyperv cryptd pata_acpi hid hyperv_keyboard hyperv_fb hv_utils hv_netvsc
[   37.846598] ---[ end trace 82874445d6a62ea4 ]---
[   37.849869] RIP: 0010:__fput+0x247/0x250
[   37.852578] Code: 00 48 85 ff 0f 84 8b fe ff ff f6 c7 40 0f 85 82 fe ff ff e8 ab 38 00 00 e9 78 fe ff ff 4c 89 f7 e8 2e 87 02 00 e9 b5 fe ff ff <0f> 0b 0f 1f 80 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31 db 48
[   37.864102] RSP: 0018:ffffaba0c3cdbde8 EFLAGS: 00010246
[   37.867960] RAX: 0000000000000000 RBX: 00000000000a801d RCX: ffff99ce838a8000
[   37.872447] RDX: ffff99ce8acf6b40 RSI: 0000000000000001 RDI: 0000000000000000
[   37.876912] RBP: ffffaba0c3cdbe10 R08: 00000000000000a9 R09: ffff99ce8cf29d58
[   37.881640] R10: ffffaba0c3cdbde8 R11: ffff99cea3891b10 R12: ffff99cea3891b00
[   37.886009] R13: ffff99ce8cf29d58 R14: ffff99ce8acf6b60 R15: ffff99ce8ce95600
[   37.890358] FS:  0000000000000000(0000) GS:ffff99cff7d00000(0000) knlGS:0000000000000000
[   37.895255] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   37.898867] CR2: 00005597f9210f2e CR3: 0000000230c10002 CR4: 00000000003706e0
[   37.903749] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   37.908554] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   37.913115] Fixing recursive fault but reboot is needed!
[   37.921422] ------------[ cut here ]------------
[   37.924679] kernel BUG at include/linux/fs.h:3103!
[   37.927869] invalid opcode: 0000 [#2] SMP PTI
[   37.930808] CPU: 1 PID: 18 Comm: ksoftirqd/1 Tainted: G      D           5.13.0-1028-azure #33~20.04.1-Ubuntu
[   37.936616] Hardware name: Microsoft Corporation Virtual Machine/Virtual Machine, BIOS 090008  12/07/2018
[   37.942683] RIP: 0010:__fput+0x247/0x250
[   37.945531] Code: 00 48 85 ff 0f 84 8b fe ff ff f6 c7 40 0f 85 82 fe ff ff e8 ab 38 00 00 e9 78 fe ff ff 4c 89 f7 e8 2e 87 02 00 e9 b5 fe ff ff <0f> 0b 0f 1f 80 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31 db 48
[   37.956709] RSP: 0018:ffffaba0c00abd58 EFLAGS: 00010246
[   37.960241] RAX: 0000000000000000 RBX: 00000000000a801d RCX: ffff99ce838a8000
[   37.964698] RDX: ffff99ce8acf6b40 RSI: 0000000000000001 RDI: 0000000000000000
[   37.969146] RBP: ffffaba0c00abd80 R08: 0000000000000010 R09: ffff99ce8cf29d58
[   37.973767] R10: ffffaba0c00abd58 R11: ffff99cea3891b10 R12: ffff99cea3891b00
[   37.978346] R13: ffff99ce8cf29d58 R14: ffff99ce8acf6b60 R15: ffff99ce8ce95600
[   37.983277] FS:  0000000000000000(0000) GS:ffff99cff7d00000(0000) knlGS:0000000000000000
[   37.988409] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   37.992409] CR2: 00007f3ef682c328 CR3: 000000014ad56002 CR4: 00000000003706e0
[   37.997201] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   38.001836] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   38.006440] Call Trace:
[   38.008503]  <TASK>
[   38.010375]  ____fput+0xe/0x10
[   38.012702]  rcu_do_batch+0x177/0x4c0
[   38.015387]  rcu_core+0x19c/0x310
[   38.017913]  rcu_core_si+0xe/0x10
[   38.020373]  __do_softirq+0xc9/0x276
[   38.022936]  run_ksoftirqd+0x1e/0x30
[   38.025516]  smpboot_thread_fn+0xd0/0x170
[   38.028414]  ? sort_range+0x30/0x30
[   38.031303]  kthread+0x12b/0x150
[   38.033760]  ? set_kthread_struct+0x40/0x40
[   38.036792]  ret_from_fork+0x22/0x30
[   38.039242]  </TASK>
[   38.041189] Modules linked in: veth xt_nat xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo ip6table_nat ip6table_filter ip6_tables xt_addrtype iptable_filter iptable_nat nf_nat br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xt_owner iptable_security xt_tcpudp bpfilter hv_balloon serio_raw joydev sch_fq_codel ipmi_devintf ipmi_msghandler drm msr i2c_core ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear crct10dif_pclmul crc32_pclmul ghash_clmulni_intel hid_generic aesni_intel crypto_simd hid_hyperv cryptd pata_acpi hid hyperv_keyboard hyperv_fb hv_utils hv_netvsc
[   38.085672] ---[ end trace 82874445d6a62ea5 ]---
[   38.089520] RIP: 0010:__fput+0x247/0x250
[   38.092642] Code: 00 48 85 ff 0f 84 8b fe ff ff f6 c7 40 0f 85 82 fe ff ff e8 ab 38 00 00 e9 78 fe ff ff 4c 89 f7 e8 2e 87 02 00 e9 b5 fe ff ff <0f> 0b 0f 1f 80 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31 db 48
[   38.104446] RSP: 0018:ffffaba0c3cdbde8 EFLAGS: 00010246
[   38.107989] RAX: 0000000000000000 RBX: 00000000000a801d RCX: ffff99ce838a8000
[   38.112672] RDX: ffff99ce8acf6b40 RSI: 0000000000000001 RDI: 0000000000000000
[   38.117308] RBP: ffffaba0c3cdbe10 R08: 00000000000000a9 R09: ffff99ce8cf29d58
[   38.122530] R10: ffffaba0c3cdbde8 R11: ffff99cea3891b10 R12: ffff99cea3891b00
[   38.127267] R13: ffff99ce8cf29d58 R14: ffff99ce8acf6b60 R15: ffff99ce8ce95600
[   38.132096] FS:  0000000000000000(0000) GS:ffff99cff7d00000(0000) knlGS:0000000000000000
[   38.137175] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   38.140979] CR2: 00007f3ef682c328 CR3: 000000014ad56002 CR4: 00000000003706e0
[   38.145906] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   38.150919] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[   38.155713] Kernel panic - not syncing: Fatal exception in interrupt
[   38.181193] Kernel Offset: 0x2be00000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[   38.188597] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---

Revision history for this message

Rob (robd003) wrote on 2022-06-09:

#33

Download full text (13.9 KiB)

Also seeing this on AWS with t4g instances. Kernel panic:

[ 12.489272] kernel BUG at include/linux/fs.h:3104!
[ 12.490111] Internal error: Oops - BUG: 0 [#1] SMP
[ 12.490923] Modules linked in: veth xt_nat xt_tcpudp xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c bpfilter br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua aes_ce_blk crypto_simd cryptd aes_ce_cipher crct10dif_ce ghash_ce sha2_ce sha256_arm64 sha1_ce efi_pstore ena sch_fq_codel ipmi_devintf ipmi_msghandler drm ip_tables x_tables autofs4
[ 12.498762] CPU: 0 PID: 1349 Comm: id Not tainted 5.13.0-1028-aws #31~20.04.1-Ubuntu
[ 12.500092] Hardware name: Amazon EC2 t4g.micro/, BIOS 1.0 11/1/2018
[ 12.501189] pstate: 60400005 (nZCv daif +PAN -UAO -TCO BTYPE=--)
[ 12.502226] pc : __fput+0x240/0x248
[ 12.502844] lr : __fput+0xb0/0x248
[ 12.503451] sp : ffff80000a11baf0
[ 12.504039] x29: ffff80000a11baf0 x28: ffff000003de6c80 x27: 0000000000000000
[ 12.505287] x26: 0000000000000000 x25: 0000000000000000 x24: 0000000000000000
[ 12.506513] x23: ffff00000447ef00 x22: ffff00000ae0ca20 x21: 00000000000a801d
[ 12.507746] x20: ffff000001ce6020 x19: ffff000002df7500 x18: 0000000000000000
[ 12.508972] x17: 0000000000000000 x16: ffffcab3b1ed6968 x15: 0000000000000000
[ 12.510189] x14: 0000000000000000 x13: 0000000000000000 x12: 0000000000000000
[ 12.511411] x11: 0000000000000000 x10: 0000000000000001 x9 : ffffcab3b1ed6298
[ 12.512650] x8 : ffff00003e40b0c0 x7 : 0000000000000808 x6 : 0000000300000000
[ 12.513873] x5 : ffff000001ce6020 x4 : 0000000000008000 x3 : ffff000000f98800
[ 12.515101] x2 : ffffffffffffffff x1 : 0000000000000000 x0 : 0000000000000000
[ 12.516333] Call trace:
[ 12.516771] __fput+0x240/0x248
[ 12.517327] ____fput+0x18/0x28
[ 12.517886] task_work_run+0xc8/0x140
[ 12.518541] do_exit+0x20c/0x8e0
[ 12.519118] do_group_exit+0x4c/0xb0
[ 12.519753] __wake_up_parent+0x0/0x38
[ 12.520437] invoke_syscall+0x74/0xf0
[ 12.521085] el0_svc_common.constprop.0+0x184/0x1a8
[ 12.521939] do_el0_svc+0x2c/0x90
[ 12.522608] el0_svc+0x24/0x38
[ 12.523155] el0_sync_handler+0xb0/0xb8
[ 12.523834] el0_sync+0x19c/0x1c0
[ 12.524426] Code: 91059283 52800020 1400016f 17ffffa0 (d4210000)
[ 12.525503] ---[ end trace 8bd8624b9b8b9618 ]---
[ 12.531116] Fixing recursive fault but reboot is needed!
[ 12.537950] ------------[ cut here ]------------
[ 12.538742] WARNING: CPU: 0 PID: 0 at kernel/rcu/tree.c:638 rcu_eqs_enter.isra.0+0x68/0x70
[ 12.540129] Modules linked in: veth xt_nat xt_tcpudp xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c bpfilter br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua aes_ce_blk crypto_simd cryptd aes_ce_cipher crct10dif_ce ghash_ce sha2_ce sha256_arm64 sha1_ce efi_pstore ena sch_fq_codel ipmi_devintf ipmi_msghandler drm ip_tables x_tabl...

Also seeing this on AWS with t4g instances. Kernel panic:

[   12.489272] kernel BUG at include/linux/fs.h:3104!
[   12.490111] Internal error: Oops - BUG: 0 [#1] SMP
[   12.490923] Modules linked in: veth xt_nat xt_tcpudp xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c bpfilter br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua aes_ce_blk crypto_simd cryptd aes_ce_cipher crct10dif_ce ghash_ce sha2_ce sha256_arm64 sha1_ce efi_pstore ena sch_fq_codel ipmi_devintf ipmi_msghandler drm ip_tables x_tables autofs4
[   12.498762] CPU: 0 PID: 1349 Comm: id Not tainted 5.13.0-1028-aws #31~20.04.1-Ubuntu
[   12.500092] Hardware name: Amazon EC2 t4g.micro/, BIOS 1.0 11/1/2018
[   12.501189] pstate: 60400005 (nZCv daif +PAN -UAO -TCO BTYPE=--)
[   12.502226] pc : __fput+0x240/0x248
[   12.502844] lr : __fput+0xb0/0x248
[   12.503451] sp : ffff80000a11baf0
[   12.504039] x29: ffff80000a11baf0 x28: ffff000003de6c80 x27: 0000000000000000
[   12.505287] x26: 0000000000000000 x25: 0000000000000000 x24: 0000000000000000
[   12.506513] x23: ffff00000447ef00 x22: ffff00000ae0ca20 x21: 00000000000a801d
[   12.507746] x20: ffff000001ce6020 x19: ffff000002df7500 x18: 0000000000000000
[   12.508972] x17: 0000000000000000 x16: ffffcab3b1ed6968 x15: 0000000000000000
[   12.510189] x14: 0000000000000000 x13: 0000000000000000 x12: 0000000000000000
[   12.511411] x11: 0000000000000000 x10: 0000000000000001 x9 : ffffcab3b1ed6298
[   12.512650] x8 : ffff00003e40b0c0 x7 : 0000000000000808 x6 : 0000000300000000
[   12.513873] x5 : ffff000001ce6020 x4 : 0000000000008000 x3 : ffff000000f98800
[   12.515101] x2 : ffffffffffffffff x1 : 0000000000000000 x0 : 0000000000000000
[   12.516333] Call trace:
[   12.516771]  __fput+0x240/0x248
[   12.517327]  ____fput+0x18/0x28
[   12.517886]  task_work_run+0xc8/0x140
[   12.518541]  do_exit+0x20c/0x8e0
[   12.519118]  do_group_exit+0x4c/0xb0
[   12.519753]  __wake_up_parent+0x0/0x38
[   12.520437]  invoke_syscall+0x74/0xf0
[   12.521085]  el0_svc_common.constprop.0+0x184/0x1a8
[   12.521939]  do_el0_svc+0x2c/0x90
[   12.522608]  el0_svc+0x24/0x38
[   12.523155]  el0_sync_handler+0xb0/0xb8
[   12.523834]  el0_sync+0x19c/0x1c0
[   12.524426] Code: 91059283 52800020 1400016f 17ffffa0 (d4210000) 
[   12.525503] ---[ end trace 8bd8624b9b8b9618 ]---
[   12.531116] Fixing recursive fault but reboot is needed!
[   12.537950] ------------[ cut here ]------------
[   12.538742] WARNING: CPU: 0 PID: 0 at kernel/rcu/tree.c:638 rcu_eqs_enter.isra.0+0x68/0x70
[   12.540129] Modules linked in: veth xt_nat xt_tcpudp xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c bpfilter br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua aes_ce_blk crypto_simd cryptd aes_ce_cipher crct10dif_ce ghash_ce sha2_ce sha256_arm64 sha1_ce efi_pstore ena sch_fq_codel ipmi_devintf ipmi_msghandler drm ip_tables x_tables autofs4
[   12.547909] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G      D           5.13.0-1028-aws #31~20.04.1-Ubuntu
[   12.549548] Hardware name: Amazon EC2 t4g.micro/, BIOS 1.0 11/1/2018
[   12.550649] pstate: 204000c5 (nzCv daIF +PAN -UAO -TCO BTYPE=--)
[   12.551689] pc : rcu_eqs_enter.isra.0+0x68/0x70
[   12.552484] lr : rcu_idle_enter+0x18/0x28
[   12.553209] sp : ffffcab3b3f33e80
[   12.553797] x29: ffffcab3b3f33e80 x28: 0000000064c94404 x27: ffffcab3b3f48040
[   12.555032] x26: 0000000000000000 x25: 0000000000000000 x24: ffffcab3b3f3db6c
[   12.556266] x23: ffffcab3b3f48040 x22: ffffcab3b397e478 x21: ffffcab3b3f3db2c
[   12.557509] x20: ffffcab3b3f3da20 x19: ffffcab3b3969008 x18: 0000000000000000
[   12.558765] x17: 0000000000000000 x16: 0000000000000000 x15: 0000ffff83bcb658
[   12.560008] x14: 0000000000000000 x13: 0000000000000000 x12: ffffcab3b2f91468
[   12.561243] x11: ffffcab3b3f3db58 x10: 0000000000000b10 x9 : ffffcab3b2a82160
[   12.562484] x8 : ffffcab3b3f48bb0 x7 : 00000000000002ca x6 : 000000007c5faadb
[   12.563738] x5 : 00ffffffffffffff x4 : ffff354c8aa48000 x3 : ffff354c8aa48000
[   12.564977] x2 : 4000000000000002 x1 : 4000000000000000 x0 : ffff00003e3c8b00
[   12.566210] Call trace:
[   12.566650]  rcu_eqs_enter.isra.0+0x68/0x70
[   12.567384]  rcu_idle_enter+0x18/0x28
[   12.568032]  default_idle_call+0x40/0x170
[   12.568738]  do_idle+0x22c/0x280
[   12.569309]  cpu_startup_entry+0x30/0x98
[   12.569994]  rest_init+0xc8/0xd8
[   12.570563]  arch_call_rest_init+0x18/0x24
[   12.571288]  start_kernel+0x700/0x740
[   12.571949] ---[ end trace 8bd8624b9b8b9619 ]---
[   12.762479] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000018
[   12.763998] Mem abort info:
[   12.764443]   ESR = 0x96000004
[   12.764955]   EC = 0x25: DABT (current EL), IL = 32 bits
[   12.765903]   SET = 0, FnV = 0
[   12.766397]   EA = 0, S1PTW = 0
[   12.766881] Data abort info:
[   12.767338]   ISV = 0, ISS = 0x00000004
[   12.767938]   CM = 0, WnR = 0
[   12.768421] user pgtable: 4k pages, 48-bit VAs, pgdp=000000004a694000
[   12.769533] [0000000000000018] pgd=0000000000000000, p4d=0000000000000000
[   12.770563] Internal error: Oops: 96000004 [#2] SMP
[   12.771322] Modules linked in: veth xt_nat xt_tcpudp xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c bpfilter br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua aes_ce_blk crypto_simd cryptd aes_ce_cipher crct10dif_ce ghash_ce sha2_ce sha256_arm64 sha1_ce efi_pstore ena sch_fq_codel ipmi_devintf ipmi_msghandler drm ip_tables x_tables autofs4
[   12.778542] CPU: 1 PID: 1329 Comm: journalctl Tainted: G      D W         5.13.0-1028-aws #31~20.04.1-Ubuntu
[   12.780158] Hardware name: Amazon EC2 t4g.micro/, BIOS 1.0 11/1/2018
[   12.781200] pstate: 80400005 (Nzcv daif +PAN -UAO -TCO BTYPE=--)
[   12.782157] pc : __handle_mm_fault+0x6c/0x510
[   12.782854] lr : handle_mm_fault+0xcc/0x258
[   12.783515] sp : ffff800008a0bcc0
[   12.784069] x29: ffff800008a0bcc0 x28: ffff00000384ae80 x27: 0000000000000007
[   12.785185] x26: ffff00001c2fa5e8 x25: 0000000000000254 x24: ffff00000ad31820
[   12.786330] x23: 0000000000000040 x22: 0000000000000254 x21: ffff00000384ae80
[   12.787500] x20: 0000ffff7a6cc000 x19: ffff00000ad31820 x18: 0000000000000000
[   12.788651] x17: 0000000000000000 x16: 0000000000000000 x15: 0000000000000000
[   12.789797] x14: 0000000000000000 x13: 0000000000000000 x12: 0000000000000000
[   12.790956] x11: 0000000000000000 x10: 0000000000000000 x9 : ffffcab3b1e380d4
[   12.792176] x8 : 0000000000000000 x7 : 0000000000000000 x6 : ffff354c8aa6a000
[   12.793377] x5 : ffff800008a0bd70 x4 : 0000000000000285 x3 : ffff000002df7000
[   12.794598] x2 : 0000000000000254 x1 : 0000ffff7a6cc000 x0 : 0000000000000000
[   12.795810] Call trace:
[   12.796231]  __handle_mm_fault+0x6c/0x510
[   12.796921]  handle_mm_fault+0xcc/0x258
[   12.797578]  do_page_fault+0x170/0x4b8
[   12.798213]  do_translation_fault+0x68/0x78
[   12.798887]  do_mem_abort+0x48/0xb8
[   12.799461]  el0_da+0x40/0x80
[   12.799958]  el0_sync_handler+0x88/0xb8
[   12.800590]  el0_sync+0x19c/0x1c0
[   12.801131] Code: a909ffff a90affff b4000083 f9406c60 (b9401800) 
[   12.802101] ---[ end trace 8bd8624b9b8b961a ]---
[   12.819021] Unable to handle kernel NULL pointer dereference at virtual address 0000000000000030
[   12.820378] Mem abort info:
[   12.820794]   ESR = 0x96000006
[   12.821297]   EC = 0x25: DABT (current EL), IL = 32 bits
[   12.822110]   SET = 0, FnV = 0
[   12.822595]   EA = 0, S1PTW = 0
[   12.823098] Data abort info:
[   12.823535]   ISV = 0, ISS = 0x00000006
[   12.824134]   CM = 0, WnR = 0
[   12.824615] user pgtable: 4k pages, 48-bit VAs, pgdp=00000000425ec000
[   12.825628] [0000000000000030] pgd=080000005cb23003, p4d=080000005cb23003, pud=080000005c8c0003, pmd=0000000000000000
[   12.827234] Internal error: Oops: 96000006 [#3] SMP
[   12.827983] Modules linked in: veth xt_nat xt_tcpudp xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c bpfilter br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua aes_ce_blk crypto_simd cryptd aes_ce_cipher crct10dif_ce ghash_ce sha2_ce sha256_arm64 sha1_ce efi_pstore ena sch_fq_codel ipmi_devintf ipmi_msghandler drm ip_tables x_tables autofs4
[   12.834966] CPU: 0 PID: 1329 Comm: journalctl Tainted: G      D W         5.13.0-1028-aws #31~20.04.1-Ubuntu
[   12.836476] Hardware name: Amazon EC2 t4g.micro/, BIOS 1.0 11/1/2018
[   12.837477] pstate: 60400005 (nZCv daif +PAN -UAO -TCO BTYPE=--)
[   12.838425] pc : down_write+0x38/0x88
[   12.839038] lr : down_write+0x20/0x88
[   12.839628] sp : ffff800008a0b710
[   12.840179] x29: ffff800008a0b710 x28: ffff00000384ae80 x27: 0000000000000007
[   12.841317] x26: ffff00001c2fa5e8 x25: 0000000000000000 x24: 0000000000000000
[   12.842466] x23: ffff800008a0b7e8 x22: 0000000000000000 x21: 0000000000000030
[   12.843594] x20: ffff000002df7000 x19: 0000000000000030 x18: 0000000000000000
[   12.844722] x17: 0000000000000000 x16: 0000000000000000 x15: 0000000000000000
[   12.845802] x14: 0000000000000000 x13: 0000000000000000 x12: ffffcab3b2f91468
[   12.846891] x11: ffffcab3b3f3db58 x10: 0000000000000000 x9 : ffffcab3b2a7a814
[   12.847997] x8 : 0000000000001b80 x7 : 0000000000000005 x6 : 0000ffff92481000
[   12.849070] x5 : 0000000000000000 x4 : 0000ffff7a481000 x3 : 0000000000000000
[   12.850165] x2 : 0000000000000001 x1 : 0000000000000000 x0 : 0000000000000030
[   12.851268] Call trace:
[   12.851649]  down_write+0x38/0x88
[   12.852152]  unlink_file_vma+0x38/0x68
[   12.852747]  free_pgtables+0xc0/0x138
[   12.853306]  exit_mmap+0xe4/0x1c0
[   12.853825]  mmput+0x90/0x1a8
[   12.854308]  exit_mm+0x19c/0x250
[   12.854798]  do_exit+0x198/0x8e0
[   12.855306]  die+0x2e8/0x2f8
[   12.855771]  die_kernel_fault+0x6c/0x80
[   12.856374]  __do_kernel_fault+0xbc/0x1e8
[   12.856990]  do_page_fault+0x208/0x4b8
[   12.857596]  do_translation_fault+0x68/0x78
[   12.858281]  do_mem_abort+0x48/0xb8
[   12.858846]  el1_abort+0x5c/0xe0
[   12.859371]  el1_sync_handler+0xac/0xc8
[   12.859993]  el1_sync+0x7c/0x100
[   12.860529]  __handle_mm_fault+0x6c/0x510
[   12.861187]  handle_mm_fault+0xcc/0x258
[   12.861926]  do_page_fault+0x170/0x4b8
[   12.862541]  do_translation_fault+0x68/0x78
[   12.863226]  do_mem_abort+0x48/0xb8
[   12.863787]  el0_da+0x40/0x80
[   12.864267]  el0_sync_handler+0x88/0xb8
[   12.864896]  el0_sync+0x19c/0x1c0
[   12.865427] Code: d2800001 aa1303e0 d2800022 aa0103e3 (c8e37e62) 
[   12.866373] ---[ end trace 8bd8624b9b8b961b ]---
[   12.872885] Fixing recursive fault but reboot is needed!

Ubuntu 20.04.4 LTS ip-172-31-44-191 ttyS0

ip-172-31-44-191 login: [   13.493058] ------------[ cut here ]------------
[   13.493848] kernel BUG at include/linux/fs.h:3104!
[   13.494605] Internal error: Oops - BUG: 0 [#4] SMP
[   13.495369] Modules linked in: veth xt_nat xt_tcpudp xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c bpfilter br_netfilter bridge stp llc aufs overlay nls_iso8859_1 dm_multipath scsi_dh_rdac scsi_dh_emc scsi_dh_alua aes_ce_blk crypto_simd cryptd aes_ce_cipher crct10dif_ce ghash_ce sha2_ce sha256_arm64 sha1_ce efi_pstore ena sch_fq_codel ipmi_devintf ipmi_msghandler drm ip_tables x_tables autofs4
[   13.502657] CPU: 0 PID: 1405 Comm: bash Tainted: G      D W         5.13.0-1028-aws #31~20.04.1-Ubuntu
[   13.504173] Hardware name: Amazon EC2 t4g.micro/, BIOS 1.0 11/1/2018
[   13.505200] pstate: 60400005 (nZCv daif +PAN -UAO -TCO BTYPE=--)
[   13.506197] pc : __fput+0x240/0x248
[   13.506778] lr : __fput+0xb0/0x248
[   13.507357] sp : ffff80000a1e3c40
[   13.507916] x29: ffff80000a1e3c40 x28: 0000000000000006 x27: 000000000000f000
[   13.509079] x26: ffff80000a1e3e70 x25: ffffcab3b3969000 x24: ffffcab3b3969008
[   13.510229] x23: ffff000004595300 x22: ffff000000447660 x21: 00000000480a801d
[   13.511405] x20: ffff000004972f30 x19: ffff00000636aa00 x18: ffff00003e40b180
[   13.512564] x17: ffff00003e40b180 x16: fffffc0000711d48 x15: 0000000000000068
[   13.513741] x14: 00000000000000c0 x13: 0000000000000000 x12: fffffc0000711d08
[   13.514957] x11: 0000000000000000 x10: 0000000000000001 x9 : ffffcab3b1ed6298
[   13.516150] x8 : ffff00003e40b0c0 x7 : 0000000000004cf0 x6 : 0000000000019830
[   13.517349] x5 : ffff000004972f30 x4 : 0000000000008000 x3 : ffff000000c69000
[   13.518562] x2 : ffffcab3b1fea810 x1 : 0000000000080000 x0 : 0000000000000000
[   13.519801] Call trace:
[   13.520226]  __fput+0x240/0x248
[   13.520786]  ____fput+0x18/0x28
[   13.521316]  task_work_run+0xc8/0x140
[   13.521928]  do_exit+0x20c/0x8e0
[   13.522459]  do_group_exit+0x4c/0xb0
[   13.523061]  get_signal+0x1ac/0x830
[   13.523669]  do_notify_resume+0x2c8/0x850
[   13.524314]  work_pending+0xc/0x244
[   13.524893] Code: 91059283 52800020 1400016f 17ffffa0 (d4210000) 
[   13.525865] ---[ end trace 8bd8624b9b8b961c ]---
[   13.533116] Fixing recursive fault but reboot is needed!
[   15.124518] loop10: detected capacity change from 0 to 8
2022/06/09 01:48:55Z: Amazon SSM Agent v3.1.1188.0 is running
2022/06/09 01:48:55Z: OsProductName: Ubuntu
2022/06/09 01:48:55Z: OsVersion: 20.04
[   17.253794] cloud-init[1523]: Cloud-init v. 22.2-0ubuntu1~20.04.1 running 'modules:config' at Thu, 09 Jun 2022 01:48:56 +0000. Up 17.11 seconds.
[   17.774237] cloud-init[1528]: Cloud-init v. 22.2-0ubuntu1~20.04.1 running 'modules:final' at Thu, 09 Jun 2022 01:48:56 +0000. Up 17.64 seconds.
[   17.775169] cloud-init[1528]: Cloud-init v. 22.2-0ubuntu1~20.04.1 finished at Thu, 09 Jun 2022 01:48:56 +0000. Datasource DataSourceEc2Local.  Up 17.76 seconds

Revision history for this message

Rob (robd003) wrote on 2022-06-09 (last edit on 2022-06-09):

#35

Just wondering, could we get a "run docker container" test as part of the QA process going forward before new kernels are released?

Revision history for this message

Aarni Koskela (akx) wrote on 2022-06-09:

#36

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1967924 seems related.

"This patch is touching overlayfs, so we may see potential regressions in overlayfs." We did indeed... :)

Revision history for this message

Roger Sikorski (rogersik) wrote on 2022-06-09:

#37

Is it possible to take this kernel back / away from repository before more system are get broken?

Revision history for this message

indy (cz172638) wrote on 2022-06-09:

#38

Download full text (12.6 KiB)

hit same problem using podman in rootless on linux-image-5.15.0-1008-intel-iotg:
######################################################################
[ 1666.319425] ------------[ cut here ]------------
[ 1666.319433] kernel BUG at include/linux/fs.h:3082!
[ 1666.319443] invalid opcode: 0000 [#3] SMP NOPTI
[ 1666.319449] CPU: 0 PID: 17586 Comm: ls Tainted: G D 5.15.0-1008-intel-iotg #11~20.04.1-Ubuntu
[ 1666.319454] Hardware name: Dell Inc. Precision 5560/XXXXXX, BIOS 1.8.0 02/08/2022
[ 1666.319457] RIP: 0010:__fput+0x265/0x270
[ 1666.319466] Code: 00 48 85 ff 0f 84 6d fe ff ff f6 c7 40 0f 85 64 fe ff ff e8 6d 39 00 00 e9 5a fe ff ff 4c 89 f7 e8 70 96 02 00 e9 97 fe ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31
[ 1666.319471] RSP: 0018:ffffb3d605127d70 EFLAGS: 00010246
[ 1666.319477] RAX: 0000000000000000 RBX: 00000000000a801d RCX: 0000000000000000
[ 1666.319480] RDX: 0000000000000000 RSI: ffffffff9ffb59f1 RDI: 0000000000000000
[ 1666.319483] RBP: ffffb3d605127d98 R08: ffff942c84c70780 R09: ffff942c8c60b520
[ 1666.319485] R10: 0000000000000010 R11: ffff9433ef5f0c40 R12: ffff942c86b08300
[ 1666.319488] R13: ffff942c8c60b520 R14: ffff942c9079d060 R15: ffff942c8a54ef00
[ 1666.319490] FS: 0000000000000000(0000) GS:ffff9433ef400000(0000) knlGS:0000000000000000
[ 1666.319494] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1666.319497] CR2: 00007ffcbd0a85c9 CR3: 0000000236f96001 CR4: 0000000000770ef0
[ 1666.319500] PKRU: 55555554
[ 1666.319503] Call Trace:
[ 1666.319505] <TASK>
[ 1666.319510] ____fput+0xe/0x10
[ 1666.319515] task_work_run+0x6d/0xb0
[ 1666.319523] exit_to_user_mode_prepare+0x1b2/0x1c0
[ 1666.319529] syscall_exit_to_user_mode+0x27/0x50
[ 1666.319536] do_syscall_64+0x69/0xc0
[ 1666.319543] ? handle_mm_fault+0xd8/0x2b0
[ 1666.319550] ? exit_to_user_mode_prepare+0x3d/0x1c0
[ 1666.319555] ? do_user_addr_fault+0x1dc/0x650
[ 1666.319560] ? irqentry_exit_to_user_mode+0x9/0x20
[ 1666.319565] ? irqentry_exit+0x19/0x30
[ 1666.319569] ? exc_page_fault+0x89/0x160
[ 1666.319573] ? asm_exc_page_fault+0x8/0x30
[ 1666.319580] entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 1666.319585] RIP: 0033:0x60003530
[ 1666.319592] Code: Unable to access opcode bytes at RIP 0x60003506.
[ 1666.319595] RSP: 002b:00007ffcbd0a83e0 EFLAGS: 00000200 ORIG_RAX: 000000000000003b
[ 1666.319599] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
[ 1666.319602] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 1666.319604] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[ 1666.319606] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[ 1666.319608] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 1666.319612] </TASK>
[ 1666.319614] Modules linked in: overlay uhid rfcomm ccm snd_hda_codec_hdmi cmac algif_hash algif_skcipher af_alg bnep binfmt_misc joydev snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_ctl_led snd_sof_xtensa_dsp snd_sof snd_hda_codec_realtek snd_soc_hdac_hda snd_hda_ext_core snd_hda_codec_generic snd_soc_acpi_intel_match...

hit same problem using podman in rootless on linux-image-5.15.0-1008-intel-iotg:
######################################################################
[ 1666.319425] ------------[ cut here ]------------
[ 1666.319433] kernel BUG at include/linux/fs.h:3082!
[ 1666.319443] invalid opcode: 0000 [#3] SMP NOPTI
[ 1666.319449] CPU: 0 PID: 17586 Comm: ls Tainted: G      D           5.15.0-1008-intel-iotg #11~20.04.1-Ubuntu
[ 1666.319454] Hardware name: Dell Inc. Precision 5560/XXXXXX, BIOS 1.8.0 02/08/2022
[ 1666.319457] RIP: 0010:__fput+0x265/0x270
[ 1666.319466] Code: 00 48 85 ff 0f 84 6d fe ff ff f6 c7 40 0f 85 64 fe ff ff e8 6d 39 00 00 e9 5a fe ff ff 4c 89 f7 e8 70 96 02 00 e9 97 fe ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31
[ 1666.319471] RSP: 0018:ffffb3d605127d70 EFLAGS: 00010246
[ 1666.319477] RAX: 0000000000000000 RBX: 00000000000a801d RCX: 0000000000000000
[ 1666.319480] RDX: 0000000000000000 RSI: ffffffff9ffb59f1 RDI: 0000000000000000
[ 1666.319483] RBP: ffffb3d605127d98 R08: ffff942c84c70780 R09: ffff942c8c60b520
[ 1666.319485] R10: 0000000000000010 R11: ffff9433ef5f0c40 R12: ffff942c86b08300
[ 1666.319488] R13: ffff942c8c60b520 R14: ffff942c9079d060 R15: ffff942c8a54ef00
[ 1666.319490] FS:  0000000000000000(0000) GS:ffff9433ef400000(0000) knlGS:0000000000000000
[ 1666.319494] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1666.319497] CR2: 00007ffcbd0a85c9 CR3: 0000000236f96001 CR4: 0000000000770ef0
[ 1666.319500] PKRU: 55555554
[ 1666.319503] Call Trace:
[ 1666.319505]  <TASK>
[ 1666.319510]  ____fput+0xe/0x10
[ 1666.319515]  task_work_run+0x6d/0xb0
[ 1666.319523]  exit_to_user_mode_prepare+0x1b2/0x1c0
[ 1666.319529]  syscall_exit_to_user_mode+0x27/0x50
[ 1666.319536]  do_syscall_64+0x69/0xc0
[ 1666.319543]  ? handle_mm_fault+0xd8/0x2b0
[ 1666.319550]  ? exit_to_user_mode_prepare+0x3d/0x1c0
[ 1666.319555]  ? do_user_addr_fault+0x1dc/0x650
[ 1666.319560]  ? irqentry_exit_to_user_mode+0x9/0x20
[ 1666.319565]  ? irqentry_exit+0x19/0x30
[ 1666.319569]  ? exc_page_fault+0x89/0x160
[ 1666.319573]  ? asm_exc_page_fault+0x8/0x30
[ 1666.319580]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 1666.319585] RIP: 0033:0x60003530
[ 1666.319592] Code: Unable to access opcode bytes at RIP 0x60003506.
[ 1666.319595] RSP: 002b:00007ffcbd0a83e0 EFLAGS: 00000200 ORIG_RAX: 000000000000003b
[ 1666.319599] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
[ 1666.319602] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 1666.319604] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[ 1666.319606] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[ 1666.319608] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 1666.319612]  </TASK>
[ 1666.319614] Modules linked in: overlay uhid rfcomm ccm snd_hda_codec_hdmi cmac algif_hash algif_skcipher af_alg bnep binfmt_misc joydev snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_ctl_led snd_sof_xtensa_dsp snd_sof snd_hda_codec_realtek snd_soc_hdac_hda snd_hda_ext_core snd_hda_codec_generic snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus snd_soc_core snd_compress intel_tcc_cooling ac97_bus snd_pcm_dmaengine x86_pkg_temp_thermal snd_hda_intel intel_powerclamp snd_intel_dspcfg snd_intel_sdw_acpi mei_dal mei_hdcp dell_laptop coretemp snd_hda_codec intel_rapl_msr kvm_intel snd_hda_core dell_wmi snd_hwdep ledtrig_audio kvm iwlmvm snd_pcm dell_smbios dcdbas snd_seq_midi mac80211 intel_cstate snd_seq_midi_event snd_rawmidi libarc4 nls_iso8859_1 dell_wmi_descriptor wmi_bmof firmware_attributes_class serio_raw uvcvideo snd_seq videobuf2_vmalloc efi_pstore hid_sensor_custom_intel_hinge
[ 1666.319704]  snd_seq_device videobuf2_memops hid_sensor_als iwlwifi videobuf2_v4l2 hid_sensor_trigger snd_timer r8153_ecm hci_uart cdc_ether industrialio_triggered_buffer iwlmei videobuf2_common usbnet kfifo_buf processor_thermal_device_pci_legacy ee1004 hid_sensor_iio_common input_leds cfg80211 industrialio r8152 processor_thermal_device videodev snd btqca btusb processor_thermal_rfim mei_me processor_thermal_mbox btrtl pmt_telemetry btbcm cros_ec_ishtp mii processor_thermal_rapl mc hid_multitouch soundcore mei pmt_class cros_ec btintel ucsi_acpi intel_rapl_common bluetooth typec_ucsi intel_soc_dts_iosf typec ecdh_generic ecc dptf_power int3403_thermal soc_button_array int340x_thermal_zone mac_hid intel_skl_int3472 intel_hid int3400_thermal acpi_thermal_rel sparse_keymap acpi_pad acpi_tad ipt_REJECT nf_reject_ipv4 xt_LOG nf_log_syslog xt_limit xt_addrtype xt_tcpudp xt_conntrack nf_conntrack sch_fq_codel nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c ipmi_devintf ip6_tables ipmi_msghandler msr
[ 1666.319801]  iptable_filter parport_pc bpfilter ppdev lp parport sunrpc ip_tables x_tables autofs4 dm_crypt usbhid hid_sensor_custom hid_sensor_hub intel_ishtp_loader intel_ishtp_hid i915 i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops hid_generic rtsx_pci_sdmmc cec rc_core crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd psmouse rtsx_pci nvme intel_lpss_pci drm i2c_i801 i2c_hid_acpi intel_ish_ipc i2c_hid intel_lpss thunderbolt xhci_pci i2c_smbus intel_ishtp idma64 intel_pmt xhci_pci_renesas nvme_core wmi hid video pinctrl_tigerlake
[ 1666.319866] ---[ end trace eace8679e8eed905 ]---
[ 1666.574004] RIP: 0010:__fput+0x265/0x270
[ 1666.574012] Code: 00 48 85 ff 0f 84 6d fe ff ff f6 c7 40 0f 85 64 fe ff ff e8 6d 39 00 00 e9 5a fe ff ff 4c 89 f7 e8 70 96 02 00 e9 97 fe ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31
[ 1666.574015] RSP: 0018:ffffb3d600e73d88 EFLAGS: 00010246
[ 1666.574033] RAX: 0000000000000000 RBX: 00000000000a800d RCX: 0000000000000000
[ 1666.574034] RDX: ffff942c818e4e48 RSI: ffff942c818e4e48 RDI: 0000000000000000
[ 1666.574035] RBP: ffffb3d600e73db0 R08: ffff942c84c70780 R09: ffff942c89ac4308
[ 1666.574036] R10: 0000000000000010 R11: ffff9433ef670c40 R12: ffff942c86b08300
[ 1666.574037] R13: ffff942c89ac4308 R14: ffff942c8e371a60 R15: ffff942c89b46600
[ 1666.574038] FS:  0000000000000000(0000[ 1666.319425] ------------[ cut here ]------------
[ 1666.319433] kernel BUG at include/linux/fs.h:3082!
[ 1666.319443] invalid opcode: 0000 [#3] SMP NOPTI
[ 1666.319449] CPU: 0 PID: 17586 Comm: ls Tainted: G      D           5.15.0-1008-intel-iotg #11~20.04.1-Ubuntu
[ 1666.319454] Hardware name: Dell Inc. Precision 5560/XXXXXX, BIOS 1.8.0 02/08/2022
[ 1666.319457] RIP: 0010:__fput+0x265/0x270
[ 1666.319466] Code: 00 48 85 ff 0f 84 6d fe ff ff f6 c7 40 0f 85 64 fe ff ff e8 6d 39 00 00 e9 5a fe ff ff 4c 89 f7 e8 70 96 02 00 e9 97 fe ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31
[ 1666.319471] RSP: 0018:ffffb3d605127d70 EFLAGS: 00010246
[ 1666.319477] RAX: 0000000000000000 RBX: 00000000000a801d RCX: 0000000000000000
[ 1666.319480] RDX: 0000000000000000 RSI: ffffffff9ffb59f1 RDI: 0000000000000000
[ 1666.319483] RBP: ffffb3d605127d98 R08: ffff942c84c70780 R09: ffff942c8c60b520
[ 1666.319485] R10: 0000000000000010 R11: ffff9433ef5f0c40 R12: ffff942c86b08300
[ 1666.319488] R13: ffff942c8c60b520 R14: ffff942c9079d060 R15: ffff942c8a54ef00
[ 1666.319490] FS:  0000000000000000(0000) GS:ffff9433ef400000(0000) knlGS:0000000000000000
[ 1666.319494] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1666.319497] CR2: 00007ffcbd0a85c9 CR3: 0000000236f96001 CR4: 0000000000770ef0
[ 1666.319500] PKRU: 55555554
[ 1666.319503] Call Trace:
[ 1666.319505]  <TASK>
[ 1666.319510]  ____fput+0xe/0x10
[ 1666.319515]  task_work_run+0x6d/0xb0
[ 1666.319523]  exit_to_user_mode_prepare+0x1b2/0x1c0
[ 1666.319529]  syscall_exit_to_user_mode+0x27/0x50
[ 1666.319536]  do_syscall_64+0x69/0xc0
[ 1666.319543]  ? handle_mm_fault+0xd8/0x2b0
[ 1666.319550]  ? exit_to_user_mode_prepare+0x3d/0x1c0
[ 1666.319555]  ? do_user_addr_fault+0x1dc/0x650
[ 1666.319560]  ? irqentry_exit_to_user_mode+0x9/0x20
[ 1666.319565]  ? irqentry_exit+0x19/0x30
[ 1666.319569]  ? exc_page_fault+0x89/0x160
[ 1666.319573]  ? asm_exc_page_fault+0x8/0x30
[ 1666.319580]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 1666.319585] RIP: 0033:0x60003530
[ 1666.319592] Code: Unable to access opcode bytes at RIP 0x60003506.
[ 1666.319595] RSP: 002b:00007ffcbd0a83e0 EFLAGS: 00000200 ORIG_RAX: 000000000000003b
[ 1666.319599] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
[ 1666.319602] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 1666.319604] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[ 1666.319606] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[ 1666.319608] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 1666.319612]  </TASK>
[ 1666.319614] Modules linked in: overlay uhid rfcomm ccm snd_hda_codec_hdmi cmac algif_hash algif_skcipher af_alg bnep binfmt_misc joydev snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_ctl_led snd_sof_xtensa_dsp snd_sof snd_hda_codec_realtek snd_soc_hdac_hda snd_hda_ext_core snd_hda_codec_generic snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus snd_soc_core snd_compress intel_tcc_cooling ac97_bus snd_pcm_dmaengine x86_pkg_temp_thermal snd_hda_intel intel_powerclamp snd_intel_dspcfg snd_intel_sdw_acpi mei_dal mei_hdcp dell_laptop coretemp snd_hda_codec intel_rapl_msr kvm_intel snd_hda_core dell_wmi snd_hwdep ledtrig_audio kvm iwlmvm snd_pcm dell_smbios dcdbas snd_seq_midi mac80211 intel_cstate snd_seq_midi_event snd_rawmidi libarc4 nls_iso8859_1 dell_wmi_descriptor wmi_bmof firmware_attributes_class serio_raw uvcvideo snd_seq videobuf2_vmalloc efi_pstore hid_sensor_custom_intel_hinge
[ 1666.319704]  snd_seq_device videobuf2_memops hid_sensor_als iwlwifi videobuf2_v4l2 hid_sensor_trigger snd_timer r8153_ecm hci_uart cdc_ether industrialio_triggered_buffer iwlmei videobuf2_common usbnet kfifo_buf processor_thermal_device_pci_legacy ee1004 hid_sensor_iio_common input_leds cfg80211 industrialio r8152 processor_thermal_device videodev snd btqca btusb processor_thermal_rfim mei_me processor_thermal_mbox btrtl pmt_telemetry btbcm cros_ec_ishtp mii processor_thermal_rapl mc hid_multitouch soundcore mei pmt_class cros_ec btintel ucsi_acpi intel_rapl_common bluetooth typec_ucsi intel_soc_dts_iosf typec ecdh_generic ecc dptf_power int3403_thermal soc_button_array int340x_thermal_zone mac_hid intel_skl_int3472 intel_hid int3400_thermal acpi_thermal_rel sparse_keymap acpi_pad acpi_tad ipt_REJECT nf_reject_ipv4 xt_LOG nf_log_syslog xt_limit xt_addrtype xt_tcpudp xt_conntrack nf_conntrack sch_fq_codel nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c ipmi_devintf ip6_tables ipmi_msghandler msr
[ 1666.319801]  iptable_filter parport_pc bpfilter ppdev lp parport sunrpc ip_tables x_tables autofs4 dm_crypt usbhid hid_sensor_custom hid_sensor_hub intel_ishtp_loader intel_ishtp_hid i915 i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops hid_generic rtsx_pci_sdmmc cec rc_core crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd psmouse rtsx_pci nvme intel_lpss_pci drm i2c_i801 i2c_hid_acpi intel_ish_ipc i2c_hid intel_lpss thunderbolt xhci_pci i2c_smbus intel_ishtp idma64 intel_pmt xhci_pci_renesas nvme_core wmi hid video pinctrl_tigerlake
[ 1666.319866] ---[ end trace eace8679e8eed905 ]---
[ 1666.574004] RIP: 0010:__fput+0x265/0x270
[ 1666.574012] Code: 00 48 85 ff 0f 84 6d fe ff ff f6 c7 40 0f 85 64 fe ff ff e8 6d 39 00 00 e9 5a fe ff ff 4c 89 f7 e8 70 96 02 00 e9 97 fe ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31
[ 1666.574015] RSP: 0018:ffffb3d600e73d88 EFLAGS: 00010246
[ 1666.574033] RAX: 0000000000000000 RBX: 00000000000a800d RCX: 0000000000000000
[ 1666.574034] RDX: ffff942c818e4e48 RSI: ffff942c818e4e48 RDI: 0000000000000000
[ 1666.574035] RBP: ffffb3d600e73db0 R08: ffff942c84c70780 R09: ffff942c89ac4308
[ 1666.574036] R10: 0000000000000010 R11: ffff9433ef670c40 R12: ffff942c86b08300
[ 1666.574037] R13: ffff942c89ac4308 R14: ffff942c8e371a60 R15: ffff942c89b46600
[ 1666.574038] FS:  0000000000000000(0000) GS:ffff9433ef400000(0000) knlGS:0000000000000000
[ 1666.574040] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1666.574041] CR2: 0000000060003506 CR3: 0000000236f96001 CR4: 0000000000770ef0
[ 1666.574043] PKRU: 55555554
) GS:ffff9433ef400000(0000) knlGS:0000000000000000
[ 1666.574040] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1666.574041] CR2: 0000000060003506 CR3: 0000000236f96001 CR4: 0000000000770ef0
[ 1666.574043] PKRU: 55555554

################################################################################
steps to reproduce:
podman run --rm -it -$PWD:/root alpine:3.16 /bin/sh -c "cd; ls"

Revision history for this message

indy (cz172638) wrote on 2022-06-09:

#39

Download full text (6.4 KiB)

also present in linux-image-5.15.0-1008-intel-iotg:
##################################################
[ 1666.319425] ------------[ cut here ]------------
[ 1666.319433] kernel BUG at include/linux/fs.h:3082!
[ 1666.319443] invalid opcode: 0000 [#3] SMP NOPTI
[ 1666.319449] CPU: 0 PID: 17586 Comm: ls Tainted: G D 5.15.0-1008-intel-iotg #11~20.04.1-Ubuntu
[ 1666.319454] Hardware name: Dell Inc. Precision 5560/XXXXXX, BIOS 1.8.0 02/08/2022
[ 1666.319457] RIP: 0010:__fput+0x265/0x270
[ 1666.319466] Code: 00 48 85 ff 0f 84 6d fe ff ff f6 c7 40 0f 85 64 fe ff ff e8 6d 39 00 00 e9 5a fe ff ff 4c 89 f7 e8 70 96 02 00 e9 97 fe ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31
[ 1666.319471] RSP: 0018:ffffb3d605127d70 EFLAGS: 00010246
[ 1666.319477] RAX: 0000000000000000 RBX: 00000000000a801d RCX: 0000000000000000
[ 1666.319480] RDX: 0000000000000000 RSI: ffffffff9ffb59f1 RDI: 0000000000000000
[ 1666.319483] RBP: ffffb3d605127d98 R08: ffff942c84c70780 R09: ffff942c8c60b520
[ 1666.319485] R10: 0000000000000010 R11: ffff9433ef5f0c40 R12: ffff942c86b08300
[ 1666.319488] R13: ffff942c8c60b520 R14: ffff942c9079d060 R15: ffff942c8a54ef00
[ 1666.319490] FS: 0000000000000000(0000) GS:ffff9433ef400000(0000) knlGS:0000000000000000
[ 1666.319494] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1666.319497] CR2: 00007ffcbd0a85c9 CR3: 0000000236f96001 CR4: 0000000000770ef0
[ 1666.319500] PKRU: 55555554
[ 1666.319503] Call Trace:
[ 1666.319505] <TASK>
[ 1666.319510] ____fput+0xe/0x10
[ 1666.319515] task_work_run+0x6d/0xb0
[ 1666.319523] exit_to_user_mode_prepare+0x1b2/0x1c0
[ 1666.319529] syscall_exit_to_user_mode+0x27/0x50
[ 1666.319536] do_syscall_64+0x69/0xc0
[ 1666.319543] ? handle_mm_fault+0xd8/0x2b0
[ 1666.319550] ? exit_to_user_mode_prepare+0x3d/0x1c0
[ 1666.319555] ? do_user_addr_fault+0x1dc/0x650
[ 1666.319560] ? irqentry_exit_to_user_mode+0x9/0x20
[ 1666.319565] ? irqentry_exit+0x19/0x30
[ 1666.319569] ? exc_page_fault+0x89/0x160
[ 1666.319573] ? asm_exc_page_fault+0x8/0x30
[ 1666.319580] entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 1666.319585] RIP: 0033:0x60003530
[ 1666.319592] Code: Unable to access opcode bytes at RIP 0x60003506.
[ 1666.319595] RSP: 002b:00007ffcbd0a83e0 EFLAGS: 00000200 ORIG_RAX: 000000000000003b
[ 1666.319599] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
[ 1666.319602] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 1666.319604] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[ 1666.319606] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[ 1666.319608] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 1666.319612] </TASK>
[ 1666.319614] Modules linked in: overlay uhid rfcomm ccm snd_hda_codec_hdmi cmac algif_hash algif_skcipher af_alg bnep binfmt_misc joydev snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_ctl_led snd_sof_xtensa_dsp snd_sof snd_hda_codec_realtek snd_soc_hdac_hda snd_hda_ext_core snd_hda_codec_generic snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus snd_soc_core snd_comp...

also present in linux-image-5.15.0-1008-intel-iotg:
##################################################
[ 1666.319425] ------------[ cut here ]------------
[ 1666.319433] kernel BUG at include/linux/fs.h:3082!
[ 1666.319443] invalid opcode: 0000 [#3] SMP NOPTI
[ 1666.319449] CPU: 0 PID: 17586 Comm: ls Tainted: G      D           5.15.0-1008-intel-iotg #11~20.04.1-Ubuntu
[ 1666.319454] Hardware name: Dell Inc. Precision 5560/XXXXXX, BIOS 1.8.0 02/08/2022
[ 1666.319457] RIP: 0010:__fput+0x265/0x270
[ 1666.319466] Code: 00 48 85 ff 0f 84 6d fe ff ff f6 c7 40 0f 85 64 fe ff ff e8 6d 39 00 00 e9 5a fe ff ff 4c 89 f7 e8 70 96 02 00 e9 97 fe ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31
[ 1666.319471] RSP: 0018:ffffb3d605127d70 EFLAGS: 00010246
[ 1666.319477] RAX: 0000000000000000 RBX: 00000000000a801d RCX: 0000000000000000
[ 1666.319480] RDX: 0000000000000000 RSI: ffffffff9ffb59f1 RDI: 0000000000000000
[ 1666.319483] RBP: ffffb3d605127d98 R08: ffff942c84c70780 R09: ffff942c8c60b520
[ 1666.319485] R10: 0000000000000010 R11: ffff9433ef5f0c40 R12: ffff942c86b08300
[ 1666.319488] R13: ffff942c8c60b520 R14: ffff942c9079d060 R15: ffff942c8a54ef00
[ 1666.319490] FS:  0000000000000000(0000) GS:ffff9433ef400000(0000) knlGS:0000000000000000
[ 1666.319494] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1666.319497] CR2: 00007ffcbd0a85c9 CR3: 0000000236f96001 CR4: 0000000000770ef0
[ 1666.319500] PKRU: 55555554
[ 1666.319503] Call Trace:
[ 1666.319505]  <TASK>
[ 1666.319510]  ____fput+0xe/0x10
[ 1666.319515]  task_work_run+0x6d/0xb0
[ 1666.319523]  exit_to_user_mode_prepare+0x1b2/0x1c0
[ 1666.319529]  syscall_exit_to_user_mode+0x27/0x50
[ 1666.319536]  do_syscall_64+0x69/0xc0
[ 1666.319543]  ? handle_mm_fault+0xd8/0x2b0
[ 1666.319550]  ? exit_to_user_mode_prepare+0x3d/0x1c0
[ 1666.319555]  ? do_user_addr_fault+0x1dc/0x650
[ 1666.319560]  ? irqentry_exit_to_user_mode+0x9/0x20
[ 1666.319565]  ? irqentry_exit+0x19/0x30
[ 1666.319569]  ? exc_page_fault+0x89/0x160
[ 1666.319573]  ? asm_exc_page_fault+0x8/0x30
[ 1666.319580]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 1666.319585] RIP: 0033:0x60003530
[ 1666.319592] Code: Unable to access opcode bytes at RIP 0x60003506.
[ 1666.319595] RSP: 002b:00007ffcbd0a83e0 EFLAGS: 00000200 ORIG_RAX: 000000000000003b
[ 1666.319599] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
[ 1666.319602] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 1666.319604] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[ 1666.319606] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[ 1666.319608] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 1666.319612]  </TASK>
[ 1666.319614] Modules linked in: overlay uhid rfcomm ccm snd_hda_codec_hdmi cmac algif_hash algif_skcipher af_alg bnep binfmt_misc joydev snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_ctl_led snd_sof_xtensa_dsp snd_sof snd_hda_codec_realtek snd_soc_hdac_hda snd_hda_ext_core snd_hda_codec_generic snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus snd_soc_core snd_compress intel_tcc_cooling ac97_bus snd_pcm_dmaengine x86_pkg_temp_thermal snd_hda_intel intel_powerclamp snd_intel_dspcfg snd_intel_sdw_acpi mei_dal mei_hdcp dell_laptop coretemp snd_hda_codec intel_rapl_msr kvm_intel snd_hda_core dell_wmi snd_hwdep ledtrig_audio kvm iwlmvm snd_pcm dell_smbios dcdbas snd_seq_midi mac80211 intel_cstate snd_seq_midi_event snd_rawmidi libarc4 nls_iso8859_1 dell_wmi_descriptor wmi_bmof firmware_attributes_class serio_raw uvcvideo snd_seq videobuf2_vmalloc efi_pstore hid_sensor_custom_intel_hinge
[ 1666.319704]  snd_seq_device videobuf2_memops hid_sensor_als iwlwifi videobuf2_v4l2 hid_sensor_trigger snd_timer r8153_ecm hci_uart cdc_ether industrialio_triggered_buffer iwlmei videobuf2_common usbnet kfifo_buf processor_thermal_device_pci_legacy ee1004 hid_sensor_iio_common input_leds cfg80211 industrialio r8152 processor_thermal_device videodev snd btqca btusb processor_thermal_rfim mei_me processor_thermal_mbox btrtl pmt_telemetry btbcm cros_ec_ishtp mii processor_thermal_rapl mc hid_multitouch soundcore mei pmt_class cros_ec btintel ucsi_acpi intel_rapl_common bluetooth typec_ucsi intel_soc_dts_iosf typec ecdh_generic ecc dptf_power int3403_thermal soc_button_array int340x_thermal_zone mac_hid intel_skl_int3472 intel_hid int3400_thermal acpi_thermal_rel sparse_keymap acpi_pad acpi_tad ipt_REJECT nf_reject_ipv4 xt_LOG nf_log_syslog xt_limit xt_addrtype xt_tcpudp xt_conntrack nf_conntrack sch_fq_codel nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c ipmi_devintf ip6_tables ipmi_msghandler msr
[ 1666.319801]  iptable_filter parport_pc bpfilter ppdev lp parport sunrpc ip_tables x_tables autofs4 dm_crypt usbhid hid_sensor_custom hid_sensor_hub intel_ishtp_loader intel_ishtp_hid i915 i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops hid_generic rtsx_pci_sdmmc cec rc_core crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd psmouse rtsx_pci nvme intel_lpss_pci drm i2c_i801 i2c_hid_acpi intel_ish_ipc i2c_hid intel_lpss thunderbolt xhci_pci i2c_smbus intel_ishtp idma64 intel_pmt xhci_pci_renesas nvme_core wmi hid video pinctrl_tigerlake
[ 1666.319866] ---[ end trace eace8679e8eed905 ]---
[ 1666.574004] RIP: 0010:__fput+0x265/0x270
[ 1666.574012] Code: 00 48 85 ff 0f 84 6d fe ff ff f6 c7 40 0f 85 64 fe ff ff e8 6d 39 00 00 e9 5a fe ff ff 4c 89 f7 e8 70 96 02 00 e9 97 fe ff ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 48 89 e5 53 31
[ 1666.574015] RSP: 0018:ffffb3d600e73d88 EFLAGS: 00010246
[ 1666.574033] RAX: 0000000000000000 RBX: 00000000000a800d RCX: 0000000000000000
[ 1666.574034] RDX: ffff942c818e4e48 RSI: ffff942c818e4e48 RDI: 0000000000000000
[ 1666.574035] RBP: ffffb3d600e73db0 R08: ffff942c84c70780 R09: ffff942c89ac4308
[ 1666.574036] R10: 0000000000000010 R11: ffff9433ef670c40 R12: ffff942c86b08300
[ 1666.574037] R13: ffff942c89ac4308 R14: ffff942c8e371a60 R15: ffff942c89b46600
[ 1666.574038] FS:  0000000000000000(0000) GS:ffff9433ef400000(0000) knlGS:0000000000000000
[ 1666.574040] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1666.574041] CR2: 0000000060003506 CR3: 0000000236f96001 CR4: 0000000000770ef0
[ 1666.574043] PKRU: 55555554
##################################################
reproducer:
podman run --rm -it -v $PWD:/root alpine:3.16 /bin/sh -c "cd;ls"
executed in rootless mode

Revision history for this message

Launchpad Janitor (janitor) wrote on 2022-06-09:

#40

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-intel-iotg-5.15 (Ubuntu Focal):
status:	New → Confirmed
Changed in linux-intel-iotg-5.15 (Ubuntu):
status:	New → Confirmed

Revision history for this message

Electric Daemon (electricdaemon) wrote on 2022-06-09:

#42

Test kernel posted fixes crash but has another bug with unkillable stuck defunct docker-proxy service causing more issues. Bug is not solved. Tested on Linux AWS Lightsail instance.

Revision history for this message

lilideng (lilideng) wrote on 2022-06-09 (last edit on 2022-06-09):

#43

below kernels on azure have this issue. please hold on the new images which contain these kernel releases. thanks.
focal/linux-azure-5.13: 5.13.0-1026-azure bad
focal/linux-azure-5.13: 5.13.0-1025-azure good

focal/linux-azure-5.15: 5.15.0-1008-azure bad
focal/linux-azure-5.15: 5.15.0-1007-azure good

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2022-06-09:

#44

@electricdaemon - please start a new bug report with sufficient detail that someone can diagnose the problem. Is this a regression from previous versions ?

Revision history for this message

Aarni Koskela (akx) wrote on 2022-06-09:

#46

@timg-tpi Yes, in https://bugs.launchpad.net/bugs/1977973 I found 5.13.0-1027-gcp to work fine.

Revision history for this message

Northwest Nodes (northwestnodes) wrote on 2022-06-09:

#47

We can confirm on: 5.13.0-1028-azure

Revision history for this message

Sebastián García Rojas (sebagr) wrote on 2022-06-09:

#48

I can confirm going back to 5.13.0-1027-gcp from 5.13.0-1030-gcp fixed it for me.

Revision history for this message

indy (cz172638) wrote on 2022-06-09:

#49

linux-intel-iotg-5.15:
5.15.0-1003 good
5.15.0-1008 bad

also reproducer (using podman) is smaller:

podman run --rm -it alpine:3.16 ls

which knocks down system
versus

podman run --rm -it busybox ls

which doesn't

Revision history for this message

andersonrfs (andersonrfsilva) wrote on 2022-06-09:

#50

Bug confirmed on Oracle Cloud running Ubuntu 20.04.4 Kernel 5.13.0-1033-oracle.

Workaround with ssh by Gerard(g-kok) works. thx

Tim Gardner (timg-tpi) on 2022-06-09

Changed in linux-oracle-5.13 (Ubuntu Focal):
assignee:	nobody → Tim Gardner (timg-tpi)
importance:	Undecided → High
status:	Confirmed → Fix Committed
Changed in linux-aws-5.13 (Ubuntu Focal):
status:	In Progress → Fix Committed
Changed in linux-azure-5.13 (Ubuntu Focal):
status:	In Progress → Fix Committed
Changed in linux-gcp-5.13 (Ubuntu Focal):
assignee:	nobody → Tim Gardner (timg-tpi)
importance:	Undecided → High
status:	Confirmed → Fix Committed
Changed in linux-intel-iotg-5.15 (Ubuntu Focal):
assignee:	nobody → Tim Gardner (timg-tpi)
importance:	Undecided → High
status:	Confirmed → Fix Committed

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2022-06-09:

#51

@cz172638 - we are aware that the 5.15 Focal backport kernels have this issue as well. However, since 5.15 is not the default -edge kernel yet, it will have to wait until the next SRU cycle due for release 20-June, 2022.

Changed in linux-intel-iotg-5.15 (Ubuntu Focal):
status:	Fix Committed → Won't Fix

Revision history for this message

Connor Riley (ctriley) wrote on 2022-06-09:

#52

Sorry for my ignorance of the software development procedure for Ubuntu, but now that this fix has been committed, how long until it is available via apt on the normal release channels?

Revision history for this message

Bill B. (n1sni) wrote on 2022-06-09:

#53

So this was painful for us. AWS hosted server running Ubuntu 20.04.4 LTS. Just for others, here are the steps we took thanks to the other comments here:

We had to force shut down the machine and wait (aws console). Then we got this script running, and started the machine back up:

while true; do
ssh -o ConnectTimeout=2 -i <.pem file> ubuntu@<host> "sudo systemctl disable docker.service; sudo systemctl disable containerd.service"
done

---
Now, ssh into machine normally, and this is what we ran in our case:
sudo -i
cd /boot/grub
grep -Ei 'submenu|menuentry ' /boot/grub/grub.cfg | sed -re "s/(.? )'([^']+)'.*/\1 \2/"
The result on mine was:
---
menuentry Ubuntu
submenu Advanced options for Ubuntu
        menuentry Ubuntu, with Linux 5.13.0-1028-aws # <-- BAD BAD BAD
        menuentry Ubuntu, with Linux 5.13.0-1028-aws (recovery mode)
        menuentry Ubuntu, with Linux 5.13.0-1025-aws # <-- what we want
        menuentry Ubuntu, with Linux 5.13.0-1025-aws (recovery mode)
        menuentry Ubuntu, with Linux 5.4.0-1029-aws
        menuentry Ubuntu, with Linux 5.4.0-1029-aws (recovery mode)
menuentry Ubuntu 20.04.4 LTS (20.04) (on /dev/nvme0n1p1)
submenu Advanced options for Ubuntu 20.04.4 LTS (20.04) (on /dev/nvme0n1p1)
        menuentry Ubuntu (on /dev/nvme0n1p1)
        menuentry Ubuntu, with Linux 5.13.0-1028-aws (on /dev/nvme0n1p1)
        menuentry Ubuntu, with Linux 5.13.0-1028-aws (recovery mode) (on /dev/nvme0n1p1)
        menuentry Ubuntu, with Linux 5.13.0-1025-aws (on /dev/nvme0n1p1)
        menuentry Ubuntu, with Linux 5.13.0-1025-aws (recovery mode) (on /dev/nvme0n1p1)
        menuentry Ubuntu, with Linux 5.4.0-1029-aws (on /dev/nvme0n1p1)
        menuentry Ubuntu, with Linux 5.4.0-1029-aws (recovery mode) (on /dev/nvme0n1p1)
---
So we wanted off 1028, and back to 1025. Edited:
vi /etc/default/grub
changed:
GRUB_DEFAULT="Advanced options for Ubuntu>Ubuntu, with Linux 5.13.0-1025-aws"

NOTE: the first half is the "MENUENTRY" from above, then ">" and then the submenu.
Saved

run:
grub-mkconfig -o /boot/grub/grub.cfg
sudo reboot

then, after reboot:

sudo systemctl enable docker.service; sudo systemctl enable containerd.service
sudo reboot

Should be back up! It seems like upgrading to the newest kernel also helps based on the above, but will try that later.

Hope this helps someone like you all helped us!!

So this was painful for us. AWS hosted server running Ubuntu 20.04.4 LTS. Just for others, here are the steps we took thanks to the other comments here:

We had to force shut down the machine and wait (aws console). Then we got this script running, and started the machine back up:

while true; do
  ssh -o ConnectTimeout=2 -i <.pem file> ubuntu@<host> "sudo systemctl disable docker.service; sudo systemctl disable containerd.service"
done

---
Now, ssh into machine normally, and this is what we ran in our case:
sudo -i
cd /boot/grub
grep -Ei 'submenu|menuentry ' /boot/grub/grub.cfg | sed -re "s/(.? )'([^']+)'.*/\1 \2/"
The result on mine was:
---
menuentry  Ubuntu
submenu  Advanced options for Ubuntu
        menuentry  Ubuntu, with Linux 5.13.0-1028-aws # <-- BAD BAD BAD
        menuentry  Ubuntu, with Linux 5.13.0-1028-aws (recovery mode)
        menuentry  Ubuntu, with Linux 5.13.0-1025-aws # <-- what we want
        menuentry  Ubuntu, with Linux 5.13.0-1025-aws (recovery mode)
        menuentry  Ubuntu, with Linux 5.4.0-1029-aws
        menuentry  Ubuntu, with Linux 5.4.0-1029-aws (recovery mode)
menuentry  Ubuntu 20.04.4 LTS (20.04) (on /dev/nvme0n1p1)
submenu  Advanced options for Ubuntu 20.04.4 LTS (20.04) (on /dev/nvme0n1p1)
        menuentry  Ubuntu (on /dev/nvme0n1p1)
        menuentry  Ubuntu, with Linux 5.13.0-1028-aws (on /dev/nvme0n1p1)
        menuentry  Ubuntu, with Linux 5.13.0-1028-aws (recovery mode) (on /dev/nvme0n1p1)
        menuentry  Ubuntu, with Linux 5.13.0-1025-aws (on /dev/nvme0n1p1)
        menuentry  Ubuntu, with Linux 5.13.0-1025-aws (recovery mode) (on /dev/nvme0n1p1)
        menuentry  Ubuntu, with Linux 5.4.0-1029-aws (on /dev/nvme0n1p1)
        menuentry  Ubuntu, with Linux 5.4.0-1029-aws (recovery mode) (on /dev/nvme0n1p1)
---
So we wanted off 1028, and back to 1025. Edited: 
vi /etc/default/grub
changed:
GRUB_DEFAULT="Advanced options for Ubuntu>Ubuntu, with Linux 5.13.0-1025-aws"

NOTE: the first half is the "MENUENTRY" from above, then ">" and then the submenu.
Saved

run:
grub-mkconfig -o /boot/grub/grub.cfg
sudo reboot

then, after reboot:

sudo systemctl enable docker.service; sudo systemctl enable containerd.service
sudo reboot

Should be back up! It seems like upgrading to the newest kernel also helps based on the above, but will try that later.

Hope this helps someone like you all helped us!!

Revision history for this message

BenC (wiq-dev-bc) wrote on 2022-06-09:

#54

@n1sni - thank you for your post.

With 5.13.0-1028-aws I could only run hello-world without killing the host.

Reverting back to 5.13.0-1025-aws from 5.13.0-1028-aws I can now run our build containers without problems.

Revision history for this message

Erik Kristensen (unhandledexception) wrote on 2022-06-09:

#55

I would like to echo earlier comments, I think that all affected kernel packages should be pulled from the APT repositories, I also think that all cloud images built with the bad kernel should be pulled too.

Revision history for this message

jd (jeff-dyke) wrote on 2022-06-09:

#56

@n1sni - wanted to extend my thanks as well, but on ubuntu 20.04 that settings was not present in /etc/default/grub, so i had to uninstall 1028 and install 1025. After adding that setting, and reloading and rebooting the change didn't take place, hence the reinstall. Going to make an AMI, until this fix is released.

sudo apt remove linux-image-5.13.0-1028-aws linux-image-aws -y
Then
sudo apt install -y linux-image-5.13.0-1025-aws
Then reboot. If you reboot between you won't have a kernel and won't be able to reboot.

Revision history for this message

Adam-morey (adam-morey) wrote on 2022-06-10:

#57

For Debian and Ubuntu, I used "sudo grub-reboot 2", which forces grub menu 2's kernel on next reboot. Once rebooted, use "dpkg-l | grep 1028" and apt remove each package relate to kernel 1028. Apt will also update grub for you.

Don't forget to uninstall or "break" unattended-upgrades, which is how my server got the new kernel in the first place.

Revision history for this message

Connor Riley (ctriley) wrote on 2022-06-10:

#58

On GCP the fix hit apt. So the easiest way to fix now is simply `sudo apt update && sudo apt upgrade`

Revision history for this message

Francis Ginther (fginther) wrote on 2022-06-10:

#59

Updated kernels are in flight. The updated kernel packages and versions are:

linux-aws-5.13 - 5.13.0-1029.32~20.04.1
linux-azure-5.13 - 5.13.0-1029.34~20.04.1
linux-gcp-5.13 - 5.13.0-1031.37~20.04.1
linux-oracle-5.13 - 5.13.0-1034.40~20.04.1

The azure and gcp kernels are already in focal-updates. The aws kernel is in focal-proposed and the oracle kernel should be there very soon.

Revision history for this message

dan the person (dantheperson) wrote on 2022-06-10 (last edit on 2022-06-10):

#60

For those who can't update, because the machine starts docker at startup and so crashes before you can get a shell open to upgrade to 1031, here's my method (on gcp)

stop and edit machine to detach disk
attach to another machine boot that and mount somewhere
edit <mountpath>/boot/grub/grub.cfg and add single as a kernel commandline parameter
shutdown temp box and detach disk
reattach disk to original machine and boot
connect via serial console
sudo systemctl disable docker
remove single from grub.cfg and reboot
ssh in and update to latest kernel and reboot
sudo systemctl enable docker

Revision history for this message

Erik Forsberg (forsberg) wrote on 2022-06-10 (last edit on 2022-06-10):

#61

I had limited success with "grub-reboot 2", but the following worked fine for me on an AWS EC2 running Ubuntu 20.04.2

sudo grub-reboot "Advanced options for Ubuntu>Ubuntu, with Linux 5.13.0-1025-aws"

Revision history for this message

dan the person (dantheperson) wrote on 2022-06-10:

#62

i'm intrigued, how do you 'sudo grub-reboot' when the machine is crashed?

And if anyone knows how to get the grub boot menu to respond to the keyboard over the serial console on GCP that'd be great, as it would have having to attach the disk to another instance to change the boot kernel or options

Revision history for this message

Jason Campanella (atlantis-stargate) wrote on 2022-06-10:

#63

Not sure if it will work on GCP but in Azure you hold escape to get into Grub while the system is booting.

Revision history for this message

Erik Forsberg (forsberg) wrote on 2022-06-10:

#64

The ability to do 'sudo grub-reboot' depends on the use-case. In my case, the docker jobs were started via crontab, and the machine didn't crash completely, so I was able to login.

Revision history for this message

Bernardo Hugo Signori (bernardos) wrote on 2022-06-10:

#65

In Oracle Cloud you can start a cloud shell console connection then force reboot the instance and in the console press esc, in the Grub menu select the previous kernel. I was able to boot with kernel 5.13.0-1030-oracle without panics.

Revision history for this message

Francis Ginther (fginther) wrote on 2022-06-10:

#66

All of the updated 5.13 kernels have now made it to the archive and into both the focal-updates and focal-security pockets. That list of kernels is:

linux-aws-5.13 - 5.13.0-1029.32~20.04.1
linux-azure-5.13 - 5.13.0-1029.34~20.04.1
linux-gcp-5.13 - 5.13.0-1031.37~20.04.1
linux-oracle-5.13 - 5.13.0-1034.40~20.04.1

Revision history for this message

Matthew Lenz (matthew-nocturnal) wrote on 2022-06-10:

#67

How did people fix this on aws instances that have no serial console access? assuming the disk was mounted and grub.cfg was edited. what did you change in the grub.cfg?

Revision history for this message

Podesta (podesta) wrote on 2022-06-10:

#68

Fixed kernel works like a charm.

@matthew-nocturnal you have to change the default GRUB that loads, so it is on /etc/default/grub. There you change the DEFAULT_GRUB with another one, as has been pointed out in the previous messages. But now you can simply run apt update / upgrade and it should get the latest kernel. If you can't access the machine to do this, you can either use a rescue machine, and do it with chroot, or try to disable docker before it crashes.

Revision history for this message

Sebastian Neumann (basti-megamorf+ubuntu-com) wrote on 2022-06-13 (last edit on 2022-06-13):

#69

Download full text (37.8 KiB)

I can confirm that the problem is indeed not fully fixed. @electricdaemon said:

> Test kernel posted fixes crash but has another bug with unkillable stuck defunct docker-proxy service causing more issues. Bug is not solved. Tested on Linux AWS Lightsail instance.

And that's the problem that I'm seeing as well. Still gathering data for a bug report.

What I'm seeing is that docker-compose stacks either don't start at all or only start partially. In both cases the affected containers cannot start due to their host port being already allocated. I can say with absolute certainty that the ports on the host are dedicated to container applications and no other service is actually bound to the affected port numbers.

# uname -a
Linux ip-10-0-69-193 5.13.0-1029-aws #32~20.04.1-Ubuntu SMP Thu Jun 9 13:03:13 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux

# docker-compose --version
docker-compose version 1.29.2, build 5becea4c

root@ip-10-0-69-193:/opt/myapp8/myappserv/int# docker-compose up -d
Creating network "myappserv-int_default" with the default driver
Creating myapp-migrator-int ... done
Creating myapp-dealer-int ...
Creating myapp-offer-int ...
Creating myapp-customer-int ...
Creating myapp-customer-int ... error
Creating myapp-dealer-int ... done
Creating myapp-offer-int ... done
: port is already allocated

ERROR: for customer Cannot start service customer: driver failed programming external connectivity on endpoint myapp8-customer-int (fe4112364528b0e7d192c793929c579e8a81af715118c8f83ad7e65e7397f3be): Bind for 0.0.0.0:9001 failed: port is already allocated
ERROR: Encountered errors while bringing up the project.

root@ip-10-0-69-193:/opt/myapp8/myappserv/int# docker-compose down
Stopping myapp8-offer-int ... done
Stopping myapp8-dealer-int ... done
Removing myapp8-customer-int ... done
Removing myapp8-offer-int ... done
Removing myapp8-dealer-int ... done
Removing myapp8-migrator-int ... done
Removing network myappserv-int_default

root@ip-10-0-69-193:/opt/myapp8/myappserv/int# docker-compose up -d
Creating network "myappserv-int_default" with the default driver
Creating myapp8-migrator-int ... done
Creating myapp8-offer-int ...
Creating myapp8-customer-int ...
Creating myapp8-customer-int ... error
WARNING: Host is already in use by another container
Creating myapp8-offer-int ... done
ERROR: for myapp8-customer-int Cannot start service customer: driver failed programming external connectivity on endpoint myapp8-customer-int (72fc08854cd278e63cd3234e7fb03c08cb045efdcfb9e42075a1250d893645d5): Bind for 0.0.0.0:9001 failed
Creating myapp8-dealer-int ... done

ERROR: for customer Cannot start service customer: driver failed programming external connectivity on endpoint myapp8-customer-int (72fc08854cd278e63cd3234e7fb03c08cb045efdcfb9e42075a1250d893645d5): Bind for 0.0.0.0:9001 failed: port is already allocated
ERROR: Encountered errors while bringing up the project.

# docker-compose config

services:
  customer:
    container_name: myapp8-customer-int
    depends_on:
      migrator:
        condition: service_completed_successfully
    image: reg.mydomain.tld/myapp8/...

I can confirm that the problem is indeed not fully fixed. @electricdaemon said:

> Test kernel posted fixes crash but has another bug with unkillable stuck defunct docker-proxy service causing more issues. Bug is not solved. Tested on Linux AWS Lightsail instance.

And that's the problem that I'm seeing as well. Still gathering data for a bug report.

What I'm seeing is that docker-compose stacks either don't start at all or only start partially. In both cases the affected containers cannot start due to their host port being already allocated.  I can say with absolute certainty that the ports on the host are dedicated to container applications and no other service is actually bound to the affected port numbers.

# uname -a
Linux ip-10-0-69-193 5.13.0-1029-aws #32~20.04.1-Ubuntu SMP Thu Jun 9 13:03:13 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux

# docker-compose --version
docker-compose version 1.29.2, build 5becea4c

root@ip-10-0-69-193:/opt/myapp8/myappserv/int# docker-compose up -d
Creating network "myappserv-int_default" with the default driver
Creating myapp-migrator-int ... done
Creating myapp-dealer-int   ... 
Creating myapp-offer-int    ... 
Creating myapp-customer-int ... 
Creating myapp-customer-int ... error
Creating myapp-dealer-int   ... done
Creating myapp-offer-int    ... done
: port is already allocated

ERROR: for customer  Cannot start service customer: driver failed programming external connectivity on endpoint myapp8-customer-int (fe4112364528b0e7d192c793929c579e8a81af715118c8f83ad7e65e7397f3be): Bind for 0.0.0.0:9001 failed: port is already allocated
ERROR: Encountered errors while bringing up the project.

root@ip-10-0-69-193:/opt/myapp8/myappserv/int# docker-compose down
Stopping myapp8-offer-int  ... done
Stopping myapp8-dealer-int ... done
Removing myapp8-customer-int ... done
Removing myapp8-offer-int    ... done
Removing myapp8-dealer-int   ... done
Removing myapp8-migrator-int ... done
Removing network myappserv-int_default

root@ip-10-0-69-193:/opt/myapp8/myappserv/int# docker-compose up -d
Creating network "myappserv-int_default" with the default driver
Creating myapp8-migrator-int ... done
Creating myapp8-offer-int    ... 
Creating myapp8-customer-int ... 
Creating myapp8-customer-int ... error
WARNING: Host is already in use by another container
Creating myapp8-offer-int    ... done
ERROR: for myapp8-customer-int  Cannot start service customer: driver failed programming external connectivity on endpoint myapp8-customer-int (72fc08854cd278e63cd3234e7fb03c08cb045efdcfb9e42075a1250d893645d5): Bind for 0.0.0.0:9001 failed
Creating myapp8-dealer-int   ... done

ERROR: for customer  Cannot start service customer: driver failed programming external connectivity on endpoint myapp8-customer-int (72fc08854cd278e63cd3234e7fb03c08cb045efdcfb9e42075a1250d893645d5): Bind for 0.0.0.0:9001 failed: port is already allocated
ERROR: Encountered errors while bringing up the project.

# docker-compose config

services:      
  customer:    
    container_name: myapp8-customer-int          
    depends_on: 
      migrator:                              
        condition: service_completed_successfully
    image: reg.mydomain.tld/myapp8/customer:430d4ca
    ports:         
    - published: 9001
      target: 9001                                                                                                     
    restart: always                                                                                                    
  dealer:
    container_name: myapp8-dealer-int
    depends_on:
      migrator:
        condition: service_completed_successfully
    image: reg.mydomain.tld/myapp8/dealer:430d4ca
    ports:
    - published: 9002
      target: 9002
    restart: always
  migrator:
    container_name: myapp8-migrator-int
    image: reg.mydomain.tld/myapp8/migrator:430d4ca
  offer:
    container_name: myapp8-offer-int
    depends_on:
      migrator:
        condition: service_completed_successfully
    image: reg.mydomain.tld/myapp8/offer:430d4ca
    ports:
    - published: 9003
      target: 9003
    restart: always

version: '3'

# ps aux | grep docker-proxy
root         997  0.0  0.0 1075148 3552 ?        Sl   08:18   0:00 /usr/bin/docker-proxy -proto tcp -host-ip 0.0.0.0 -host-port 19000 -container-ip 172.21.0.2 -container-port 9000
root        1003  0.0  0.0 1148624 3756 ?        Sl   08:18   0:00 /usr/bin/docker-proxy -proto tcp -host-ip :: -host-port 19000 -container-ip 172.21.0.2 -container-port 9000
root        1016  0.0  0.0 1148880 3716 ?        Sl   08:18   0:00 /usr/bin/docker-proxy -proto tcp -host-ip 0.0.0.0 -host-port 8065 -container-ip 172.27.0.2 -container-port 8055
root        1022  0.0  0.0 1222356 3612 ?        Sl   08:18   0:00 /usr/bin/docker-proxy -proto tcp -host-ip :: -host-port 8065 -container-ip 172.27.0.2 -container-port 8055
root        1037  0.0  0.0 1222612 3640 ?        Sl   08:18   0:00 /usr/bin/docker-proxy -proto tcp -host-ip 0.0.0.0 -host-port 8055 -container-ip 172.23.0.2 -container-port 8055
root        1043  0.0  0.0 1075148 3584 ?        Sl   08:18   0:00 /usr/bin/docker-proxy -proto tcp -host-ip :: -host-port 8055 -container-ip 172.23.0.2 -container-port 8055
root        1077  0.0  0.0 1148880 3640 ?        Sl   08:18   0:00 /usr/bin/docker-proxy -proto tcp -host-ip 127.0.0.1 -host-port 40000 -container-ip 172.18.0.2 -container-port 80
root        1090  0.0  0.0 1148880 4140 ?        Sl   08:18   0:00 /usr/bin/docker-proxy -proto tcp -host-ip 0.0.0.0 -host-port 9001 -container-ip 172.26.0.4 -container-port 9001
root        1096  0.0  0.0 1148624 3588 ?        Sl   08:18   0:00 /usr/bin/docker-proxy -proto tcp -host-ip :: -host-port 9001 -container-ip 172.26.0.4 -container-port 9001
root        4519  0.0  0.0 1222612 3896 ?        Sl   09:00   0:00 /usr/bin/docker-proxy -proto tcp -host-ip 0.0.0.0 -host-port 9002 -container-ip 172.28.0.3 -container-port 9002
root        4525  0.0  0.0 1074892 3644 ?        Sl   09:00   0:00 /usr/bin/docker-proxy -proto tcp -host-ip :: -host-port 9002 -container-ip 172.28.0.3 -container-port 9002
root        4539  0.0  0.0 1148880 3716 ?        Sl   09:00   0:00 /usr/bin/docker-proxy -proto tcp -host-ip 0.0.0.0 -host-port 9003 -container-ip 172.28.0.2 -container-port 9003
root        4544  0.0  0.0 1074892 3740 ?        Sl   09:00   0:00 /usr/bin/docker-proxy -proto tcp -host-ip :: -host-port 9003 -container-ip 172.28.0.2 -container-port 9003

# netstat -tulpn | egrep "(Foreign|docker-proxy)"
Proto Recv-Q Send-Q Local Address           Foreign Address         State       PID/Program name    
tcp        0      0 0.0.0.0:9001            0.0.0.0:*               LISTEN      1090/docker-proxy   
tcp        0      0 0.0.0.0:9002            0.0.0.0:*               LISTEN      4519/docker-proxy   
tcp        0      0 0.0.0.0:9003            0.0.0.0:*               LISTEN      4539/docker-proxy   
tcp        0      0 0.0.0.0:8055            0.0.0.0:*               LISTEN      1037/docker-proxy   
tcp        0      0 0.0.0.0:19000           0.0.0.0:*               LISTEN      997/docker-proxy    
tcp        0      0 127.0.0.1:40000         0.0.0.0:*               LISTEN      1077/docker-proxy   
tcp        0      0 0.0.0.0:8065            0.0.0.0:*               LISTEN      1016/docker-proxy   
tcp6       0      0 :::9001                 :::*                    LISTEN      1096/docker-proxy   
tcp6       0      0 :::9002                 :::*                    LISTEN      4525/docker-proxy   
tcp6       0      0 :::9003                 :::*                    LISTEN      4544/docker-proxy   
tcp6       0      0 :::8055                 :::*                    LISTEN      1043/docker-proxy   
tcp6       0      0 :::19000                :::*                    LISTEN      1003/docker-proxy   
tcp6       0      0 :::8065                 :::*                    LISTEN      1022/docker-proxy

Docker daemon startup log with port binding issues

-- Reboot --
Jun 13 08:18:27 ip-10-0-69-193 systemd[1]: Starting Docker Application Container Engine...
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.256199431Z" level=info msg="Starting up"
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.259890222Z" level=info msg="detected 127.0.0.53 nameserver, assuming systemd-resolved, so using resolv.conf: /run/systemd/resolve/resolv.conf"
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.338986729Z" level=info msg="parsed scheme: \"unix\"" module=grpc
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.339028163Z" level=info msg="scheme \"unix\" not registered, fallback to default scheme" module=grpc
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.339788195Z" level=info msg="ccResolverWrapper: sending update to cc: {[{unix:///run/containerd/containerd.sock  <nil> 0 <nil>}] <nil> <nil>}" module=grpc
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.339818518Z" level=info msg="ClientConn switching balancer to \"pick_first\"" module=grpc
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.370153265Z" level=info msg="parsed scheme: \"unix\"" module=grpc
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.373318484Z" level=info msg="scheme \"unix\" not registered, fallback to default scheme" module=grpc
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.373508320Z" level=info msg="ccResolverWrapper: sending update to cc: {[{unix:///run/containerd/containerd.sock  <nil> 0 <nil>}] <nil> <nil>}" module=grpc
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.373689058Z" level=info msg="ClientConn switching balancer to \"pick_first\"" module=grpc
Jun 13 08:18:28 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:28.521565874Z" level=info msg="[graphdriver] using prior storage driver: overlay2"
Jun 13 08:18:29 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:29.562866065Z" level=warning msg="Your kernel does not support CPU realtime scheduler"
Jun 13 08:18:29 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:29.563318522Z" level=warning msg="Your kernel does not support cgroup blkio weight"
Jun 13 08:18:29 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:29.563408595Z" level=warning msg="Your kernel does not support cgroup blkio weight_device"
Jun 13 08:18:29 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:29.563993369Z" level=info msg="Loading containers: start."
Jun 13 08:18:30 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:30.607949827Z" level=info msg="Removing stale sandbox 3b05224a2aacde133ba6e5b6b38e5958caca1cd1e25c27a4fc927fe9b0d0e64f (830dd3e1f0f166cd196e6ba7ce968331c9b54a78418cfe94411ffd29b42a2da2)"
Jun 13 08:18:30 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:30.632726976Z" level=warning msg="Error (Unable to complete atomic operation, key modified) deleting object [endpoint 7ee2dcd248f6607a560671a13f4938550bf4640565a589da68982a00817caa6f d4aa1d1b46f9d60e4c63b26d7403860f753d88791989e7a67538846016a0780b], retrying...."
Jun 13 08:18:30 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:30.791435745Z" level=info msg="Removing stale sandbox 887458aa16f64a4a07772ff6d5e154a271b73a6629704134a7c2b713bbd6d565 (eb1fb5718de82dd7719597e5cbac1091159ce4e94379dc07b6ebecfcd74d586e)"
Jun 13 08:18:30 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:30.927944615Z" level=info msg="Removing stale sandbox c73087bb5c2be5f386b2fbaef1d2594be14939e77ca63d95cd9dbe9d62e70ba9 (734e7cab764e533809b6edb6b3f7bdaef174651411e2011107cf4c55c45c8170)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.056824971Z" level=info msg="Removing stale sandbox e8148d1ffec21c1f44d2a24be5bb6c1d0c7b8c91998ff1e9bb5aa7bebe4ca6e3 (0bd839cbe4ebc01328a9ec8368395eadda7e72ffeffbfed42540a51dea68feca)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.342728937Z" level=info msg="Removing stale sandbox 0888ff9891abfe4955d610ca1f52c898553b2ef03c05bb510cc837282ca47711 (5a8feb357508e72d55e8b38231e34ecd90f8c2dc596756637e8b0c023ae63369)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.480517319Z" level=info msg="Removing stale sandbox 0c00e3eb1d3fb560231f3cfc4dd6d43b6128e82a4303e048bc3e1bce095c37e4 (ac811048aac5694c80d44bd8feff50e2baf0f7c94fa30331d93f90446960be93)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.632098665Z" level=info msg="Removing stale sandbox 210c64a62ea5bdd3d0d11039971318520c027e8af4ed7223695b469a7fa91870 (b699d29d6699dc6d25507f9b58830c614d836903ecf49ae292605021c1692c00)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.638488221Z" level=info msg="Removing stale endpoint cms-int-dummy (1d3cb1bc1a4c628af3141e00a8b67767501c53e9df41583332917985cd7aa62d)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.643475450Z" level=info msg="Removing stale endpoint cms-int-contoso (ba362dceaab5a016e32a122131e933ed2c3c4927901726dc610e0d0be26acdde)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.647756569Z" level=info msg="Removing stale endpoint portainer (18ee707396ada0df1f20b4ac0c7f2f3dad34142071b65861bfc4f921387560f8)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.659188980Z" level=info msg="Removing stale endpoint logspout-forwarder_logspout_1 (86553ec65469cdcc5d40be071a8975b7dfc2295c4ccf97023dd45bb4e284063c)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.663687874Z" level=info msg="Removing stale endpoint promtail_promtail_1 (8f0cb821a7322a7f7390c093a538e4af595c042aeb59af0423ae501c1e66cc63)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.667646467Z" level=info msg="Removing stale endpoint mapp8-customer-int (e826dee7b0d1dc6b015b563e4ebe94169d8bc36cbf57f97bdc808329677c8957)"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.727213355Z" level=info msg="Default bridge (docker0) is assigned with an IP address 172.17.0.0/16. Daemon option --bip can be used to set a preferred IP address"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.847480135Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.847673088Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.867425295Z" level=warning msg="Failed to allocate and map port 40000-40000: Bind for 127.0.0.1:40000 failed: port is already allocated"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.903819097Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.903850582Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.961822116Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.961877774Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:18:31 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:31.967810851Z" level=warning msg="Failed to allocate and map port 8065-8065: Bind for 0.0.0.0:8065 failed: port is already allocated"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.020268138Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.021928936Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.025449737Z" level=warning msg="Failed to allocate and map port 19000-19000: Bind for 0.0.0.0:19000 failed: port is already allocated"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.067237638Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.067272652Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.193806632Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.194391464Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.201922365Z" level=warning msg="Failed to allocate and map port 8055-8055: Bind for 0.0.0.0:8055 failed: port is already allocated"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.261898918Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.261925476Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.274600075Z" level=error msg="Container not cleaned up from containerd from previous run" container=eb1fb5718de82dd7719597e5cbac1091159ce4e94379dc07b6ebecfcd74d586e error="id already in use"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.278680667Z" level=error msg="failed to start container" container=734e7cab764e533809b6edb6b3f7bdaef174651411e2011107cf4c55c45c8170 error="driver failed programming external connectivity on endpoint cms-int-dummy (140d12526fd6ac50af276e5e4bfd9ddf58ddaefe08fb2212e491c252aee9a1eb): Bind for 0.0.0.0:8065 >
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.280102302Z" level=error msg="failed to start container" container=5a8feb357508e72d55e8b38231e34ecd90f8c2dc596756637e8b0c023ae63369 error="driver failed programming external connectivity on endpoint portainer (4fef35183d5abc13788f16520cb2128b19e5c7568f28f90ed68a6e01a0672856): Bind for 0.0.0.0:19000 failed: >
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.281626508Z" level=error msg="failed to start container" container=b699d29d6699dc6d25507f9b58830c614d836903ecf49ae292605021c1692c00 error="driver failed programming external connectivity on endpoint logspout-forwarder_logspout_1 (3d12e7dee4b448e4b8ab8c0d680bfa72094d3b6fdec7accfcf02e902f33fcfc7): Bind for 12>
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.324772875Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.324804948Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.375847635Z" level=error msg="failed to start container" container=0bd839cbe4ebc01328a9ec8368395eadda7e72ffeffbfed42540a51dea68feca error="driver failed programming external connectivity on endpoint cms-int-contoso (81163f037821378437ba958ee1efb1062467a678dc2570106ac9b8213bcedfc5): Bind for 0.0.0.0:8055>
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.437882668Z" level=warning msg="Failed to allocate and map port 9001-9001: Bind for 0.0.0.0:9001 failed: port is already allocated"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.541567238Z" level=error msg="Container not cleaned up from containerd from previous run" container=830dd3e1f0f166cd196e6ba7ce968331c9b54a78418cfe94411ffd29b42a2da2 error="id already in use"
Jun 13 08:18:32 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:32.634683839Z" level=error msg="failed to start container" container=ac811048aac5694c80d44bd8feff50e2baf0f7c94fa30331d93f90446960be93 error="driver failed programming external connectivity on endpoint mapp8-customer-int (f6187117263ef52b444366c3768fe5d6d2f790b5c2c67f92ebbb37ee95c3efb1): Bind for 0.0.0.0:9001>
Jun 13 08:18:33 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:33.634918366Z" level=info msg="Loading containers: done."
Jun 13 08:18:33 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:33.706060624Z" level=info msg="Docker daemon" commit=a89b842 graphdriver(s)=overlay2 version=20.10.17
Jun 13 08:18:33 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:33.707922063Z" level=info msg="Daemon has completed initialization"
Jun 13 08:18:33 ip-10-0-69-193 systemd[1]: Started Docker Application Container Engine.
Jun 13 08:18:33 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:18:33.862505842Z" level=info msg="API listen on /run/docker.sock"
Jun 13 08:22:43 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:22:43.789841985Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:22:43 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:22:43.790506837Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:22:45 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:22:45.185275300Z" level=info msg="ignoring event" container=4027e2353a12d39b7c26185341420e592a7df6422a566a031f4edef00a8ac774 module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:22:45 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:22:45.381127409Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:22:45 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:22:45.381181456Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:22:45 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:22:45.387012769Z" level=warning msg="Failed to allocate and map port 9001-9001: Bind for 0.0.0.0:9001 failed: port is already allocated"
Jun 13 08:22:45 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:22:45.453286174Z" level=error msg="ac811048aac5694c80d44bd8feff50e2baf0f7c94fa30331d93f90446960be93 cleanup: failed to delete container from containerd: no such container"
Jun 13 08:22:45 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:22:45.453349031Z" level=error msg="Handler for POST /v1.41/containers/ac811048aac5694c80d44bd8feff50e2baf0f7c94fa30331d93f90446960be93/start returned error: driver failed programming external connectivity on endpoint mapp8-customer-int (7fa7459091a2af4e51957c6f5dfd8ff2f5cd56b08b10e33befb16c44bb79700e): Bind for>
Jun 13 08:22:59 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:22:59.916291062Z" level=info msg="ignoring event" container=830dd3e1f0f166cd196e6ba7ce968331c9b54a78418cfe94411ffd29b42a2da2 module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:22:59 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:22:59.918733458Z" level=info msg="ignoring event" container=cf64e743992bbf2f2d7b1c850f20aa12651a0183cb73b5927e2ec00118b67152 module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:23:17 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:17.827418142Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:23:17 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:17.827450325Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:23:19 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:19.047386493Z" level=info msg="ignoring event" container=ae395c2538e700fd924bbec6f2b4a5b57d7e2ea6d5245a849500cef2c0ca4e60 module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:23:19 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:19.321630729Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:23:19 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:19.321746414Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:23:19 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:19.329549269Z" level=warning msg="Failed to allocate and map port 9001-9001: Bind for 0.0.0.0:9001 failed: port is already allocated"
Jun 13 08:23:19 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:19.382935265Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:23:19 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:19.383236299Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:23:19 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:19.463762674Z" level=error msg="dad821d5302241f31cd491d688076ae6e4f1d5e464eba12f0da280a89f9db41f cleanup: failed to delete container from containerd: no such container"
Jun 13 08:23:19 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:19.463820543Z" level=error msg="Handler for POST /v1.41/containers/dad821d5302241f31cd491d688076ae6e4f1d5e464eba12f0da280a89f9db41f/start returned error: driver failed programming external connectivity on endpoint mapp8-customer-int (c32484aaa31d246b8f460f19dc7ac8361701617569a113fb1bddaf4ac00723ce): Bind for>
Jun 13 08:23:19 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:19.487660545Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:23:19 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:23:19.487702291Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:52:09 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:09.862833838Z" level=info msg="ignoring event" container=37b53379da4cf48ff0242a73682a0e183b2747a91fb89afd6cab327a79efe212 module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:52:09 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:09.865856024Z" level=info msg="ignoring event" container=6199c36ad2b2227d19947abfc87e45b80b3f3a81a7f055a27223691331cab279 module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:52:21 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:21.967320843Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:52:21 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:21.967390358Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:52:23 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:23.190909775Z" level=info msg="ignoring event" container=9ccb3f701316409756de0bf0fd02a04ef8fd4216a0a41f4de720014c8c00cca7 module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:52:23 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:23.445491986Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:52:23 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:23.445927880Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:52:23 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:23.533028998Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:52:23 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:23.533068877Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:52:23 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:23.537043683Z" level=warning msg="Failed to allocate and map port 9001-9001: Bind for 0.0.0.0:9001 failed: port is already allocated"
Jun 13 08:52:23 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:23.622866735Z" level=error msg="064b2123bee332effd04dbd3bcffcfb8f11809ca5ab8a64a49bf51226ca7086c cleanup: failed to delete container from containerd: no such container"
Jun 13 08:52:23 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:23.622924463Z" level=error msg="Handler for POST /v1.41/containers/064b2123bee332effd04dbd3bcffcfb8f11809ca5ab8a64a49bf51226ca7086c/start returned error: driver failed programming external connectivity on endpoint mapp8-customer-int (9f1d38c06efb5b80aeda25a48e0e7af7f9e36c79fd87b7a055551c8fa30fae20): Bind for>
Jun 13 08:52:23 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:23.633530605Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:52:23 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:52:23.633561674Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:54:47 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:54:47.829285003Z" level=info msg="ignoring event" container=9a5415028dc593f9fe4523fbeefe2531b35dff84f87b21c55f58bd00b911f910 module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:54:47 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:54:47.834727552Z" level=info msg="ignoring event" container=7740f98f1129a3f5876cf681ccc0c01020a0b2bef6e4ff617409e2f90d7b0aea module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:56:54 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:54.728411446Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:56:54 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:54.728448756Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:56:55 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:55.897911938Z" level=info msg="ignoring event" container=0734bba3e7ee3374de333efaac9f2d2db1dca85368240b75ccd64ffbf1400ea9 module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:56:56 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:56.177063023Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:56:56 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:56.177301235Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:56:56 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:56.260179273Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:56:56 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:56.260223950Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:56:56 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:56.284922803Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 08:56:56 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:56.284958172Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 08:56:56 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:56.305397156Z" level=warning msg="Failed to allocate and map port 9001-9001: Bind for 0.0.0.0:9001 failed: port is already allocated"
Jun 13 08:56:56 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:56.399247702Z" level=error msg="b7eecf49152619fa109f575cd1058e0a6f6f9d80389349ab6c80c9d931570b1d cleanup: failed to delete container from containerd: no such container"
Jun 13 08:56:56 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:56:56.399293703Z" level=error msg="Handler for POST /v1.41/containers/b7eecf49152619fa109f575cd1058e0a6f6f9d80389349ab6c80c9d931570b1d/start returned error: driver failed programming external connectivity on endpoint mapp8-customer-int (fe4112364528b0e7d192c793929c579e8a81af715118c8f83ad7e65e7397f3be): Bind for>
Jun 13 08:59:57 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:59:57.527631039Z" level=info msg="ignoring event" container=90e2599350b1e86088807cf3919b297982655bb8cd5f09bec2d39535a35e4fdc module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 08:59:57 ip-10-0-69-193 dockerd[622]: time="2022-06-13T08:59:57.531088853Z" level=info msg="ignoring event" container=b9e863ae8291aeaa2d6ac7bcad1f6bb4bee35f0ccddfd88e31109ade1dc8c18d module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 09:00:21 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:21.007435346Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 09:00:21 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:21.007470812Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 09:00:22 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:22.308127921Z" level=info msg="ignoring event" container=36ebd6dc2a75e97526da0b5d9e638306c4a247d1e1bdb1a0f16df7af260959c9 module=libcontainerd namespace=moby topic=/tasks/delete type="*events.TaskDelete"
Jun 13 09:00:22 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:22.609096106Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 09:00:22 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:22.609134479Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 09:00:22 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:22.616347863Z" level=warning msg="Failed to allocate and map port 9001-9001: Bind for 0.0.0.0:9001 failed: port is already allocated"
Jun 13 09:00:22 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:22.700350206Z" level=error msg="02f56d9557f39c9a9851a052d3e3423192b03e96c72ca95794eff78933390533 cleanup: failed to delete container from containerd: no such container"
Jun 13 09:00:22 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:22.700662555Z" level=error msg="Handler for POST /v1.41/containers/02f56d9557f39c9a9851a052d3e3423192b03e96c72ca95794eff78933390533/start returned error: driver failed programming external connectivity on endpoint mapp8-customer-int (72fc08854cd278e63cd3234e7fb03c08cb045efdcfb9e42075a1250d893645d5): Bind for>
Jun 13 09:00:22 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:22.728698048Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 09:00:22 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:22.728742840Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 09:00:22 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:22.788951270Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 09:00:22 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:00:22.788996392Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 09:32:46 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:32:46.956730677Z" level=info msg="No non-localhost DNS nameservers are left in resolv.conf. Using default external servers: [nameserver 8.8.8.8 nameserver 8.8.4.4]"
Jun 13 09:32:46 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:32:46.956771453Z" level=info msg="IPv6 enabled; Adding default IPv6 external servers: [nameserver 2001:4860:4860::8888 nameserver 2001:4860:4860::8844]"
Jun 13 09:32:46 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:32:46.961616258Z" level=warning msg="Failed to allocate and map port 8055-8055: Bind for 0.0.0.0:8055 failed: port is already allocated"
Jun 13 09:32:47 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:32:47.077575092Z" level=error msg="0bd839cbe4ebc01328a9ec8368395eadda7e72ffeffbfed42540a51dea68feca cleanup: failed to delete container from containerd: no such container"
Jun 13 09:32:47 ip-10-0-69-193 dockerd[622]: time="2022-06-13T09:32:47.077671169Z" level=error msg="Handler for POST /v1.41/containers/0bd839cbe4ebc01328a9ec8368395eadda7e72ffeffbfed42540a51dea68feca/start returned error: driver failed programming external connectivity on endpoint cms-int-contoso (b60cfae8405d9213bd1cbc583d46fbf9f7cbcbeafd2a1d4b33fa3d6162d00267): Bind for>

Revision history for this message

Tim Gardner (timg-tpi) wrote on 2022-06-13:

#70

Sebastian Neumann (basti-megamorf+ubuntu-com) - please start a new bug report so that we can address your specific problem. It may or may not be related to the patch that fixed this kernel crash.

Changed in linux-aws-5.13 (Ubuntu Focal):
status:	Fix Committed → Fix Released
Changed in linux-azure-5.13 (Ubuntu Focal):
status:	Fix Committed → Fix Released
Changed in linux-gcp-5.13 (Ubuntu Focal):
status:	Fix Committed → Fix Released
Changed in linux-oracle-5.13 (Ubuntu Focal):
status:	Fix Committed → Fix Released

Revision history for this message

Sebastian Neumann (basti-megamorf+ubuntu-com) wrote on 2022-06-13:

#71

Created a new bug report: https://bugs.launchpad.net/ubuntu/+source/linux-aws-5.13/+bug/1978475

Hopefully @electricdaemon and other affected users can help to provide a reproducible test.

Revision history for this message

dan the person (dantheperson) wrote on 2022-06-22:

#72

@matthew-nocturnal

For me single user mode would stop docker starting, and thus avoid the crash. But if you don't have serial console, how would you then get a shell to then fix the machine?

For single user mode, find the first menuentry in grub.cfg and add single after ro

i.e change
linux /boot/vmlinuz-5.13.0-1030-gcp root=PARTUUID=3c480693-932a-4c3c-8409-1bc45cd64f32 ro console=ttyS0

to
linux /boot/vmlinuz-5.13.0-1030-gcp root=PARTUUID=3c480693-932a-4c3c-8409-1bc45cd64f32 ro single console=ttyS0

Revision history for this message

dan the person (dantheperson) wrote on 2022-06-22 (last edit on 2022-06-22):

#73

Alternatively you can apparently just stop docker starting. instead of adding 'single' add 'systemd.mask=docker.service'

That will work better for you if you don't have serial console as then networking will still come up.

https://unix.stackexchange.com/a/176406/64349

Dave Chiluk (chiluk) on 2024-06-21

tags:

removed: indeed

Ubuntu
linux-gcp-5.13 package

Docker container creation causes kernel oops on linux-aws 5.13.0.1028.31~20.04.22

Bug Description

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

	Status	Importance	Assigned to
linux-aws-5.13 (Ubuntu)	Confirmed	Undecided	Unassigned
Focal	Fix Released	High	Tim Gardner
linux-azure-5.13 (Ubuntu)	Confirmed	Undecided	Unassigned
Focal	Fix Released	High	Tim Gardner
linux-gcp-5.13 (Ubuntu)	Confirmed	Undecided	Unassigned
Focal	Fix Released	High	Tim Gardner
linux-intel-iotg-5.15 (Ubuntu)	Confirmed	Undecided	Unassigned
Focal	Won't Fix	High	Tim Gardner
linux-oracle-5.13 (Ubuntu)	Confirmed	Undecided	Unassigned
Focal	Fix Released	High	Tim Gardner

Ubuntulinux-gcp-5.13 package

Docker container creation causes kernel oops on linux-aws 5.13.0.1028.31~20.04.22

Bug Description

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
linux-gcp-5.13 package