Comment 9 for bug 1921355

Revision history for this message
Nikita Nedvetskiy (n-nedvetskiy) wrote :

Thank you all for your ideas!

Sure, we do have some modules not from the kernel source tree. These are Mellanox (our NICs) and OpenvSwitch, as we've had some problems that were fixed in the newer driver versions.

We don't have apport enabled, and actually, the hypervisor nodes don't even have direct access to the internet (only some VMs on them).
I checked on a test VM what kind of info it collects, and it seems that these are the arch, kernel version, and the stack trace. That kind of info is attached manually, we have netconsole enabled that collected it.

When the issue started, it was even reproducible on the then-latest kernel (5.4.0-66), so I'm not sure that simply upgrading can help.

Currently I'm working on integrating kdump into our infrastructure, trying to reproduce again, and I'll also try to schedule migration + upgrade for our hypervisor node (that's not fast though).