Ubuntu Bionic freezes on Supermicro hardware when console redirection is configured in kernel parameters
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Expired
|
Undecided
|
Unassigned |
Bug Description
STEPS TO REPRODUCE
1. Use MAAS for deploying Ubuntu on bare metal servers.
2. Enable serial-over-lan in BIOS.
3. Enable console redirection in kernel parameters:
console=tty0 console=
4. Connect to the ipmi console to watch the boot and installation process
ipmiconsole -u <user> -p <pass> -h <host>
5. Start deploying the server with MAAS (bionic, GA kernel)
It may require several re-deployments until you experience the issue. In my case, I'm deploying 9 bare metal machines at the same time. The issue typically surfaces on one random node, 50% of the time.
6. During the deployment, after first reboot, typically during execution of cloud-init scripts, the ipmi console freezes. Typically in the middle of printing out some output.
You can still SSH to the node. While being logged into the node:
- running 'ps aux' takes more than 20 seconds,
- running 'systemctl' times out,
- the number of zombie processes increases.
Sometimes, running 'sosreport' unblocks the node, so that cloud-init scripts finish executing and the node shuts down as expected.
Sometimes, only logging into the node with SSH unblocks it.
7. Eventually, after 30 minutes, MAAS marks the node as "Failed deployment".
WORKAROUND
Remove console redirection from kernel parameters (console=tty0 console=
AFFECTED HARDWARE
Supermicro SYS-2029U-TR4
https:/
OTHER
The issue does not affect Xenial with GA kernel.
This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:
apport-collect 1865145
and then change the status of the bug to 'Confirmed'.
If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.
This change has been made by an automated script, maintained by the Ubuntu Kernel Team.