Qunata server memory stress cause "rcu_sched detected stalls on cpus/tasks"

Bug #1796217 reported by Zhanglei Mao on 2018-10-05

This bug report will be marked for expiration in 6 days if no further activity occurs. (find out why)

8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
High
Unassigned
Bionic
High
Unassigned

Bug Description

We are facing a problem during the certification test. System always stop at below screen while memory stress. In the following re-producing, kernel seems un-normal and it can’t capture above info into the syslog or kernel log, but the console will show same warning as screen shot attachment file.

Zhanglei Mao (zhanglei-mao) wrote :
Zhanglei Mao (zhanglei-mao) wrote :

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1796217

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v4.19 kernel[0].

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.19-rc7

Changed in linux (Ubuntu):
importance: Undecided → High
Changed in linux (Ubuntu Bionic):
importance: Undecided → High
status: New → Incomplete
tags: added: kernel-da-key
Joseph Salisbury (jsalisbury) wrote :

Also, are there prior kernel versions that do no exhibit this bug?

Zhanglei Mao (zhanglei-mao) wrote :

A good feedback from partner is "I just update Ubuntu kernel to v4.19 and Memory Stress can got pass result this time."

Jeff Lane (bladernr) wrote :

How about the 4.4 kernel? And are they using a normal HDD/SSD for the root filesystem, or are they using a NVMe of some sort?

I'm currently following another bug where the memory test fails reliably when using an NVMe as the root filesystem but passes when a SSD or HDD is used. In that case, 4.4 can pass, but 4.15 and 4.18 fail.

To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers