Comment 45 for bug 1011792

Revision history for this message
Steven Noonan (steven-valvesoftware) wrote :

Out of curiosity I tried an Arch Linux instance (running Linux 3.5.6-1-ARCH), which also has CONFIG_SCHED_AUTOGROUP:

# zgrep AUTOGROUP /proc/config.gz
CONFIG_SCHED_AUTOGROUP=y

I ran the same pgslam workload on it, and it filled 64G of the /var/lib/postgres md-raid before I stopped it. This indicates to me one of two possibilities. Either this scheduler deadlock has been fixed between 3.2.x and 3.5.x, or one of the Ubuntu-specific commits is to blame. Unfortunately it's not super-easy to bisect this problem because it requires a lot of kernel rebuilds and lots of patience. I'm working on some other things at the moment, but hopefully we can narrow this down further and get it fixed properly. Autogrouping is nice, and I'd hate to toss it out.

Perhaps one way to narrow it down would be to try the same workload on a 12.10 Quantal instance. I'll try to kick that off in the next couple hours.