Comment 35 for bug 666211

Revision history for this message
David Taylor (david-taylor) wrote :

It sounds like I've bumped into the identical problem (twice so far!) as MatthiasP (mpdude).

I'm using ami-af7e2eea in us-west-1 on c1.xlarge. I have 8 x 128GB EBS volumes in a RAID10 array using mdadm. At some point /dev/md0 would "freeze" and load would shoot up from <5 to >300-400.

Any attempts to interrogate the mounted filesystem would hang and be uninterruptible.

When I ran "mdadm --examine" on each of the devices it returned "state: clean" on all of them except one. That command would never return and I'd have to Ctrl-C to interrupt it.

I had the same log messages in /var/log/syslog, so I won't re-paste them here.

Any ideas what the cause is? Better yet, the fix? I see some suggestions that kernel upgrades might help. What are the prevailing thoughts on that, has it been confirmed?

Also, how did you perform the kernel upgrade? Did you build your own AMI or are you upgrading it after launch?

Cheers,
David.