Bug #361689 “hald crashed with SIGSEGV in hotplug_event_begin_ad...” : Bugs : hal package : Ubuntu

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-04-15:

#1

Thank you for taking the time to report this bug and helping to make Ubuntu better. Please try to obtain a backtrace following the instructions at http://wiki.ubuntu.com/DebuggingProgramCrash and upload the backtrace (as an attachment) to the bug report. This will greatly help us in tracking down your problem.

Changed in hal (Ubuntu):
status:	New → Incomplete

Revision history for this message

Noumayos (noumayos) wrote on 2009-04-16:

#2

gdb-hald.txt Edit (9.0 KiB, text/plain)

Thank you for your work.

Revision history for this message

StoatWblr (stoatwblr) wrote on 2009-04-16:

#3

I am seeing the same segfault when hald probes my md raid1 devices.

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-04-16:

#4

Thanks. It would be useful also for you to be able to run "sudo hald --verbose=yes --daemon=no 2>&1 | tee ~/hald.log", then recreate the steps to trigger the crash and attach the log here.

Thanks

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-04-16:

#5

Would you also mind trying the build of HAL from my PPA [1]?

[1] - https://launchpad.net/~chrisccoulson/+archive/ppa

Noumayos (noumayos) on 2009-04-17

summary:

- hald segfault when using a raid 0 volume
+ hald segfault when using a raid volume

Revision history for this message

Noumayos (noumayos) wrote on 2009-04-17: Re: hald segfault when using a raid volume

#6

hald.log Edit (2.5 MiB, text/plain)

Please find attached the requested log

Revision history for this message

Noumayos (noumayos) wrote on 2009-04-17:

#7

hald-chriscoulson.log Edit (2.5 MiB, text/plain)

Your build seems to work.

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-04-17:

#8

Thanks. I don't know if my patch is the right way to fix it. With my build, could you also please run "lshal > lshal.log" after assembling the raid volume, and attaching "lshal.log" to the bug report. Once that is done, I will send all this upstream.

Thanks

Changed in hal (Ubuntu):
importance:	Undecided → Medium

Revision history for this message

StoatWblr (stoatwblr) wrote on 2009-04-17:

#9

Your patch is working for me.

I'll leave it to the original poster to post his lshal.log unless you'd like mine as well.

This bug only manifested on 2.6.28-11 - booting 2.6.27-11 on Jaunty beta was fine.

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-04-17:

#10

If you can provide the output, then it would be appreciated (from both kernels).

Revision history for this message

Noumayos (noumayos) wrote on 2009-04-17:

#11

lshal.log Edit (162.5 KiB, text/plain)

Please find the log attached.

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-04-17:

#12

Thanks Noumayos. That was before you assembled your raid array though wasn't it?

Revision history for this message

Noumayos (noumayos) wrote on 2009-04-17:

#13

The result of lshal is the same before and after the raid array.

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-04-19:

#14

Would you mind running "lshal -m", assembling your array and then posting any output?

Thanks

Revision history for this message

Noumayos (noumayos) wrote on 2009-04-20:

#15

I have no output when assembling my array.

Revision history for this message

StoatWblr (stoatwblr) wrote on 2009-04-20:

#16

lshal.log Edit (167.8 KiB, text/plain)

Here's my lshal.log. As I said, your updated package is working for me (RAID1)

I hope this helps.

Revision history for this message

software-schlosser (software-schlosser) wrote on 2009-04-20:

#17

Works for me too. Good work, many thanks! :)

Revision history for this message

Yakov Shafranovich (launchpad-net-shaftek) wrote on 2009-04-24:

#18

Hi,

I upgraded today from Intrepid to Jaunty, and had the same problem with no mouse/keyboard in X, and a message in system log about a segfault with HAL. I am using software raid.

I downloaded and installed the package supplied by Chris and it works. I hope this patch will be applied to the official package as well.

Thanks for the help!

Revision history for this message

Chris Morgan (chmorgan) wrote on 2009-04-26:

#19

Also used the hald supplied by Chris and the keyboard and mouse work again.

Revision history for this message

BobMcD (mcbobbo) wrote on 2009-04-28:

#20

One more: I also used the hald supplied by Chris and it fixed it. Same symptom - md's causing hal to crash.

Revision history for this message

Russell Davies (russelldavies) wrote on 2009-04-29:

#21

lshal.log Edit (101.9 KiB, text/plain)

I can confirm this when using a RAID 10 array. Chris's HAL builds also fixed the problem for me.

Revision history for this message

Sergey Nizovtsev (snizovtsev) wrote on 2009-05-01:

#22

Chris's HAL builds helped for me too. I think that the bug status should be 'In progress' instead of 'Incomplete'.

Revision history for this message

mobrien118 (mobrien118) wrote on 2009-05-06:

#23

How is it possible that this bug is only listed as "Medium" importance?!?!?

I discovered what I think is this bug when I upgraded my system using system update. Upon a later reboot I found that I had no access to the system whatsoever. After trying multiple things to get back in from the root recovery console, I re-formatted, losing a lot of configuration.

Fortunately, I am somewhat tech savvy and I didn't lose everything, but this cost me 2 days and will probably cost more. And for the average user, this could equate to complete data loss. I would think this would be a high importance bug since it disables a working system!

Does anyone agree with me, or am I missing something here?

--mobrien118

Revision history for this message

mobrien118 (mobrien118) wrote on 2009-05-06:

#24

FYI, it looks like Chris's PPA packages fixed it, though.

Thank goodness! If I had been remote from this machine (which I will be for the next few months) it would have been a nightmare!

Revision history for this message

Chris Morgan (chmorgan) wrote on 2009-05-06:

#25

Mobrien118, I agree that there should be some consideration of its importance. Maybe it isn't that big of a deal since few people have raid sets on their computers. It cost me several hours of rebooting and googling before I thought to boot into recovery console, install openssh and then log in with another machine to look at the dmsg output. I considered myself fortunate to have stumbled upon the solution here since the failure case is very confusing. If hald is so critical maybe there should be a better way of reporting these errors to the user.

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-05-06:

#26

I've not had much time to look any further at this, and I haven't proposed my patch as a fix yet because I don't know if it is the right way to fix it. What I need to do really is send this bug report upstream, and also have a play around with a mdraid setup myself, but I don't have a clue how to set one of those up.

Perhaps someone here could help me set one up ;)

Changed in hal (Ubuntu):
status:	Incomplete → Confirmed

Revision history for this message

In freedesktop.org Bugzilla #21603, Chris Coulson (chrisccoulson) wrote on 2009-05-06:

#27

Download full text (5.8 KiB)

When assembling certain MD raid devices, hald crashes:

#0 0x0000000000435b05 in hotplug_event_begin_add_blockdev (sysfs_path=0x26c0130 "/sys/devices/virtual/block/md0/md0p1", device_file=<value optimized out>, is_partition=<value optimized out>, parent=0x260bca0, end_token=0x26c0020) at blockdev.c:1501
sysfs_path_len = <value optimized out>
is_physical_partition = <value optimized out>
volume_label = 0x2681390 ""
buf = "Volume\000\0009\001l\002\000\000\000\0001\n\000\000\000\000\000\000Ù\202\\\005Ê\177\000\000\000Þ\202\005Ê\177\000\000\037^Y\005Ê\177\000\000\000\206`\002\000\000\000\000 \000l\002\000\000\000"
major_minor = <value optimized out>
d = (HalDevice *) 0x267de80
major = 259
minor = 0
is_fakevolume = 0
sysfs_path_real = 0x2693670 "/sys/devices/virtual/block/md0/md0p1"
floppy_num = <value optimized out>
is_device_mapper = 0
is_md_device = 1
is_cciss_device = 0
md_number = 0
__func__ = "hotplug_event_begin_add_blockdev"
#1 0x0000000000425d72 in hotplug_event_begin_sysfs (hotplug_event=0x26c0020) at hotplug.c:220
parent = (HalDevice *) 0x0
range = 1
is_partition = 1
d = (HalDevice *) 0x0
subsystem = "0Rë\rÿ\177\000\000S£D\000\000\000\000\0000\001l\002\000\000\000\0008\227Á\004Ê\177\000\000\001\200û\000\000\000\0000\001l\002\000\000\000\0000\001l\002\000\000\000\0000\001l\002\000\000\000\0000\001l\002\000\000\000\000T\001l\002\000\000\000\000/\003l\002\000\000\000\0000\001l\002\000\000\000\000/\003l\002", '\0' <repeats 44 times>, " \000\000\000\004\000\000\000 \020\000\000\000\000\000\000\000\000è\004Ê\177\000\000\000\000\000\000\000\000\000\000\001\000\000\000\000\000\000\000\000ªè\004Ê\177\000\0000\020", '\0' <repeats 14 times>, "\b\000\000\000\000\000\000\000pªè\004Ê\177\000\000ÿÿÿÿ\000\000\000\000Tl\\\005Ê\177"...
subsystem_target = <value optimized out>
__func__ = "hotplug_event_begin_sysfs"
#2 0x00000000004261c8 in hotplug_event_process_queue () at hotplug.c:295
hotplug_event = (HotplugEvent *) 0x26c0020
lp = (GList *) 0x2683da0
lp2 = (GList *) 0x0
processing = 1
__func__ = "hotplug_event_process_queue"
#3 0x0000000000424f82 in hald_udev_data (source=<value optimized out>, condition=<value optimized out>, user_data=<value optimized out>) at osspec.c:259
fd = <value optimized out>
smsg = {msg_name = 0x0, msg_namelen = 0, msg_iov = 0x7fff0deb53d0, msg_iovlen = 1, msg_control = 0x7fff0deb63e0, msg_controllen = 32, msg_flags = 0}
cmsg = <value optimized out>
iov = {iov_base = 0x7fff0deb53e0, iov_len = 4096}
cred = <value optimized out>
cred_msg = "\034\000\000\000\000\000\000\000\001\000\000\000\002\000\000\000Í\034", '\0' <repeats 13 times>
buf = "add@/devices/virtual/block/md0/md0p1\000UDEV_LOG=3\000ACTION=add\000DEVPATH=/devices/virtual/block/md0/md0p1\000SUBSYSTEM=block\000DEVTYPE=partition\000SEQNUM=1723\000MAJOR=259\000MINOR=0\000DEVLINKS=/dev/block/259:0\000DEVNAME=/d"...
bufpos = 209
action = 0x7fff0deb5417 "add"
__func__ = "hald_udev_data"
#4 0x00007fca055a420a in g_main_context_dispatch () from /usr/lib/libglib-2.0.so.0
No symbol table info available.
#5 0x00007fca055a78e0 in ?? () from /usr/lib/libglib-2.0.so.0
No symbol table info available.
#6 0x00007fca055a7dad in g_main_loop...

When assembling certain MD raid devices, hald crashes:

#0  0x0000000000435b05 in hotplug_event_begin_add_blockdev (sysfs_path=0x26c0130 "/sys/devices/virtual/block/md0/md0p1", device_file=<value optimized out>, is_partition=<value optimized out>, parent=0x260bca0, end_token=0x26c0020) at blockdev.c:1501
	sysfs_path_len = <value optimized out>
	is_physical_partition = <value optimized out>
	volume_label = 0x2681390 ""
	buf = "Volume\000\0009\001l\002\000\000\000\0001\n\000\000\000\000\000\000Ù\202\\\005Ê\177\000\000\000Þ\202\005Ê\177\000\000\037^Y\005Ê\177\000\000\000\206`\002\000\000\000\000 \000l\002\000\000\000"
	major_minor = <value optimized out>
	d = (HalDevice *) 0x267de80
	major = 259
	minor = 0
	is_fakevolume = 0
	sysfs_path_real = 0x2693670 "/sys/devices/virtual/block/md0/md0p1"
	floppy_num = <value optimized out>
	is_device_mapper = 0
	is_md_device = 1
	is_cciss_device = 0
	md_number = 0
	__func__ = "hotplug_event_begin_add_blockdev"
#1  0x0000000000425d72 in hotplug_event_begin_sysfs (hotplug_event=0x26c0020) at hotplug.c:220
	parent = (HalDevice *) 0x0
	range = 1
	is_partition = 1
	d = (HalDevice *) 0x0
	subsystem = "0Rë\rÿ\177\000\000S£D\000\000\000\000\0000\001l\002\000\000\000\0008\227Á\004Ê\177\000\000\001\200û\000\000\000\0000\001l\002\000\000\000\0000\001l\002\000\000\000\0000\001l\002\000\000\000\0000\001l\002\000\000\000\000T\001l\002\000\000\000\000/\003l\002\000\000\000\0000\001l\002\000\000\000\000/\003l\002", '\0' <repeats 44 times>, " \000\000\000\004\000\000\000 \020\000\000\000\000\000\000\000\000è\004Ê\177\000\000\000\000\000\000\000\000\000\000\001\000\000\000\000\000\000\000\000ªè\004Ê\177\000\0000\020", '\0' <repeats 14 times>, "\b\000\000\000\000\000\000\000pªè\004Ê\177\000\000ÿÿÿÿ\000\000\000\000Tl\\\005Ê\177"...
	subsystem_target = <value optimized out>
	__func__ = "hotplug_event_begin_sysfs"
#2  0x00000000004261c8 in hotplug_event_process_queue () at hotplug.c:295
	hotplug_event = (HotplugEvent *) 0x26c0020
	lp = (GList *) 0x2683da0
	lp2 = (GList *) 0x0
	processing = 1
	__func__ = "hotplug_event_process_queue"
#3  0x0000000000424f82 in hald_udev_data (source=<value optimized out>, condition=<value optimized out>, user_data=<value optimized out>) at osspec.c:259
	fd = <value optimized out>
	smsg = {msg_name = 0x0, msg_namelen = 0, msg_iov = 0x7fff0deb53d0, msg_iovlen = 1, msg_control = 0x7fff0deb63e0, msg_controllen = 32, msg_flags = 0}
	cmsg = <value optimized out>
	iov = {iov_base = 0x7fff0deb53e0, iov_len = 4096}
	cred = <value optimized out>
	cred_msg = "\034\000\000\000\000\000\000\000\001\000\000\000\002\000\000\000Í\034", '\0' <repeats 13 times>
	buf = "add@/devices/virtual/block/md0/md0p1\000UDEV_LOG=3\000ACTION=add\000DEVPATH=/devices/virtual/block/md0/md0p1\000SUBSYSTEM=block\000DEVTYPE=partition\000SEQNUM=1723\000MAJOR=259\000MINOR=0\000DEVLINKS=/dev/block/259:0\000DEVNAME=/d"...
	bufpos = 209
	action = 0x7fff0deb5417 "add"
	__func__ = "hald_udev_data"
#4  0x00007fca055a420a in g_main_context_dispatch () from /usr/lib/libglib-2.0.so.0
No symbol table info available.
#5  0x00007fca055a78e0 in ?? () from /usr/lib/libglib-2.0.so.0
No symbol table info available.
#6  0x00007fca055a7dad in g_main_loop_run () from /usr/lib/libglib-2.0.so.0
No symbol table info available.
#7  0x0000000000414005 in main (argc=233531616, argv=<value optimized out>) at hald.c:821
	loop = (GMainLoop *) 0x260b2c0
	path = <value optimized out>
	newpath = "/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/lib/hal:/usr/lib/hal/scripts\000\000t\t²\004Ê\177\000\000.\000\000\000ÿ\177\000\0008õØ\003\000\000\000\000pgë\rÿ\177\000\000\030ië\rÿ\177\000\000h\v²\004Ê\177\000\000\000\000\000\000\000\000\000\000\220\233é\005Ê\177\000\000¸¤é\005Ê\177\000\000~E@\000\000\000\000\000\020Ò²\004Ê\177\000\000è\024@\000\000\000\000\000\000\000\000\000\001\000\000\000"...
	opt_child_timeout = 250
	p_error = (PolKitError *) 0x0
	__func__ = "main"
	long_options = {{name = 0x43e3e3 "exit-after-probing", has_arg = 0, flag = 0x0, val = 0}, {name = 0x43e466 "daemon", has_arg = 1, flag = 0x0, val = 0}, {name = 0x43e404 "verbose", has_arg = 1, flag = 0x0, val = 0}, {name = 0x43e40c "retain-privileges", has_arg = 0, flag = 0x0, val = 0}, {name = 0x43e3f6 "child-timeout", has_arg = 1, flag = 0x0, val = 0}, {name = 0x43e41e "use-syslog", has_arg = 0, flag = 0x0, val = 0}, {name = 0x43e3c1 "help", has_arg = 0, flag = 0x0, val = 0}, {name = 0x44c958 "version", has_arg = 0, flag = 0x0, val = 0}, {name = 0x0, has_arg = 0, flag = 0x0, val = 0}}

Here is the output of hald --verbose=yes --daemon=no when adding the device:

19:58:32.177 [I] osspec.c:251: SEQNUM=1883, ACTION=change, SUBSYSTEM=block, DEVPATH=/sys/devices/virtual/block/md0, DEVNAME=/dev/md0, IFINDEX=0
19:58:32.177 [I] hotplug.c:435: checking event /sys/devices/virtual/block/md0
19:58:32.177 [I] blockdev.c:903: block_add: sysfs_path=/sys/devices/virtual/block/md0 dev=/dev/md0 is_part=0, parent=0x00000000
19:58:32.177 [I] blockdev.c:915: Handling /dev/md0 as MD device
19:58:32.177 [I] blockdev.c:727: In refresh_md_state() for '/sys/devices/virtual/block/md0'
19:58:32.177 [I] blockdev.c:729:  MD Level is 'raid0'
19:58:32.177 [W] blockdev.c:735: Cannot get sync_action for /sys/devices/virtual/block/md0
19:58:32.177 [W] blockdev.c:1577: Not adding device object
19:58:32.177 [D] hotplug.c:453: events queued = 0, events in progress = 0
19:58:32.177 [D] hotplug.c:458: Hotplug-queue empty now ... no hotplug events in progress
19:58:32.201 [I] osspec.c:251: SEQNUM=1884, ACTION=add, SUBSYSTEM=block, DEVPATH=/sys/devices/virtual/block/md0/md0p1, DEVNAME=/dev/md0p1, IFINDEX=0
19:58:32.201 [I] hotplug.c:435: checking event /sys/devices/virtual/block/md0/md0p1
19:58:32.201 [I] blockdev.c:903: block_add: sysfs_path=/sys/devices/virtual/block/md0/md0p1 dev=/dev/md0p1 is_part=1, parent=0x00000000
19:58:32.201 [I] blockdev.c:915: Handling /dev/md0p1 as MD device
Segmentation fault

Revision history for this message

In freedesktop.org Bugzilla #21603, Chris Coulson (chrisccoulson) wrote on 2009-05-06:

#28

Created an attachment (id=25568)
Patch which fixes the issue (don't assume that the parent has storage.drive_type property)

Revision history for this message

mobrien118 (mobrien118) wrote on 2009-05-06: Re: hald segfault when using a raid volume

#29

I think I can get you started.

I mean, the first step to creating a RAID volume is to think it out. Where do you need redundancy? Where do you need speed? Then map out your partitions (especially if you have different sized disks).

Remember that RAID will cause a slight (in the case of RAID0 or RAID1) to slightly greater (RAID5 or RAID6) processor and I/O load. That is the trade-off for getting better overall disk performance.

Although it is supposedly not needed to RAID0 your swap partitions across disks (supposedly the swap daemon manages multiple disks very well) it doesn't hurt to do so and is an easy and safe way to get started with RAID. You might consider making this your test case.

This page lays out mdadm and Linux RAID pretty well: http://ubuntuforums.org/showthread.php?t=408461

Basically:
1. format disks you want to use as "RAID" partitions
2. create a RAID array using "mdadm --create /dev/md0 --level=[level] --raid-devices=[number of devices] [device1] [device2]...[deviceN]
3. assemble the array
4. format the array in the filesystem of your choice (like any other partition)
5. mount it like you would any other disk partition

Pretty simple, eh?

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-05-06:

#30

Thanks for your help mobrien118 and Russell (who also contacted me privately with some help). I've managed to recreate the crash now.

Changed in hal (Ubuntu):
assignee:	nobody → Chris Coulson (chrisccoulson)
status:	Confirmed → In Progress

Bug Watch Updater (bug-watch-updater) on 2009-05-06

Changed in hal:
status:	Unknown → Confirmed

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-05-06:

#31

Now I understand it properly, I've re-written the patch, tested it and sent upstream.

Changed in hal (Ubuntu):
status:	In Progress → Triaged

Revision history for this message

mobrien118 (mobrien118) wrote on 2009-05-06:

#32

To the main Ubuntu repos or to your PPA?

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-05-06:

#33

The patch is in my bzr branch, waiting to be merged in to the ubuntu-core-dev branch. I've also sent the patch to the upstream freedesktop bug tracker: https://bugs.freedesktop.org/show_bug.cgi?id=21603

Chris Coulson (chrisccoulson) on 2009-05-07

summary:

- hald segfault when using a raid volume
+ hald crashed with SIGSEGV in hotplug_event_begin_add_blockdev when
+ assembling mdraid devices

Revision history for this message

René Diepstraten (rene-renediepstraten) wrote on 2009-05-14:

#34

lshal.log Edit (183.6 KiB, text/plain)

Also confirmed, problem appeared after creating mdadm raidset and reboot.

Chris' build fixed my problem as well
Thanks!

Revision history for this message

mgcsinc (mgcsinc) wrote on 2009-05-14:

#35

Adding my voice to the chorus - same problem.

Haven't tried the patch yet (I'm away from the box right now), but will ASAP. I agree that there should be consideration of increasing importance if that's still appropriate.

Revision history for this message

In freedesktop.org Bugzilla #21603, Martin Pitt (pitti) wrote on 2009-05-15:

#36

Thank you! Committed in b35bf1f

Revision history for this message

Martin Pitt (pitti) wrote on 2009-05-15:

#37

I committed the fix upstream, thanks Chris!

Revision history for this message

Launchpad Janitor (janitor) wrote on 2009-05-15:

#38

This bug was fixed in the package hal - 0.5.12+git20090512-0ubuntu2

---------------
hal (0.5.12+git20090512-0ubuntu2) karmic; urgency=low

  * debian/patches/50_no_crash_on_md_blockdev.patch:
    - When adding a block device, don't assume that the parent
      has storage capability. This fixes a crash where the device
      is re-parented to the root computer device object (such as
      with mdraid devices). LP: #361689.

-- Chris Coulson <email address hidden> Fri, 15 May 2009 18:34:58 +0200

Changed in hal (Ubuntu):
status:	Triaged → Fix Released

Bug Watch Updater (bug-watch-updater) on 2009-05-16

Changed in hal:
status:	Confirmed → Fix Released

Revision history for this message

Davíð Steinn Geirsson (david-dsg) wrote on 2009-05-18:

#39

So... may I suggest the new hald build be uploaded to the jaunty repository?

I've spent the last 3 hours debugging what seemed to be a DBus problem, but turned out to be this. Then I needed to pull the fix from karmic because the fix is not available in the jaunty repo.

Revision history for this message

René Diepstraten (rene-renediepstraten) wrote on 2009-05-19:

#40

Update seems to be in repository already, waiting for the index to be updated.
These builds can be downloaded @
http://nl3.archive.ubuntu.com/ubuntu/pool/main/h/hal/hal_0.5.12+git20090512-0ubuntu2_i386.deb
http://nl3.archive.ubuntu.com/ubuntu/pool/main/h/hal/hal_0.5.12+git20090512-0ubuntu2_amd64.deb

Revision history for this message

Eric D (ericdeshayes) wrote on 2009-05-25:

#41

excuse my ignorance, but do we have any idea when that fix will be available when I update my system?
shouldn't the severity be high as it breaks any installation that is using raid afaik? shouldn't that issue be listed in the release note (i would not have upgraded if I had known..).

I've updated on saturday from 8.10 and now nothing is working and I am not too keen on applying a temporary fix, knowing that it has few depencies (libblkid1). the alternative would be to re-install 8.10 unless I am told a fix would be available in the next few days.

many thanks for your work and for your answer.
eric

Revision history for this message

ded (ded-launchpad) wrote on 2009-05-25:

#42

Same deal here, except that the fixed hal .deb depends on version 2.15 of libblkid1 which is not available, at least on amd64, in the repositories either.

My system is now working (after some painful googling with links) after applying the following:

http://launchpadlibrarian.net/26631965/libblkid1_2.15-1ubuntu2_amd64.deb

Then applying Chris's patch from above.

Gigthanks, Chris. And like everyone else, I think the raid-running world ought to be warned off of Jaunty until this is fixed in the distribution.

Regards,
ded

Revision history for this message

ded (ded-launchpad) wrote on 2009-05-27:

#43

Spoke too soon. After installing both Chris's fixed hal and the libblkid1 update above, my AMD64 system failed to boot, pausing about 1/5 of the way through. If I hit Alt-Ctl-Del, I could get the boot to resume but with my / mounted read-only. I suspect something is wrong with the libblkid1 that is causing the problem.

Anyone else seeing this?

Revision history for this message

Eric D (ericdeshayes) wrote on 2009-05-27:

#44

Yes, I have the same problem.
From my quick investigation, the problem was when the findfs binary was called and it was stuck on that binary execution.

Revision history for this message

ded (ded-launchpad) wrote on 2009-05-27:

#45

Eric, thanks for confirming. Is your system AMD64 or i386?

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-05-27:

#46

Please don't do silly things like install libblkid from karmic - that's totally unsupported and is likely to break your machine.

Everyone is at UDS at the moment but I'll see if this could be considered for a SRU when everyone gets back.

Revision history for this message

Eric D (ericdeshayes) wrote on 2009-05-27:

#47

my system is AMD64.

Revision history for this message

ded (ded-launchpad) wrote on 2009-05-27:

#48

It's not something I would have thought of on my own, but it appears to be a dependency in the amd .deb hal package you posted above:

root@saturn:/home/ded/Downloads# dpkg -i hal_0.5.12+git20090512-0ubuntu2_amd64.deb
(Reading database ... 94344 files and directories currently installed.)
Preparing to replace hal 0.5.12~rc1+git20090403-0ubuntu1 (using hal_0.5.12+git20090512-0ubuntu2_amd64.deb) ...
* Stopping Hardware abstraction layer hald [ OK ]
Unpacking replacement hal ...
dpkg: dependency problems prevent configuration of hal:
hal depends on libblkid1 (>= 2.15~rc2-1ubuntu1); however:
Version of libblkid1 on system is 1.41.4-1ubuntu1.
dpkg: error processing hal (--install):
dependency problems - leaving unconfigured
Processing triggers for man-db ...
Errors were encountered while processing:
hal

Is karmic poison? Why the warning?

Thanks.

Revision history for this message

ded (ded-launchpad) wrote on 2009-05-28:

#49

Chris et. al.,

OK, I get it now. karmic is the next release---my bad, just an end user here.

Still, is there some repository that has version 2.15 of libblkid1 for jaunty? Would someone please post a link or the sources entry for such a repository? Looks like Chris's fix to hal needs it from somewhere at least on the 64-bit systems.

Thanks.

Revision history for this message

ded (ded-launchpad) wrote on 2009-06-04:

#50

All,

Has everyone else been able to work around this issue, or is it just me? Since my jaunty update, I can't get mouse or keyboard with mdadm installed.

I was hoping someone would tell me what to do about the libblkid1 dependency in Chris's hal update---at least on amd64---but no traffic here for several days.

Chris, any help? Any one else? Does it work on an i386?

Regards,

Martin Pitt (pitti) on 2009-06-04

tags:

added: regression-release

Chris Coulson (chrisccoulson) on 2009-06-04

Changed in hal (Ubuntu Jaunty):
assignee:	nobody → Chris Coulson (chrisccoulson)
importance:	Undecided → Medium
status:	New → In Progress

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-06-04:

#51

hal_0.5.12~rc1+git20090403-0ubuntu2.debdiff Edit (3.2 KiB, text/plain)

Here's a debdiff for the Jaunty update

Changed in hal (Ubuntu Jaunty):
status:	In Progress → Triaged

Revision history for this message

Martin Pitt (pitti) wrote on 2009-06-05:

#52

Accepted hal into jaunty-proposed, the package will build now and be available in a few hours. Please test and give feedback here. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you in advance!

Changed in hal (Ubuntu Jaunty):
status:	Triaged → Fix Committed
tags:	added: verification-needed

Revision history for this message

Patryk Bajer (bayger) wrote on 2009-06-09:

#53

The patch from jaunty-proposed WORKS for me! Thank you!

Revision history for this message

ded (ded-launchpad) wrote on 2009-06-09:

#54

Patch also worked for me on AMD64. Thanks, Chris.

Martin Pitt (pitti) on 2009-06-09

tags:

added: verification-done
removed: verification-needed

Revision history for this message

Launchpad Janitor (janitor) wrote on 2009-06-11:

#55

This bug was fixed in the package hal - 0.5.12~rc1+git20090403-0ubuntu2

---------------
hal (0.5.12~rc1+git20090403-0ubuntu2) jaunty-proposed; urgency=low

  * debian/patches/50_no_crash_on_md_blockdev.patch:
    - When adding a block device, don't assume that the parent
      has storage capability. This fixes a crash where the device
      is re-parented to the root computer device object (such as
      with mdraid devices). LP: #361689.

-- Chris Coulson <email address hidden> Fri, 05 Jun 2009 12:25:50 +0200

Changed in hal (Ubuntu Jaunty):
status:	Fix Committed → Fix Released

Revision history for this message

Oli Wade (olithered) wrote on 2009-06-17:

#56

After applying this update I am having trouble with my X server. There are the following errors in the log:

====
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
(EE) config/hal: NewInputDeviceRequest failed (8)
====

Do you think it could be a side effect?

Revision history for this message

Martin Pitt (pitti) wrote on 2009-06-18: Re: [Bug 361689] Re: hald crashed with SIGSEGV in hotplug_event_begin_add_blockdev when assembling mdraid devices

#57

Oli Wade [2009-06-17 9:41 -0000]:
> After applying this update I am having trouble with my X server. There
> are the following errors in the log:
>
> ====
> (EE) config/hal: NewInputDeviceRequest failed (8)
>
> Do you think it could be a side effect?

The hal update didn't change anything wrt. input devices. If you
downgrade to the previous hal again [1], does it work again?

Does that only happen right after the package upgrade, or also after a
restart of the machine?

[1] sudo apt-get install hal/jaunty-updates
--
Martin Pitt | http://www.piware.de
Ubuntu Developer (www.ubuntu.com) | Debian Developer (www.debian.org)

Revision history for this message

Oli Wade (olithered) wrote on 2009-06-18:

#58

It happened after a reboot - I blamed this update due to the "hal" in the error message.

I've downgraded ("sudo apt-get install hal/jaunty libhal1/jaunty libhal-storage1/jaunty") but the problem remained through several reboots until I did a shutdown and then poweron.

Therefore I suspect some part(s) of the hardware might have been in a weird state and there is nothing wrong with the update.

Revision history for this message

Martin Pitt (pitti) wrote on 2009-06-18:

#59

Oli Wade [2009-06-18 8:56 -0000]:
> Therefore I suspect some part(s) of the hardware might have been in a
> weird state and there is nothing wrong with the update.

OK, thanks for checking!

Revision history for this message

mobrien118 (mobrien118) wrote on 2009-07-07:

#60

Ahhh! Is this bug back?

The server I was having a problem with is 1000 miles away from me now and I rebooted it and it didn't come back up. Thinking back a few hours, I remember that "update-manager" installed a HAL update.

Noooooooooooo! I need my server and this is going to force me into weeks of downtime! How is this not a "Critical" bug, and how did this update cause a regression?

This is absolutely horrible. The first time I experienced this bug, it cost me hours/possibly days of troubleshooting, now I have indefinite unscheduled downtime. Seriously CRITICAL!

Anyone have any suggestions?

Revision history for this message

mobrien118 (mobrien118) wrote on 2009-07-07:

#61

Didn't mean to sound upset at anyone in my previous post. I know that Chris did an awesome job with the first patch. I'm just upset and looking for a support group :-)

The sooner we can get a permanent fix for this issue, the sooner I can get a good night's sleep.

Please, anyone who is capable, help out with this!

Also, can we change the status back to "confirmed" or "incomplete" so it will bubble back up and get noticed?

Revision history for this message

Chris Coulson (chrisccoulson) wrote on 2009-07-07:

#62

This bug hasn't regressed, as the recent HAL update was completely unrelated, and didn't even touch any code AFAICT. If you're experiencing any issues, it's defaintely not related to this bug, even if you are experiencing a HAL crash.

You should open a new bug report, preferably by submitting a crash report using Apport. You might need to enable apport in /etc/default/apport and restart though.

Bug Watch Updater (bug-watch-updater) on 2010-09-13

Changed in hal:
importance:	Unknown → Critical

Bug Watch Updater (bug-watch-updater) on 2011-01-25

Changed in hal:
importance:	Critical → Unknown

Bug Watch Updater (bug-watch-updater) on 2011-02-03

Changed in hal:
importance:	Unknown → Critical

Ubuntu
hal package

hald crashed with SIGSEGV in hotplug_event_begin_add_blockdev when assembling mdraid devices

Bug Description

Related branches

Duplicates of this bug

Other bug subscribers

Patches

Bug attachments

Remote bug watches

	Status	Importance	Assigned to
HAL	Fix Released	Critical	freedesktop-bugs #21603
hal (Ubuntu)	Fix Released	Medium	Chris Coulson
Jaunty	Fix Released	Medium	Chris Coulson

Ubuntuhal package

hald crashed with SIGSEGV in hotplug_event_begin_add_blockdev when assembling mdraid devices

Bug Description

Related branches

Duplicates of this bug

Other bug subscribers

Patches

Bug attachments

Remote bug watches

Ubuntu
hal package