after upgrade to lucid, xorg frequently fails to start properly using nvidia-current

Bug #561049 reported by bcrowell
38
This bug affects 6 people
Affects Status Importance Assigned to Milestone
nvidia-graphics-drivers (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

After upgrading to lucid, I find that xorg only starts successfully about 50% of the time. I am using nvidia-current. (Tried switching to nv and nouveau, but neither one worked properly.) I am attaching two xorg log files, one from a successful startup and one from an unsuccessful one.

ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: nvidia-current 195.36.15-0ubuntu2
ProcVersionSignature: Ubuntu 2.6.32-19.28-generic 2.6.32.10+drm33.1
Uname: Linux 2.6.32-19-generic x86_64
NonfreeKernelModules: nvidia
Architecture: amd64
Date: Sun Apr 11 15:36:29 2010
DkmsStatus:
 nvidia-current, 195.36.15, 2.6.32-19-generic, x86_64: installed
 nvidia-current, 195.36.15, 2.6.31-14-generic, x86_64: installed
InstallationMedia: Ubuntu 9.10 "Karmic Koala" - Release amd64 (20091027)
Lsusb:
 Bus 002 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
 Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-19-generic root=UUID=e71e649e-6128-4ae9-816a-850fd3f1f48b ro quiet splash
ProcEnviron:
 PATH=(custom, user)
 LANG=en_US.utf8
 SHELL=/bin/bash
SourcePackage: nvidia-graphics-drivers
dmi.bios.date: 12/26/2006
dmi.bios.vendor: Award Software International, Inc.
dmi.bios.version: F2
dmi.board.name: M61P-S3
dmi.board.vendor: Gigabyte Technology Co., Ltd.
dmi.board.version: x.x
dmi.chassis.type: 3
dmi.modalias: dmi:bvnAwardSoftwareInternational,Inc.:bvrF2:bd12/26/2006:svn:pn:pvr:rvnGigabyteTechnologyCo.,Ltd.:rnM61P-S3:rvrx.x:cvn:ct3:cvr:
system:
 distro: Ubuntu
 codename: lucid
 architecture: x86_64
 kernel: 2.6.32-19-generic
---
Architecture: amd64
DistroRelease: Ubuntu 10.04
DkmsStatus:
 nvidia-current, 195.36.15, 2.6.32-19-generic, x86_64: installed
 nvidia-current, 195.36.15, 2.6.31-14-generic, x86_64: installed
InstallationMedia: Ubuntu 9.10 "Karmic Koala" - Release amd64 (20091027)
Lsusb:
 Bus 002 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
 Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
NonfreeKernelModules: nvidia
Package: nvidia-graphics-drivers (not installed)
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-19-generic root=UUID=e71e649e-6128-4ae9-816a-850fd3f1f48b ro quiet splash
ProcEnviron:
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcVersionSignature: Ubuntu 2.6.32-19.28-generic 2.6.32.10+drm33.1
Tags: lucid lucid
Uname: Linux 2.6.32-19-generic x86_64
UserGroups: adm admin cdrom dialout lpadmin plugdev sambashare
dmi.bios.date: 12/26/2006
dmi.bios.vendor: Award Software International, Inc.
dmi.bios.version: F2
dmi.board.name: M61P-S3
dmi.board.vendor: Gigabyte Technology Co., Ltd.
dmi.board.version: x.x
dmi.chassis.type: 3
dmi.modalias: dmi:bvnAwardSoftwareInternational,Inc.:bvrF2:bd12/26/2006:svn:pn:pvr:rvnGigabyteTechnologyCo.,Ltd.:rnM61P-S3:rvrx.x:cvn:ct3:cvr:
system:
 distro: Ubuntu
 codename: lucid
 architecture: x86_64
 kernel: 2.6.32-19-generic

---
Architecture: amd64
DistroRelease: Ubuntu 10.04
DkmsStatus:
 nvidia-current, 195.36.15, 2.6.32-19-generic, x86_64: installed
 nvidia-current, 195.36.15, 2.6.31-14-generic, x86_64: installed
InstallationMedia: Ubuntu 9.10 "Karmic Koala" - Release amd64 (20091027)
Lsusb:
 Bus 002 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
 Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
NonfreeKernelModules: nvidia
Package: nvidia-graphics-drivers (not installed)
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-19-generic root=UUID=e71e649e-6128-4ae9-816a-850fd3f1f48b ro quiet splash
ProcEnviron:
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcVersionSignature: Ubuntu 2.6.32-19.28-generic 2.6.32.10+drm33.1
Tags: lucid lucid
Uname: Linux 2.6.32-19-generic x86_64
UserGroups: adm admin cdrom dialout lpadmin plugdev sambashare
dmi.bios.date: 12/26/2006
dmi.bios.vendor: Award Software International, Inc.
dmi.bios.version: F2
dmi.board.name: M61P-S3
dmi.board.vendor: Gigabyte Technology Co., Ltd.
dmi.board.version: x.x
dmi.chassis.type: 3
dmi.modalias: dmi:bvnAwardSoftwareInternational,Inc.:bvrF2:bd12/26/2006:svn:pn:pvr:rvnGigabyteTechnologyCo.,Ltd.:rnM61P-S3:rvrx.x:cvn:ct3:cvr:
system:
 distro: Ubuntu
 codename: lucid
 architecture: x86_64
 kernel: 2.6.32-19-generic

Revision history for this message
bcrowell (launchpadcrowell07) wrote :
Revision history for this message
bcrowell (launchpadcrowell07) wrote :
Revision history for this message
bcrowell (launchpadcrowell07) wrote :
Revision history for this message
Fabián Rodríguez (magicfab) wrote :

We're now at 2.6.32-21, have you tried this again ? I am running the same driver -using a C51 though- on 64-bit and can' t reproduce your problem..
When/if it fails again, it woul dbe important to have complete information, by running:
apport-collect 561049

Changed in nvidia-graphics-drivers (Ubuntu):
status: New → Incomplete
Revision history for this message
TJ (tj) wrote :

Are you still experiencing this issue?

If so, can you copy /var/log/kern.log to a safe location immediately after a failure occurs and then upload it to this bug once you have a good session?

E.g.

cp /var/log/kern.log $HOME/

Revision history for this message
bcrowell (launchpadcrowell07) wrote : BootDmesg.txt

apport information

tags: added: apport-collected
description: updated
Revision history for this message
bcrowell (launchpadcrowell07) wrote : CurrentDmesg.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : GdmLog.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : GdmLog1.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : GdmLog2.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : Lspci.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : PciDisplay.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : ProcCpuinfo.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : ProcInterrupts.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : ProcModules.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : RelatedPackageVersions.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : UdevDb.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : UdevLog.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : XorgConf.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : XorgLog.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : XorgLogOld.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

Hi, TJ and Fabián Rodríguez -- Thanks for your replies. Yes, I am still having the problem. I just did an apport-collect (see above), and am attaching kern.log. This was with kernel 2.6.32-19, however. I will update to 2.6.32-21 and see if I can reproduce it again.

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

After an apt-get update and an apt-get upgrade, I'm still at kernel 2.6.32-19. Do I need to recompile my kernel by hand or something grungy like that in order to get to -21?

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

After an apt-get update and an apt-get upgrade, I'm still experiencing intermittent failures to boot properly. Out of 9 boots, I had one failure, which was different than what I was experiencing before. I get a black screen with nothing in it but a blinking _ cursor. If I do a ctl-alt-f7, I see this message: "fsck from util-linux-ng 2.17.2 ..." I don't know if this is the same bug or a different one. I've seen reports that there is a problem with Plymouth that occurs when there is an fsck. This might be that problem. I'm unable to collect apprt info or kern.log, because I don't get a login prompt. I'll try some more and see if I can get the same behavior I saw before.

Revision history for this message
bcrowell (launchpadcrowell07) wrote : BootDmesg.txt

apport information

description: updated
Revision history for this message
bcrowell (launchpadcrowell07) wrote : CurrentDmesg.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : GdmLog.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : GdmLog1.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : GdmLog2.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : Lspci.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : PciDisplay.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : ProcCpuinfo.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : ProcInterrupts.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : ProcModules.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : RelatedPackageVersions.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : UdevDb.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : UdevLog.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : XorgConf.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : XorgLog.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote : XorgLogOld.txt

apport information

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

Okay, I got the original bug to happen again, with my freshly updated system. The above apport info is from this occurrence of the bug, and I'm attaching the kern.log file. Since this is occurring with a freshly updated system, I'm going to update the status of the bug from incomplete.

Changed in nvidia-graphics-drivers (Ubuntu):
status: Incomplete → In Progress
Revision history for this message
bcrowell (launchpadcrowell07) wrote :

I've updated the status from "incomplete" to "in progress." Don't know if that was the right thing to do...?

Revision history for this message
TJ (tj) wrote :

There is nothing obvious in the log files to show what the cause of the issue is.

Next time it occurs can you check of the device nodes exist? Here's an example of what you should expect to see:

ls -l /dev/nvidia*
crw-rw-rw- 1 root root 195, 0 2010-04-19 09:36 /dev/nvidia0
crw-rw-rw- 1 root root 195, 255 2010-04-19 09:36 /dev/nvidiactl

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

Okay, here is the result of ls -l /dev/nvidia* from a session that succeeded:

crw-rw-rw- 1 root root 195, 255 2010-04-21 20:32 /dev/nvidiactl

Here it is from a session that was successful:

crw-rw-rw- 1 root root 195, 0 2010-04-21 20:34 /dev/nvidia0
crw-rw-rw- 1 root root 195, 255 2010-04-21 20:34 /dev/nvidiactl

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

Oops, #43 reads "succeeded" on the second line where it should read "failed." The corrected version is this:

Okay, here is the result of ls -l /dev/nvidia* from a session that failed:

crw-rw-rw- 1 root root 195, 255 2010-04-21 20:32 /dev/nvidiactl

Here it is from a session that was successful:

crw-rw-rw- 1 root root 195, 0 2010-04-21 20:34 /dev/nvidia0
crw-rw-rw- 1 root root 195, 255 2010-04-21 20:34 /dev/nvidiactl

Revision history for this message
TJ (tj) wrote : Re: [Bug 561049] Re: after upgrade to lucid, xorg frequently fails to start properly using nvidia-current
Download full text (3.8 KiB)

On Thu, 2010-04-22 at 03:39 +0000, bcrowell wrote:
> Oops, #43 reads "succeeded" on the second line where it should read
> "failed." The corrected version is this:
>
>
> Okay, here is the result of ls -l /dev/nvidia* from a session that
> failed:
>
> crw-rw-rw- 1 root root 195, 255 2010-04-21 20:32 /dev/nvidiactl
>
> Here it is from a session that was successful:
>
> crw-rw-rw- 1 root root 195, 0 2010-04-21 20:34 /dev/nvidia0
> crw-rw-rw- 1 root root 195, 255 2010-04-21 20:34 /dev/nvidiactl
>

So there is a missing device node. This could be because the nvidia
(nvidia-current) kernel module failed to load successfully or some
problem with X driver.

The documentation says (/usr/share/doc/nvidia-current/README.txt.gz):

Q. How and when are the the NVIDIA device files created?

A. Depending on the target system's configuration, the NVIDIA device files
   used to be created in one of three different ways:

      o at installation time, using mknod

      o at module load time, via devfs (Linux device file system)

      o at module load time, via hotplug/udev

   With current NVIDIA driver releases, device files are created or modified
   by the X driver when the X server is started.

   By default, the NVIDIA driver will attempt to create device files with the
   following attributes:

         UID: 0 - 'root'
         GID: 0 - 'root'
         Mode: 0666 - 'rw-rw-rw-'

   Existing device files are changed if their attributes don't match these
   defaults. If you want the NVIDIA driver to create the device files with
   different attributes, you can specify them with the "NVreg_DeviceFileUID"
   (user), "NVreg_DeviceFileGID" (group) and "NVreg_DeviceFileMode" NVIDIA
   Linux kernel module parameters.

   For example, the NVIDIA driver can be instructed to create device files
   with UID=0 (root), GID=44 (video) and Mode=0660 by passing the following
   module parameters to the NVIDIA Linux kernel module:

         NVreg_DeviceFileUID=0
         NVreg_DeviceFileGID=44
         NVreg_DeviceFileMode=0660

I don't see, in your logs or mine, any reference to creating the device
nodes in the udev logs so I'm not clear what is responsible for creating
them at this point.

As a test could you reboot into Recovery Mode which is single-user and
won't start X and, in theory, will not load the nvidia driver.

At the recovery menu choose to start a shell then:

1. Check whether the module is currently loaded:

lsmod | grep nvidia

2. Check for device nodes:

ls -l /dev/nvidia*

3. If nvidia module is loaded, unload it:

modprobe -vr --first-time -o nvidia nvidia-current

3a. If you see:

FATAL: Module nvidia is in use.

then something is already using the module and we have to think some
more on how to proceed. Move on to step 5.

3b. Otherwise, continue onwards...

4. Load the nvidia module manually with verbose reporting enabled:

modprobe -v --first-time -o nvidia nvidia-current

5. Try manually starting X with verbose logging:

startx -- -verbose 5 -logverbose 5

6. Check if the device node has been created:

ls -l /dev/nvidia*

7. If the device node /dev/nvidia0 is not present or the X server fails
to start ...

Read more...

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

How do I get into recovery mode? Googling tells me to hit escape when prompted to do so by GRUB. I don't get any messages from GRUB displayed on my screen. I get BIOS messages, then the screen stays black for a long time, and then I get the purple gdm screen.

Revision history for this message
TJ (tj) wrote :

On Fri, 2010-04-23 at 21:24 +0000, bcrowell wrote:
> How do I get into recovery mode? Googling tells me to hit escape when
> prompted to do so by GRUB. I don't get any messages from GRUB displayed
> on my screen. I get BIOS messages, then the screen stays black for a
> long time, and then I get the purple gdm screen.

Hold down the Shift key when BIOS loads GRUB. GRUB will detect the shift
key and display the menu.

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

Okay, here's what I found.

When I boot into recovery mode, the nvidia module is already loaded, but the device nodes don't exist. I then unload and reload the nvidia module. Nothing special happens, except for the warning "WARNING: Could not find old name in /lib/modules/2.6.32-19-generic/updates/dkms/nvidia-current.ko to replace!" Then I start x. After I start x, the device nodes have been created. I used the attached shell script to automate the process. Out of 36 boots, the results were:

31 successful starts of X
2 cases where I never got a shell prompt
2 cases where X failed to start, but I wasn't set up to do the logging
1 case where I think I got the necessary information. Not completely sure about this one, however, because it was one of my first attempts. I was still messing around with the perl script, and I wasn't paying close enough attention to what was happening.

In all cases, I was doing warm reboots.

I will post a log from a case where X worked, and the one where it failed.

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

This is a log from a case where X failed to start properly.

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

This is a log from a case where X started up normally.

Revision history for this message
Robert Hooker (sarvatt) wrote :

bcrowell: You must be using a apt mirror that has not been updated in quite some time, please change it to one of the primary mirrors (like http://archive.ubuntu.com/ubuntu) and update with sudo apt-get update and then sudo apt-get dist-upgrade. Do you have /usr mounted on a seperate partition? This sounds like a bug that has been fixed quite some time ago where the blacklist wouldn't get loaded if the seperate partition it was on was being fscked and you just have not received the updates yet because of the mirror problem.

Changed in nvidia-graphics-drivers (Ubuntu):
status: In Progress → Incomplete
Revision history for this message
TJ (tj) wrote :

Well it gets more confusing! Both logs show the device nodes created. Are you *sure* the failed_log isn't accidentally a successful run?

After sed-ing out the differences in dates and times a diff doesn't show much difference aside from echoing of the commands in the successful_log.

sed -i -e 's/..:..:../ : : /g' -e 's/..:../ : /g' -e 's/2010-04-24/ - - /g' succeeded_log
sed -i -e 's/..:..:../ : : /g' -e 's/..:../ : /g' -e 's/2010-04-24/ - - /g' failed_log

diff -Nu succeeded_log failed_log
--- succeeded_log 2010-04-24 20:12:38.000000000 +0100
+++ failed_log 2010-04-24 20:12:43.000000000 +0100
@@ -1,14 +1,8 @@
-lsmod | grep nvidia
 nvidia 10799466 0
-ls -l /dev/nvidia*
   : annot access /dev/nvidi : o such file or directory
-modprobe -vr --first-time -o nvidia nvidia-current
 rmmod /lib/modules/2.6.32-19-generic/updates/dkms/nvidia-current.ko
-modprobe -v --first-time -o nvidia nvidia-current
 WARNI : ould not find old name in /lib/modules/2.6.32-19-generic/updates/dkms/nvidia-current.ko to replace!
 insmod /lib/modules/2.6.32-19-generic/updates/dkms/nvidia-current.ko
-startx -- -verbose 5 -logverbose 5
-echo "forked, x started"
 forked, x started

@@ -497,7 +491,6 @@
 (**) AT Translated Set 2 keyboa : pplying InputClass "evdev keyboard catchall"
 (**) AT Translated Set 2 keyboa : lways reports core events
 (**) AT Translated Set 2 keyboa : evi : /dev/input/event3"
-sleep 35
 (II) AT Translated Set 2 keyboa : ound keys
 (II) AT Translated Set 2 keyboa : onfiguring as keyboard
 (II) XINP : dding extended input device "AT Translated Set 2 keyboard" (ty : EYBOARD)
@@ -535,10 +528,8 @@
 (II) config/ud : dding input device Macintosh mouse button emulation (/dev/input/mouse0)
 (II) No input driver/identifier specified (ignoring)
 (II) X : euse xkmfile /var/lib/xkb/server-F8D9B4EE1D9075AF4B1C23C75362EE93E14954A0.xkm
-ls -l /dev/nvidia*
 crw-rw-rw- 1 root root 195, 0 - - : /dev/nvidia0
 crw-rw-rw- 1 root root 195, 255 - - : /dev/nvidiactl
-shutdown -r now

 waiting for X server to shut down (II) Power Butt : lose
 (II) UnloadModu : evdev"
--------

On a related note there's confirmation that nvidia_drv.so creates the nodes:

strings /usr/lib/nvidia-current/xorg/nvidia_drv.so | grep dev/nvidia
/dev/nvidiactl
/dev/nvidia%d

TJ (tj)
Changed in nvidia-graphics-drivers (Ubuntu):
assignee: nobody → TJ (intuitivenipple)
Revision history for this message
bcrowell (launchpadcrowell07) wrote :

Thanks, Robert Hooker, for the suggestion in #52. But could you explain what makes you think that I have a stale version? My sources.list file points to us.archive.ubuntu.com/ubuntu, which is identical to what you suggested except for the "us." on the front. Is there something specific in the log files I've posted that convinces you conclusively that my system is in a stale state, or are you just guessing based on the symptoms of the problem? I updated from us.archive.ubuntu.com/ubuntu this morning, before my most recent reports.

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

TJ wrote:
"Well it gets more confusing! Both logs show the device nodes created. Are you *sure* the failed_log isn't accidentally a successful run?"

No, I'm not sure. That's what I was saying in #49. After spending several hours doing reboot cycles, I got a a few failures to boot properly, but most of them were cases where I didn't even get a login prompt after my system was done booting. As I said in #49, the case I posted was one of the ones from early on, where I wasn't paying proper attention and wasn't sure I noticed correctly what was going on. That was the only one where it failed to start X, but also gave me a login prompt.

Recently when I've rebooted normally (not in recovery mode), the failure to start x had been happening fairly often. Here are some statistics from a few days ago:
34 successful boots
2 boots that didn't result in a login prompt
6 boots that gave a login prompt but didn't start X

I see several possibilities here:
(a) There are two separate bugs, one of which causes X to fail, and one which causes me not even to get a login prompt.
(b) Same as a, but the X bug doesn't occur in recovery mode.
(c) There is only one bug, and I was just "unlucky" in not being able to reproduce it today after several hours of rebooting.

It takes less time for me to simply reboot that it does to go through the whole procedure involving recovery mode and logging. I'm going to try doing 30 cycles the quick way, and see if the X problem is still occurring or not.

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

Okay, I just did two reboots the normal way (not using recovery mode). The statistics were as follows:
1 boot that didn't result in a login prompt
1 boot that gave a login prompt but didn't start X

So to summarize what I've found so far:
(1) The bug is still present in a system freshly updated this morning from us.archive.ubuntu.com.
(2) The bug does not seem to occur when starting up in recovery mode. (In the session described in #39, I saw it either 0 or 1 time out of 36 attempts in recovery mode. In the session described in this post, I saw it on 1 out of 2 attempts when not using recovery mode.

I'm going to switch the status back from Incomplete to In progress, since I think Robert Hooker may have been making an incorrect inference in #54. Robert Hooker, maybe you could clarify: do you believe that us.archive.ubuntu.com is out of date compared to archive.ubuntu.com? This would surprise me (a lot), but if it is the case, i can certainly point my sources.list to archive.ubuntu.com and do another update.

Changed in nvidia-graphics-drivers (Ubuntu):
status: Incomplete → In Progress
Revision history for this message
bcrowell (launchpadcrowell07) wrote :

As a test to see whether my apt server might be out of date, I tried deleting the "us." from all the URLs in my sources.list. After an apt-get update, an apt-get upgrade resulted in "0 upgraded, 0 newly installed, 0 to remove and 9 not upgraded." So it looks to me like there was no problem with my apt server being stale.

Revision history for this message
Robert Hooker (sarvatt) wrote :

bcrowell:

you posted on april 21st using -

[ 0.000000] Linux version 2.6.32-19-generic (buildd@yellow) (gcc version 4.4.3 (Ubuntu 4.4.3-4ubuntu5) ) #28-Ubuntu SMP Thu Apr 1 10:39:41 UTC 2010 (Ubuntu 2.6.32-19.28-generic 2.6.32.10+drm33.1)

and saying you were not receiving a kernel upgrade when the current current was 2.6.32-21. 2.6.32-20 was published 10 days before you said that so there is something wrong elsewhere if you are not receiving updates and an out of date mirror was the most likely culprit.

However your comment just now sounds like there is another issue since you are getting the "9 not upgraded" part. Can you please do a sudo apt-get dist-upgrade (not just sudo apt-get upgrade) and see what's offered? Do you have the package linux-generic installed?

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

Aha! An apt-get dist-upgrade brought me to kernel 2.6.32-21. Yeesh, I'm a little grumpy that it was so hard to find out about this. Maybe this was in a README that I didn't read? I had no idea that a dist-upgrade would do anything more than an upgrade, once you were already on the current version of the OS (letter of the alphabet in the ubuntu naming scheme).

I just did 30 reboots with no problems, so it looks like I'm all set now. Thanks, Robert Hooker, for the help!

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

I'm going to change the status of this one to Fix Released. If the problem recurs, I'll put it back.

Changed in nvidia-graphics-drivers (Ubuntu):
status: In Progress → Fix Released
TJ (tj)
Changed in nvidia-graphics-drivers (Ubuntu):
assignee: TJ (intuitivenipple) → nobody
Revision history for this message
ISV_Damocles (isv-damocles) wrote :

This bug is not fixed, and I get the failure far more often than he reports (most of the time it fails to load the nvidia module, and only occasionally can I get it to boot properly).

Ubuntu 10.04 is the first version of Ubuntu that seems to support my nvidia graphics card (GeForce GT 320M) (at least I couldn't get it to use the nvidia drivers in 9.10).

Attached are Xorg.0.log (a successful nvidia start up), Xorg.1.log (Xorg catching the failure and switching to VESA), and Xorg.2.log (a failed log).

Xorg.0.log is in one system boot while Xorg.1.log and Xorg.2.log are in the previous system boot.

Also attached is the messages file, which the nvidia driver suggests reading for an error message, but I can't find one that doesn't show up under a successful boot up, as well.

I did find in comparing the dmesg and dmesg.0 (the current and previous boot, respectively) that when the failure occurs, it seems the nForce2 driver is failing ("nForce2_smbus 0000:00:03.2: Error probing SMB2.") in dmesg.0 while in dmesg no such line can be found. Do you think it could be some sort of race condition on communicating with the nForce2 chipset between the nForce2 driver and nvidia's proprietary graphics driver? (It is a black box, it could be touching who knows what.)

Revision history for this message
bcrowell (launchpadcrowell07) wrote :

Like ISV_Damocles, I'm finding that the problem wasn't actually fixed at the end of April. I've experienced it as recently as May 7, although it does seem much less frequent than it was in April. I'm going to change the status of the bug bag from Fix Released to In Progress.

Changed in nvidia-graphics-drivers (Ubuntu):
status: Fix Released → In Progress
Revision history for this message
OliverSchmid (oliver-schmd) wrote :

Hi all, I ran across a workaround in the forums. When it starts up into low resolution mode, log into the terminal and run: "sudo start gdm". GDM then pops up with full resolution.

Revision history for this message
Martin Wildam (mwildam) wrote :

I also noticed resolution problems when using my Dell Notebook with a port replicator / docking station.

I managed to get it right changing the gdm default start script - see http://it-tactics.blogspot.com/2010/08/ubuntu-1004-with-docking-station.html - maybe you can get a permanently working workaround using the information accordingly.

Revision history for this message
dino99 (9d9) wrote :

That version is no more supported; please open a new bug report if the actual archive found version also has the same issue.

Changed in nvidia-graphics-drivers (Ubuntu):
status: In Progress → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.