CUDA locking up my computer

Questions and Answers : GPU applications : CUDA locking up my computer
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Borgholio
Avatar

Send message
Joined: 2 Aug 99
Posts: 654
Credit: 18,623,738
RAC: 45
United States
Message 923127 - Posted: 2 Aug 2009, 9:23:01 UTC

Occasionally, when starting a new Seti@home CUDA task, my computer will completely lock up. No blue screen or other error message, everything just stops. This does not appear to happen during crunching, only when crunching first begins. This does not happen constantly, it varies between once every couple weeks and a couple times a day.

The GPU temperatures were always in the green, as were CPU temperatures. Upgrading drivers and CUDA version from 1 to 2.2 didn't seem to work at the time. Since I mainly run Seti@home, I don't know if the problem occurs with other CUDA projects or only Seti. I'm running a few GPUGrid workunits now so we'll see.

Originally I had a GeForce 8400 GS that never locked up. Then I upgraded to a GeForce 9600 GT card. Unfortunately, I had to send it back to the manufacturer 3 times under warranty for various issues. During that period I ran on a GeForce 8600 with no problems.

The fourth 9600 I received worked fine...except for CUDA. That's when the locking up first began. I was running 186.18 drivers and Cuda 2.2. A couple days ago I upgraded to a GeForce 9800 GT (same driver version), and the locking up continued.

I have just now updated to the latest drivers (190.38), and Cuda 2.3. I will have to wait and see if this has any effect. I have also put my old 9600 in another computer to see if it has issues there as well.

While I wait to see what happens, does anybody have any suggestions as to what else I should look at? Thanks!
You will be assimilated...bunghole!

ID: 923127 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 923141 - Posted: 2 Aug 2009, 11:29:53 UTC - in response to Message 923127.  

Process of elimination. Put all cards in other machines. If they don't lock up there, try running with the CPU only on the affected machine.. or try running something else that stresses the videocard, like a heavy OpenGL game such as Crysis.

If that locks up the machine as well, you could be looking at a damaged motherboard.
ID: 923141 · Report as offensive
Joseph Monk

Send message
Joined: 31 Mar 07
Posts: 150
Credit: 1,181,197
RAC: 0
Korea, South
Message 923143 - Posted: 2 Aug 2009, 11:53:26 UTC - in response to Message 923127.  

Sure they aren't VLARs, my system became very unresponsive when CUDA ran VLAR WU. I'm on 190.18/CUDA 2.3 on two GTX 260-216 (HEAVILY OC) and the system is pretty slow when crunching any WU, haven't tested yet to see if it's my OC or the CUDA doing it, but didn't do it before (I set the OC shortly after upgrading).
ID: 923143 · Report as offensive
Profile Borgholio
Avatar

Send message
Joined: 2 Aug 99
Posts: 654
Credit: 18,623,738
RAC: 45
United States
Message 923212 - Posted: 2 Aug 2009, 18:22:52 UTC - in response to Message 923141.  

Process of elimination. Put all cards in other machines. If they don't lock up there, try running with the CPU only on the affected machine.. or try running something else that stresses the videocard, like a heavy OpenGL game such as Crysis.

If that locks up the machine as well, you could be looking at a damaged motherboard.


I'll try running 3dmark, as well as Ntune's stability test and Sandra's burn in. Don't have Crysis, unfortunately.
You will be assimilated...bunghole!

ID: 923212 · Report as offensive
Profile popandbob
Volunteer tester

Send message
Joined: 19 Mar 05
Posts: 551
Credit: 4,673,015
RAC: 0
Canada
Message 923263 - Posted: 3 Aug 2009, 0:48:50 UTC

I've had this error in the past but upgrading to cuda 2.3 seems to have stopped the problem.
Bob


Do you Good Search for Seti@Home? http://www.goodsearch.com/?charityid=888957
Or Good Shop? http://www.goodshop.com/?charityid=888957
ID: 923263 · Report as offensive
Profile Borgholio
Avatar

Send message
Joined: 2 Aug 99
Posts: 654
Credit: 18,623,738
RAC: 45
United States
Message 923309 - Posted: 3 Aug 2009, 5:02:55 UTC - in response to Message 923263.  

I've had this error in the past but upgrading to cuda 2.3 seems to have stopped the problem.
Bob


Very good news, I anxiously await the results on this end. Thanks!
You will be assimilated...bunghole!

ID: 923309 · Report as offensive
Profile Borgholio
Avatar

Send message
Joined: 2 Aug 99
Posts: 654
Credit: 18,623,738
RAC: 45
United States
Message 923835 - Posted: 5 Aug 2009, 20:07:11 UTC

So far after upgrading to Cuda 2.3, my 9800 system is running fine (knock on wood). However my 9600 system running 2.2 just locked up. Thus I can conclude it's not something unique to my system, as the lockups occurred with both computers when they ran the 9600. I will try upgrading the other computer to Cuda 2.3 and see if that fixes it.

Interestingly though, it seems that the lockup did NOT occur at the start of a workunit. It happened about 89% of the way through. After hitting the reset button, the same workunit picked up where it left off.
You will be assimilated...bunghole!

ID: 923835 · Report as offensive
Profile Borgholio
Avatar

Send message
Joined: 2 Aug 99
Posts: 654
Credit: 18,623,738
RAC: 45
United States
Message 924216 - Posted: 7 Aug 2009, 0:12:35 UTC
Last modified: 7 Aug 2009, 0:32:23 UTC

The 9600 locked up again, even with the latest drivers and CUDA 2.3. 9800 still running fine. I will try to downgrade the drivers to the earliest ones that support CUDA and go back to 1.0.

Edit - Before downgrading, I decided to try to underclock the card and see if that helped. If not, I'm curious if possibly Raistmer's V12 CUDA app could be the culprit...perhaps I should go back to the standard Seti CUDA app?

Any other thoughts?
You will be assimilated...bunghole!

ID: 924216 · Report as offensive
Profile Borgholio
Avatar

Send message
Joined: 2 Aug 99
Posts: 654
Credit: 18,623,738
RAC: 45
United States
Message 925688 - Posted: 12 Aug 2009, 22:15:26 UTC

Well a week after I downgraded from Raistmer's V12 CUDA app to the stock Seti 6.08 app, the 9600 machine is still locking up occasionally...only this time it seems to un-lock after 10 - 20 seconds and runs fine afterwards. I don't recall the computer recovering on it's own with the V12 app. The 9800 machine isn't locking up at all. This is most curious...

It appears upon reflection to lock up mainly when I have a game running in the background, so it could be competition for resources that's causing the issue. Since the computer appears to recover on it's own without need for a reboot, it's pretty much moot at this point.
You will be assimilated...bunghole!

ID: 925688 · Report as offensive

Questions and Answers : GPU applications : CUDA locking up my computer


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.