CUDA and the BLUE SCREEN OF DEATH

Message boards : Number crunching : CUDA and the BLUE SCREEN OF DEATH
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Steve Robertson

Send message
Joined: 14 May 99
Posts: 38
Credit: 2,643,210
RAC: 0
United States
Message 851456 - Posted: 9 Jan 2009, 23:15:12 UTC
Last modified: 9 Jan 2009, 23:16:01 UTC

CUDA has been crashing my GPU (I assume due to overheating) so I de-selected it as an option. Does the use x% of processor setting affect only the CPUs or the GPU too? I don't have any intentions of adding fans etc to my system to otherwise rectify this.
ID: 851456 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51507
Credit: 1,018,363,574
RAC: 1,004
United States
Message 851533 - Posted: 10 Jan 2009, 2:24:35 UTC - in response to Message 851456.  

CUDA has been crashing my GPU (I assume due to overheating) so I de-selected it as an option. Does the use x% of processor setting affect only the CPUs or the GPU too? I don't have any intentions of adding fans etc to my system to otherwise rectify this.

Another good Cuda query......can Cuda usage be throttled?
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 851533 · Report as offensive
.clair.

Send message
Joined: 4 Nov 04
Posts: 1300
Credit: 55,390,408
RAC: 69
United Kingdom
Message 851546 - Posted: 10 Jan 2009, 3:23:40 UTC

Can CUDA be throttled ? ..........
Hmmm, brings `something` to mind.........
.
Curiously Undeveloped Dubious Application :)
ID: 851546 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51507
Credit: 1,018,363,574
RAC: 1,004
United States
Message 851573 - Posted: 10 Jan 2009, 4:53:49 UTC - in response to Message 851546.  
Last modified: 10 Jan 2009, 4:55:41 UTC

Can CUDA be throttled ? ..........
Hmmm, brings `something` to mind.........
.
Curiously Undeveloped Dubious Application :)

I was talking about the app, not the devs...........although...never mind.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 851573 · Report as offensive
Gregory A Pharr

Send message
Joined: 27 May 01
Posts: 17
Credit: 281,581
RAC: 0
United States
Message 851579 - Posted: 10 Jan 2009, 5:09:53 UTC

I too started processing CUDA units and began getting BSOD. I've never had those before, even in hard gaming. In analyzing the dump files and checking with nVidia, it does indeed look like my card too was overheating. Looks like CUDA was driving my card harder than any game ever has. But, my card is also factory 'superclocked', so it probably ran hotter anyway. Oh well, another excuse to get me one of the new GTX 295 cards. For those having the same problem, I've seen in other forums that you can tweak down the speed of the GPU to reduce heat, and of course, make sure your case is well ventilated as well. Also, if you're running any of the newer GTX 200 series, you need to be sure to have not only enough wattage on your power supply, but also enough amps available on the +12v rails. Many power supplies have lots of watts, but can't supply the ampage on the rails, and that is very often the source of stability issues when driving the nVidia processors hard. For example, the GTX 295 requires that the power supply be able to provide at least 46amps +12v, and few power supplies can do that. If you run high end cards with wimpy power supplies, you'll have problems.
ID: 851579 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51507
Credit: 1,018,363,574
RAC: 1,004
United States
Message 851580 - Posted: 10 Jan 2009, 5:12:43 UTC - in response to Message 851579.  

I too started processing CUDA units and began getting BSOD. I've never had those before, even in hard gaming. In analyzing the dump files and checking with nVidia, it does indeed look like my card too was overheating. Looks like CUDA was driving my card harder than any game ever has. But, my card is also factory 'superclocked', so it probably ran hotter anyway. Oh well, another excuse to get me one of the new GTX 295 cards. For those having the same problem, I've seen in other forums that you can tweak down the speed of the GPU to reduce heat, and of course, make sure your case is well ventilated as well. Also, if you're running any of the newer GTX 200 series, you need to be sure to have not only enough wattage on your power supply, but also enough amps available on the +12v rails. Many power supplies have lots of watts, but can't supply the ampage on the rails, and that is very often the source of stability issues when driving the nVidia processors hard. For example, the GTX 295 requires that the power supply be able to provide at least 46amps +12v, and few power supplies can do that. If you run high end cards with wimpy power supplies, you'll have problems.

Stock setups do not consider 100% load 100% of the time as 'normal'.....
They are not engineered for that..........

So continues the sad saga of the lonely Cuda......
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 851580 · Report as offensive
MarkJ Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 08
Posts: 1139
Credit: 80,854,192
RAC: 5
Australia
Message 851627 - Posted: 10 Jan 2009, 10:29:54 UTC - in response to Message 851533.  
Last modified: 10 Jan 2009, 10:32:18 UTC

CUDA has been crashing my GPU (I assume due to overheating) so I de-selected it as an option. Does the use x% of processor setting affect only the CPUs or the GPU too? I don't have any intentions of adding fans etc to my system to otherwise rectify this.

Another good Cuda query......can Cuda usage be throttled?


The short answer is No. Not in the current version of BOINC. In fact it won't even task switch them, they run until they finish (one way or the other) and then it will release the gpu for another task.

It probably won't be in the next version of BOINC either, but if you don't ask you don't get. Its a good idea so i'll put the suggestion in and we will see.
BOINC blog
ID: 851627 · Report as offensive
MarkJ Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 08
Posts: 1139
Credit: 80,854,192
RAC: 5
Australia
Message 851632 - Posted: 10 Jan 2009, 10:48:44 UTC - in response to Message 851627.  

but if you don't ask you don't get. Its a good idea so i'll put the suggestion in and we will see.


Added trak ticket 822 with the suggestion.
BOINC blog
ID: 851632 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 851637 - Posted: 10 Jan 2009, 11:21:47 UTC - in response to Message 851456.  
Last modified: 10 Jan 2009, 11:25:45 UTC

CUDA has been crashing my GPU (I assume due to overheating) so I de-selected it as an option. Does the use x% of processor setting affect only the CPUs or the GPU too? I don't have any intentions of adding fans etc to my system to otherwise rectify this.


Just try to underclock GPU if you think overheating is the reason (I doubt in that )

For now common reason for driver restart/BSoD on Vista and BSoD on XP is VLAR task. And probably it caused BSOD by kernel that occupies GPU more than few seconds - known CUDA/videodriver issue - check nVidia CUDA page for this limitation.
ID: 851637 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51507
Credit: 1,018,363,574
RAC: 1,004
United States
Message 851685 - Posted: 10 Jan 2009, 14:29:21 UTC - in response to Message 851637.  

CUDA has been crashing my GPU (I assume due to overheating) so I de-selected it as an option. Does the use x% of processor setting affect only the CPUs or the GPU too? I don't have any intentions of adding fans etc to my system to otherwise rectify this.


Just try to underclock GPU if you think overheating is the reason (I doubt in that )

For now common reason for driver restart/BSoD on Vista and BSoD on XP is VLAR task. And probably it caused BSOD by kernel that occupies GPU more than few seconds - known CUDA/videodriver issue - check nVidia CUDA page for this limitation.

Rather than make the statement thet 'Cuda is Death'......I suppose I should wish you all well........tune your Cuda, LOL...
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 851685 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 851707 - Posted: 10 Jan 2009, 15:36:05 UTC - in response to Message 851685.  
Last modified: 10 Jan 2009, 15:47:04 UTC

CUDA has been crashing my GPU (I assume due to overheating) so I de-selected it as an option. Does the use x% of processor setting affect only the CPUs or the GPU too? I don't have any intentions of adding fans etc to my system to otherwise rectify this.


Just try to underclock GPU if you think overheating is the reason (I doubt in that )

For now common reason for driver restart/BSoD on Vista and BSoD on XP is VLAR task. And probably it caused BSOD by kernel that occupies GPU more than few seconds - known CUDA/videodriver issue - check nVidia CUDA page for this limitation.

Rather than make the statement thet 'Cuda is Death'......I suppose I should wish you all well........tune your Cuda, LOL...

WTF, Mark? CUDA is just new technology and sooner or later it will be well used in SETI and not only in SETI as any another computational technology. Why so negative ???
Maybe it's time to stop to blame and to try to use it ?
Debugging in progress indeed. One of overflow bugs already fixed.
ID: 851707 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51507
Credit: 1,018,363,574
RAC: 1,004
United States
Message 851718 - Posted: 10 Jan 2009, 15:54:18 UTC - in response to Message 851707.  
Last modified: 10 Jan 2009, 15:58:43 UTC

CUDA has been crashing my GPU (I assume due to overheating) so I de-selected it as an option. Does the use x% of processor setting affect only the CPUs or the GPU too? I don't have any intentions of adding fans etc to my system to otherwise rectify this.


Just try to underclock GPU if you think overheating is the reason (I doubt in that )

For now common reason for driver restart/BSoD on Vista and BSoD on XP is VLAR task. And probably it caused BSOD by kernel that occupies GPU more than few seconds - known CUDA/videodriver issue - check nVidia CUDA page for this limitation.

Rather than make the statement thet 'Cuda is Death'......I suppose I should wish you all well........tune your Cuda, LOL...

WTF, Mark? CUDA is just new technology and sooner or later it will be well used in SETI and not only in SETI as any another computational technology. Why so negative ???
Maybe it's time to stop to blame and to try to use it ?
Debugging in progress indeed. One of overflow bugs already fixed.

Oh, you detected my negativity? No problem with you, my friend. The biggest bitch I have is that this crap still belongs in BETA.....NOT HERE IN MAIN!!!!!!!!! AM I CLEAR ON MY POINT?????? Rant off........ It never should have seen the f-in light of day on main.........period.....exclamation point....end of line. Have fun with it......but don't waste my time or the project's time until it is producing valid science.........until then, it is a toy.......not a tool. Advances in technology are all well and good.......if the f'in wheels don't fall off of it when ya roll it out..........seems like Toy Story Bloopers......
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 851718 · Report as offensive
Steve Robertson

Send message
Joined: 14 May 99
Posts: 38
Credit: 2,643,210
RAC: 0
United States
Message 851944 - Posted: 11 Jan 2009, 0:10:19 UTC - in response to Message 851637.  
Last modified: 11 Jan 2009, 0:11:28 UTC

"For now common reason for driver restart/BSoD on Vista and BSoD on XP is VLAR task. And probably it caused BSOD by kernel that occupies GPU more than few seconds - known CUDA/videodriver issue - check nVidia CUDA page for this limitation."

I think you're right on this one, because it hasn't been running that hot (that I can tell). I noticed the newest drivers don't address this- does anybody have a fix?
ID: 851944 · Report as offensive
Beau

Send message
Joined: 24 Feb 08
Posts: 50
Credit: 129,080
RAC: 0
United States
Message 851946 - Posted: 11 Jan 2009, 0:19:32 UTC - in response to Message 851944.  

Since your taking requests dealing with cuda here, I would like to request a setting/checkbox or something that I can click on so I wont get paired up with a wingman running cuda. I think people runnig cuda should get paired up together, and those of us running cpu's be paired up with no intermingeling between the 2.
ID: 851946 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 851950 - Posted: 11 Jan 2009, 0:26:35 UTC - in response to Message 851946.  

Since your taking requests dealing with cuda here, I would like to request a setting/checkbox or something that I can click on so I wont get paired up with a wingman running cuda. I think people runnig cuda should get paired up together, and those of us running cpu's be paired up with no intermingeling between the 2.

Why do you think so? Do you have any idea why pairing is needed at all?
To ensure result validity. If you propose to pair buggy app with buggy app - you want to get database corruption? Think twice again.
ID: 851950 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 851951 - Posted: 11 Jan 2009, 0:27:04 UTC - in response to Message 851944.  

"For now common reason for driver restart/BSoD on Vista and BSoD on XP is VLAR task. And probably it caused BSOD by kernel that occupies GPU more than few seconds - known CUDA/videodriver issue - check nVidia CUDA page for this limitation."

I think you're right on this one, because it hasn't been running that hot (that I can tell). I noticed the newest drivers don't address this- does anybody have a fix?


Fixing in progress.
ID: 851951 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 851954 - Posted: 11 Jan 2009, 0:32:17 UTC - in response to Message 851951.  

V4c Raistmer? :) V4b is running good for me now. I'm going to give it a couple of days before I do anything else.


PROUD MEMBER OF Team Starfire World BOINC
ID: 851954 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 852005 - Posted: 11 Jan 2009, 2:54:39 UTC - in response to Message 851946.  

Since your taking requests dealing with cuda here, I would like to request a setting/checkbox or something that I can click on so I wont get paired up with a wingman running cuda. I think people runnig cuda should get paired up together, and those of us running cpu's be paired up with no intermingeling between the 2.

For now, the preference setting to only accept Astropulse work would suit you.

I distinctly disagree with any suggestion that CUDA should be paired with CUDA now, that would put too many doubtful results in the master science database. Once the bugs are fixed I doubt many will care what pairings happen, but more flexibility would at least make sense.
                                                                  Joe
ID: 852005 · Report as offensive
Beau

Send message
Joined: 24 Feb 08
Posts: 50
Credit: 129,080
RAC: 0
United States
Message 852019 - Posted: 11 Jan 2009, 3:38:50 UTC - in response to Message 852005.  


I distinctly disagree with any suggestion that CUDA should be paired with CUDA now, that would put too many doubtful results in the master science database. Once the bugs are fixed I doubt many will care what pairings happen, but more flexibility would at least make sense.
                                                                  Joe
[/quote]

Another point, if cuda is giving so many "doubtful" results as you had said, why is it being used in the mainstream crunching and not back in beta testing where it should remain until it works? Why cant I also crunch seti on my TI-84 calculator as well; seeing as the quality of results obviously dont matter to seti.
ID: 852019 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 852054 - Posted: 11 Jan 2009, 4:55:26 UTC - in response to Message 852019.  

...
Why cant I also crunch seti on my TI-84 calculator as well; seeing as the quality of results obviously dont matter to seti.

Hey, if somebody wants to re-write the MB code into BASIC so that it can be loaded as an app, go for it. Though I don't know what kind of instruction set you would be stuck with. That Zilog Z80 is a relic. The 83+SE is clocked to 15MHz (I think the 84 is the same). The 83 and 83+ are only 6MHz.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 852054 · Report as offensive
1 · 2 · 3 · Next

Message boards : Number crunching : CUDA and the BLUE SCREEN OF DEATH


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.