CUDA and no SETI

Message boards : Number crunching : CUDA and no SETI
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Profile Paul D Harris
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 1122
Credit: 33,600,005
RAC: 0
United States
Message 852541 - Posted: 12 Jan 2009, 2:12:42 UTC
Last modified: 12 Jan 2009, 2:13:58 UTC

I just started to do CUDA since I got my computer fixed and I have a CUDA card and now my WU's has all CUDA and no SETI? What gives?
ID: 852541 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 852543 - Posted: 12 Jan 2009, 2:23:49 UTC - in response to Message 852541.  

It saw you had a CUDA card so it sent you CUDA WUs by default. You should also get some astropulse WUs.


PROUD MEMBER OF Team Starfire World BOINC
ID: 852543 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51531
Credit: 1,018,363,574
RAC: 1,004
United States
Message 852544 - Posted: 12 Jan 2009, 2:29:57 UTC - in response to Message 852541.  
Last modified: 12 Jan 2009, 2:30:47 UTC

I just started to do CUDA since I got my computer fixed and I have a CUDA card and now my WU's has all CUDA and no SETI? What gives?

You could shoulda had no Cuda as you shouldna.......but you Cuda had a cuda so ya just been Cuda'd..........


Enjoy....
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 852544 · Report as offensive
Profile Paul D Harris
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 1122
Credit: 33,600,005
RAC: 0
United States
Message 852550 - Posted: 12 Jan 2009, 2:58:29 UTC - in response to Message 852544.  

I just started to do CUDA since I got my computer fixed and I have a CUDA card and now my WU's has all CUDA and no SETI? What gives?

You could shoulda had no Cuda as you shouldna.......but you Cuda had a cuda so ya just been Cuda'd..........


Enjoy....


That's good!
ID: 852550 · Report as offensive
Profile Paul D Harris
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 1122
Credit: 33,600,005
RAC: 0
United States
Message 852565 - Posted: 12 Jan 2009, 4:12:25 UTC
Last modified: 12 Jan 2009, 4:17:10 UTC

My CUDA WU's run faster than my SETI WU's for the same credit in other words more work done with CUDA equals more credit with CUDA. A 14 credit WU with CUDA equals about 50 seconds CPU time. I think if I understand my results page.

My computer number is 37936
See http://setiweb.ssl.berkeley.edu/beta/workunit.php?wuid=1444420
And http://setiweb.ssl.berkeley.edu/beta/workunit.php?wuid=1602276
ID: 852565 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 852567 - Posted: 12 Jan 2009, 4:25:47 UTC - in response to Message 852565.  

My CUDA WU's run faster than my SETI WU's for the same credit in other words more work done with CUDA equals more credit with CUDA. A 14 credit WU with CUDA equals about 50 seconds CPU time. I think if I understand my results page.
...

You need to look at elapsed time, the 50 seconds is the small amount of CPU time needed to feed the GPU. There are plans for BOINC to gather and report actual time used IIRC. That's not yet available, so the only way to tell is to look in the logged or BOINC Manager messages for the start time and finish time of a task.
                                                              Joe
ID: 852567 · Report as offensive
Profile Paul D Harris
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 1122
Credit: 33,600,005
RAC: 0
United States
Message 852568 - Posted: 12 Jan 2009, 4:30:09 UTC - in response to Message 852567.  

My CUDA WU's run faster than my SETI WU's for the same credit in other words more work done with CUDA equals more credit with CUDA. A 14 credit WU with CUDA equals about 50 seconds CPU time. I think if I understand my results page.
...

You need to look at elapsed time, the 50 seconds is the small amount of CPU time needed to feed the GPU. There are plans for BOINC to gather and report actual time used IIRC. That's not yet available, so the only way to tell is to look in the logged or BOINC Manager messages for the start time and finish time of a task.
                                                              Joe


OK thanks that makes more sense now.
ID: 852568 · Report as offensive
John Turner

Send message
Joined: 16 Mar 02
Posts: 1
Credit: 82,311
RAC: 0
United States
Message 852581 - Posted: 12 Jan 2009, 5:36:21 UTC - in response to Message 852568.  

Yeah, well here's the deal. Other threads got it right. Cuda isn't ready for prime time. Every one of my tasks errored out with not enough memory, and that = no credi-to. Worse, instead of the elegent sharing that has allowed me to continue whatever work I have with S@H I now have sparklies after a CUDA computational error. I opted out and will stay out until one of you brave souls logs about 50 hours with no problems. If there's change review comittee in charge they need to reconsider their choices.
ID: 852581 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 852588 - Posted: 12 Jan 2009, 6:27:30 UTC - in response to Message 852581.  
Last modified: 12 Jan 2009, 6:40:35 UTC

Yeah, well here's the deal. Other threads got it right. Cuda isn't ready for prime time. Every one of my tasks errored out with not enough memory, and that = no credi-to. Worse, instead of the elegent sharing that has allowed me to continue whatever work I have with S@H I now have sparklies after a CUDA computational error. I opted out and will stay out until one of you brave souls logs about 50 hours with no problems. If there's change review comittee in charge they need to reconsider their choices.

I see many -9 overflows but my guess is they were caused by This task. This is a Very Low Angle Range task, or VLAR (0.1 angle range and below tasks) that if you didn't reboot after this task, can have the effect of causing what the other tasks that followed had, "SETI@Home Informational message -9 result_overflow
NOTE: The number of results detected exceeds the storage space allocated." which is a false result in most cases under these circumstances.

It's been suggested that you use the batch file descrebed in several of the threads here like this one in an attempt to spot these VLAR tasks before they run so they can be aborted.
ID: 852588 · Report as offensive
Riil
Volunteer tester

Send message
Joined: 9 Mar 04
Posts: 9
Credit: 327,611
RAC: 9
Poland
Message 852594 - Posted: 12 Jan 2009, 8:15:31 UTC

Got same problem in here. My screen gets black from time to time (couple of times a day atm). I've got this information in the right corner "display driver nvlddmkm stopped response, but it works again now". In many cases i have to reeboot my machine. Its because of thaousands of small pixels moving on my screen. This kind of WUs need less than 30 sec of time to be finished (all with errors, all with Spike count: 30, all with same message 'SETI@Home Informational message -9 result_overflow') Any clue how to fix it ???
ID: 852594 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 852599 - Posted: 12 Jan 2009, 9:04:22 UTC - in response to Message 852594.  

Got same problem in here. My screen gets black from time to time (couple of times a day atm). I've got this information in the right corner "display driver nvlddmkm stopped response, but it works again now". In many cases i have to reeboot my machine. Its because of thaousands of small pixels moving on my screen. This kind of WUs need less than 30 sec of time to be finished (all with errors, all with Spike count: 30, all with same message 'SETI@Home Informational message -9 result_overflow') Any clue how to fix it ???


If you have troubles with CUDA, it's worth to read other threads where all currently know bugs described and solutions are proposed.
Black screen, driver restart, BSoD, overflows - all these symptoms belongs to VLAR/VHAR processing bugs in current CUDA MB version. The solution is simple you just need to run script sorting tasks by AR, abort all tasks with AR known in "bug area" and do happy CUDA crunching for few days. Then refill cache and repeat procedure.
That's all. And it takes much less time to complete than to write endless moanings here on forums.
Look here too:
http://setiathome.berkeley.edu/forum_thread.php?id=51175
ID: 852599 · Report as offensive
Roberto Patriarca

Send message
Joined: 17 May 99
Posts: 22
Credit: 1,967,389
RAC: 0
Italy
Message 852707 - Posted: 12 Jan 2009, 17:08:02 UTC - in response to Message 852599.  

The solution is simple you just need to run script sorting tasks by AR, abort all tasks with AR known in "bug area" and do happy CUDA crunching for few days. Then refill cache and repeat procedure.


Where can I find such a script? I have grown tired of aborting frozen tasks, coping with unnecessary overflows, rebooting, etc. I had a look around and found references to that script but not the script itself.

Thanks in advance.

Roberto who got a 9800 GT just to speed up crunching
ID: 852707 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 852717 - Posted: 12 Jan 2009, 18:07:14 UTC - in response to Message 852707.  

ID: 852717 · Report as offensive
Roberto Patriarca

Send message
Joined: 17 May 99
Posts: 22
Credit: 1,967,389
RAC: 0
Italy
Message 852726 - Posted: 12 Jan 2009, 19:02:11 UTC - in response to Message 852717.  

here: http://setiathome.berkeley.edu/beta/forum_thread.php?id=1443&nowrap=true#36281


Thank you so much. I've worked out my own script, but without that reference it would have been impossible.
Now let's see if my computer can run unattended for more than a few hours.
ID: 852726 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 852731 - Posted: 12 Jan 2009, 19:22:08 UTC - in response to Message 852726.  

here: http://setiathome.berkeley.edu/beta/forum_thread.php?id=1443&nowrap=true#36281


Thank you so much. I've worked out my own script, but without that reference it would have been impossible.
Now let's see if my computer can run unattended for more than a few hours.

If you made some useful improvements it's worth to publish your own script too - maybe it will help someone... ;)
ID: 852731 · Report as offensive
Profile Paul D Harris
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 1122
Credit: 33,600,005
RAC: 0
United States
Message 852757 - Posted: 12 Jan 2009, 21:21:45 UTC

I have been reading about the CUDA and how fast it is or isn't.
I am undecided but here is a jpg of my task folder

http://seti.pdharris.com/task/task_2009.12.01.jpg

ID: 852757 · Report as offensive
Riil
Volunteer tester

Send message
Joined: 9 Mar 04
Posts: 9
Credit: 327,611
RAC: 9
Poland
Message 852758 - Posted: 12 Jan 2009, 21:24:45 UTC - in response to Message 852599.  

Got same problem in here. My screen gets black from time to time (couple of times a day atm). I've got this information in the right corner "display driver nvlddmkm stopped response, but it works again now". In many cases i have to reeboot my machine. Its because of thaousands of small pixels moving on my screen. This kind of WUs need less than 30 sec of time to be finished (all with errors, all with Spike count: 30, all with same message 'SETI@Home Informational message -9 result_overflow') Any clue how to fix it ???


If you have troubles with CUDA, it's worth to read other threads where all currently know bugs described and solutions are proposed.
Black screen, driver restart, BSoD, overflows - all these symptoms belongs to VLAR/VHAR processing bugs in current CUDA MB version. The solution is simple you just need to run script sorting tasks by AR, abort all tasks with AR known in "bug area" and do happy CUDA crunching for few days. Then refill cache and repeat procedure.
That's all. And it takes much less time to complete than to write endless moanings here on forums.
Look here too:
http://setiathome.berkeley.edu/forum_thread.php?id=51175


THX for Your tips Raistmer. I'm gonna try to sort this out tomorrow. I'll check forum twice before my next question.
ID: 852758 · Report as offensive
Maik

Send message
Joined: 15 May 99
Posts: 163
Credit: 9,208,555
RAC: 0
Germany
Message 852765 - Posted: 12 Jan 2009, 21:59:19 UTC - in response to Message 852757.  
Last modified: 12 Jan 2009, 22:10:43 UTC

I have been reading about the CUDA and how fast it is or isn't.
I am undecided ...

There are two kind of WU's sent out.
1st one needs around 9min
2nd needs around 27min

reference: the cpu on my host needs at least twice
ID: 852765 · Report as offensive
Roberto Patriarca

Send message
Joined: 17 May 99
Posts: 22
Credit: 1,967,389
RAC: 0
Italy
Message 852979 - Posted: 13 Jan 2009, 8:28:36 UTC - in response to Message 852731.  


If you made some useful improvements it's worth to publish your own script too - maybe it will help someone... ;)


Oh, it just sorts the WUs by true_angle and prints out the results. The only difference is that it is written using Unix shell commands (Windows + Cygwin) because I like bash better than DOS.

Now I was wondering whether I can delete the WU files my CUDA devices are not able to process instead of aborting them manually. I don't know how BoincManager behaves in case of missing files.

By the way, I needed to abort all task with angle < 0.012, because CUDA got struck on one unit with angle = 0.012. That made some 30% of all WUs in the last batch I downloaded. I hope I will not be marked as a "bad" user of sort in S&H databases. My computer has been running fine for 18 hours now, thought.

Thanks again for your help!

Roberto
ID: 852979 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 852980 - Posted: 13 Jan 2009, 8:35:37 UTC - in response to Message 852979.  
Last modified: 13 Jan 2009, 8:45:03 UTC

Now I was wondering whether I can delete the WU files my CUDA devices are not able to process instead of aborting them manually. I don't know how BoincManager behaves in case of missing files.

It's better that you abort tasks instead of deleting them unless you following the instructions in this post.

By the way, I needed to abort all task with angle < 0.012, because CUDA got struck on one unit with angle = 0.012. That made some 30% of all WUs in the last batch I downloaded. I hope I will not be marked as a "bad" user of sort in S&H databases. My computer has been running fine for 18 hours now, thought.


Those are Very Low Angle Range or VLAR tasks (0.1 angle range and below) and it's understood those of us using CUDA will be aborting them for now. No one will hold it against you.
ID: 852980 · Report as offensive
1 · 2 · 3 · Next

Message boards : Number crunching : CUDA and no SETI


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.