V10 of modified SETI MB CUDA + opt AP package for full multi-GPU+CPU use

Message boards : Number crunching : V10 of modified SETI MB CUDA + opt AP package for full multi-GPU+CPU use
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 15 · Next

AuthorMessage
Profile Bob Mahoney Design
Avatar

Send message
Joined: 4 Apr 04
Posts: 178
Credit: 9,205,632
RAC: 0
United States
Message 882118 - Posted: 3 Apr 2009, 23:24:29 UTC
Last modified: 3 Apr 2009, 23:26:01 UTC

Raistmer, it was a special moment...

I just installed BOINC 6.6.20.
Then I installed your new v10/11 with no affinity and no CPU->GPU.
Then I ran it for 30 minutes.

Then I got this validated task result:

http://setiathome.berkeley.edu/workunit.php?wuid=429443399

It was a WU that validated with your machine.

Isn't that an amazing service Raistmer provides? He works on Opti Apps, and he validates your WU's for you, too!

The new software is running perfect. Thanks for all the work you did on it.

Bob
Opinion stated as fact? Who, me?
ID: 882118 · Report as offensive
Profile -=SuperG=-
Avatar

Send message
Joined: 3 Apr 99
Posts: 63
Credit: 89,161,651
RAC: 23
Canada
Message 882201 - Posted: 4 Apr 2009, 8:10:21 UTC - in response to Message 882118.  
Last modified: 4 Apr 2009, 8:12:50 UTC

Raistmer, it was a special moment...

I just installed BOINC 6.6.20.
Then I installed your new v10/11 with no affinity and no CPU->GPU.
Then I ran it for 30 minutes.

Then I got this validated task result:

http://setiathome.berkeley.edu/workunit.php?wuid=429443399

It was a WU that validated with your machine.

Isn't that an amazing service Raistmer provides? He works on Opti Apps, and he validates your WU's for you, too!

The new software is running perfect. Thanks for all the work you did on it.

Bob


That's hilarious.. Bob.. would you mind handing over some of that RAW power... woot... :P It still takes me 40 minutes to complete a similar task.. Giggle..:P

And Raistmer.. it's running perfect for me too.. hee hee..

"still giggling"...

G
Boinc Wiki




"Great spirits have always encountered violent opposition from mediocre minds." -Albert Einstein
ID: 882201 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 882742 - Posted: 6 Apr 2009, 14:03:40 UTC
Last modified: 6 Apr 2009, 14:04:52 UTC


What could be here the prob?

With Raistmer's V7 kill mod I didn't had this prob.

Now with Raistmer's V7 .dll's and V10 CUDA mod app.

The 2. time I got this error:
http://setiathome.berkeley.edu/result.php?resultid=1196613052

'Out Of Memory' ?

My CPU/RAM/mobo isn't OCed.
My GPUs are OCed from manufacturer.

ID: 882742 · Report as offensive
samuel7
Volunteer tester

Send message
Joined: 2 Jan 00
Posts: 47
Credit: 2,194,240
RAC: 0
Finland
Message 882765 - Posted: 6 Apr 2009, 16:07:29 UTC - in response to Message 882084.  

Please, keep posting stderr output about "-12 triplet error", it contains some useful info along with error message.

Here's one from my rig with V10 (task #1197925306):
Exception detected inside cudaAcc_find_triplets, dumping client state
icfft=73222, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error
cudaAcc_find_triplets erroneously found a triplet twice in find_triplets_kernel
File: ..\analyzePoT.cpp
Line: 348

ID: 882765 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 882779 - Posted: 6 Apr 2009, 17:09:45 UTC - in response to Message 882742.  


What could be here the prob?

With Raistmer's V7 kill mod I didn't had this prob.

Now with Raistmer's V7 .dll's and V10 CUDA mod app.

The 2. time I got this error:
http://setiathome.berkeley.edu/result.php?resultid=1196613052

'Out Of Memory' ?

My CPU/RAM/mobo isn't OCed.
My GPUs are OCed from manufacturer.


Try the .dll's from Raistmer's original V10 download,
I'm using them with no problem.

Claggy
ID: 882779 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 882863 - Posted: 6 Apr 2009, 21:18:49 UTC - in response to Message 882742.  


What could be here the prob?

With Raistmer's V7 kill mod I didn't had this prob.

Now with Raistmer's V7 .dll's and V10 CUDA mod app.

The 2. time I got this error:
http://setiathome.berkeley.edu/result.php?resultid=1196613052

'Out Of Memory' ?

My CPU/RAM/mobo isn't OCed.
My GPUs are OCed from manufacturer.


CPUID: AMD Processor model unknown

Cache: L1=64K L2=512K
strange, unknown CPU....
ID: 882863 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 882865 - Posted: 6 Apr 2009, 21:19:27 UTC - in response to Message 882765.  

Please, keep posting stderr output about "-12 triplet error", it contains some useful info along with error message.

Here's one from my rig with V10 (task #1197925306):
Exception detected inside cudaAcc_find_triplets, dumping client state
icfft=73222, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error
cudaAcc_find_triplets erroneously found a triplet twice in find_triplets_kernel
File: ..\analyzePoT.cpp
Line: 348

thanks
ID: 882865 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 882869 - Posted: 6 Apr 2009, 21:27:03 UTC - in response to Message 882863.  
Last modified: 6 Apr 2009, 21:29:30 UTC


What could be here the prob?

With Raistmer's V7 kill mod I didn't had this prob.

Now with Raistmer's V7 .dll's and V10 CUDA mod app.

The 2. time I got this error:
http://setiathome.berkeley.edu/result.php?resultid=1196613052

'Out Of Memory' ?

My CPU/RAM/mobo isn't OCed.
My GPUs are OCed from manufacturer.


CPUID: AMD Processor model unknown

Cache: L1=64K L2=512K
strange, unknown CPU....


Nothing specially.. or secret.. ;-D

I have the 'old' BIOS (V1.5) on my Mobo. [MSI K9A2 Platinum]
In BIOS V1.7 I should/would have the new CPU Name..
It's the new AMD Phenom II X4 940 BE.. :-)

But.. ;-) ..you could guess/know why I got this error [to now 2. times] ?


..with your V7 mod I got not this error..

ID: 882869 · Report as offensive
Profile Voyager
Volunteer tester
Avatar

Send message
Joined: 2 Nov 99
Posts: 602
Credit: 3,264,813
RAC: 0
United States
Message 882927 - Posted: 6 Apr 2009, 23:47:51 UTC
Last modified: 6 Apr 2009, 23:52:23 UTC

Iv'e got these lately, they both completed after falling back to host cpu.
with run time error not enough memory
using v10
this one
and this one
ID: 882927 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 882988 - Posted: 7 Apr 2009, 3:50:50 UTC


Also to my upper question.. :-)

..

Since it's not allowed to ask CUDA related questions in the NC forum..

Profis - please, have a look in the Questions & answers : CUDA :

> BOINC V6.6.11 and 1/2 CUDA performance - thread.


Thanks a lot!

ID: 882988 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 882998 - Posted: 7 Apr 2009, 5:15:34 UTC


This are the two 'Out Of Memory' errors:
http://setiathome.berkeley.edu/result.php?resultid=1195475990


[This error I posted already in this thread]
http://setiathome.berkeley.edu/result.php?resultid=1196613052


-----------------------------------------------------------------


http://setiathome.berkeley.edu/result.php?resultid=1195495666
Exception detected inside cudaAcc_find_triplets, dumping client state
icfft=86040, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error
cudaAcc_find_triplets erroneously found a triplet twice in find_triplets_kernel
File: ..\analyzePoT.cpp
Line: 348



http://setiathome.berkeley.edu/result.php?resultid=1190297258
Exception detected inside cudaAcc_find_triplets, dumping client state
icfft=94365, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error
cudaAcc_find_triplets erroneously found a triplet twice in find_triplets_kernel
File: ..\analyzePoT.cpp
Line: 348



[This error I posted already in this thread]
http://setiathome.berkeley.edu/result.php?resultid=1195160891
Exception detected inside cudaAcc_find_triplets, dumping client state
icfft=86665, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error
cudaAcc_find_triplets erroneously found a triplet twice in find_triplets_kernel
File: ..\analyzePoT.cpp
Line: 348

ID: 882998 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 883305 - Posted: 8 Apr 2009, 6:16:26 UTC - in response to Message 882927.  

Iv'e got these lately, they both completed after falling back to host cpu.
with run time error not enough memory
using v10
this one
and this one

Total GPU memory 335216640 free GPU memory 64638976
nothing unusual - your host had too small free memory amount indeed.
True question is not why it fallback to CPU but why on starting time it had so low amount of free GPU memory available?
ID: 883305 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 883306 - Posted: 8 Apr 2009, 6:21:57 UTC - in response to Message 882988.  


Also to my upper question.. :-)

..

Since it's not allowed to ask CUDA related questions in the NC forum..

Profis - please, have a look in the Questions & answers : CUDA :

> BOINC V6.6.11 and 1/2 CUDA performance - thread.


Thanks a lot!

NC == Number crunching. Who thinks that GPU doesn't do number crunching ?? GPU performance topics look absolute relevant to this forum as much as Cell blade performance, CPU performance and motherboard/RAM performance....
Monitoring many different places is boring actually...
ID: 883306 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 883308 - Posted: 8 Apr 2009, 6:26:32 UTC - in response to Message 882998.  



[This error I posted already in this thread]


Thanks.
It seems this error occurs in different places. More statistic will be good.
I start to put such errors in table.
Only line with state dump (like icfft=86665, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error) worth to post.
Thanks again to all who participate in collection of these bug reports.
ID: 883308 · Report as offensive
Profile Misfit
Volunteer tester
Avatar

Send message
Joined: 21 Jun 01
Posts: 21804
Credit: 2,815,091
RAC: 0
United States
Message 883586 - Posted: 9 Apr 2009, 0:29:16 UTC - in response to Message 883306.  

NC == Number crunching. Who thinks that GPU doesn't do number crunching ??

Admin/Mods apparently.
me@rescam.org
ID: 883586 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 883595 - Posted: 9 Apr 2009, 1:11:57 UTC - in response to Message 883586.  

NC == Number crunching. Who thinks that GPU doesn't do number crunching ??

Admin/Mods apparently.


Not me.

But the reason for the CUDA forum is permissions. People who may have just installed BOINC and have no RAC can't post over here. The entire Q&A section of the message boards allows people with no RAC to post if they have a problem.
ID: 883595 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 883628 - Posted: 9 Apr 2009, 3:52:17 UTC
Last modified: 9 Apr 2009, 3:55:55 UTC


O.K., but there in the 'Questions & answers : CUDA :' - area are only 3 or 4 people around which could have knowledge - ..or not.

Here in the NC are more people around which could have knowledge because of special topics or problems.

..therefore it would be better to allow CUDA related topics here in the NC.


..or make a new NC-CUDA under-forum..

ID: 883628 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 883671 - Posted: 9 Apr 2009, 11:28:11 UTC - in response to Message 883628.  


O.K., but there in the 'Questions & answers : CUDA :' - area are only 3 or 4 people around which could have knowledge - ..or not.

Here in the NC are more people around which could have knowledge because of special topics or problems.

..therefore it would be better to allow CUDA related topics here in the NC.


..or make a new NC-CUDA under-forum..


It depends on the nature of the CUDA question. If we don't allow CUDA questions here, as you suggest, then why are Raistmer's threads here? The more technically advanced posts stay here in Number Crunching. The simple questions go into the CUDA forum. There is no need for an NC-CUDA forum.
ID: 883671 · Report as offensive
Profile Misfit
Volunteer tester
Avatar

Send message
Joined: 21 Jun 01
Posts: 21804
Credit: 2,815,091
RAC: 0
United States
Message 883870 - Posted: 10 Apr 2009, 5:18:46 UTC - in response to Message 883595.  
Last modified: 10 Apr 2009, 5:22:21 UTC

NC == Number crunching. Who thinks that GPU doesn't do number crunching ??

Admin/Mods apparently.

Not me.

But the reason for the CUDA forum is permissions. People who may have just installed BOINC and have no RAC can't post over here. The entire Q&A section of the message boards allows people with no RAC to post if they have a problem.

The post permissions were only supposed to be imposed on the Cafe. That's where the problems were when it was implemented.

It depends on the nature of the CUDA question. If we don't allow CUDA questions here, as you suggest, then why are Raistmer's threads here? The more technically advanced posts stay here in Number Crunching. The simple questions go into the CUDA forum. There is no need for an NC-CUDA forum.

That's all relative. Who's to say what is simple and what is advanced? Most likely the poster. So you're right back to the only criteria being the post permission.
me@rescam.org
ID: 883870 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 883929 - Posted: 10 Apr 2009, 12:50:09 UTC - in response to Message 883870.  

NC == Number crunching. Who thinks that GPU doesn't do number crunching ??

Admin/Mods apparently.

Not me.

But the reason for the CUDA forum is permissions. People who may have just installed BOINC and have no RAC can't post over here. The entire Q&A section of the message boards allows people with no RAC to post if they have a problem.

The post permissions were only supposed to be imposed on the Cafe. That's where the problems were when it was implemented.


Its not up to me on the post permissions. I do not have control over that. All I know is that the entire board except all of Q&A need to have a RAC of 1 to create a post.

It depends on the nature of the CUDA question. If we don't allow CUDA questions here, as you suggest, then why are Raistmer's threads here? The more technically advanced posts stay here in Number Crunching. The simple questions go into the CUDA forum. There is no need for an NC-CUDA forum.

That's all relative. Who's to say what is simple and what is advanced? Most likely the poster. So you're right back to the only criteria being the post permission.


Which is why I mod based on personal judgement. So far I've had no complaints. Then again, this is all of topic in Raistmer's thread, and I doubt he'd take kindly to us hijacking it for this purpose.
ID: 883929 · Report as offensive
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 15 · Next

Message boards : Number crunching : V10 of modified SETI MB CUDA + opt AP package for full multi-GPU+CPU use


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.