V10 of modified SETI MB CUDA + opt AP package for full multi-GPU+CPU use

Message boards : Number crunching : V10 of modified SETI MB CUDA + opt AP package for full multi-GPU+CPU use
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 15 · Next

AuthorMessage
FiveHamlet
Avatar

Send message
Joined: 5 Oct 99
Posts: 783
Credit: 32,638,578
RAC: 0
United Kingdom
Message 884396 - Posted: 11 Apr 2009, 22:06:34 UTC - in response to Message 884389.  
Last modified: 11 Apr 2009, 22:07:42 UTC

What makes the difference the opp app you are using or the machine?
Or the o/clocking?
ID: 884396 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 884405 - Posted: 11 Apr 2009, 22:29:49 UTC - in response to Message 884396.  

What makes the difference the opp app you are using or the machine?
Or the o/clocking?


All three. A machine overclocked well enough running on decent CPU architecture (or GPU) using an optimized app. If overclocking is not your thing, then a fast CPU using an optimized app is your next best option.
ID: 884405 · Report as offensive
FiveHamlet
Avatar

Send message
Joined: 5 Oct 99
Posts: 783
Credit: 32,638,578
RAC: 0
United Kingdom
Message 884409 - Posted: 11 Apr 2009, 22:36:06 UTC - in response to Message 884405.  

I understand the i7 920 that I have running can be o/clocked but I
Have to get to grips with the opp app's first LOL.
To many things for this brain to take in at the moment.
Dave
ID: 884409 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 884686 - Posted: 12 Apr 2009, 18:55:26 UTC - in response to Message 882998.  
Last modified: 12 Apr 2009, 18:59:02 UTC


This are the two 'Out Of Memory' errors:
http://setiathome.berkeley.edu/result.php?resultid=1195475990


[This error I posted already in this thread]
http://setiathome.berkeley.edu/result.php?resultid=1196613052
...


I give your V10 CUDA app again and the opt. AK V8.0 a try.. :-)
..I'm curious and confused that I have now well CUDA performance with BOINC V6.6.20.. ;-)

Hmm.. I don't know why..

But.. I got now 3 more errors.. 'out of memory':
http://setiathome.berkeley.edu/result.php?resultid=1203126308
http://setiathome.berkeley.edu/result.php?resultid=1203126304
http://setiathome.berkeley.edu/result.php?resultid=1203125400

My rig isn't OCed.. only the GPUs from the manufacturer.

'out of memory' is meant the GPU- or the mobo- RAM ?

What could be the prob?


EDIT:
BTW.
Which .dll's are the best choice?
From the V10 CUDA app?
I read that some had probs with newer .dll's.

ID: 884686 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 884706 - Posted: 12 Apr 2009, 19:55:52 UTC - in response to Message 884686.  

Looks like system RAM, not GPU's.
Try both DLL sets and use what best suited to your system.
If this is new RAM check it with some memory test utility...
ID: 884706 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 884987 - Posted: 13 Apr 2009, 17:05:12 UTC
Last modified: 13 Apr 2009, 18:02:29 UTC


Are you sure? ;-D

..kidding..


The 800 MHz RAM ran @ 944 MHz @ 5-5-5-18 @ 2.1 V - in the QX6700 rig for 1 1/2 years. [with active cooling]
Now in the AMD 940 BE rig @ 800 MHz @ 5-5-5-18 @ 1.8 V* [all AUTO]
This are Corsair xms2 6400C4 sticks. 5-5-5-18 @ 1.8 V stock or 4-4-4-12 @ 2.1 V .
With memtest [@ QX6700 rig in past] I had some [~ 3 - 5] errors only @ 4-4-4-12, 5-5-5-18 worked fine. [800 MHz]
O.K., a well reason for warranty change in past.. but the Core2 was better/faster with 944/5-5-5-18 as 788/4-4-4-12, so it was O.K. ..

If I remember correct, since your 'GPU WU start opt.' I have this errors.

I'm the only one..? If yes - O.K., then it's my RAM.. ;-)

[* I guess the voltage]
Is there a prog with which I can see the RAM voltage in Windows?
Current at AUTO, so maybe not enough or to low voltage..


Since BOINC V6.6.20 your nice team mod isn't needed anymore.
Maybe you can open a new thread with the latest available CUDA versions? [V10/11]
[incl. dll's and needed app_info.xml]

Or maybe a hint to the app_info.xml thread?
app_info for AP500, AP503, MB603 and MB608


Which/where are the .dll-versions from you?
Please could you post where to find?
Or this are the .dll's from your V7 and V10 mod. This are the only different .dll-versions from you?
[in the 1st posts of the threads @ lunatics.kwsn.net?]

I could use the stock Berkeley .dll's also with your mod?
Or what's different?
I have no knowledge for what the .dll's are needed.. I'm not the profi.. I'm an 'amateur'.. ;-)

ID: 884987 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 884989 - Posted: 13 Apr 2009, 17:19:55 UTC - in response to Message 882998.  
Last modified: 13 Apr 2009, 17:22:46 UTC


http://setiathome.berkeley.edu/result.php?resultid=1195495666
icfft=86040, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error

http://setiathome.berkeley.edu/result.php?resultid=1190297258
icfft=94365, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error

[This error I posted already in this thread]
http://setiathome.berkeley.edu/result.php?resultid=1195160891
icfft=86665, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error


Now my 4th..
http://setiathome.berkeley.edu/result.php?resultid=1202303914
icfft=84509, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error


[@ all - I'm the only one which see this errors? *confused*]

Or I'm the only one which see/look to the error column in the overview. ;-D [Now it's easy to find because of the choice of the look]
But need also some time, because of the 'bad WU header's '.. ;-)

ID: 884989 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66431
Credit: 55,293,173
RAC: 49
United States
Message 884997 - Posted: 13 Apr 2009, 17:29:34 UTC - in response to Message 884989.  


http://setiathome.berkeley.edu/result.php?resultid=1195495666
icfft=86040, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error

http://setiathome.berkeley.edu/result.php?resultid=1190297258
icfft=94365, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error

[This error I posted already in this thread]
http://setiathome.berkeley.edu/result.php?resultid=1195160891
icfft=86665, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error


Now my 4th..
http://setiathome.berkeley.edu/result.php?resultid=1202303914
icfft=84509, PoT_activity=0, PoT_freq_bin=-1SETI@home error -12 Unknown error


[@ all - I'm the only one which see this errors? *confused*]

Or I'm the only one which see/look to the error column in the overview. ;-D [Now it's easy to find because of the choice of the look]
But need also some time, because of the 'bad WU header's '.. ;-)

I've only seen Error -6 when I ran two gpus of My GTX295 video card and processed 2D video at the same time and yes this was at the 295's stock speed too, So I went back to using 1 gpu instead and have overclocked the GTX295 to 641MHz(briefly tried 648MHz but got an error -6 according to BV 1.4.2), Fan is on 100% and ram(1101MHz) and shaders(1382MHz) are overclocked and I'm seeing no problems.
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 884997 · Report as offensive
samuel7
Volunteer tester

Send message
Joined: 2 Jan 00
Posts: 47
Credit: 2,194,240
RAC: 0
Finland
Message 885008 - Posted: 13 Apr 2009, 17:48:05 UTC - in response to Message 884989.  


[@ all - I'm the only one which see this errors? *confused*]

Or I'm the only one which see/look to the error column in the overview. ;-D [Now it's easy to find because of the choice of the look]
But need also some time, because of the 'bad WU header's '.. ;-)


You're not quite alone, I reported one here.

They certainly are easier to find with the task list filter. Also, I don't even look at ones with <1 sec CPU time anymore, since they all seem to be VLAR kills.

ID: 885008 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 885009 - Posted: 13 Apr 2009, 17:48:20 UTC - in response to Message 884997.  

I've only seen Error -6 when I ran two gpus of My GTX295 video card and processed 2D video at the same time and yes this was at the 295's stock speed too, So I went back to using 1 gpu instead and have overclocked the GTX295 to 641MHz(briefly tried 648MHz but got an error -6 according to BV 1.4.2), Fan is on 100% and ram(1101MHz) and shaders(1382MHz) are overclocked and I'm seeing no problems.


For Raistmer it's for interest the -12 error.
Because it's a BUG in the current CUDA code. [stock - and because of this also in his app]

You let run the fan at 100 % ??
Woohoo.. I hope the rig is very far from the place you live..
Because I read the fan is veeery loud..

Because of this I bought 2 x GTX260 Core216 and not a GTX295 - the GPUs are much hotter and the fan must turn much faster for 2 GPUs as only 1 GPU in the same GPU case.

ID: 885009 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 885010 - Posted: 13 Apr 2009, 17:55:39 UTC - in response to Message 885008.  

You're not quite alone, I reported one here.

They certainly are easier to find with the task list filter. Also, I don't even look at ones with <1 sec CPU time anymore, since they all seem to be VLAR kills.


Ops.. I didn't saw..

You are right.. the ~ 1 sec. errors are only from the 'VLAR kill'..
The -12 error occurs in the task process.. so you see some seconds of processing..

ID: 885010 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66431
Credit: 55,293,173
RAC: 49
United States
Message 885023 - Posted: 13 Apr 2009, 18:24:53 UTC - in response to Message 885009.  

I've only seen Error -6 when I ran two gpus of My GTX295 video card and processed 2D video at the same time and yes this was at the 295's stock speed too, So I went back to using 1 gpu instead and have overclocked the GTX295 to 641MHz(briefly tried 648MHz but got an error -6 according to BV 1.4.2), Fan is on 100% and ram(1101MHz) and shaders(1382MHz) are overclocked and I'm seeing no problems.


For Raistmer it's for interest the -12 error.
Because it's a BUG in the current CUDA code. [stock - and because of this also in his app]

You let run the fan at 100 % ??
Woohoo.. I hope the rig is very far from the place you live..
Because I read the fan is veeery loud..

Because of this I bought 2 x GTX260 Core216 and not a GTX295 - the GPUs are much hotter and the fan must turn much faster for 2 GPUs as only 1 GPU in the same GPU case.

Oh contraire, It's not that loud, The ATi cooling fan is much louder and so My Delta 120mm 150cfm cpu fan is slightly louder than the 295's fan and yet less loud than the ATi fan, So It's not too bad as I can hear the hdd when the head is moving around inside the hdd.
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 885023 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 885030 - Posted: 13 Apr 2009, 18:44:22 UTC - in response to Message 885023.  
Last modified: 13 Apr 2009, 18:45:19 UTC

Oh contraire, It's not that loud, The ATi cooling fan is much louder and so My Delta 120mm 150cfm cpu fan is slightly louder than the 295's fan and yet less loud than the ATi fan, So It's not too bad as I can hear the hdd when the head is moving around inside the hdd.


Hmm.. it depend what you feel as loud.. ;-)

If your Delta with 150 cfm - this must be 255 m³/h [for the europeans/germans around here.. ;-)]..
..then I guess it must be ~ 45 dBA.. huuhh.. too loud for me.. ;-D

Everything > 25 dBA is toooo loud for me.. :-)

ID: 885030 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66431
Credit: 55,293,173
RAC: 49
United States
Message 885035 - Posted: 13 Apr 2009, 18:56:00 UTC - in response to Message 885030.  

Oh contraire, It's not that loud, The ATi cooling fan is much louder and so My Delta 120mm 150cfm cpu fan is slightly louder than the 295's fan and yet less loud than the ATi fan, So It's not too bad as I can hear the hdd when the head is moving around inside the hdd.


Hmm.. it depend what you feel as loud.. ;-)

If your Delta with 150 cfm - this must be 255 m³/h [for the Europeans/Germans around here.. ;-)]..
..then I guess it must be ~ 45 dBA.. huuhh.. too loud for me.. ;-D

Everything > 25 dBA is too loud for me.. :-)

You get used to It, It used to be much louder in the old case which used high speed fans to get around the bottlenecks in the case design, the New case flows more air and so the stock fans(all 4) in the Coolermaster HAF 932 case are quieter and when I get enough money together for an Thermalright IFX-14 heatsink I'm going to try dual 120mm 100cfm fans to make the PC quieter still and yet still cool the cpu effectively.
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 885035 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 885076 - Posted: 13 Apr 2009, 20:20:42 UTC - in response to Message 884987.  




Since BOINC V6.6.20 your nice team mod isn't needed anymore.
Maybe you can open a new thread with the latest available CUDA versions? [V10/11]
[incl. dll's and needed app_info.xml]

Or maybe a hint to the app_info.xml thread?
app_info for AP500, AP503, MB603 and MB608


Which/where are the .dll-versions from you?
Please could you post where to find?
Or this are the .dll's from your V7 and V10 mod. This are the only different .dll-versions from you?
[in the 1st posts of the threads @ lunatics.kwsn.net?]

I could use the stock Berkeley .dll's also with your mod?
Or what's different?
I have no knowledge for what the .dll's are needed.. I'm not the profi.. I'm an 'amateur'.. ;-)


Yes, a new thread is really required for Raistmer's V10/V11 app in Boinc 6.6.20 standalone mode,
and a new download with the required .dll's so at least you get everything in one package. (If you really need them, or if you can make do if stock .dll's)

Claggy
ID: 885076 · Report as offensive
FiveHamlet
Avatar

Send message
Joined: 5 Oct 99
Posts: 783
Credit: 32,638,578
RAC: 0
United Kingdom
Message 885093 - Posted: 13 Apr 2009, 20:47:30 UTC

Question
I seem to recall seeing a 64bit Cuda opp app but cant find the link,is there any advantage to running this app?
I have v10 running at the moment and all is well.
I have the app for vlar kill but I get errors when BM starts.
What am I doing wrong?
Sorry if this is the wrong place to post this.

ID: 885093 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 885106 - Posted: 13 Apr 2009, 21:20:29 UTC
Last modified: 13 Apr 2009, 21:31:06 UTC


Raistmer.. sorry, I hope you don't feel spammed from my posts.. [many in a row]
But maybe I found something interesting?



Hmm.. what happened here.. same MB AR WU but 618.8* (CPU 100.625) sec. and 301.8* (CPU 66.21875) sec.
[* Wall-clock time elapsed since last restart]

http://setiathome.berkeley.edu/result.php?resultid=1203882723

http://setiathome.berkeley.edu/result.php?resultid=1203882724



Also..
Normally a MB AR WU 0.44x - ~ 480 sec. - ~ 8 min. [~ 52 credits**]

0.547505 - 868.6 sec. - ~ 14.5 min. [45 credits**]
http://setiathome.berkeley.edu/result.php?resultid=1203924655

1.066794 - 1042.6 sec. - ~ 17.5 min. [30 credits**]
http://setiathome.berkeley.edu/result.php?resultid=1204001983

[** incl. CUDA overclaim!]

Is this normal?
AFAIK, < AR means > calculation time .
This is now exactly the contrary.. *confused*


Finally..
Or is the CPU/real wall clock time counter broken in BOINC V6.6.20 ?
I wasn't in front of the PC.. so I have only this counter times.


EDIT:
Or is this again my in past seen 1/2 CUDA performance lost with > BOINC V6.6.11 and your V7 and V10 CUDA app?
Current I use BOINC V6.6.20 .

ID: 885106 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 885108 - Posted: 13 Apr 2009, 21:36:32 UTC - in response to Message 885093.  

Question
I seem to recall seeing a 64bit Cuda opp app but cant find the link,is there any advantage to running this app?
I have v10 running at the moment and all is well.
I have the app for vlar kill but I get errors when BM starts.
What am I doing wrong?
Sorry if this is the wrong place to post this.


Only CUDA 32 bit available.
On GPU you don't have 32 or 64 bit.

Or you mean the applications for the CPU?


If you have a bunch of VLARs, they will be 'killed' immediately..

Or all ARs will be errored?

ID: 885108 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 885115 - Posted: 13 Apr 2009, 22:17:13 UTC - in response to Message 885108.  
Last modified: 13 Apr 2009, 22:19:00 UTC

I did x64 CUDA MB build but there was no speed increase so not worth to support it for now. Maybe later it will show better results.

About CUDA performance - if the core where CUDA MB executed is occupied with some higher priority process (with priority higher than CUDA MB worker thread one) the app can experience substantional performance degradation. That's why CPU affinity mod was created (to give user with multicore CPUs ability to bound other processes on higher core numbers while leaving first core mostly for CUDA MB.) Other BOINC apps executed on even more low priority and should not interfere with CUDA MB.
ID: 885115 · Report as offensive
FiveHamlet
Avatar

Send message
Joined: 5 Oct 99
Posts: 783
Credit: 32,638,578
RAC: 0
United Kingdom
Message 885127 - Posted: 13 Apr 2009, 22:33:26 UTC - in response to Message 885115.  

Thanks for the info.
Can you tell me politly where MB_6.08_mod_VLAR_kill_CUDA.exe is put in the app_info.xml?
Here is my info file.I have input the flops do I have to put the numbers after the decimal point in? Does it look ok to run?

<app_info>
<app>
<name>astropulse</name>
</app>
<file_info>
<name>ap_5.00r103_SSE3.exe</name>
<executable/>
</file_info>
<app_version>
<app_name>astropulse</app_name>
<version_num>500</version_num>
<flops>5233741404.0081975</flops>
<file_ref>
<file_name>ap_5.00r103_SSE3.exe</file_name>
<main_program/>
</file_ref>
</app_version>
<app>
<name>astropulse_v5</name>
</app>
<file_info>
<name>ap_5.03r112_SSE3.exe</name>
<executable/>
</file_info>
<app_version>
<app_name>astropulse_v5</app_name>
<version_num>503</version_num>
<flops>6047878955.742806</flops>
<file_ref>
<file_name>ap_5.03r112_SSE3.exe</file_name>
<main_program/>
</file_ref>
</app_version>
<app>
<name>setiathome_enhanced</name>
</app>
<file_info>
<name>AK_v8b_win_SSE3_AMD_GPU_CPU_team_V10.exe</name>
<executable/>
</file_info>
<file_info>
<name>cudart.dll</name>
<executable/>
</file_info>
<file_info>
<name>cufft.dll</name>
<executable/>
</file_info>
<file_info>
<name>libfftw3f-3-1-1a_upx.dll</name>
<executable/>
</file_info>
<file_info>
<name>MB_6.08_mod_CUDA_V10.exe</name>
<executable/>
</file_info>

<app_version>
<app_name>setiathome_enhanced</app_name>
<version_num>528</version_num>
<file_ref>
<file_name>AK_v8b_win_SSE3_AMD_GPU_CPU_team_V10.exe</file_name>
<main_program/>
</file_ref>
<file_ref>
<file_name>cudart.dll</file_name>
</file_ref>
<file_ref>
<file_name>cufft.dll</file_name>
</file_ref>
<file_ref>
<file_name>libfftw3f-3-1-1a_upx.dll</file_name>
</file_ref>
<file_ref>
<file_name>MB_6.08_mod_CUDA_V10.exe</file_name>
</file_ref>
</app_version>

<app_version>
<app_name>setiathome_enhanced</app_name>
<version_num>603</version_num>
<flops>4070687758.6730425</flops>
<file_ref>
<file_name>AK_v8b_win_SSE3_AMD_GPU_CPU_team_V10.exe</file_name>
<main_program/>
</file_ref>
<file_ref>
<file_name>cudart.dll</file_name>
</file_ref>
<file_ref>
<file_name>cufft.dll</file_name>
</file_ref>
<file_ref>
<file_name>libfftw3f-3-1-1a_upx.dll</file_name>
</file_ref>
<file_ref>
<file_name>MB_6.08_mod_CUDA_V10.exe</file_name>
</file_ref>
</app_version>


<app_version>
<app_name>setiathome_enhanced</app_name>
<version_num>607</version_num>
<file_ref>
<file_name>AK_v8b_win_SSE3_AMD_GPU_CPU_team_V10.exe</file_name>
<main_program/>
</file_ref>
<file_ref>
<file_name>cudart.dll</file_name>
</file_ref>
<file_ref>
<file_name>cufft.dll</file_name>
</file_ref>
<file_ref>
<file_name>libfftw3f-3-1-1a_upx.dll</file_name>
</file_ref>
<file_ref>
<file_name>MB_6.08_mod_CUDA_V10.exe</file_name>
</file_ref>
</app_version>

<app_version>
<app_name>setiathome_enhanced</app_name>
<version_num>608</version_num>
<flops>7400000000</flops>
<file_ref>
<file_name>AK_v8b_win_SSE3_AMD_GPU_CPU_team_V10.exe</file_name>
<main_program/>
</file_ref>
<file_ref>



Your advice is appreciated.

Dave
ID: 885127 · Report as offensive
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 15 · Next

Message boards : Number crunching : V10 of modified SETI MB CUDA + opt AP package for full multi-GPU+CPU use


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.