New AstroPulse for GPU ( ATi & NV) released (r1316) |
![]() |
| log in |
Message boards : Number crunching : New AstroPulse for GPU ( ATi & NV) released (r1316)
Previous · 1 · 2 · 3 · 4 · 5 . . . 11 · Next
| Author | Message |
|---|---|
As menioned in release notes, yes, you can, listed params work as before. | |
| ID: 1256931 · | |
Completed my first Task. It ran very fast - was on track for about an hour running with 2 MB 6.10 tasks at the same time, but exited early at the 35 minute mark with "Found 30 single pulses and 30 repeating pulses, exiting." Please keep looking at outcome of this task. 30/30 exits, especially if they come first, are always suspicious it could be just noisy task, or some app failure. | |
| ID: 1256932 · | |
Completed my first Task. It ran very fast - was on track for about an hour running with 2 MB 6.10 tasks at the same time, but exited early at the 35 minute mark with "Found 30 single pulses and 30 repeating pulses, exiting." Will do that. I started another task and will advise if anything is unusual. Glad we're starting to get some AP work. Thanks for the responses. ____________ Another Fred Support SETI@home when you search the Web or shop online with GoodSearch and GoodShop | |
| ID: 1256935 · | |
Please keep looking at outcome of this task. 30/30 exits, especially if they come first, are always suspicious it could be just noisy task, or some app failure. That first task, workunit 10249935008 has gone inconclusive. Another 30/30 exit, workunit 1024964421 has also gone inconclusive. Both are against the stock cpu app. Edit - both found 30/30 end edit. I've had 2 other 30/30 results, one of those validated against a stock cpu app. All 4 of the 30/30 exits came from ap_01jl10_aa_B5, so maybe its just a noisy file. Anyone else seeing this? Have 4 results from 01jl11 that ran full length and are pending validation. ____________ Another Fred Support SETI@home when you search the Web or shop online with GoodSearch and GoodShop | |
| ID: 1257512 · | |
|
So long your wingman returns a 30/30 also your result should get validated at the end. | |
| ID: 1257527 · | |
|
Mike, | |
| ID: 1257621 · | |
|
Raistmer wrote: New AstroPulse binaries are available here: http://lunatics.kwsn.net/12-gpu-crunching/ap6-for-nv-ati-gpus-r1316-released.msg49053.html#msg49053 Raistmer, a big *Thank you!* to you for your time and hard work - and the same to the alpha and beta tester! - - - - - - - - - - What are the default settings (values) for/of the application if nothing is in the <cmdline> in the app_info.xml file for... -unroll -ffa_block -ffa_block_fetch ? - - - - - - - - - - How I could find/know which settings (& values) are the best for the own system for max. performance? Thanks * Best regards! :-) * Sutaru Tsureku, team seti.international founder. * Optimize your PC for higher RAC. * SETI@home needs your help. * ____________ >Das Deutsche Cafe. The German Cafe.< | |
| ID: 1257623 · | |
Mike, I left mine the same and it is working just fine on my GTX-560. <app_info> <app> <name>setiathome_enhanced</name> </app> <file_info> <name>AK_v8b2_win_x64_SSE3.exe</name> <executable/> </file_info> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>603</version_num> <platform>windows_intelx86</platform> <file_ref> <file_name>AK_v8b2_win_x64_SSE3.exe</file_name> <main_program/> </file_ref> </app_version> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>603</version_num> <platform>windows_x86_64</platform> <file_ref> <file_name>AK_v8b2_win_x64_SSE3.exe</file_name> <main_program/> </file_ref> </app_version> <app> <name>astropulse_v6</name> </app> <file_info> <name>AP6_win_x86_SSE2_OpenCL_NV_r1316.exe</name> <executable/> </file_info> <app_version> <app_name>astropulse_v6</app_name> <version_num>601</version_num> <avg_ncpus>0.04</avg_ncpus> <max_ncpus>0.2</max_ncpus> <platform>windows_intelx86</platform> <plan_class>cuda_fermi</plan_class> <cmdline>-instances_per_device 1 -unroll 10 -ffa_block 6144 -ffa_block_fetch 1536 -sbs 256</cmdline> <coproc> <type>CUDA</type> <count>0.51</count> </coproc> <file_ref> <file_name>AP6_win_x86_SSE2_OpenCL_NV_r1316.exe</file_name> <main_program/> </file_ref> </app_version> <app_version> <app_name>astropulse_v6</app_name> <version_num>601</version_num> <avg_ncpus>0.04</avg_ncpus> <max_ncpus>0.2</max_ncpus> <platform>windows_x86_64</platform> <plan_class>cuda_fermi</plan_class> <cmdline>-instances_per_device 1 -unroll 10 -ffa_block 6144 -ffa_block_fetch 1536 -sbs 256</cmdline> <coproc> <type>CUDA</type> <count>0.51</count> </coproc> <file_ref> <file_name>AP6_win_x86_SSE2_OpenCL_NV_r1316.exe</file_name> <main_program/> </file_ref> </app_version> <app> <name>setiathome_enhanced</name> </app> <file_info> <name>Lunatics_x41y_win32_cuda50.exe</name> <executable/> </file_info> <file_info> <name>cudart32_50_7.dll</name> <executable/> </file_info> <file_info> <name>cufft32_50_7.dll</name> <executable/> </file_info> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>610</version_num> <platform>windows_intelx86</platform> <plan_class>cuda_fermi</plan_class> <avg_ncpus>0.040000</avg_ncpus> <max_ncpus>0.040000</max_ncpus> <coproc> <type>CUDA</type> <count>0.49</count> </coproc> <file_ref> <file_name>Lunatics_x41y_win32_cuda50.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>cudart32_50_7.dll</file_name> </file_ref> <file_ref> <file_name>cufft32_50_7.dll</file_name> </file_ref> </app_version> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>609</version_num> <platform>windows_intelx86</platform> <plan_class>cuda23</plan_class> <avg_ncpus>0.040000</avg_ncpus> <max_ncpus>0.040000</max_ncpus> <coproc> <type>CUDA</type> <count>0.49</count> </coproc> <file_ref> <file_name>Lunatics_x41y_win32_cuda50.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>cudart32_50_7.dll</file_name> </file_ref> <file_ref> <file_name>cufft32_50_7.dll</file_name> </file_ref> </app_version> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>608</version_num> <platform>windows_intelx86</platform> <plan_class>cuda</plan_class> <avg_ncpus>0.040000</avg_ncpus> <max_ncpus>0.040000</max_ncpus> <coproc> <type>CUDA</type> <count>0.49</count> </coproc> <file_ref> <file_name>Lunatics_x41y_win32_cuda50.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>cudart32_50_7.dll</file_name> </file_ref> <file_ref> <file_name>cufft32_50_7.dll</file_name> </file_ref> </app_version> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>610</version_num> <platform>windows_x86_64</platform> <plan_class>cuda_fermi</plan_class> <avg_ncpus>0.040000</avg_ncpus> <max_ncpus>0.040000</max_ncpus> <coproc> <type>CUDA</type> <count>0.49</count> </coproc> <file_ref> <file_name>Lunatics_x41y_win32_cuda50.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>cudart32_50_7.dll</file_name> </file_ref> <file_ref> <file_name>cufft32_50_7.dll</file_name> </file_ref> </app_version> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>609</version_num> <platform>windows_x86_64</platform> <plan_class>cuda23</plan_class> <avg_ncpus>0.040000</avg_ncpus> <max_ncpus>0.040000</max_ncpus> <coproc> <type>CUDA</type> <count>0.49</count> </coproc> <file_ref> <file_name>Lunatics_x41y_win32_cuda50.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>cudart32_50_7.dll</file_name> </file_ref> <file_ref> <file_name>cufft32_50_7.dll</file_name> </file_ref> </app_version> <app_version> <app_name>setiathome_enhanced</app_name> <version_num>608</version_num> <platform>windows_x86_64</platform> <plan_class>cuda</plan_class> <avg_ncpus>0.040000</avg_ncpus> <max_ncpus>0.040000</max_ncpus> <coproc> <type>CUDA</type> <count>0.49</count> </coproc> <file_ref> <file_name>Lunatics_x41y_win32_cuda50.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>cudart32_50_7.dll</file_name> </file_ref> <file_ref> <file_name>cufft32_50_7.dll</file_name> </file_ref> </app_version> </app_info> ____________ | |
| ID: 1257630 · | |
|
How does this look it does not mention (it would be better to include libfftw3f-3.dll in app_info) and I changed <max_ncpus>0.2</max_ncpus> to <max_ncpus>0.04</max_ncpus> and
<app>
<name>astropulse_v6</name>
</app>
<file_info>
<name>AP6_win_x86_SSE2_OpenCL_NV_r1316.exe</name>
<executable/>
</file_info>
<app_version>
<app_name>astropulse_v6</app_name>
<version_num>601</version_num>
<avg_ncpus>0.04</avg_ncpus>
<max_ncpus>0.04</max_ncpus>
<platform>windows_intelx86</platform>
<plan_class>cuda_fermi</plan_class>
<cmdline>-instances_per_device 1 -unroll 10 -ffa_block 6144 -ffa_block_fetch 1536 -sbs 256</cmdline>
<coproc>
<type>CUDA</type>
<count>0.5</count>
</coproc>
<file_ref>
<file_name>AP6_win_x86_SSE2_OpenCL_NV_r1316.exe</file_name>
<main_program/>
</file_ref>
</app_version>
<app_version>
<app_name>astropulse_v6</app_name>
<version_num>601</version_num>
<avg_ncpus>0.04</avg_ncpus>
<max_ncpus>0.04</max_ncpus>
<platform>windows_x86_64</platform>
<plan_class>cuda_fermi</plan_class>
<cmdline>-instances_per_device 1 -unroll 10 -ffa_block 6144 -ffa_block_fetch 1536 -sbs 256</cmdline>
<coproc>
<type>CUDA</type>
<count>0.5</count>
</coproc>
<file_ref>
<file_name>AP6_win_x86_SSE2_OpenCL_NV_r1316.exe</file_name>
<main_program/>
</file_ref>
</app_version>
____________ | |
| ID: 1257636 · | |
|
Okay, changed mine too and left it at 601. Also noticed that the old 560 was sse3 and the 1316 is sse2. Changed that also. Left everything else (unroll, ffa, ect) the same as I had it. Suspended a bunch of 603s to let a 601 start. Seemed to be running okay so I opened up the 603s again. It will be awhile before it restarts the 601 but it should be alright. I also changed my instances per to 2 and all my counts back to .5. My little GTS 450 should be able to handle it. | |
| ID: 1257637 · | |
It's not nessasary to ask for 6.04 though it's real version of that binary. Also, you need to remove </copy> flag from app_info regarding to CL file. Or even better - remove mention of CL file at all. | |
| ID: 1257659 · | |
unroll=2, ffa_block=1024, ffa_block_fetch=512 are defaults. Usual method can be used - to try and to see. There are some heuristics though I and others used before. They still work for this new build. Can't mention all of those here but understanding what these params change should give insights how to use them. So: -unroll - changes number of data chnks processed at once in main loop. To loow number will leave some of GPU's compute units idle, too big can cause issues with stride of memory access (depends of GPU memory controller and number of channels it uses). -ffa_block - how many different periods calculated on GPU at once. Each such period corresponds to OpenCL workitem. So again, if this value too low this can result in part of GPU being idle. Too big values have other issue - more and more unneeded work will be done cause data arrays for different periods have different sizes but GPU will process rectangular area corresponding to longest of arrays in block. Also, r1316 includes FFA pre-compute. It means that whole block will be redone if GPU found some signals inside (signals are rare so in most cases this result in faster operation than checking if there was signal or not after each small step). Too big block will reduce performance in case of signal (and usually each task has at least 2-3 signals). | |
| ID: 1257663 · | |
|
</copy> flag? I don't see it. Are you talking to me? Only OpenCL reference I see is in the name. | |
| ID: 1257670 · | |
|
None of the app_info mention libfftw3f-3.dll in app_info | |
| ID: 1257672 · | |
None of the app_info mention libfftw3f-3.dll in app_info Because we are using it for long time already. For those running these apps for the fisrt time it avoids that boinc deletes it. ____________ | |
| ID: 1257685 · | |
|
So I added llibfftw3f-3.dll 3 times. Is this OK also I added the default <cmdline>unroll=2, ffa_block=1024, ffa_block_fetch=512</cmdline> what else I need to add?
<app>
<name>astropulse_v6</name>
</app>
<file_info>
<name>AP6_win_x86_SSE2_OpenCL_NV_r1316.exe</name>
<executable/>
</file_info>
<file_info>
<name>llibfftw3f-3.dll</name>
<executable/>
</file_info>
<app_version>
<app_name>astropulse_v6</app_name>
<version_num>601</version_num>
<avg_ncpus>0.04</avg_ncpus>
<max_ncpus>0.04</max_ncpus>
<platform>windows_intelx86</platform>
<plan_class>cuda_fermi</plan_class>
<cmdline>unroll=2, ffa_block=1024, ffa_block_fetch=512</cmdline>
<coproc>
<type>CUDA</type>
<count>0.5</count>
</coproc>
<file_ref>
<file_name>AP6_win_x86_SSE2_OpenCL_NV_r1316.exe</file_name>
<main_program/>
</file_ref>
<file_info>
<name>llibfftw3f-3.dll</name>
<executable/>
</file_info>
</app_version>
<app_version>
<app_name>astropulse_v6</app_name>
<version_num>601</version_num>
<avg_ncpus>0.04</avg_ncpus>
<max_ncpus>0.04</max_ncpus>
<platform>windows_x86_64</platform>
<plan_class>cuda_fermi</plan_class>
<cmdline>-instances_per_device 1 -unroll 10 -ffa_block 6144 -ffa_block_fetch 1536 -sbs 256</cmdline>
<coproc>
<type>CUDA</type>
<count>0.5</count>
</coproc>
<file_ref>
<file_name>AP6_win_x86_SSE2_OpenCL_NV_r1316.exe</file_name>
<main_program/>
</file_ref>
<file_info>
<name>llibfftw3f-3.dll</name>
<executable/>
</file_info>
</app_version>
____________ | |
| ID: 1257705 · | |
|
Paul, the dll file goes in the SETI at Home folder. It doesn't need to be mentioned in the app_info. | |
| ID: 1257707 · | |
Paul, the dll file goes in the SETI at Home folder. It doesn't need to be mentioned in the app_info. Wrong Perryjay. Regarding what to mention in app_info file: for stock users upgrading to this release it would be better to include libfftw3f-3.dll in app_info. ____________ | |
| ID: 1257710 · | |
|
Oops, sorry, I missed that since I was coming from the old 560. | |
| ID: 1257714 · | |
</copy> flag? I don't see it. Are you talking to me? Only OpenCL reference I see is in the name. Then you already removed it before, no additional changes required. | |
| ID: 1257715 · | |
Message boards : Number crunching : New AstroPulse for GPU ( ATi & NV) released (r1316)
| Copyright © 2013 University of California |