Message boards :
Number crunching :
OpenCL NV MultiBeam v8 SoG edition for Windows
Message board moderation
Author | Message |
---|---|
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Here http://lunatics.kwsn.info/index.php?action=downloads;sa=view;down=487 new MultiBeam build for nVidia GPUs available as standalone pack. Feel free to use it and post comments here. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14672 Credit: 200,643,578 RAC: 874 |
Please note that this pack uses values which do not conform to the likely pattern of future stock deployment: <version_num>802</version_num> <plan_class>cuda</plan_class> At Beta, Eric used "8.07 (opencl_nvidia_SoG)" At Main, I would expect the version number to be 800, and the plan class to be the same as Beta. If you deploy the pack in its current form, be ready to make adjustments when the next installer comes out. |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Please note that this pack uses values which do not conform to the likely pattern of future stock deployment: It was in our inner RC more then week already to note this before release and not after it. Well, till next release. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14672 Credit: 200,643,578 RAC: 874 |
This is what happens when the developer makes private releases without thinking about the context within which the developed application will be used. |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
This is what happens when the developer makes private releases without thinking about the context within which the developed application will be used. Provided app_info snippet allows copy/paste w/o any change. But if I would receive adequate testing/feedback before release I could make it easier for next version of automatization tools like Lunatics installer to accommodate to it. Well, let it be the biggest issue of all :) |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Thanks, I'll update aistub in next commit. |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242 |
I look forward to it as well. 500 validated on 1 machine, 2400 validated on another on beta. Any timeline as to when these will be released on Main? (one can hope) |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Release as stock will require unattended deployment and stealth execution (versus "user knows what he's doing" on anonymous platform. So before such release issues with low-end GPUs should be solved (at least driver restarts). So, if you have low-end GPU and experience lags/driver restart with this build in its default config, and if you was able to find tuning that allow to solve this issues please post your command line here or in corresponding thread on beta: http://setiweb.ssl.berkeley.edu/beta/forum_thread.php?id=2288 |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14672 Credit: 200,643,578 RAC: 874 |
@ Tutankhamon, Thanks for posting that. We can't be certain that 800 / opencl_nvidia_SoG will be the ultimate deployment values - they depend on Eric, and how much water has flowed under the bridge before the scheduler issues have been resolved to his satisfaction, for official Berkeley release. For full public releases, we normally prefer to add the .cl file info and ref lines into app_info as well: in this case, they would be <file_info> <name>MultiBeam_Kernels_r3366.cl</name> </file_info> <file_ref> <file_name>MultiBeam_Kernels_r3366.cl</file_name> </file_ref> Before the generation of aistub flies was automated, that was one additional step to forget during early-stage testing, and so they were often omitted during pre-release testing. @ Raistmer, It's a good thing you put that disclaimer about 'experienced users only' and 'may cause driver restarts on low-end cards' into your initial release post. It's because I knew that you were still working with Eric to work round such deployment issues that I allowed myself a week's holiday from development matters after the installer release. |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
'may cause driver restarts on low-end cards' into your initial release post. http://lunatics.kwsn.info/index.php?action=downloads;sa=view;down=487 347+ drivers required. |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Did you reconsider omitting this one? |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
yes, cause the influence of this option is AR-dependent. |
Mike Send message Joined: 17 Feb 01 Posts: 34346 Credit: 79,922,639 RAC: 80 |
Also try -sbs 192 and 384. Should be faster on your GPU. With each crime and every kindness we birth our future. |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242 |
Tut are you getting work for that OpenCL SOG? |
Mike Send message Joined: 17 Feb 01 Posts: 34346 Credit: 79,922,639 RAC: 80 |
Tut are you getting work for that OpenCL SOG? Maybe you need to change plan claas to opencl_nvidia_sah until Eric has released plan class for SoG version. With each crime and every kindness we birth our future. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14672 Credit: 200,643,578 RAC: 874 |
Tut are you getting work for that OpenCL SOG? Plan Class names used under Anonymous Platform don't have to match the plan classes used for stock distributions - I've made up plan classes including my initials and the word 'test' before now, and they worked just fine. But they should include the keyword for the type of scheduling anticipated - OpenCL in this case (for BOINC versions >= 7.0.40). All mine did, so I can't speak for what happens if you leave it out. It'll be in a (debug) log if you fall foul of something, and need to look it up. This is the other way round, but error messages might look something like this: 11/15/2012 8:53:52 AM | | App version needs opencl but GPU doesn't support it |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Very high CPU usage for WU's other than High AR's. Almost a full core, for AR's other than VHAR's where the CPU usage is 8-10% only. ATi OpenCL build handles VLAR quite easely. Worth to try with OpenCL NV also. That's the disadvantage of beta - subset of ARs, subset of devices... Pulses and Triplets still processed by old way - and synhing uses lot of CPU as before (again, NV-specific). |
Mike Send message Joined: 17 Feb 01 Posts: 34346 Credit: 79,922,639 RAC: 80 |
Very high CPU usage for WU's other than High AR's. Almost a full core, for AR's other than VHAR's where the CPU usage is 8-10% only. You can try _use_sleep or -use_sleep_ex 5 to reduce CPU usage. But i suggst to use this only running multiple instances. With each crime and every kindness we birth our future. |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
because by using -use_sleep, this app will not be any faster than CUDA50. Would be interesting to check this BTW. Sleep() implemented mostly in PulseFind area. And VHAR has small amount of PulseFind so -use_sleep impact there would be quite small and CPU savings with midrange AR could be substantional. From other side, balancing overall host performance depends on GPU vs CPU work share. For fast GPUs most of host RAC should come from GPU part and CPU part could be neglectible. |
Mike Send message Joined: 17 Feb 01 Posts: 34346 Credit: 79,922,639 RAC: 80 |
Very high CPU usage for WU's other than High AR's. Almost a full core, for AR's other than VHAR's where the CPU usage is 8-10% only. Thats why i suggested -use_sleep_ex 5. Shouldn`t be much slower running 3 instances but reduces CPU usage at least a little bit. Running benches atm. With each crime and every kindness we birth our future. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.