Message boards :
Number crunching :
No more guppi's=vlars on the gpu please
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · Next
Author | Message |
---|---|
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13745 Credit: 208,696,464 RAC: 304 |
Any speed advantage gain of SOG is futile, if a whole core has to support it. Is it? On my GTX 750Tis it was worth allocating 4 CPU cores to my 2 GTX 750Tis running 2 WUs at a time as the improved GPU performance offset the loss of CPU output. Maybe it's the same for S0G, maybe it's not. Only one way to find out- try it. With sleep, without sleep, settings tweaked, settings not tweaked, cores reserved, cores not reserved. Grant Darwin NT |
Rasputin42 Send message Joined: 25 Jul 08 Posts: 412 Credit: 5,834,661 RAC: 0 |
Compared to 4 CUDA tasks + 4 cpu tasks(in your case) ? |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
But it seems strange to me, that the one application behaves so contrarily in dealing with the two different types of WU. With nonVLAR it combines them and fully utilises the GPU, but with Guppis it does almost the opposite. . . I have a screen shot of present performance and I will happily rerun the triple Guppis trials to get some for that condition. But I need to be informed on how to paste them into this message base. I have tried twice before but failed both times. . . Likewise I am ignorant on how to paste links in here. |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Is there a setting that I can tweak to persuade the Guppi WU's to truly run concurrently and behave as the nonVLAR WUs do? From tests I saw so far (mostly for AMD actually, much bigger NV community as whole seems more stronger in whine skill than in precise benchmarking and results sharing and I have no compatible NV hardware at all (!) :/ ) -sbs 512 gives little to no additional advantage over -sbs 256. But decrease number of iterations from 50 to 10 for example will give roughly 5-times bigger kernel launch that could keep GPU busy while app's process sleeping. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13745 Credit: 208,696,464 RAC: 304 |
Compared to 4 CUDA tasks + 4 cpu tasks(in your case) ? Running 2 CUDA tasks on 2 GPUs with the -poll option and 1 core reserved for each WU and 4 CPU WUs produces more work per hour than just running 2 WUs on 2 GPUs without the poll option & 8 CPU WUs. Not a lot, about an extra 0.5 WUs per hour. But it adds up. Grant Darwin NT |
Rasputin42 Send message Joined: 25 Jul 08 Posts: 412 Credit: 5,834,661 RAC: 0 |
-poll option What does that do? |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Compared to 4 CUDA tasks + 4 cpu tasks(in your case) ? 8 real cores or hyperthreaded? |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
-poll option that changes CUDA runtime sync behavior to spin-wait on CPU as OpenCL runtime does |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14653 Credit: 200,643,578 RAC: 874 |
. . Likewise I am ignorant on how to paste links in here. Every time you post to this message board, there's a link http://setiathome.berkeley.edu/bbcode.php above and to the left of your text entry. If you click that, it opens in a new window/tab, so you don't lose your place. It involves putting tags in [square brackets] around your text - many of the common ones can be applied by using the buttons above the text entry area. |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
. . Not being a programmer I am having trouble following some of that so let's sneak up on this thing one step at a time. First lets see what effect setting the -sbs value has. . . As I understand it after trying different -sbs values, I should try combining a larger -sbs N value such as 512 with a lower -period_iterations_num N value such 1. Is this while still running 3 simultaneous WUs? . . When it come to the effect of FFT size I am lost, so let's deal with that further down the track. |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
Each major iteration in the MB application has seen an increase in the amount of calculation performed. With that an the increased complexity of the calculations required for the guppi data it is hardly surprising that these take longer to run. . . But I suspect they both take a lot longer to process than "normal" Arecibo WUs. I think that is the point. And CUDA, well .... |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
yes, still multitasking.
to understand that one should read original processing algorithm perhaps. |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
Here http://lunatics.kwsn.info/index.php/topic,1806.0.html will be pictures of v8 performance. For now one can refresh memories about how it was with older apps. . . Too much information on that graph my head is spinning :) . . But the best I can make out from it is that the relationship between VHARs, normal WUs and VLARs has been pretty constant over the different incarnations of BOINC, and across hardware platforms. |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
Any speed advantage gain of SOG is futile, if a whole core has to support it. . . Are you a fortune teller? :) . . Have you installed 0.45 Beta yet? . . It definitely needs the use of the CPU cores. |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
Is there a setting that I can tweak to persuade the Guppi WU's to truly run concurrently and behave as the nonVLAR WUs do? -sbs 384 gives best result on my R9 380. With each crime and every kindness we birth our future. |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
Is there a setting that I can tweak to persuade the Guppi WU's to truly run concurrently and behave as the nonVLAR WUs do? . . So first step then is -sbs 256 -period_iterations_num 1 |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
. . Likewise I am ignorant on how to paste links in here. . . I have mastered the use of the common functions like quoting and bold text but when I use the "['img']" option and try to paste an image in there I get nothing. And where do I find the URL for my Host details? . . The only URL I see from this page is in the browser address window. . . Hold everything! http://setiathome.berkeley.edu/show_host_detail.php?hostid=8012534 . . Now did it work ? :) |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
. . Yes :) |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
. . OK I am learning .... http://setiathome.berkeley.edu/show_host_detail.php?hostid=8012534 . . Now I only have to work out how to insert a graphic/image |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
. . I suspect I would not understand it, I am not a mathematics professor :) |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.