Message boards :
Number crunching :
CUDA Versions
Message board moderation
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · Next
Author | Message |
---|---|
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
Which host are you talking about ? With each crime and every kindness we birth our future. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
Additionally, I'm unsure how the command switch construct above as inserted in Neither is necessary. Simply place the command line text you require into the supplied (empty) .txt file, and the application will read and act on it when the next task starts running in the normal course of events. If you are running multiple copies of the application, each will read its own copy of the parameters as each starts its own next task - so don't draw any conclusions from performance measurements until *every* task instance has completed at least one task starting with the new settings. |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
Well, have taken several actions. Moved nv driver to 337.88 (working with drivers always has to be a hassle), now working on Lunatics 0.42, and have dropped the number of wu's per GPU to 4 (vs 5). Per your advice to others, been reading up on parameters as described in ReadMe_AstroPulse_OpenCL_NV.txt and decided, again per your advice, to use your set You just edit the file (with notepath only)with a simple copy & paste sequence and close the file, each time a new AP WU will start the Boinc automaticaly read the file and execute the command. I prefear, to be sure, restart the Boinc itself but that is not realy necessary. Today i follow a Mike´s sugestion and change a little the parameters in my 780 hosts to: -use_sleep -unroll 12 -ffa_block 16384 -ffa_block_fetch 8192 -tune 1 64 4 1 until now all is working fine and obtain a little gain in crunching times & video performance, i imagine in your 780Ti you will see similar improvements. A explanation: i dont´t use the -hp switch since my host is a non dedicated cruncher and i use it for other tasks who don´t allow me to run Boinc at high priority, but you could add that at the end of the command line and gain a little more. Finally, I had been advised elsewhere to use EVGA PrecisionX 15 to monitor GPU performance but it has been withdrawn due to some plagiarism issues. Yes you need to use a program to increase your GPU fan speed in order to keep it cooler or their thermal protection will slow down the GPU clock. Most of us uses EVGA Precision to do that, if you can´t find it, PM your e-mail (never put your e-mail on the open forums) and i will send it to you. There are another similar program at the MSI Site called Afterburner, works in the similar way. How do you measure GPU and CPU performance changes? You could use EVGA Precision to do that or if you wish you could DL a program called GPU-Z it´s free to and allow you to motinoring all the caracteristics and sendsors on practicaly all avaiable GPU´s (not just NV). This is the link: http://www.techpowerup.com/gpuz/ I assume that wu feeds to CPU must be turned off in order to see changes from the -use_sleep command otherwise the CPU will be running full bore (100%) executing wu's. Alternatively, I suppose I could wait for an average of CPU times once the -use_sleep command is turned on. Don´t worry for now about that, first be sure the switch (and the command line) is working, to see the go to the Stderr output of a crunched AP WU, in the begining of the file you will see something like this (depends only on the parameter you use): <core_client_version>7.2.42</core_client_version> <![CDATA[ <stderr_txt> Running on device number: 0 Sleep() & wait for event loops will be used in some places DATA_CHUNK_UNROLL set to:12 FFA thread block override value:16384 FFA thread fetchblock override value:8192 TUNE: kernel 1 now has workgroup size of (64,4,1) If that don´t apears your command line is not working. For now check if all is working, when you where sure we will continue to the next phase, find the optimal number of WU crunched at a time for your host. WIll wait for your return. <edit> Mike he talks about his 2x780Ti host. |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242 |
The latest version of precision x was having problems engaging the auto regulation of the fans. I pulled it off mine after temps were spiking and the fans weren't speeding up. Fortunately I had old installer on a computer and reinstalled on all mine. |
qbit Send message Joined: 19 Sep 04 Posts: 630 Credit: 6,868,528 RAC: 0 |
Can anybody pls explain to me what the use sleep option does? I thought I give it a try today and it seems to decrease the crunching speed on my system. APs seem to take about 15% longer while the GPU temperature is around 5 degrees lower. My CPU is rather old and slow, so maybe this option is more useful on faster CPUs? |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
Can anybody pls explain to me what the use sleep option does? I thought I give it a try today and it seems to decrease the crunching speed on my system. APs seem to take about 15% longer while the GPU temperature is around 5 degrees lower. Exactly vice versa. Use_sleep reduces CPU consumption for Nvidia GPU`s. It just needs to be set up correctly. I can see you only use ffa_fetch without ffa_fetch_block. Also using -tune is speeding things up. Try -unroll 6 -ffa_block 6144 -ffa_block_fetch 1536 -tune 1 64 4 1 -use_sleep. With each crime and every kindness we birth our future. |
qbit Send message Joined: 19 Sep 04 Posts: 630 Credit: 6,868,528 RAC: 0 |
I will ty this, thx Mike! |
FalconFly Send message Joined: 5 Oct 99 Posts: 394 Credit: 18,053,892 RAC: 0 |
Hmm, I stumbled over a nasty surprise this morning : On this Host I found two AP workunits got stuck overnight, that had been wasting GPU time on the GTX750ti for many hours without making any further progress. Is is that a known contingency requiring frequent monitoring/caretaking ? Upon quitting and restarting BOINC, they finished normal with and with expected performance. Paramaters for the Lun v0.42 App I'm using on that host : -unroll 12 -ffa_block 8192 -ffa_block_fetch -hp -tune 1 64 4 1 -use_sleep 2 AP were Task running parallel, one using >40% CPU and one using only 0.5% CPU but both definitely stuck. Anything I could do to prevent that in the future ? GPUs are very well cooled and GPUs hardly exceed 50deg C. For now, I've set the Windows power management to high performance profile (basically already was, I ran on balanced profile with every power-saving turned off with the exception of the CPU allowed to clock down if able to) |
Mike Send message Joined: 17 Feb 01 Posts: 34258 Credit: 79,922,639 RAC: 80 |
Hmm, I stumbled over a nasty surprise this morning : Thats the point. An AMD CPU should never downclock whilst running astropulses on GPU. With each crime and every kindness we birth our future. |
Bill Greene Send message Joined: 3 Jul 99 Posts: 80 Credit: 116,047,529 RAC: 61 |
Additionally, I'm unsure how the command switch construct above as inserted in That shortens the change process some lending to more efficient experimentation. Yes, seems though each change deserves averages of an overnight with the variability in results I've noted. Fortunately, and thus far, I've been able to keep to a single instance on each machine but I have some oddball GPU's lying around that I would like to put to work. Your input will be especially useful for that purpose. Thanks for the response. |
Bill Greene Send message Joined: 3 Jul 99 Posts: 80 Credit: 116,047,529 RAC: 61 |
Juan - just in case you did not get my reply, Precision executable received and now operating on all nv machines. It revealed the reason that one of the 480's in the dual 480 machine was throttling back - one of the GPU's was hot ... over 100 C. Must get some additional cooling on it somehow. I have also updated most of my machines to Lun. 0.42 along with updates to 337.88 on nv machines. With minor changes where appropriate, I've installed the command line switches you provided including -use_sleep on all nv equipped machines. With your help, I feel like I've come a long way in just a few days and am now looking forward to evaluating effects. I no longer see display stalls or driver failures on the 780 machine and see on Precision that GPU1 is running about 88% (4 wu's). Not sure how to show GPU2. Always open to more suggestions but have learned a lot and my confidence is up that I'm not going to trash my system with these changes. Thanks again for your perseverance getting up to speed. |
Bill Greene Send message Joined: 3 Jul 99 Posts: 80 Credit: 116,047,529 RAC: 61 |
Anyone know of a way to put on-board GPU's to work when there exists bus installed GPU's? Eligible on-board GPU's are recognized and it seems a waste that they are setting there idle. |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242 |
At the bottom of precision x you will see the precision log with 3 displays. If you double click anywhere on it, it will open up a new window with multiple stats to view. You can run the built in GPU on your chip, but from what I've heard it's not worth it if you have GPUs via the PCIe. Using it tends to slow down the dedicated GPUs from what I've read. Most that have dedicated GPUs don't tend to use it. If you search the threads you will find what I talk about. If that's the only GPU you have them it's fine to use it, others I'd suggest you concentrate on the dedicated GPUs. My 2 cents Zalster |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13736 Credit: 208,696,464 RAC: 304 |
one of the GPU's was hot ... over 100 C. Must get some additional cooling on it somehow. At that temperature I'd say it's not getting any cooling. Probably be like my old GTX 560Ti, fans ended up seizing up. Sleave bearings are rubbish. Grant Darwin NT |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
Anyone know of a way to put on-board GPU's to work when there exists bus installed GPU's? Eligible on-board GPU's are recognized and it seems a waste that they are setting there idle. Hi SOrry for the delay, i was sleeping, you know time zones and shifts are our enemy. About the Precision i get the msg, i´m happy i was able to give you a hand on that, just one tip be sure to enable it´s automatic fans control and try to run your GPU´s fans in the range of 80%. Thats will make them lasting for a long time. Each GPU has his max temp, but 100C is not a safe temp for sure, i set my max temp at 75C in the case of the 780 for example. I never try to use the iGPU´s, but i imagine is not a good ideia. Why? iGPUs normaly makes the CPU produce a lot of heat, and a lot of heat is bad, it could trotle you CPU clock, or forces your CPU fan to the limit. IMHO the troubles are bigger than a possible gain, your 780Ti has a lot larger crunching capacity than the iGPU, so focus on their optimization and leave the iGPU aside, at least for now. I see your allready crunched AP Wu (on the 780Ti host at least) and seems like your configuration is OK and the -use_sllep is working fine, look the diference on the CPU times. I belive you are ready to the next phase, find the best WU/time could we continue? |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Can anybody pls explain to me what the use sleep option does? ПоÑле Ð¿Ð¾Ð¼ÐµÑ‰ÐµÐ½Ð¸Ñ Ð² очередь неÑкольких вызовов ГПУ-Ñдер вызывает Sleep(), Ð¾Ñ‚Ð´Ð°Ð²Ð°Ñ Ð¦ÐŸÐ£ ÑиÑтеме Ð´Ð»Ñ Ð´Ñ€ÑƒÐ³Ð¸Ñ… нужд. -use_sleep проÑтейший ÑпоÑоб немного оÑвободить процеÑÑор. Ð”Ð»Ñ Ð»ÑŽÐ±Ð¸Ñ‚ÐµÐ»ÐµÐ¹ поÑкÑпериментировать Ñ Ð´Ð¾Ð±Ð°Ð²Ð¸Ð» другие опции: Из ReadMe: -initial_ffa_sleep N M: In PC-FFA will sleep N ms for short and M ms for large one before looking for results. Can decrease CPU usage. Ðти опции позволÑÑŽÑ‚ подобрать времÑ, на которое программа отдает ЦПУ. Ðто Ð²Ñ€ÐµÐ¼Ñ Ð´Ð¾Ð»Ð¶Ð½Ð¾ быть примерно равно времени, которое требуетÑÑ Ð“ÐŸÐ£ Ð´Ð»Ñ Ð·Ð°Ð²ÐµÑ€ÑˆÐµÐ½Ð¸Ñ Ñ€Ð°Ð±Ð¾Ñ‚Ñ‹. Тогда оÑвобождение ЦПУ не будет приводить к ÑущеÑтвенному падению общей производительноÑти. ОÑобенно актуально Ð´Ð»Ñ Ñамых быÑтрых (где требуетÑÑ Ð´Ð¾Ð¿Ð¾Ð»Ð½Ð¸Ñ‚ÐµÐ»ÑŒÐ½Ð¾Ðµ укрупнение ГПУ-Ñдер путем дополнительных опций) и Ñамых медленных, где ГПУ не уÑпевает закончить обработку за временной интервал, уÑтанавливаемый через -use_sleep. P.S. ОбÑуждаемые опции применимы к ÐÑтроПульÑу, который напиÑан на OpenCL, не на CUDA. ПоÑтому заголовок обÑÑƒÐ¶Ð´ÐµÐ½Ð¸Ñ Ð¼Ð¾Ð¶ÐµÑ‚ неÑколько запутать читателей в Ñтой чаÑти. |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Еще один ÑпоÑоб избежать неполной загрузки ЦПУ - иÑпользование -cpu_lock опции. Ð”Ð»Ñ ATi Ñта Ð¾Ð¿Ñ†Ð¸Ñ Ð¿Ð¾Ð·Ð²Ð¾Ð»Ñет иÑпользовать 100% загрузку ЦПУ другими приложениÑми, Ð¿Ñ€Ð¸Ð²Ð¾Ð´Ñ Ð»Ð¸ÑˆÑŒ к небольшой потере производительноÑти ГПУ (по крайней мере, в некоторых конфигурациÑÑ…). Ð”Ð»Ñ nVidia (наÑколько мне извеÑтно) Ñта Ð¾Ð¿Ñ†Ð¸Ñ Ð½Ðµ теÑтировалаÑÑŒ в полной мере. Тем не менее, она доÑтупна Ð´Ð»Ñ Ð¸ÑÐ¿Ð¾Ð»ÑŒÐ·Ð¾Ð²Ð°Ð½Ð¸Ñ Ð²Ð¾ вÑех OpenCL ÐÑтроПульÑах, Ð²ÐºÐ»ÑŽÑ‡Ð°Ñ iGPU. Ð”Ð»Ñ Ð±Ñ‹Ñтрых уÑтройÑтв она должна быть дополнена указанием, Ñколько ÑкземплÑров программы выполнÑетÑÑ Ð¾Ð´Ð½Ð¾Ð²Ñ€ÐµÐ¼ÐµÐ½Ð½Ð¾ (Ð´Ð»Ñ Ð¿Ñ€Ð°Ð²Ð¸Ð»ÑŒÐ½Ð¾Ð³Ð¾ раÑÐ¿Ñ€ÐµÐ´ÐµÐ»ÐµÐ½Ð¸Ñ Ð¿Ñ€Ð¾Ñ†ÐµÑÑов по имеющимÑÑ Ð² ÑиÑтеме логичеÑким процеÑÑорам). Из ReadMe: -cpu_lock : Enables CPUlock feature. Results in CPUs number limitation for particular app instance. Also attempt to bind different instances to different CPU cores will be made. |
qbit Send message Joined: 19 Sep 04 Posts: 630 Credit: 6,868,528 RAC: 0 |
Can anybody pls explain to me what the use sleep option does? I thought I give it a try today and it seems to decrease the crunching speed on my system. APs seem to take about 15% longer while the GPU temperature is around 5 degrees lower. OK, I gave those settings a try but crunching is still slower with "use sleep" option then without. Here's an example: This is with "use sleep": http://setiathome.berkeley.edu/result.php?resultid=3678613272 This is without "use sleep": http://setiathome.berkeley.edu/result.php?resultid=3678619850 But, as I said, the computer is running more quiet and more cool when using the "use sleep" option. GPU temperature is around 57 degrees, without the opotion it is around 62 degrees. Can anybody pls explain to me what the use sleep option does? Thx Raistmer, but I'm afaraid I don't understand this language. I tried to translate with google and it looks like you recommend me to test those other options instead of "use sleep"? |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Thx Raistmer, but I'm afaraid I don't understand this language. I tried to translate with google and it looks like you recommend me to test those other options instead of "use sleep"? "This language" is Russian. Worth to be acquainted with at least if not learn. And Sheldon could ask Volovitz - he knows this language well ;) :D Yep, those options allow more fine tuning sleeping time. |
qbit Send message Joined: 19 Sep 04 Posts: 630 Credit: 6,868,528 RAC: 0 |
Ah, ok. I'm sure russian is a nice language but I'm also sure it's rather hard to learn. I can't even read the characters. And Howard is busy with Bernadette once again ;-) Could you recommend a commandline with those options for my card? I'm new to this whole Cuda/OpenCL thing so it's not easy for me to figure out the best values on my own now. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.