Modified SETI MB CUDA + opt AP package for full GPU utilization

Message boards : Number crunching : Modified SETI MB CUDA + opt AP package for full GPU utilization
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 25 · Next

AuthorMessage
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 843046 - Posted: 21 Dec 2008, 14:15:36 UTC

So, you both see different temperature regime between 3+1 and 4+1 modes while using my mod? Interesting, will check it on my host too.
ID: 843046 · Report as offensive
George
Volunteer tester

Send message
Joined: 14 Oct 08
Posts: 100
Credit: 435,680
RAC: 0
United States
Message 843049 - Posted: 21 Dec 2008, 14:23:12 UTC

after more testing i found what is the source of the temp dif there this line in the file app_info.xml
<avg_ncpus>0.040000</avg_ncpus>
<max_ncpus>0.040000</max_ncpus>
meaning the gpu feding at any one time can can use 4% cpu time and over all can only use 4% the cpus time so by changing this
<max_ncpus>0.040000</max_ncpus>
to this
<max_ncpus>0.080000</max_ncpus>
it allows the gpu feeding to bust the data when needed the same cpu time is used just all at once and not broken. this speeds up the gpu task's that is all that is need to be done
ID: 843049 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 843052 - Posted: 21 Dec 2008, 14:29:50 UTC - in response to Message 843049.  

<max_ncpus>0.080000</max_ncpus>
it allows the gpu feeding to bust the data when needed the same cpu time is used just all at once and not broken. this speeds up the gpu task's that is all that is need to be done


Done,thanks for the tip.

Just in time too, now running AP and CUDA MB on the AMD, with AP switching with Einstein.

Going well so far.
ID: 843052 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 843053 - Posted: 21 Dec 2008, 14:30:33 UTC - in response to Message 843049.  
Last modified: 21 Dec 2008, 14:32:03 UTC

yes, this option defines CPU share but I thought it's only for BOINC scheduling mechanism, not for OS that really does process scheduling (grant CPU cycles to app).
Will do more tests.
For now I discover that at least on my card with big cooler temperature regime is NOT stationary.
Host doing shorty tasks now ~6'50" on each task.
When it finishes one task and starts another temp drops to 51C (44 in idle - for reference).
Then temp increases to 57C and variates in 1-2 C range while task is processed. Then next drop to 51-51C and so on.
So, speaking anout GPU temp lets speak about its peak temperature and ensure that it's peak indeed.

Will try other configs and app_info.xml changes now.
ID: 843053 · Report as offensive
Grey Shadow
Volunteer tester
Avatar

Send message
Joined: 26 Nov 08
Posts: 41
Credit: 139,654
RAC: 0
Russia
Message 843058 - Posted: 21 Dec 2008, 15:10:10 UTC

Just installed your mod and now enjoy 10x increase of speed of computing. Thanks a lot %)

But it looks like now I still don't use all cores at 100% efficacy.

PC has 2 cores + GPU (8800).
I have AP and MB units running together. Judging by Task manager AP creates about 45-50% of CPU workload and MB (crunched mostly at GPU) - about 5-10%. So total CPU workload is about 60%. Are there any ways to utilize 100% by parallel computing of one more WU?
ID: 843058 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 843063 - Posted: 21 Dec 2008, 15:19:37 UTC - in response to Message 843058.  

Just installed your mod and now enjoy 10x increase of speed of computing. Thanks a lot %)

But it looks like now I still don't use all cores at 100% efficacy.

PC has 2 cores + GPU (8800).
I have AP and MB units running together. Judging by Task manager AP creates about 45-50% of CPU workload and MB (crunched mostly at GPU) - about 5-10%. So total CPU workload is about 60%. Are there any ways to utilize 100% by parallel computing of one more WU?

Try this
ID: 843063 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 843068 - Posted: 21 Dec 2008, 15:32:02 UTC - in response to Message 843058.  

Just installed your mod and now enjoy 10x increase of speed of computing. Thanks a lot %)

But it looks like now I still don't use all cores at 100% efficacy.

PC has 2 cores + GPU (8800).
I have AP and MB units running together. Judging by Task manager AP creates about 45-50% of CPU workload and MB (crunched mostly at GPU) - about 5-10%. So total CPU workload is about 60%. Are there any ways to utilize 100% by parallel computing of one more WU?


You need to read my first message completely.
Statement "7)"
7) For best CPU and GPU usage I recommend to set number of processors available for BOINC to real_number_of_cores+1. This will mitigate current BOINC bug with CPU+CUDA scheduling and will allow fully load CPU and GPU.
Did you make it?
ID: 843068 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 843069 - Posted: 21 Dec 2008, 15:34:02 UTC - in response to Message 843068.  

And about 3+1 config or app_info.xml changes. I tried both, but no GPU peak temp increase was detected. So it seems it doesn't work for my host.
But changeing app_info.xml could still be useful for another hosts.
On my host CUDA app CPU % is about 3% so it fits in original 4% constrains well.
ID: 843069 · Report as offensive
Grey Shadow
Volunteer tester
Avatar

Send message
Joined: 26 Nov 08
Posts: 41
Credit: 139,654
RAC: 0
Russia
Message 843070 - Posted: 21 Dec 2008, 15:37:24 UTC

2 Byron and Raistmer:
Yes, I set ncpus to 3 in cc_config.
ID: 843070 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 843073 - Posted: 21 Dec 2008, 15:58:27 UTC - in response to Message 843070.  

2 Byron and Raistmer:
Yes, I set ncpus to 3 in cc_config.

So now you should see 2 AP instead of 1 AP (each ~50% CPU) and CUDA MB (some small CPU % )
ID: 843073 · Report as offensive
Profile popandbob
Volunteer tester

Send message
Joined: 19 Mar 05
Posts: 551
Credit: 4,673,015
RAC: 0
Canada
Message 843100 - Posted: 21 Dec 2008, 17:34:15 UTC - in response to Message 843022.  

Ya once i get two 8600gts back from a rma i will have 4gpu's in this computer. Well that be two much for the cpus cores to handle. where as I should put them in a different computer.


The 8600gts's take about 1-3% of the cpu while a higher end card like a gtx 260 uses from 3-9%. So yes once core should be enough.

Also for FYI... I have not done any modifications and boinc is running 4+2.
It just takes it some time to realize its not as high priority as it thinks :)

~Bob


Do you Good Search for Seti@Home? http://www.goodsearch.com/?charityid=888957
Or Good Shop? http://www.goodshop.com/?charityid=888957
ID: 843100 · Report as offensive
Grey Shadow
Volunteer tester
Avatar

Send message
Joined: 26 Nov 08
Posts: 41
Credit: 139,654
RAC: 0
Russia
Message 843166 - Posted: 21 Dec 2008, 20:05:05 UTC
Last modified: 21 Dec 2008, 20:06:22 UTC

So now you should see 2 AP instead of 1 AP (each ~50% CPU) and CUDA MB (some small CPU % )

No, I still see only one AP unit and one cuda-MB. Also I see "Your video driver failed and was restarted successfully" each time when I try to open new window while computation is going - and this error leads to failure of computation of current MB unit.

Let's wait till this AP unit is completed... it was started before applications upgrade, maybe that is the reason...
ID: 843166 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 843197 - Posted: 21 Dec 2008, 20:32:26 UTC - in response to Message 843100.  

Ya once i get two 8600gts back from a rma i will have 4gpu's in this computer. Well that be two much for the cpus cores to handle. where as I should put them in a different computer.


The 8600gts's take about 1-3% of the cpu while a higher end card like a gtx 260 uses from 3-9%. So yes once core should be enough.

Also for FYI... I have not done any modifications and boinc is running 4+2.
It just takes it some time to realize its not as high priority as it thinks :)

~Bob

Yes, but you still priority mod to feed GPU fully in 4+2 config.
ID: 843197 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 843198 - Posted: 21 Dec 2008, 20:33:42 UTC - in response to Message 843166.  

So now you should see 2 AP instead of 1 AP (each ~50% CPU) and CUDA MB (some small CPU % )

No, I still see only one AP unit and one cuda-MB. Also I see "Your video driver failed and was restarted successfully" each time when I try to open new window while computation is going - and this error leads to failure of computation of current MB unit.

Let's wait till this AP unit is completed... it was started before applications upgrade, maybe that is the reason...


Strange... with ncpus==3 you should see 3 tasks running no matter CPU or GPU but at least 3 tasks (maybe even 4 - 3 CPU +1 GPU)....
ID: 843198 · Report as offensive
Profile popandbob
Volunteer tester

Send message
Joined: 19 Mar 05
Posts: 551
Credit: 4,673,015
RAC: 0
Canada
Message 843203 - Posted: 21 Dec 2008, 20:36:34 UTC - in response to Message 843197.  


Yes, but you still priority mod to feed GPU fully in 4+2 config.


Yes I am running priority mod but I was referring to no mods to Boinc for running 4+2.

~Bob


Do you Good Search for Seti@Home? http://www.goodsearch.com/?charityid=888957
Or Good Shop? http://www.goodshop.com/?charityid=888957
ID: 843203 · Report as offensive
(retired account)
Volunteer tester

Send message
Joined: 5 May 99
Posts: 30
Credit: 91,116
RAC: 0
Message 843235 - Posted: 21 Dec 2008, 21:28:19 UTC
Last modified: 21 Dec 2008, 21:35:09 UTC

Hello Raistmer,

thank you for the modified application.

My first four workunits, which were still in the cache when I did the change to your application, were all failing with calculation error and video driver crash (blank screen and driver restartet by OS).

But after checking the results I guess it's the VLAR issue and not related to your modified app, right?

WU true angle range is : 0.009700
WU true angle range is : 0.011063
WU true angle range is : 0.009700
WU true angle range is : 0.010032

I'll try to get some new workunits, hopefully non-VLAR ones.

BTW, please allow me a silly question, just for clarification: your modified application is based on version 6.05 from the main project, not on version 6.06 from Beta, right?

Best regards
Alex

EDIT (some info 'bout the system):
CPU AMD Athlon 64 X2 6000+ dualcore (non-OC)
GPU GTX260-216 ~1.400 MHz (factory-OC)
driver: Nvidia Forceware 180.48 64bit (stock driver)
BOINC: 6.4.5 64bit w/ cc_config tweak (ncpus=3)
OS: Win Vista Home Premium 64bit
ID: 843235 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 843242 - Posted: 21 Dec 2008, 21:37:52 UTC - in response to Message 843235.  
Last modified: 21 Dec 2008, 21:39:02 UTC

But after checking the results I guess it's the VLAR issue and not related to your modified app, right?

Right :)


BTW, please allow me a silly question, just for clarification: your modified application is based on version 6.05 from the main project, not on version 6.06 from Beta, right?

Best regards
Alex

No, not 6.06 or 6.05. I recived sources from Eric and after compilation got exe file with 6.04 in name.
Does it mean that sources slightly outdated or does it mean it's just cosmetic flaw (I also very often forget to increase build number by hands) - I don't know. Some sources available now, I will compare that codebase with my own. If more recent source is available I'm surely rebuild and repack. It will be announced here and on Lunatics of course.
ID: 843242 · Report as offensive
Grey Shadow
Volunteer tester
Avatar

Send message
Joined: 26 Nov 08
Posts: 41
Credit: 139,654
RAC: 0
Russia
Message 843252 - Posted: 21 Dec 2008, 22:06:27 UTC

OK, at last I fixed issue with Nr of CPUs... it was my mistake - I put cc_config.xml into BOINC directory, but it should be located in BOINC Data...

But almost gone crazy with video drivers failures. At least half of WUs were spoiled. Also I have a feeling that now these errors happen even when BOINC is not launched or computation is suspended. What is this damned "VLAR issue" and are there any ways to avoid it?
ID: 843252 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 843253 - Posted: 21 Dec 2008, 22:12:39 UTC - in response to Message 843252.  

Hm, it's some method but I don't think you like it. You need to check all incoming tasks headers and manually abort VeryLowAngleRange ones....
Too much manual work to be effective unless you write some script for it.
If you familiar with Perl it will be not so hard IMO.

Try to install another driver instead....
ID: 843253 · Report as offensive
Grey Shadow
Volunteer tester
Avatar

Send message
Joined: 26 Nov 08
Posts: 41
Credit: 139,654
RAC: 0
Russia
Message 843259 - Posted: 21 Dec 2008, 22:27:51 UTC

Hmm... I'm just a physician interested in PCs and programming %) I'm afraid my knowledge of Perl is not enough to deal with such task %)

What other drivers can you recommend? Currently I ran standard 180.48 package. It looks like this is the only version that really works with my card - I also tried CUDA 2.1 and CUDA 2.0 downloaded from CUDA's page but with them BOINC doesn't identify my card as CUDA-capable.
ID: 843259 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 . . . 25 · Next

Message boards : Number crunching : Modified SETI MB CUDA + opt AP package for full GPU utilization


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.