Message boards :
Number crunching :
To Many ERRORS
Message board moderation
Previous · 1 · 2 · 3 · 4 · Next
Author | Message |
---|---|
Fred J. Verster Send message Joined: 21 Apr 04 Posts: 3252 Credit: 31,903,643 RAC: 0 |
I was thinking the same, but could not find any 'real evedence' while errors: Exit status 197 (0xc5) EXIT_TIME_LIMIT_EXCEEDED, kept happening. Although to a lesser extend since I used 2 threads to feed the GPUs. I changed, afew minutes ago, period_iterations from 20 to 10, which decreased runtime and increased GPU-load, also decreased lag. Will let it run with this setting. B.t.w. doing 2 instances_per_device for MB work. (WIN 7;64bit, BOINC 7.0.28;64bit, Lunatics rev.390 app. for MB., CPU=i7-2600, GPUs 2x AMD/ATI EAH5870).All stock settings. |
Mike Send message Joined: 17 Feb 01 Posts: 34249 Credit: 79,922,639 RAC: 80 |
Thats what i expected. I had exactly the same issue last week on my 5850. I already said how to fix it. With each crime and every kindness we birth our future. |
LadyL Send message Joined: 14 Sep 11 Posts: 1679 Credit: 5,230,097 RAC: 0 |
to digress: it's worth a try at least if it's something app related. If you still get errors with something like 100 you know it's not that... I'm not the Pope. I don't speak Ex Cathedra! |
Mike Send message Joined: 17 Feb 01 Posts: 34249 Credit: 79,922,639 RAC: 80 |
to digress: No it won´t change anything. With each crime and every kindness we birth our future. |
LadyL Send message Joined: 14 Sep 11 Posts: 1679 Credit: 5,230,097 RAC: 0 |
Thats what i expected. yes, I was wrong - happens. I'll leave this to your capable hands then. If your suggestions don't help, we can get back to the drawing board. I'm not the Pope. I don't speak Ex Cathedra! |
LadyL Send message Joined: 14 Sep 11 Posts: 1679 Credit: 5,230,097 RAC: 0 |
I had exactly the same issue last week on my 5850. Probably best if you repeat how to fix it, Mike. I find it's rather hidden and skildude may have missed it. I'm not the Pope. I don't speak Ex Cathedra! |
skildude Send message Joined: 4 Oct 00 Posts: 9541 Credit: 50,759,529 RAC: 60 |
On a FX you need to free 2 cores to get full GPU utilisation. is that the fix? I'll free another CPU core and see what happens reduced my usage to 6 cores I'm now wondering if I could up my instances to 4 on the GPU if this actually works In a rich man's house there is no place to spit but his face. Diogenes Of Sinope |
BilBg Send message Joined: 27 May 07 Posts: 3720 Credit: 9,385,827 RAC: 0 |
I changed, afew minutes ago, period_iterations from 20 to 10, which Are you sure about the lag?? You can feel the lag most with VLARs, if you now run non-VLARs you will feel less lag. I run with -period_iterations_num 80 and even with this higher value I feel small lag (especially when scrolling) if VLAR is running. (with -period_iterations_num 10 lag is very big) This makes me ask Raistmer - Is it possible to have some option that sets -period_iterations_num at different values depending on AR? e.g.: -period_iterations_num 20 -period_iterations_num_VLAR 100 Â - ALF - "Find out what you don't do well ..... then don't do it!" :) Â |
Mike Send message Joined: 17 Feb 01 Posts: 34249 Credit: 79,922,639 RAC: 80 |
On a FX you need to free 2 cores to get full GPU utilisation. Yes. And watch your DCF. Tell me your GPU usage please. With each crime and every kindness we birth our future. |
Fred J. Verster Send message Joined: 21 Apr 04 Posts: 3252 Credit: 31,903,643 RAC: 0 |
I changed, afew minutes ago, period_iterations from 20 to 10, which I'm a little confused too, expected to see an increase in lag with a lower period_iterations_for_pulsefind. Probably each card/GPU has it's 'best' settings for period_iterations_for pulsefind.... Difference in runtime is small, compaired to 20, but I'll keep 10. Biggest difference was freeing up 2 in stead of 1 thread, that's 1 i7-2600 core. That's what Mike suggested, in the first place, too. |
.clair. Send message Joined: 4 Nov 04 Posts: 1300 Credit: 55,390,408 RAC: 69 |
I seem to be a bit late getting to the party- I have set `period_iterations 2` and the lag is only a problem if a workunit is starting and being loaded into the GPU. this thing is a crunch box so i am willing to tolerate quite a bit of lag. The CPU is only a P4 3.6ghz (prescot 660) with HT, The cpu only crunches one freehal nci so as to keep its load down. I find that the P4 is often overwelmed by the demands of two 7970 and during a shorty storm can not cope with servicing the GPU and stays at 100% load for several minits at a time and this makes the computer unuseable for me. Though if it is `busy` it is up to me to leave it alone to get on with it and go play with one of the other comp`s. I did `borrow` my q6600 from another rig to see how it fared and in that short test found that i had to keep one core free to feed each GPU, though i was not using -pi2 or -hp at that time. If crunching on all fore cpu cores i was geting Maximum_Time_Exceded errors these stoped with two cores free for the gpu`s to use. I am only runing two WU per card cos the PSU cant cope with any more, its is only a corsair HX620w and this box is eating about 500w, I have to get another psu before the third card, edit - OS win7home64, BM 7.0.28, ccc12.4, |
skildude Send message Joined: 4 Oct 00 Posts: 9541 Credit: 50,759,529 RAC: 60 |
On a FX you need to free 2 cores to get full GPU utilisation. 95-100% In a rich man's house there is no place to spit but his face. Diogenes Of Sinope |
Mike Send message Joined: 17 Feb 01 Posts: 34249 Credit: 79,922,639 RAC: 80 |
I see no more errors anymore. Your times have stabilized as well. Nice card IMHO. With each crime and every kindness we birth our future. |
skildude Send message Joined: 4 Oct 00 Posts: 9541 Credit: 50,759,529 RAC: 60 |
yet I have a 5850 that isn't having this problem. In a rich man's house there is no place to spit but his face. Diogenes Of Sinope |
Mike Send message Joined: 17 Feb 01 Posts: 34249 Credit: 79,922,639 RAC: 80 |
yet I have a 5850 that isn't having this problem. I dont see a problem anymore on your 7970. With each crime and every kindness we birth our future. |
Fred J. Verster Send message Joined: 21 Apr 04 Posts: 3252 Credit: 31,903,643 RAC: 0 |
yet I have a 5850 that isn't having this problem. Well, the Exit status 197 (0xc5) EXIT_TIME_LIMIT_EXCEEDED errors, surfaced again......... Trying todo 1 WU per GPU. See if that's helping. Giving a very low load, so back to 2 per GPU and period_iterations 40 in stead of 10. Still using 1, i7-2600 thread of 8, for each GPU. (ATI HD5870) CPU load during the first 10 seconds is 100% per (idle)thread. Errors appear on both 1st and 2nd GPU, having about the same load, 85% average. |
Mike Send message Joined: 17 Feb 01 Posts: 34249 Credit: 79,922,639 RAC: 80 |
Dont confuse me please Fred. Whats your DCF ? Have you flops included in your appinfo ? Whats the estimated times on GPU´s ? How many CPU cores are in use ? With each crime and every kindness we birth our future. |
skildude Send message Joined: 4 Oct 00 Posts: 9541 Credit: 50,759,529 RAC: 60 |
I still dont get why it needs 2 cores to load In a rich man's house there is no place to spit but his face. Diogenes Of Sinope |
Fred J. Verster Send message Joined: 21 Apr 04 Posts: 3252 Credit: 31,903,643 RAC: 0 |
Dont confuse me please Fred. Why should I confuse you? Task duration correction factor 3.61424 No FLOPS included. (Never had on this rig). 3 Cores, 6 threads are in use. 1 core or 2 threads (HT=ON) to feed GPUs. Estimated times are ofcoarse, too high, 1.5 x runtime. |
Mike Send message Joined: 17 Feb 01 Posts: 34249 Credit: 79,922,639 RAC: 80 |
First of all you quoted my reply to skildude. So i got confused. Anyways. I fear you need to free 1 physical core per GPU. Not one thread. Try it please to see if this helps. It certainly should. With each crime and every kindness we birth our future. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.