Message boards :
Number crunching :
The 400 and 50 WU limits are way too small
Message board moderation
Previous · 1 · 2 · 3 · 4 · Next
Author | Message |
---|---|
Cosmic_Ocean Send message Joined: 23 Dec 00 Posts: 3027 Credit: 13,516,867 RAC: 13 |
Actually, the AP creation rate is about what it has been for many months. Anywhere between 0.25-1.00 is "normal." It has spiked over 1.00 a few times, but not very often. Linux laptop: record uptime: 1511d 20h 19m (ended due to the power brick giving-up) |
red-ray Send message Joined: 24 Jun 99 Posts: 308 Credit: 9,029,848 RAC: 0 |
How can a system reach the limit when it's just returned 3 tasks? 21/02/2012 11:47:52 | SETI@home | Reporting 3 completed tasks, requesting new tasks for CPU I assume it's a race condition and the test is being done before the 3 just returned have been processed. The effect of this is to cause the CPU work fetch deferred to back off and starve the system of WUs. This is my slow (18K per day) system, so it's not a big issue. It just amused me. |
LadyL Send message Joined: 14 Sep 11 Posts: 1679 Credit: 5,230,097 RAC: 0 |
How can a system reach the limit when it's just returned 3 tasks? Good one :D Maybe it reported GPU tasks - it was asking for CPU. |
Belthazor Send message Joined: 6 Apr 00 Posts: 219 Credit: 10,373,795 RAC: 13 |
How can a system reach the limit when it's just returned 3 tasks? It's easy - scheduler just made re-count neÑessary time to crunch received WUs and on some reasons this time is increased, for example if you're used BOINC 24\7 but now it was shutted down for a while. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
How can a system reach the limit when it's just returned 3 tasks? No, the limit is by quantity, not by time. |
W-K 666 Send message Joined: 18 May 99 Posts: 19064 Credit: 40,757,560 RAC: 67 |
"The 400 and 50 WU limits are way too small" for some computers, way too big for most. We need BOINC to provide a time-based limits option, restricting each host to about 2 days of work "in progress" would be appropriate for this project currently.Joe But DCF is controlling the hosts requests for work. MB cpu tasks on my computers reset the DCF to 1.00 +/- 0.1 but the MB GPU or AP tsks drive it low. If I load up the cpu with AP work then the GPU tsks drive the DCF down to below 0.2 regulary. Of course when the AP tasks complete and a MB task completes then the computer enters panic mode. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
But DCF is controlling the hosts requests for work. It may be controlling the request, but it isn't controlling the reply any more. With my 0.0691, it's a case of "request 15 hours work, allocated 1 hour" - even if there's an infinite amount of work ready to send in the feeder queue. |
W-K 666 Send message Joined: 18 May 99 Posts: 19064 Credit: 40,757,560 RAC: 67 |
But DCF is controlling the hosts requests for work. Then how is it I have to drive the DCF down to get even 1 day of work for the GPU. At one stage I had about 8 days of work for the cpu thanks to ~20 ap's and only about 50 tasks total for the gpu. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
But DCF is controlling the hosts requests for work. I can't answer that, but try enabling <sched_op_debug> logging so you get the 'estimated total task duration' after a work allocation. If DCF is low, the (client estimate of the) work allocation will be low in the same proportion, at best. Edit - and if DCF is high, you probably won't have a resouce shortfall to trigger the work request in the first place. |
W-K 666 Send message Joined: 18 May 99 Posts: 19064 Credit: 40,757,560 RAC: 67 |
Edit - and if DCF is high, you probably won't have a resouce shortfall to trigger the work request in the first place. Thats the problem, with only a project DCF, the GPU tasks are estimated, when DCF=1, at 4 or 5 times actual. |
red-ray Send message Joined: 24 Jun 99 Posts: 308 Credit: 9,029,848 RAC: 0 |
I keep wondering why the Average turnaround time can't be used to decide how many WUs you are allowed. Currently for my 980X it is 0.38 days and it has 731 pending so it would need to be allowed about 6000 rather than 2200 so it could last 3 days. If each host had a quota that got adjusted to meet a target turn-a-round time all the hosts would get a fair share of WUs and overall there would be far less results in progress. Ideally you would have separate GPU and CPU quotas and turn-a-round times. |
HAL9000 Send message Joined: 11 Sep 99 Posts: 6534 Credit: 196,805,888 RAC: 57 |
I keep wondering why the Average turnaround time can't be used to decide how many WUs you are allowed. Currently for my 980X it is 0.38 days and it has 731 pending so it would need to be allowed about 6000 rather than 2200 so it could last 3 days. If each host had a quota that got adjusted to meet a target turn-a-round time all the hosts would get a fair share of WUs and overall there would be far less results in progress. Ideally you would have separate GPU and CPU quotas and turn-a-round times. It might be possible, or they could already be doing it, to include that into the DCF calculations. Not that DCF isn't already messed up enough as it is. SETI@home classic workunits: 93,865 CPU time: 863,447 hours Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[ |
W-K 666 Send message Joined: 18 May 99 Posts: 19064 Credit: 40,757,560 RAC: 67 |
I keep wondering why the Average turnaround time can't be used to decide how many WUs you are allowed. Currently for my 980X it is 0.38 days and it has 731 pending so it would need to be allowed about 6000 rather than 2200 so it could last 3 days. If each host had a quota that got adjusted to meet a target turn-a-round time all the hosts would get a fair share of WUs and overall there would be far less results in progress. Ideally you would have separate GPU and CPU quotas and turn-a-round times. Obviously it doesn't work for new or re-attached computers but why not work out downloads on RAC. By definition that says everything about what the computer can crunch in an average day. edit] Silly me, I just remembered why it cannot happen, although Eric K did a pretty good job at it, the BOINC people cannot work out what the credits/time are. |
tbret Send message Joined: 28 May 99 Posts: 3380 Credit: 296,162,071 RAC: 40 |
Just get them to send you a thumb-drive full of WUs like I did. |
red-ray Send message Joined: 24 Jun 99 Posts: 308 Credit: 9,029,848 RAC: 0 |
Obviously it doesn't work for new or re-attached computers but why not work out downloads on RAC. There would be a minimum quota to deal with new and returning systems. No using the RAC is not ideal. You could have an high RAC and be turning WUs round in 10 days. The regime should favour systems that return WUs before the turn-a-round target time to try and keep the number of WUs in progess to a minimum. |
W-K 666 Send message Joined: 18 May 99 Posts: 19064 Credit: 40,757,560 RAC: 67 |
Obviously it doesn't work for new or re-attached computers but why not work out downloads on RAC. But they could still use the RAC * "turn-a-round target" where RAC would be translated into flops/time in the formula to calculate max work downloaded. |
red-ray Send message Joined: 24 Jun 99 Posts: 308 Credit: 9,029,848 RAC: 0 |
I just got: 28/02/2012 01:27:34 | SETI@home | Reporting 2 completed tasks, requesting new tasks for NVIDIA GPU 28/02/2012 01:27:46 | SETI@home | Scheduler request completed: got 0 new tasks 28/02/2012 01:27:46 | SETI@home | No tasks sent 28/02/2012 01:27:46 | SETI@home | No tasks are available for Astropulse v5 28/02/2012 01:27:46 | SETI@home | No tasks are available for Astropulse v505 28/02/2012 01:27:46 | SETI@home | This computer has reached a limit on tasks in progress But only have 1,437 WUs. Given I have 4 GPUs I would be allowed 1,600 for the GPUs alone. Is there yet another limit? |
HAL9000 Send message Joined: 11 Sep 99 Posts: 6534 Credit: 196,805,888 RAC: 57 |
I just got: Probably ghost tasks but that can't be checked right now with the results disabled. SETI@home classic workunits: 93,865 CPU time: 863,447 hours Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[ |
red-ray Send message Joined: 24 Jun 99 Posts: 308 Credit: 9,029,848 RAC: 0 |
OK. Is there a way to expunge any ghost tasks that do exist pelase? |
Lionel Send message Joined: 25 Mar 00 Posts: 680 Credit: 563,640,304 RAC: 597 |
I hope Eric can get that assimilator sorted out for when the new download server arrives - after all, we'll want massive download congestion so we can see what it can do under real load (and give them a chance to fine tune the software). I think you should do the honourable thing ... and move a bit further away ... :) |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.