Message boards :
Number crunching :
Impossible deadlines
Message board moderation
Author | Message |
---|---|
![]() ![]() Send message Joined: 15 Mar 10 Posts: 54 Credit: 4,819,910 RAC: 7,376 ![]() ![]() |
I currently have 53 errors and they are all tasks that had deadlines within just a few minutes of the 'Sent' time. My machine is not up there on crunch time ranking like many of you, but even if it were it seems a big ask. This seems to be something that is increasing rather than diminishing. It does not particularly affect my stats, and frankly I wouldn't really be bothered if it did, as all I am doing is releasing idle time to S@H. But it seems like an awful waste of valuable S@H Server time and bandwidth to me. It has been happening occasionally for months, but just seems to be getting worse. I don't nurse maid S@H, so I only ever see these as errors. Therefore I can't tell if it's true, but it appears that if the 'sent' timestamp came when the download started it might even have expired by the time it was on my machine. Even if my machine started it immediately it could not possibly finish by the deadline. Len I think I am. Therefore I am. I think. ![]() |
Richard Haselgrove ![]() Volunteer tester Send message Joined: 4 Jul 99 Posts: 11892 Credit: 115,439,375 RAC: 68,885 ![]() ![]() |
Don't worry about it. The explanation is well known, and has been given on these boards many times - but it's only really of interest to people who do monitor their machines obsessively. The important thing is that no bandwidth is wasted - these tasks never get anywhere near your computer. |
.clair. Send message Joined: 4 Nov 04 Posts: 1300 Credit: 42,154,685 RAC: 1,274 ![]() |
I take it as an indication of how stresed the download system is, When downloads are stuck i have had some that expire before i compleat the download, Just another day at the office. edit - Richard types quicker than i do :¬) |
![]() ![]() Send message Joined: 15 Mar 10 Posts: 54 Credit: 4,819,910 RAC: 7,376 ![]() ![]() |
Thanks Richard & Clive. I shall relax and simply consider it a measure of how hard S@H as a whole is stressed. - And the speedy replies as a measure of the enthusiastic community we have. ;) Len I think I am. Therefore I am. I think. ![]() |
![]() ![]() Send message Joined: 18 Sep 03 Posts: 805 Credit: 1,697,160 RAC: 73 ![]() |
The explanation is well known, and has been given on these boards many times I think we should have a sticky thread about it, this question seems to come back at least 1-2 times a week (including posts in the panic thread). . ![]() |
![]() Volunteer tester ![]() Send message Joined: 11 Sep 99 Posts: 6529 Credit: 182,836,182 RAC: 49,331 ![]() ![]() |
The explanation is well known, and has been given on these boards many times People actually read those? Ideally I think the solution would be to change the results messages from "Outcome No reply" & "Timed out - no response" to something like "Outcome server canceled" & "Timed out - server canceled" Something more descriptive would be better, but I haven't had coffee yet this morning. SETI@home classic workunits: 93,865 CPU time: 863,447 hours |
![]() ![]() Send message Joined: 18 Sep 03 Posts: 805 Credit: 1,697,160 RAC: 73 ![]() |
That would be even better. There is already the status "Canceled by server", it's used for tasks cancelled by server in case of results returned after deadline, i.e. the replacement task is canceled after the late task has been returned and maybe validated (on projects which use this feature, for example Collatz). That status could be used here too, the current messages are completely wrong. . ![]() |
![]() Volunteer developer Volunteer tester ![]() Send message Joined: 16 Jun 01 Posts: 5920 Credit: 80,449,374 RAC: 29,722 ![]() ![]() |
Unfortunatelly, these more destructive... 2609402650 1067642640 17 Sep 2012 | 15:27:44 UTC 17 Sep 2012 | 15:33:55 UTC Время истекло - нет ответа 0.00 0.00 --- SETI@home Enhanced Анонимная платформа (Тип ЦП) 2609402648 1067642634 17 Sep 2012 | 15:27:44 UTC 17 Sep 2012 | 15:33:55 UTC Время истекло - нет ответа 0.00 0.00 --- SETI@home Enhanced Анонимная платформа (Тип ЦП) 2609402645 1067642628 17 Sep 2012 | 15:27:44 UTC 17 Sep 2012 | 15:33:55 UTC Время истекло - нет ответа 0.00 0.00 --- SETI@home Enhanced Анонимная платформа (Тип ЦП) 2609402643 1067642622 17 Sep 2012 | 15:27:44 UTC 17 Sep 2012 | 15:33:55 UTC Время истекло - нет ответа 0.00 0.00 --- SETI@home Enhanced Анонимная платформа (Тип ЦП) 2609402641 1067642616 17 Sep 2012 | 15:27:44 UTC 17 Sep 2012 | 15:33:55 UTC Время истекло - нет ответа 0.00 0.00 --- SETI@home Enhanced Анонимная платформа (Тип ЦП) 2609402639 1067642610 17 Sep 2012 | 15:27:44 UTC 17 Sep 2012 | 15:33:55 UTC Время истекло - нет ответа 0.00 0.00 --- SETI@home Enhanced Анонимная платформа (Тип ЦП) 2609402637 1067642604 17 Sep 2012 | 15:27:44 UTC 17 Sep 2012 | 15:33:55 UTC Время истекло - нет ответа 0.00 0.00 --- SETI@home Enhanced Анонимная платформа (Тип ЦП) 2609402635 1067642598 17 Sep 2012 | 15:27:44 UTC 17 Sep 2012 | 15:33:55 UTC Время истекло - нет ответа 0.00 0.00 --- SETI@home Enhanced Анонимная платформа (Тип ЦП) 2609402631 1067642586 17 Sep 2012 | 15:27:44 UTC 17 Sep 2012 | 15:33:55 UTC Время истекло - нет ответа 0.00 0.00 --- SETI@home Enhanced Анонимная платформа (Тип ЦП) And now my main cruncher sits w/o CPU tasks at all - quota was stretched to 6 tasks per day... because of server errors. IMHO it's very time to get separate treatment for client side and server side errors. Last should not affect user quota ! |
tbret Volunteer tester ![]() Send message Joined: 28 May 99 Posts: 3377 Credit: 262,478,676 RAC: 106,702 ![]() ![]() |
+1 |
![]() Volunteer tester Send message Joined: 22 Jul 99 Posts: 768 Credit: 24,140,697 RAC: 0 ![]() |
Isn't this new wording? Change today? 9/17/2012 5:09:59 PM | SETI@home | Didn't resend lost task 13my12aa.28462.19290.12.10.168.vlar_0 (expired). At least it gives a clue. Agree that these should not impact daily quotas and s/b called server aborted or something like that. Another Fred Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop. ![]() |
![]() Volunteer tester ![]() Send message Joined: 11 Sep 99 Posts: 6529 Credit: 182,836,182 RAC: 49,331 ![]() ![]() |
Isn't this new wording? Change today? The client has had that message for quite some time. SETI@home classic workunits: 93,865 CPU time: 863,447 hours |
JohnDK ![]() ![]() Volunteer tester ![]() Send message Joined: 28 May 00 Posts: 1019 Credit: 142,389,037 RAC: 158,496 ![]() ![]() |
|
![]() Send message Joined: 18 Nov 08 Posts: 12 Credit: 800,330 RAC: 0 ![]() |
Now we're at it... How about the BOINC manager working on tasks in order of deadline? I have enough time for all tasks on my system (most of the time...), but I always notice that tasks with a deadline of Nov 3 are already processed while deadlines of Sept 30 are still waiting... ![]() ![]() |
Horacio Send message Joined: 14 Jan 00 Posts: 536 Credit: 75,967,266 RAC: 0 ![]() |
Now we're at it... How about the BOINC manager working on tasks in order of deadline? I have enough time for all tasks on my system (most of the time...), but I always notice that tasks with a deadline of Nov 3 are already processed while deadlines of Sept 30 are still waiting... AFAIK, BOINC works in strict FIFO order unless it thinks that one WU will miss a deadline. If it were working by dealine order, then long tasks with long deadlines will be allways delayed and suspended when new work with short deadlines appears. And then projects with long tasks (and long deadlines) will have serious issues to get their work done (not to mention that then some projects may try to reduce the deadlines to put themselves very high in the order lists). If the client scheduller notices that some WUs will miss the deadline then it enters in "panic mode" (aka High priority) and then it changes from the FIFO order to the deadline order and crunches first those tasks in danger... ![]() |
![]() ![]() ![]() Send message Joined: 1 Jun 01 Posts: 727 Credit: 122,552,961 RAC: 101,007 ![]() ![]() |
I do wonder why this AP timed out. My guess would be that it was stuck downloading and did not finish the download in time. I am seeing that every now and then now that I am having problems downloading. (This is especially more the case when it is a large file as APs are) ![]() SETI@home classic workunits 4,019 SETI@home classic CPU time 34,348 hours |
Horacio Send message Joined: 14 Jan 00 Posts: 536 Credit: 75,967,266 RAC: 0 ![]() |
I do wonder why this AP timed out. My guess is different...and I think its a bug (or feature?) in the resent code that aborts the APs that were sent to a CPU if they get resent to an Nvidia GPU that is not OpenCl "capable" (i.e. its not on a host running Boinc 7.xx)... ![]() |
JohnDK ![]() ![]() Volunteer tester ![]() Send message Joined: 28 May 00 Posts: 1019 Credit: 142,389,037 RAC: 158,496 ![]() ![]() |
I do wonder why this AP timed out. I have disabled AP's for GPU for now, so it can't be that. |
Horacio Send message Joined: 14 Jan 00 Posts: 536 Credit: 75,967,266 RAC: 0 ![]() |
I do wonder why this AP timed out. At the contrary, I think it's more likely then... If the scheduller was not able to sent them to your GPU then thats why they were "aborted"... The resent code doesnt put WUs on hold waiting for a request to a more suitable device... if the scheduller were doing that then instead of the usual short deadline on vlars, they will be skipped until you need a CPU task... (I have several APs "aborted" through the deadline in all my hosts. All them use BOINC 6.10.60 but only one doesnt have the apps to crunch them on GPU...) Edit: What I think, is that with the adition of the new OpenCl stock apps for ATI and Nvidia, and trying to make them compatible with older clients using the optimized apps, and all the workarounds made to avoid vlars on GPU and whatnot... something is not working as intended with the resend of APs... ![]() |
![]() ![]() Send message Joined: 24 Jan 00 Posts: 13711 Credit: 179,944,503 RAC: 77,590 ![]() ![]() |
I do wonder why this AP timed out. Well then that answers it, that AP just suffered the same fate that VLAR's do. Cheers. |
JohnDK ![]() ![]() Volunteer tester ![]() Send message Joined: 28 May 00 Posts: 1019 Credit: 142,389,037 RAC: 158,496 ![]() ![]() |
I do wonder why this AP timed out. I disabled, removed the app_info section, for GPU APs days ago, so that AP could only be sent/resent to CPU. |
©2018 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.