False "abandoned" tasks |
![]() |
| log in |
Message boards : Number crunching : False "abandoned" tasks
| Author | Message |
|---|---|
|
I have an issue with one computer for which 55 tasks were wrongly reported as "abandoned" on 18 Feb 2013, 15:06:59 UTC, though the completed ones were regularly reported during the black out period. The others are still being executed. One exemple is: | |
| ID: 1339405 · | |
|
Not your computer's fault, it's seti@home. | |
| ID: 1339421 · | |
|
Thanks Khangollo. | |
| ID: 1339626 · | |
|
This occurred to one of my machines two days back but I only noticed it today. Since I access it via remote desktop, it is really a painful task to check individual tasks to see which have been abandoned but still being processed by the machine. Is resetting a good option to clear this issue? And will the tasks be re-downloaded after the reset? | |
| ID: 1339695 · | |
This occurred to one of my machines two days back but I only noticed it today. Since I access it via remote desktop, it is really a painful task to check individual tasks to see which have been abandoned but still being processed by the machine. Is resetting a good option to clear this issue? And will the tasks be re-downloaded after the reset? Yes, reset the project, Only the tasks that haven't been abandoned will be resent, 20 at a time, Claggy | |
| ID: 1339753 · | |
Yes, reset the project, Only the tasks that haven't been abandoned will be resent, 20 at a time, Thanks for clarifying Claggy. The machine is now downloading all the lost tasks except for those that are expired. Also noticed that all the MB tasks are now just labelled seti_enhanced 6.03 and there is no identification for the CUDA tasks. But the system does seem to recognize them and the WUs are being properly processed by the CPUs and GPUs. ______________ | |
| ID: 1339758 · | |
|
Anybody worried whether they have such 'abandoned' tasks can check by clicking on the 'tasks' link on your main account page. | |
| ID: 1339760 · | |
|
All my hosts have been "abandoning" all their tasks in the last days... one of them did it twice just today... | |
| ID: 1339793 · | |
|
When a task is marked 'abandoned', there's a server timestamp for the event on the website task list. | |
| ID: 1339805 · | |
|
In the host that did it twice today, the last batch of abandoned where at 19:37:04 (UTC) Im at -3 UTC so it was at 16:37:04 my time, this time matches (more or less) a RPC and there is nothing unusual in the log... (there are a lot of "project comunication failed" entries, but those are related to the downloads and are a very common, frequent and consistent issue downloading SETI WUs...) 20/02/2013 16:33:50 | | Internet access OK - project servers may be temporarily down. 20/02/2013 16:36:08 | | Project communication failed: attempting access to reference site 20/02/2013 16:36:08 | SETI@home | Backing off 1 min 0 sec on download of 02dc12ad.12608.4157.9.10.248 20/02/2013 16:36:10 | | Internet access OK - project servers may be temporarily down. 20/02/2013 16:37:37 | SETI@home | Sending scheduler request: To fetch work. 20/02/2013 16:37:37 | SETI@home | Reporting 1 completed tasks, requesting new tasks for CPU and GPU 20/02/2013 16:37:49 | SETI@home | Scheduler request completed: got 1 new tasks 20/02/2013 16:37:49 | SETI@home | Message from server: Resent lost task 02dc12ad.12608.366236.9.10.164_1 20/02/2013 16:40:43 | | Project communication failed: attempting access to reference site 20/02/2013 16:40:43 | SETI@home | Backing off 1 min 0 sec on download of 29dc12ae.9337.6611.13.10.144 20/02/2013 16:40:52 | | Internet access OK - project servers may be temporarily down. Indeed, there is something weird: it says that is sending a ghost but the previous RPCs didn't failed... ____________ | |
| ID: 1339815 · | |
|
It happened again, in another host, for the second time today. | |
| ID: 1339856 · | |
|
OMG, got 150+ abandoned AP tasks, all but two on my machine...This worths 1.2 GB downloads. | |
| ID: 1339861 · | |
|
That could conceivably happen if two different machines were reporting in and claiming to have the same HostID. | |
| ID: 1339862 · | |
|
Negative. At least not this particular BOINC installation. It resides here, on this machine, from years. | |
| ID: 1339863 · | |
That could conceivably happen if two different machines were reporting in and claiming to have the same HostID. Nope, never... my current hosts have not "suffered" any upgrade recently... last upgrade was the addition of a new GPU about 4 months ago... besides that all of them have been working as they are now for almost a year... and all my hosts were installed using the long way (installers, manual attach to projects, etc...) By the way, I think that if two computers were really having the same ID one of them should have a wrong number in the RPC sequence which should make BOINC to assign a new ID... which is not happening in my hosts... ____________ | |
| ID: 1339864 · | |
When a task is marked 'abandoned', there's a server timestamp for the event on the website task list. It happens during scheduler requests that time out (when S@H scheduler is misbehaving). Server accepts request and sends response, but it gets lost and never gets back to the client. Then the next request attempt has a chance of triggering this task abandonment bug. This is at least how I observed it happened to me. ____________ | |
| ID: 1339881 · | |
Message boards : Number crunching : False "abandoned" tasks
| Copyright © 2013 University of California |