Whole cache Error 144 unrecoverable ?

Message boards : Number crunching : Whole cache Error 144 unrecoverable ?
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile mlcudd
Volunteer tester
Avatar

Send message
Joined: 11 Apr 03
Posts: 782
Credit: 63,647
RAC: 0
United States
Message 25741 - Posted: 13 Sep 2004, 2:31:12 UTC
Last modified: 13 Sep 2004, 2:33:52 UTC

Hi All,
I guess the "Little problem someone mentioned in another thread is a "big" problem. Out of 77 pending WU's waiting to be crunched, they have all errored out in a matter 0f 3 minutes. I now have 0 (Zero) running. I am running XP SP1, and as soon as I attempted to transfer completed WU's, all the ones "uncrunched" methodically errored out. I quickly disabled Network access but has soon as I connected again the rest errored out. It clearly says Deferring Communication 1 minute and right underneath it it says 'Unrecoversable error Unable to start app"
This is listed 31 times. When I reconnected Network access, the exact same thing happened "deferring communication for 1 minute, and the immeadiatley all the rest of the WU's. On the work page they all still show as ready to run, but none will start.

** I spoke to soon, they are all now showing "Ready to Report"

Regards,

Rocky

<img> <img>
ID: 25741 · Report as offensive
JAF
Avatar

Send message
Joined: 9 Aug 00
Posts: 289
Credit: 168,721
RAC: 0
United States
Message 25752 - Posted: 13 Sep 2004, 2:52:20 UTC - in response to Message 25741.  

Rocky,

Do all the "Ready to Report" WU's have CPU time? If not, I'm afraid you will lose them all. I had over 30 WU's ready to report on one of my computers and one WU with zero CPU time. I just couldn't do a successful update, even on Friday when things started flowing.

Eventually I had to reset the project because I was going to run out of work. After the reset, I receieved a bunch of WU's that it started crunching and they reported just fine.
ID: 25752 · Report as offensive
Profile mlcudd
Volunteer tester
Avatar

Send message
Joined: 11 Apr 03
Posts: 782
Credit: 63,647
RAC: 0
United States
Message 25798 - Posted: 13 Sep 2004, 4:45:29 UTC
Last modified: 13 Sep 2004, 4:46:03 UTC

Hey JAF,
No unfortunately they all have zero time. I would really like to know why this happens, as I thought this was a "fixed"issue before. what it seems lets say for instance we have a power outage, and the computer restarts, the network connection is enabled on Boinc by default(whether it was disabled before or not) and unless you are right on top of the box when the power comes back on, your screwed,because if a WU finishes and can't upload some kind of error occurs that makes all the other WU's invalid.It goes through one at a time trying to "open the app", and then errors them out as unrecoverable.
That is about all I have come up with.

All I can ask is for what I would think is a "simple Fix", which is to have the network access disabled by default. Or at least ask if you want it disabled by default for those who are on dial-up.
Again I am no programmer or coder, I don't even use a computer that well, but it seems like an easy fix to me.
I was gooing to send an email to Rom. Do you think that thayt would be the right approach. I know I have not seen him on the boards today, and with the present problems, I don't think he will be tomorrow either.

Thanks for the response.

Warm Regards,

Rocky
ID: 25798 · Report as offensive

Message boards : Number crunching : Whole cache Error 144 unrecoverable ?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.