BOINC lost attachment to S@H project

Message boards : Number crunching : BOINC lost attachment to S@H project
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1483759 - Posted: 2 Mar 2014, 19:54:04 UTC

Friday evening, I discovered that BOINC had lost track of S@H on my host 7057115 following a restart. It was suddenly telling me that "This computer is not attached to any projects". The short version is that I eventually ended up having to add the S@H project back to BOINC, but in the process wound up with 146 Abandoned tasks. The crisis has passed, for now, but I'd like to understand whether there's some other approach I could take to avoid the Abandoned tasks situation if this sort of thing happens again. (I suspect the root cause is an incomplete system shutdown leading to startup anomalies, so unless I can track down that gremlin, BOINC / S@H will continue to be at risk.)

The first thing I did when I saw the "not attached" message was to confirm that the S@H project directory was still present and populated. It was, and as far as I could tell still had all required files present. The Event Log, though, had numerous error messages, starting with "Couldn't parse account file account_setiathome.berkeley.edu.xml", "Couldn't parse statistics_setiathome.berkeley.edu.xml", and "Project SETI@home is in state file but no account file found". I won't try to post everything here unless somebody thinks it necessary, but basically there are a lot of "... outside project in state file" messages for every WU and task (the ones that ultimately got abandoned).

I initially tried just shutting down the BOINC Manager and client, but still got the same "not attached" message, although this time the Event Log only showed the "Couldn't parse ..." messages without the myriad of additional error messages. At this point I finally just decided to go ahead and add S@H back to BOINC, hoping that it would recognize the existing project directory and tasks, but of course it didn't. It immediately tried to download two new tasks and all the associated application files. However, those downloads were failing with a series of "Can't create HTTP response output file ..." messages, one for each task and app file.

So then I just threw up my hands and rebooted the machine. When BOINC came back up this time, the new downloads proceeded without any problems and I eventually got all the application files back (well, the MB ones, anyway) and a whole queue full of new tasks, but of course by now the original 146 tasks had been abandoned. I guess BOINC just overwrote the entire S@H project directory, since I also lost my app_config.xml and mbcuda.cfg files, which I had to recreate.

As I say, the crisis has passed and BOINC seems to be running smoothly again, but I'd like to know if there's a way to reconnect BOINC to the project without such complete carnage, should it every lose its memory like this again. Any suggestions?
ID: 1483759 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1483861 - Posted: 3 Mar 2014, 0:40:58 UTC

i think those lost tasks get resend if you reattach it in a reasonable ammount of time , i think it was a fluke

i think i have seen another post somewhere here that mentioned the same thing
I came down with a bad case of i don't give a crap
ID: 1483861 · Report as offensive
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9954
Credit: 103,452,613
RAC: 328
United Kingdom
Message 1483863 - Posted: 3 Mar 2014, 0:50:21 UTC

Last Monday the 24th Feb the HDD in this machine 4241618 failed.

I replaced it on Tuesday and re-installed the OS, as I had to start with Vista then "upgrade" to Win 7 it took till Wednesday afternoon before it was ready. I re-installed Boinc and connected, and I have lost no tasks and had no errors.

So it would seem as long as the machine is the same there should not be a problem.
ID: 1483863 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1483868 - Posted: 3 Mar 2014, 1:23:15 UTC - in response to Message 1483863.  

I re-installed Boinc and connected, and I have lost no tasks and had no errors.

When you reinstalled BOINC, did you have to reattach S@H or did it find your projects and project directories automatically?
ID: 1483868 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1483869 - Posted: 3 Mar 2014, 1:31:43 UTC - in response to Message 1483868.  
Last modified: 3 Mar 2014, 1:35:45 UTC

I re-installed Boinc and connected, and I have lost no tasks and had no errors.

When you reinstalled BOINC, did you have to reattach S@H or did it find your projects and project directories automatically?

He would have had to reattach because he had lost the following file:

starting with "Couldn't parse account file account_setiathome.berkeley.edu.xml", "Couldn't parse statistics_setiathome.berkeley.edu.xml", and "Project SETI@home is in state file but no account file found".

Had he restored them from another host he may not have had to do that, whether he would have lost his Wu's is another matter.

Claggy
ID: 1483869 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1483870 - Posted: 3 Mar 2014, 1:45:49 UTC - in response to Message 1483869.  

I re-installed Boinc and connected, and I have lost no tasks and had no errors.

When you reinstalled BOINC, did you have to reattach S@H or did it find your projects and project directories automatically?

He would have had to reattach because he had lost the following file:

starting with "Couldn't parse account file account_setiathome.berkeley.edu.xml", "Couldn't parse statistics_setiathome.berkeley.edu.xml", and "Project SETI@home is in state file but no account file found".

Had he restored them from another host he may not have had to do that, whether he would have lost his Wu's is another matter.

Claggy

Well, I guess the loss of that account file is the reason that I had to reattach, but I was wondering whether Bernie had to do the same thing when he reinstalled BOINC.

What could have caused that account file to disappear, anyway? (I'm guessing that it was a shutdown problem, but I'm not really sure.)
ID: 1483870 · Report as offensive
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9954
Credit: 103,452,613
RAC: 328
United Kingdom
Message 1483959 - Posted: 3 Mar 2014, 8:10:19 UTC - in response to Message 1483870.  

I re-installed Boinc and connected, and I have lost no tasks and had no errors.

When you reinstalled BOINC, did you have to reattach S@H or did it find your projects and project directories automatically?

He would have had to reattach because he had lost the following file:

starting with "Couldn't parse account file account_setiathome.berkeley.edu.xml", "Couldn't parse statistics_setiathome.berkeley.edu.xml", and "Project SETI@home is in state file but no account file found".

Had he restored them from another host he may not have had to do that, whether he would have lost his Wu's is another matter.

Claggy

Well, I guess the loss of that account file is the reason that I had to reattach, but I was wondering whether Bernie had to do the same thing when he reinstalled BOINC.

What could have caused that account file to disappear, anyway? (I'm guessing that it was a shutdown problem, but I'm not really sure.)


I just installed Boinc and reattached, I didn't copy any files form any other machines, just started from new. The machine reattached as it's old ID as I had kept everything the same, PC name, ip address and OS.
ID: 1483959 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1484001 - Posted: 3 Mar 2014, 12:45:53 UTC

well i just had a hickup with one of my machines that came online yesterday night when it shredded the boinc folder i purged the boinc install and reinstalled it and now all my wu's are marked as abandoned
I came down with a bad case of i don't give a crap
ID: 1484001 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1484005 - Posted: 3 Mar 2014, 13:18:37 UTC - in response to Message 1484001.  
Last modified: 3 Mar 2014, 13:19:27 UTC

well i just had a hickup with one of my machines that came online yesterday night when it shredded the boinc folder i purged the boinc install and reinstalled it and now all my wu's are marked as abandoned

Which Boinc directory did it shred? There are two, on Windows they are Boinc program files directory, and the Boinc Data directory,

Claggy
ID: 1484005 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1484168 - Posted: 3 Mar 2014, 20:23:59 UTC - in response to Message 1484005.  
Last modified: 3 Mar 2014, 20:25:04 UTC

well i just had a hickup with one of my machines that came online yesterday night when it shredded the boinc folder i purged the boinc install and reinstalled it and now all my wu's are marked as abandoned

Which Boinc directory did it shred? There are two, on Windows they are Boinc program files directory, and the Boinc Data directory,

Claggy


it was on my linux box intel i3 and /var/lib/boinc-client/projects/setiathome.berkeley.edu/

the files where not readable anymore and just showed $#%JG%$(^@^ or #_________$##$@

dunno what happened

that was about 2 hours or so after the client went online for the first time so i assume it was a glitch
I came down with a bad case of i don't give a crap
ID: 1484168 · Report as offensive

Message boards : Number crunching : BOINC lost attachment to S@H project


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.