Reseting BOINC

Message boards : Number crunching : Reseting BOINC
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1331107 - Posted: 25 Jan 2013, 11:26:42 UTC
Last modified: 25 Jan 2013, 11:28:57 UTC

I have started getting a lot of "exited with zero status but no "finished" file" messages and it keeps telling me to Reset BOINC.
What does that mean exactly? I have closed it and restarted it, and also rebooted the computer that is happening on but it is still happening. I had a total power failure in the computer room yesterday, a UPS was found to be tripping the GFI outlet in the room.
The interesting thing is that they do not seem to log an errored WU.

SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1331107 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1331223 - Posted: 25 Jan 2013, 18:16:33 UTC - in response to Message 1331107.  
Last modified: 25 Jan 2013, 18:20:38 UTC

I have started getting a lot of "exited with zero status but no "finished" file" messages and it keeps telling me to Reset BOINC.

The message is:
"If this happens repeatedly you may need to reset the project."

Don't do that (the reset) - it will not help.
This message is intended for stock users in case they received broken .exe file (e.g. partial download)

"exited with zero status but no 'finished' file" may be caused by:
- heavy load on the CPU/HDD by some other program (e.g. antivirus scheduled scan)
- realtime antivirus protection locking files for too long (while checking them 'on-the-fly')
- computer clock fluctuating back/forth in time (by more than 30 s)

("exited with zero status but no 'finished' file" does not cause the tasks to error, they restart from the checkpoint)


 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1331223 · Report as offensive
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1331605 - Posted: 26 Jan 2013, 16:50:44 UTC - in response to Message 1331223.  
Last modified: 26 Jan 2013, 16:52:12 UTC

I have started getting a lot of "exited with zero status but no "finished" file" messages and it keeps telling me to Reset BOINC.

The message is:
"If this happens repeatedly you may need to reset the project."

Don't do that (the reset) - it will not help.
This message is intended for stock users in case they received broken .exe file (e.g. partial download)

"exited with zero status but no 'finished' file" may be caused by:
- heavy load on the CPU/HDD by some other program (e.g. antivirus scheduled scan)
- realtime antivirus protection locking files for too long (while checking them 'on-the-fly')
- computer clock fluctuating back/forth in time (by more than 30 s)

("exited with zero status but no 'finished' file" does not cause the tasks to error, they restart from the checkpoint)



Thanks, and that is exactly what is happeinging. Something is going wrong with my WHS Connector software on this computer and it causes the normally 100% use on each core to drop to near zero. I did not associate the two happenings.

As for the misquote, that is what happens when you can not cut and paste between computers (unless I were to pay for LogMeIn).

I am going to have to try a complete system restore as the restore of the Connector software fails with a missing file message. As long as I do not try to backup this computer to the server, all runs as it should. Again, right on and Thanks.

SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1331605 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1331716 - Posted: 26 Jan 2013, 21:57:36 UTC - in response to Message 1331605.  
Last modified: 26 Jan 2013, 22:19:30 UTC

Something is going wrong with my WHS Connector software ...

If that causes lost net connection - yes, I think I remember something about 'bug' in BOINC (or in libcurl.dll ?)
- boinc.exe uses synchronous DNS and waits 'forever' for response from DNS server so boinc.exe do not communicate with the apps for too long and they exit.


can not cut and paste between computers (unless I were to pay for LogMeIn)

TeamViewer allow this in free version - there is option (Advanced - Clipboard synchronization)
http://www.teamviewer.com/


 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1331716 · Report as offensive
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1331863 - Posted: 27 Jan 2013, 3:09:08 UTC - in response to Message 1331716.  

I will keep that program in mind but so far I am happy with what I have now.

Just finished running a ckdisk and there were a number of corrections. Everything seems to be running so far.


SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1331863 · Report as offensive
parl

Send message
Joined: 22 May 04
Posts: 95
Credit: 4,476,976
RAC: 0
United States
Message 1332435 - Posted: 29 Jan 2013, 6:32:06 UTC

I'm currently having the task restart problem as well. I think what I'll do for a start is (a) suspend computation when the virus scanning is taking place (around 4 AM) and (b) also tell my backup program (Carbonite) not to back up any of my SETI files.

BTW, the zero exit is always preceded by:

1/28/2013 12:00:12 PM | | System clock was turned backwards; clearing timeouts
1/28/2013 12:00:13 PM | SETI@home | Task 29my12ad.7687.13731.6.10.154_0 exited with zero status but no 'finished' file
1/28/2013 12:00:13 PM | SETI@home | If this happens repeatedly you may need to reset the project.
1/28/2013 12:00:13 PM | SETI@home | Task 29my12ad.7687.13731.6.10.159_0 exited with zero status but no 'finished' file
1/28/2013 12:00:13 PM | SETI@home | If this happens repeatedly you may need to reset the project.
1/28/2013 12:00:13 PM | SETI@home | Restarting task 29my12ad.7687.13731.6.10.154_0 using setiathome_enhanced version 603 in slot 1
1/28/2013 12:00:13 PM | SETI@home | Restarting task 29my12ad.7687.13731.6.10.159_0 using setiathome_enhanced version 603 in slot 0

I'm not sure if that is significant or not. Anyway, my other machine is running Ubuntu 12.04 LTS, so it has no virus checking nor does it have automated backup (Carbonite does not work on Linux).

I though only Superman could turn time back. And apparently not even Cher can do it:

http://www.youtube.com/watch?v=mEszTzdUMcY

Ross
ID: 1332435 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1332456 - Posted: 29 Jan 2013, 8:37:00 UTC - in response to Message 1332435.  

I'm currently having the task restart problem as well. I think what I'll do for a start is (a) suspend computation when the virus scanning is taking place (around 4 AM) and (b) also tell my backup program (Carbonite) not to back up any of my SETI files.

BTW, the zero exit is always preceded by:

1/28/2013 12:00:12 PM | | System clock was turned backwards; clearing timeouts

I presume this is your Windows system. This can happen when your on-board clock doesn't keep very close time and is adjusted from the network infrequently enough that the correction becomes large. First check in Control Panel that your clock is set to synchronise to "internet time", and set your preferred server to an NTP server close to you. Then check and optionally change the Registry entry HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Services\W32Time\TimeProviders\NtpClient\SpecialPollInterval using regedit -- the default is 604800 (seconds, i.e. a week), I have mine set to 14400 (four hours).
ID: 1332456 · Report as offensive
parl

Send message
Joined: 22 May 04
Posts: 95
Credit: 4,476,976
RAC: 0
United States
Message 1333303 - Posted: 31 Jan 2013, 21:52:26 UTC

OK. I've done that. I'll check back in a couple of days to see how it's gone. BTW, my Ubuntu 12.04 LTS box is having similar problems. How do I check on doing NTP there?
ID: 1333303 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1333316 - Posted: 31 Jan 2013, 22:28:45 UTC - in response to Message 1333303.  

OK. I've done that. I'll check back in a couple of days to see how it's gone. BTW, my Ubuntu 12.04 LTS box is having similar problems. How do I check on doing NTP there?

Look up ntp and ntpd (e.g.here) -- for me it usually "just works". On my Ubuntu server there's a control programme in /etc/init.d/ntp which has a note that it runs by default in run-levels 2, 3, 4, and 5. Information on what servers to poll is held in /etc/ntp.conf. The command 'ntpq -p' will show what servers you are currently synchronised with (if any). The default polling interval appears to be 1024 seconds, but can be controlled by minpoll and maxpoll parameters in /etc/ntp.conf.
ID: 1333316 · Report as offensive
SockGap

Send message
Joined: 16 Apr 07
Posts: 14
Credit: 7,700,416
RAC: 0
Australia
Message 1333403 - Posted: 1 Feb 2013, 3:34:50 UTC - in response to Message 1333303.  

You may also need to allow the connection on your firewall. NTP uses UDP port 123. So you would have to allow an outbound connection from your LAN to "any" address on the WAN/internet on UDP port 123.

Cheers
Jeff
ID: 1333403 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1333554 - Posted: 1 Feb 2013, 15:43:30 UTC - in response to Message 1333403.  
Last modified: 1 Feb 2013, 15:44:13 UTC

You may also need to allow the connection on your firewall. NTP uses UDP port 123. So you would have to allow an outbound connection from your LAN to "any" address on the WAN/internet on UDP port 123.

Cheers
Jeff

Another option may be to have the machine send its NTP request to the router. Which would be configured to sync its time form the internet. This is what I do at home.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1333554 · Report as offensive
parl

Send message
Joined: 22 May 04
Posts: 95
Credit: 4,476,976
RAC: 0
United States
Message 1333745 - Posted: 2 Feb 2013, 1:42:31 UTC

I have a 2-Wire DSL "modem" from AT&T. I'm not sure it has all of those sophisticated features. I'll see if I can find a manual for it on the 'net.
ID: 1333745 · Report as offensive

Message boards : Number crunching : Reseting BOINC


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.