Tasks hanging

Questions and Answers : Getting started : Tasks hanging
Message board moderation

To post messages, you must log in.

AuthorMessage
Archer

Send message
Joined: 30 May 99
Posts: 20
Credit: 1,106,422
RAC: 0
United States
Message 827931 - Posted: 7 Nov 2008, 19:19:43 UTC

Over the last couple of weeks I've had big problems with tasks not getting anywhere, and eventually timing out. About three-fourths of the tasks that BOINC tries to process (SETI@home_Enhanced 6.03) run, using a full CPU, but never show any progress percentage moving. The other quarter of the tasks progress as expected, and finish in a few hours. The ones which hang end up running about 4 days, and then abort themselves on a work unit timeout. I've started aborting the hung ones manually if they still show 0% progress after 30 minutes or so of CPU time. The ones that progress normally will be 10% or so after 30 minutes.

Running on WinXP SP3.

Any suggestions?

D

ID: 827931 · Report as offensive
Profile Gundolf Jahn

Send message
Joined: 19 Sep 00
Posts: 3184
Credit: 446,358
RAC: 0
Germany
Message 827975 - Posted: 7 Nov 2008, 20:56:25 UTC - in response to Message 827931.  

In your preferences, do you have set xx in Use at most xx percent of CPU time to less than 100%? If so, reset it to 100%. The "CPU throttling" is known to cause problems on some machines. If you get heat or response time problems when running on full load, set in your preferences On multiprocessors, use at most 1 processors (Enforced by version 5.10 and earlier) or On multiprocessors, use at most 50 % of the processors (Enforced by version 6.1+). That results in an over all usage of 50% for BOINC tasks and should prevent your computer from overheating.

Gruß,
Gundolf

Computer sind nicht alles im Leben. (Kleiner Scherz)

SETI@home classic workunits 3,758
SETI@home classic CPU time 66,520 hours
ID: 827975 · Report as offensive
Steve Zurlo

Send message
Joined: 14 Apr 00
Posts: 3
Credit: 2,033,162
RAC: 0
United States
Message 828909 - Posted: 10 Nov 2008, 13:31:45 UTC
Last modified: 10 Nov 2008, 13:32:01 UTC

I'm having the same issue one of my 2 XP Pro SP2 machines. Astropulse work units are not affected by this, however. They always progress correclty. But a couple of times a week when I check on this one XP machine I will see it working on a couple of SETI@home work units that have accumulated a ton of CPU time but are still at 00.000% progress. I have found that if I just close BOINC and restart it, that fixes it for the affected workunits.
All of my machines are set to use 100% of CPU time and 100% of CPUs.
The affected machine is an Athlon 64 X2 3800+ with 2 GB of RAM running the 32 bit version of XP Pro SP2.

Steve
ID: 828909 · Report as offensive
Archer

Send message
Joined: 30 May 99
Posts: 20
Credit: 1,106,422
RAC: 0
United States
Message 828995 - Posted: 10 Nov 2008, 17:54:44 UTC - in response to Message 827975.  

That's how I have it set: 100% of one cpu, out of the two on my machine.

Steve's suggestion of starting and restarting the boinc service seems to do the trick!!

D
ID: 828995 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 829015 - Posted: 10 Nov 2008, 18:27:59 UTC - in response to Message 828995.  
Last modified: 10 Nov 2008, 18:43:27 UTC

That's how I have it set: 100% of one cpu, out of the two on my machine.

Steve's suggestion of starting and restarting the boinc service seems to do the trick!!

D

For you it could be the version of BOINC that you're using. 6.2.19 is the recomended version, but since Steve is using that version and his dual core is having the same problem, could be both those versions are a problem and I'm thinking maybe going back to 5.10.45 might be a better suggestion for you both.

I can't remember anyone with that version having this problem (could be wrong and maybe you've already tried 5.10.45 or someone has a heads up about this version regarding this problem) so it might be worth a try to finish up the WU's in your cache, set No New Tasks and give the 5.10.45 a shot. I've seen others that were able to restart them the way Steve suggests, but I've seen quite a few where that has become less and less effective as time goes on. Haven't seen any success stories from a detach or reset of project either, but I guess that could be an option too, unless you're ok with just stoping and restart BOINC when needed.
ID: 829015 · Report as offensive
Steve Zurlo

Send message
Joined: 14 Apr 00
Posts: 3
Credit: 2,033,162
RAC: 0
United States
Message 829859 - Posted: 13 Nov 2008, 15:23:30 UTC - in response to Message 829015.  
Last modified: 13 Nov 2008, 15:25:44 UTC


For you it could be the version of BOINC that you're using. 6.2.19 is the recomended version, but since Steve is using that version and his dual core is having the same problem, could be both those versions are a problem and I'm thinking maybe going back to 5.10.45 might be a better suggestion for you both.


It shows 6.2.19 in Help>About BOINC Manager, but on each line in the task view under "Application" it says "Seti@home Enhanced 6.03".
IOW, I guess the BOINC version is 6.2.19, but the seti@home WU processor is 6.03.
All of my computers are like this. Is this not normal?
ID: 829859 · Report as offensive
Profile Leaps-from-Shadows
Volunteer tester
Avatar

Send message
Joined: 11 Aug 08
Posts: 323
Credit: 259,220
RAC: 0
United States
Message 829861 - Posted: 13 Nov 2008, 15:37:49 UTC - in response to Message 829859.  

It shows 6.2.19 in Help>About BOINC Manager, but on each line in the task view under "Application" it says "Seti@home Enhanced 6.03".
IOW, I guess the BOINC version is 6.2.19, but the seti@home WU processor is 6.03.
All of my computers are like this. Is this not normal?

Yes, it is normal. The BOINC manager and SETI@home are two separate applications, with different version numbers. The Astropulse application is also separate.
Cruiser
Gateway GT5692 L-f-S Edition
-Phenom X4 9650 CPU
-4GB 667MHz DDR2 RAM
-500GB SATA HD
-Vista x64 SP1
-BOINC 6.2.19 32-bit client
-SSE3 optimized 32-bit apps
ID: 829861 · Report as offensive
Steve Zurlo

Send message
Joined: 14 Apr 00
Posts: 3
Credit: 2,033,162
RAC: 0
United States
Message 830937 - Posted: 15 Nov 2008, 21:05:40 UTC

Well, as suggested, I let the pending work units finish and then down-graded to 5.10.45. Under that version I was not able to get any work unit to advance. All WU just accumulated CPU time with no advancement in percentage complete. I tried starting and stopping the BOIC UI, the BOINC service, and rebooting the machine. No help. I went back to 6.2.19 and now I am back to the original issue. I have to start and stop BOINC frequently to get a new WU to register progress. Something else I have just had happen that makes this even worse, is that now I have an issue where one WU is advancing correctly and one is not, and restarting BOINC simply switches which one is working correctly. The one that was hung will start advancing and the one that was advacning will just accumulate CPU time with no change in progress %.
ID: 830937 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 830961 - Posted: 15 Nov 2008, 23:23:48 UTC - in response to Message 830937.  
Last modified: 15 Nov 2008, 23:25:30 UTC

Well, as suggested, I let the pending work units finish and then down-graded to 5.10.45. Under that version I was not able to get any work unit to advance. All WU just accumulated CPU time with no advancement in percentage complete. I tried starting and stopping the BOIC UI, the BOINC service, and rebooting the machine. No help. I went back to 6.2.19 and now I am back to the original issue. I have to start and stop BOINC frequently to get a new WU to register progress. Something else I have just had happen that makes this even worse, is that now I have an issue where one WU is advancing correctly and one is not, and restarting BOINC simply switches which one is working correctly. The one that was hung will start advancing and the one that was advacning will just accumulate CPU time with no change in progress %.

Hi Steve, is it possible you could list your settings for processor usage regarding when computing is allowed, and disk and memory usage. Also, I'm curious what result you'd get if you set your cpu usage to one cpu
ID: 830961 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 830997 - Posted: 16 Nov 2008, 1:34:09 UTC - in response to Message 830937.  

Well, as suggested, I let the pending work units finish and then down-graded to 5.10.45. Under that version I was not able to get any work unit to advance. All WU just accumulated CPU time with no advancement in percentage complete. I tried starting and stopping the BOIC UI, the BOINC service, and rebooting the machine. No help. I went back to 6.2.19 and now I am back to the original issue. I have to start and stop BOINC frequently to get a new WU to register progress. Something else I have just had happen that makes this even worse, is that now I have an issue where one WU is advancing correctly and one is not, and restarting BOINC simply switches which one is working correctly. The one that was hung will start advancing and the one that was advacning will just accumulate CPU time with no change in progress %.

Should also mention that in some other threads I've read, other people have had some success with similar problems by using opti apps. If it's something you'd be interested in trying for MB or AP, you should check out this message.
ID: 830997 · Report as offensive

Questions and Answers : Getting started : Tasks hanging


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.