Time to Completion steadily increasing

Message boards : Number crunching : Time to Completion steadily increasing
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 604001 - Posted: 15 Jul 2007, 13:53:01 UTC

My most recent SETI@Home task, work unit 140472475, seems to be behaving a little strangely. Since it started, the "time to completion" has gone up by roughly one second for every second of CPU used. It now sits at 29:51:02 CPU time used, 30:02:01 time to completion. It has been stuck at 24.668% complete for some hours now.

I also noticed this task ran continuously for at least 6 hours (maybe longer, I don't check my BOINC statistics regularly). My machine normally switches between SETI and Climateprediction at roughly one hour intervals. I rebooted my machine, and it came back with Climateprediction running.

This isn't very scientific, but based on viewing the graphics for this task, it appeared to be proceeding painfully sloooooowly.

Should I just kill this task, or can it be salvaged?

ID: 604001 · Report as offensive
Profile RandyC
Avatar

Send message
Joined: 20 Oct 99
Posts: 714
Credit: 1,704,345
RAC: 0
United States
Message 604003 - Posted: 15 Jul 2007, 13:57:32 UTC - in response to Message 604001.  

My most recent SETI@Home task, work unit 140472475, seems to be behaving a little strangely. Since it started, the "time to completion" has gone up by roughly one second for every second of CPU used. It now sits at 29:51:02 CPU time used, 30:02:01 time to completion. It has been stuck at 24.668% complete for some hours now.

I also noticed this task ran continuously for at least 6 hours (maybe longer, I don't check my BOINC statistics regularly). My machine normally switches between SETI and Climateprediction at roughly one hour intervals. I rebooted my machine, and it came back with Climateprediction running.

This isn't very scientific, but based on viewing the graphics for this task, it appeared to be proceeding painfully sloooooowly.

Should I just kill this task, or can it be salvaged?


Try restarting BOINC (reboot if necessary). If it doesn't make any progress after trying that, then abort it.
ID: 604003 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 604005 - Posted: 15 Jul 2007, 13:59:29 UTC

You say it's still progressing, so it's not stuck, but something seems amiss. perhaps something is stopping boinc from running. Something like a preference setting (do work while in use NO, comes to mind). Have you tried a restart of the puter to free resources? Have you tried suspending and resuming the task? If else fails, just abort it and move on.
ID: 604005 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 604020 - Posted: 15 Jul 2007, 14:17:06 UTC - in response to Message 604005.  

You say it's still progressing, so it's not stuck, but something seems amiss. perhaps something is stopping boinc from running. Something like a preference setting (do work while in use NO, comes to mind). Have you tried a restart of the puter to free resources? Have you tried suspending and resuming the task? If else fails, just abort it and move on.


Thanks all for the suggestions. I did reboot this morning, and Climatepredication has been running for about 3 hours straight now. Presumably this is to make up for the time SETI was "stuck". (both SETI and Climateprediction have equal resource shares on my machine). I'll let you know what happens when SETI restarts.
ID: 604020 · Report as offensive
Franz Bauer

Send message
Joined: 8 Feb 01
Posts: 127
Credit: 9,690,361
RAC: 0
Canada
Message 604025 - Posted: 15 Jul 2007, 14:26:07 UTC - in response to Message 604001.  

This isn't very scientific, but based on viewing the graphics for this task, it appeared to be proceeding painfully sloooooowly.

Should I just kill this task, or can it be salvaged?


Bill:

I noticed that you are running the standard Seti application. You may want to download and install one of Simon's Seti apps from the Chicken Coop. It should double your speed.

Franz
ID: 604025 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 604068 - Posted: 15 Jul 2007, 16:30:24 UTC - in response to Message 604001.  

...
It has been stuck at 24.668% complete for some hours now.
...

It was truly stuck and making no progress. The application recalculates its progress very frequently when running normally, even on a very slow system the fractional part should show changes each time BOINC Manager updates its display.

If ever I saw unchanging progress for 30 seconds I'd take action to kill the application and restart it from the last checkpoint. The advice to reboot of course accomplishes that for any combination of preferences and any platform.
                                                                Joe
ID: 604068 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 604109 - Posted: 15 Jul 2007, 18:17:00 UTC - in response to Message 604068.  

[quoteIt was truly stuck and making no progress. The application recalculates its progress very frequently when running normally, even on a very slow system the fractional part should show changes each time BOINC Manager updates its display.

If ever I saw unchanging progress for 30 seconds I'd take action to kill the application and restart it from the last checkpoint. The advice to reboot of course accomplishes that for any combination of preferences and any platform.
                                                                Joe
[/quote]

I finally suspended Climateprediction, just to see what would happen. SETI restarted, ran normally for a few seconds (time to complete going down as CPU time went up), but then exited with an error:

7/15/2007 2:10:07 PM|SETI@home|Reason: Unrecoverable error for result 07my00aa.14716.3026.461070.3.229_2 ( - exit code -1073741819 (0xc0000005))

So, was the file I received corrupt in some way, or is there anything I need to do to keep SETI running on my machine?
ID: 604109 · Report as offensive
Profile Jim-R.
Volunteer tester
Avatar

Send message
Joined: 7 Feb 06
Posts: 1494
Credit: 194,148
RAC: 0
United States
Message 604159 - Posted: 15 Jul 2007, 19:51:36 UTC - in response to Message 604109.  

[quoteIt was truly stuck and making no progress. The application recalculates its progress very frequently when running normally, even on a very slow system the fractional part should show changes each time BOINC Manager updates its display.

If ever I saw unchanging progress for 30 seconds I'd take action to kill the application and restart it from the last checkpoint. The advice to reboot of course accomplishes that for any combination of preferences and any platform.
                                                                Joe


I finally suspended Climateprediction, just to see what would happen. SETI restarted, ran normally for a few seconds (time to complete going down as CPU time went up), but then exited with an error:

7/15/2007 2:10:07 PM|SETI@home|Reason: Unrecoverable error for result 07my00aa.14716.3026.461070.3.229_2 ( - exit code -1073741819 (0xc0000005))

So, was the file I received corrupt in some way, or is there anything I need to do to keep SETI running on my machine?
[/quote]
Don't know whether it applies in this case, but I think I remember seeing somewhere that this is related to Windows handling of the -9 overflow error. Many times (if I am correct) Windows hangs until you restart and then gives this error when you get a noisy work unit instead of exiting cleanly and giving the -9 error.
Jim

Some people plan their life out and look back at the wealth they've had.
Others live life day by day and look back at the wealth of experiences and enjoyment they've had.
ID: 604159 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 604182 - Posted: 15 Jul 2007, 20:23:15 UTC

Yes, Jim, I looks like one of those 1 in 10,000 wus bugs where it goofs up and then on restart goes to error. Not much to do, but move on. Thankfully they aren't very frequent.

ID: 604182 · Report as offensive
QSilver

Send message
Joined: 26 May 99
Posts: 232
Credit: 6,452,764
RAC: 0
United States
Message 606465 - Posted: 20 Jul 2007, 13:46:24 UTC

What do we do when all cached units are showing excessive times to complete?

Starting about 3 days ago, the times to complete for the WUs in my cache went from 2-6 hours to 47-160 hours. The actual time to complete has gone up but not that much (2-4 hours went to 2.5-5.0 hours). The main problem with this situation is that my cache will drain down to 1-2 WUs (only one WU if it has an ETC of 160 hours).

This is ocurring on a Pentium M 1.8GHz with WinXP that gets re-booted daily.

TIA
QS
ID: 606465 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 606474 - Posted: 20 Jul 2007, 14:08:26 UTC - in response to Message 606465.  
Last modified: 20 Jul 2007, 14:11:15 UTC

What do we do when all cached units are showing excessive times to complete?

Starting about 3 days ago, the times to complete for the WUs in my cache went from 2-6 hours to 47-160 hours. The actual time to complete has gone up but not that much (2-4 hours went to 2.5-5.0 hours). The main problem with this situation is that my cache will drain down to 1-2 WUs (only one WU if it has an ETC of 160 hours).

This is ocurring on a Pentium M 1.8GHz with WinXP that gets re-booted daily.

TIA
QS

Stop BOINC and - with care and a text editor - find this line in your 'client_state.xml' file:
<project>
    <master_url>http://setiathome.berkeley.edu/</master_url>
...
    [b]<duration_correction_factor>[color=red]0.483604[/color]</duration_correction_factor>[/b]

You'll probably find the figure in red is in the tens, or even hundreds.

If so, change it. The nominal value is 1.000000 (six decimal places): it will often be lower, particularly if you run an optimised application. If you do, somewhere round 0.400000 or even 0.300000 will be appropriate.

Re-start BOINC. You may over- or under-fetch the first time, but the <duration_correction_factor> is self-correcting over time.
ID: 606474 · Report as offensive
QSilver

Send message
Joined: 26 May 99
Posts: 232
Credit: 6,452,764
RAC: 0
United States
Message 606477 - Posted: 20 Jul 2007, 14:22:18 UTC - in response to Message 606474.  

You'll probably find the figure in red is in the tens, or even hundreds.

If so, change it. The nominal value is 1.000000 (six decimal places): it will often be lower, particularly if you run an optimised application. If you do, somewhere round 0.400000 or even 0.300000 will be appropriate.

Re-start BOINC. You may over- or under-fetch the first time, but the <duration_correction_factor> is self-correcting over time.


My DCF seems about right for using Chicken Soup 2.2B:

    <duration_correction_factor>0.251810</duration_correction_factor>


QS
ID: 606477 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 606482 - Posted: 20 Jul 2007, 14:29:27 UTC

Then I'm stumped. Very odd.
ID: 606482 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 606497 - Posted: 20 Jul 2007, 14:50:11 UTC
Last modified: 20 Jul 2007, 15:13:23 UTC

QSilver, what are your current benchmarks, and what did they used to be.

[edit]Wholey Crimini batman, look at these benchmarks from the website::::

Measured floating point speed 47.48 million ops/sec
Measured integer speed 92.71 million ops/sec


somethings gone whacky, Try manually rerunning benchmarks. your benchmarks should be roughly 1550/3200

First, check temps/cooling
then check ram

[second edit] the only time I've seen lower benchmarks is with my Pentium 60, heck my Celeron 500 is 409/673

ID: 606497 · Report as offensive
QSilver

Send message
Joined: 26 May 99
Posts: 232
Credit: 6,452,764
RAC: 0
United States
Message 606525 - Posted: 20 Jul 2007, 15:34:01 UTC

Just ran benchmarks, and they came in at 702/1849.

To be honest, I've never really paid attention to the results when benchmarks run on their own. However, my actual times to complete (while a little longer lately than historical) are definitely in what I consider to be their typical range. The Pentium-M's RAC is also at a typical value (i.e., has not taken a hit over the last 3-4 days since these long ETCs started showing up).

QS
ID: 606525 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 606529 - Posted: 20 Jul 2007, 15:43:09 UTC

Still might wanna check temps. Here's some benchmarks from other Intel Pentium M 1.8Ghz machines:

1564.84 3204.84
1607.44 3298.38
1510.68 3160.84
1533.10 3139.66
1551.44 3217.64
1571.77 3238.79
1586.59 3242.01
1586.59 3242.01
1579.53 3292.57

Traditionally, when a thermal diode (or coolnquiet, etc) limits the freq to 1/2 it's normal value, you end up with benchmarks that are nearly 1/2 that of others.

tony


ID: 606529 · Report as offensive
Profile Jim-R.
Volunteer tester
Avatar

Send message
Joined: 7 Feb 06
Posts: 1494
Credit: 194,148
RAC: 0
United States
Message 606530 - Posted: 20 Jul 2007, 15:44:36 UTC - in response to Message 606525.  
Last modified: 20 Jul 2007, 15:45:23 UTC

Just ran benchmarks, and they came in at 702/1849.

To be honest, I've never really paid attention to the results when benchmarks run on their own. However, my actual times to complete (while a little longer lately than historical) are definitely in what I consider to be their typical range. The Pentium-M's RAC is also at a typical value (i.e., has not taken a hit over the last 3-4 days since these long ETCs started showing up).

QS

Yes, with the new apps and clients using FPOPS for establishing credit you won't notice any variation regardless of the benchmarks unless you happen to get paired with someone using an old client with low benchmarks! Now the benchmarks are only used to estimate the time to completion and thus the amount of work you can download at a time. It doesn't affect your RAC and credit.
Jim

Some people plan their life out and look back at the wealth they've had.
Others live life day by day and look back at the wealth of experiences and enjoyment they've had.
ID: 606530 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 606533 - Posted: 20 Jul 2007, 15:47:39 UTC

just noticed it was a pentium M 1.86. I have no samples for that processor.
ID: 606533 · Report as offensive
QSilver

Send message
Joined: 26 May 99
Posts: 232
Credit: 6,452,764
RAC: 0
United States
Message 606549 - Posted: 20 Jul 2007, 16:55:29 UTC
Last modified: 20 Jul 2007, 16:56:16 UTC

Checking temps...

I installed SpeedFan 4.32. In the first 15 minutes, it showed the CPU temp stepping slowing up to 91C from 68, stepping slowly down to 69. It now heading back up again (currently at 84).

Again, performance doesn't appear to be impacted; it's only the ETCs that show excessive times.

QS

[edited to add missing word]
ID: 606549 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 606550 - Posted: 20 Jul 2007, 16:59:06 UTC

My AMD's go into alarm at like 69C and Failure at 124C. I'm not sure what Pent M 1.86's are, or where to look. Anyone got a link?
ID: 606550 · Report as offensive
1 · 2 · Next

Message boards : Number crunching : Time to Completion steadily increasing


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.