Shorties estimate up from three minutes to six hours after today's outage!

Message boards : Number crunching : Shorties estimate up from three minutes to six hours after today's outage!
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9

AuthorMessage
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1153847 - Posted: 19 Sep 2011, 12:07:31 UTC - in response to Message 1153843.  

And I wonder what this is going to do to all of the 'set and forget' hosts...
Folks that attached to the project on a whim and have long since forgotten it is running in the background. There are probably many thousands of them.

Might the project have to issue some kind of DCF reset command to coax them back into action? Is that even possible?

If they simply attached, they don't have a problem - none of this, thankfully, is affecting users of the stock application, since stage 2 of the bungle - fortunately - wasn't applied as scheduled on Friday.

But there are some 'set and forget' types who got as far as installing an optimised application (or having one installed for them) during the 'set' phase, and have now proceeded to 'forget' - we know that from the V12/Fermi hall of shame. Those are the ones that will be caught, unless the project itself manages the situation carefully.
ID: 1153847 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1153848 - Posted: 19 Sep 2011, 12:09:20 UTC - in response to Message 1153847.  
Last modified: 19 Sep 2011, 12:10:18 UTC

And I wonder what this is going to do to all of the 'set and forget' hosts...
Folks that attached to the project on a whim and have long since forgotten it is running in the background. There are probably many thousands of them.

Might the project have to issue some kind of DCF reset command to coax them back into action? Is that even possible?

If they simply attached, they don't have a problem - none of this, thankfully, is affecting users of the stock application, since stage 2 of the bungle - fortunately - wasn't applied as scheduled on Friday.

But there are some 'set and forget' types who got as far as installing an optimised application (or having one installed for them) during the 'set' phase, and have now proceeded to 'forget' - we know that from the V12/Fermi hall of shame. Those are the ones that will be caught, unless the project itself manages the situation carefully.

Of course, no matter how carefully the PROJECT may proceed, they are at the mercy and foibles of the Boinc code supplied.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1153848 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1153849 - Posted: 19 Sep 2011, 12:12:14 UTC - in response to Message 1153846.  

It's been such a long time.......
I don't recall if a reset will chuck out the baby with the bathwater...IE, the optimized apps and app_info file. A detach certainly does.

Ditto. I've got a couple of boxes with nothing cached and only a few tasks waiting to upload - I've set NNT on them (they're happy enough crunching Einstein and GPUGrid), so I don't get any surprises before I get a chance to experiment once the upload server is back.
ID: 1153849 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1153853 - Posted: 19 Sep 2011, 12:16:24 UTC - in response to Message 1153848.  

And I wonder what this is going to do to all of the 'set and forget' hosts...
Folks that attached to the project on a whim and have long since forgotten it is running in the background. There are probably many thousands of them.

Might the project have to issue some kind of DCF reset command to coax them back into action? Is that even possible?

If they simply attached, they don't have a problem - none of this, thankfully, is affecting users of the stock application, since stage 2 of the bungle - fortunately - wasn't applied as scheduled on Friday.

But there are some 'set and forget' types who got as far as installing an optimised application (or having one installed for them) during the 'set' phase, and have now proceeded to 'forget' - we know that from the V12/Fermi hall of shame. Those are the ones that will be caught, unless the project itself manages the situation carefully.

Of course, no matter how carefully the PROJECT may proceed, they are at the mercy and foibles of the Boinc code supplied.

Feel free to remind David that he is still listed as Director of SETI. For the purposes of the current discussion, the project and BOINC have to be thought of as (and actually behave as) members of the same team.
ID: 1153853 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1153854 - Posted: 19 Sep 2011, 12:22:55 UTC - in response to Message 1153853.  

And I wonder what this is going to do to all of the 'set and forget' hosts...
Folks that attached to the project on a whim and have long since forgotten it is running in the background. There are probably many thousands of them.

Might the project have to issue some kind of DCF reset command to coax them back into action? Is that even possible?

If they simply attached, they don't have a problem - none of this, thankfully, is affecting users of the stock application, since stage 2 of the bungle - fortunately - wasn't applied as scheduled on Friday.

But there are some 'set and forget' types who got as far as installing an optimised application (or having one installed for them) during the 'set' phase, and have now proceeded to 'forget' - we know that from the V12/Fermi hall of shame. Those are the ones that will be caught, unless the project itself manages the situation carefully.

Of course, no matter how carefully the PROJECT may proceed, they are at the mercy and foibles of the Boinc code supplied.

Feel free to remind David that he is still listed as Director of SETI. For the purposes of the current discussion, the project and BOINC have to be thought of as (and actually behave as) members of the same team.

Well, the team is waiting for the manager to quit telling them to walk or bunt and start hitting home runs again.

"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1153854 · Report as offensive
Profile James Sotherden
Avatar

Send message
Joined: 16 May 99
Posts: 10436
Credit: 110,373,059
RAC: 54
United States
Message 1153870 - Posted: 19 Sep 2011, 12:49:19 UTC

I run with a one day cache,Im also having trouble with shortys palying havoc on my GPU. Im going to drop to a 0.25 Cache and just ride out the storm.

If worst comes to worst I will set NNT and take a deep breath.
[/quote]

Old James
ID: 1153870 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1153873 - Posted: 19 Sep 2011, 12:55:05 UTC - in response to Message 1153845.  

Detaching/reattaching to the project physically deletes the setiathome project folder - so all application files and configuration setting are lost. But you usually get the same HostID, and host APR etc., back again. That's the worst possible outcome, even if you remembered to back up your applications so you can put them back again.

Resetting the project does set DCF to the default 1.0000 (and, I think, in general preserves optimised applications) - but I think the behaviour varies between different versions of BOINC - best to let someone else test it first.

The other thing that resetting the project certainly does it to throw away, without reporting, all current tasks - including those completed but not yet uploaded.

The best sequence, if you're adventurous enough to try, is:

NNT - crunch all cached tasks - upload - report - backup folder contents - reset - check folder contents are still intact - allow new work.

One little Extra to that, when you reset the project, if you had any Wu's left, copy the backed up Wu files back to your project folder before re-enabling work fetch, it'll save you having to download the Wu's again when they get resent,

Claggy
ID: 1153873 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1153887 - Posted: 19 Sep 2011, 13:59:12 UTC

Is this the record ?
A predicted time of 591 hours for a VLAR CPU unit (normal crunching time around 100 minutes). :-S

T.A.
ID: 1153887 · Report as offensive
Previous · 1 . . . 6 · 7 · 8 · 9

Message boards : Number crunching : Shorties estimate up from three minutes to six hours after today's outage!


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.