**SETI MB OUTAGE-4/3/09**-CLOSED


log in

Advanced search

Message boards : Number crunching : **SETI MB OUTAGE-4/3/09**-CLOSED

1 · 2 · 3 · 4 . . . 5 · Next
Author Message
Profile Blurf
Volunteer tester
Send message
Joined: 2 Sep 06
Posts: 7562
Credit: 6,898,270
RAC: 7,883
United States
Message 878924 - Posted: 24 Mar 2009, 21:39:18 UTC
Last modified: 24 Mar 2009, 21:39:41 UTC

This sticky is to inform the users of an outage starting 3/24/09 regarding the Seti MB Workunits.

There is a problem with the Thumper machine and according to Matt's Post- "...don't expect SETI@home to be generating any new work or assimilating anything for a week. We'll at least try to keep Astropulse working during this time, so computers that can run Astropulse will be kept busy."

We'll post updates here whenever possible.
____________


Profile Blurf
Volunteer tester
Send message
Joined: 2 Sep 06
Posts: 7562
Credit: 6,898,270
RAC: 7,883
United States
Message 878957 - Posted: 24 Mar 2009, 23:01:34 UTC

Update from Matt:

To help alleviate panic, here's the gist of our general plan:

1. get thumper back up and running with a three-way root mirror. If all goes well, this will be done enough sometime tomorrow (Wednesday), i.e. we'll have a two-way root mirror and let the third one sync up in the background while we bring the system up, then during next week's outage do more drive swapping to install grub/finish the resync on this third drive.

Splitting/assimilating will be completely off for all projects until thumper is back up.

2. as soon as thumper is back up (tomorrow?) we can turn splitting/assimilating on for AP and get to work on the pulse table reconfiguration (which we can only do if the system/database is up). The plan (in simplest terms) is: create new database chunks, copy the current pulse table to these new chunks, then drop the old table and rename the new one. We estimate at least 24 hours for that.

So if we time things right we may be fully functional before the end of thursday, maybe friday. However considering the lost time this morning and the usual unexpected hurdles that crop up.. that's why I give it a week if only to keep expectations realistic, yet leave room for potential pleasant surprises.

- Matt

____________


Profile Bill Walker
Avatar
Send message
Joined: 4 Sep 99
Posts: 3374
Credit: 2,104,857
RAC: 2,532
Canada
Message 878991 - Posted: 25 Mar 2009, 1:17:26 UTC - in response to Message 878957.

Update from Matt:

.. that's why I give it a week if only to keep expectations realistic, yet leave room for potential pleasant surprises.

- Matt


A guiding premise from my days in flight test: plan for the worst, and then all your surprises are pleasant ones.

Got my fingers crossed, will be watching and hoping.
____________

zpm
Volunteer tester
Avatar
Send message
Joined: 25 Apr 08
Posts: 284
Credit: 1,598,656
RAC: 950
United States
Message 879013 - Posted: 25 Mar 2009, 2:34:41 UTC - in response to Message 878991.
Last modified: 25 Mar 2009, 2:35:04 UTC

My system got over 100 seti cuda today before it went nuts...

Profile Borgholio
Avatar
Send message
Joined: 2 Aug 99
Posts: 651
Credit: 12,063,390
RAC: 3,321
United States
Message 879017 - Posted: 25 Mar 2009, 3:04:07 UTC - in response to Message 879013.

My system got over 100 seti cuda today before it went nuts...



I'm gonna find out where you live and hijack your hard drive. All your CUDA are belong to us. Muwahahahaâ„¢
____________


You will be assimilated...bunghole!

Ianab
Volunteer tester
Send message
Joined: 11 Jun 08
Posts: 667
Credit: 12,508,883
RAC: 9,052
New Zealand
Message 879041 - Posted: 25 Mar 2009, 7:43:16 UTC

Thanks for the heads up.

Posting the info / explanation for the outage will hopefully avoid a lot of the stress, pulling of hair and gnashing of teeth that sometimes acompanies these outages.

Hopefully the rebuilds go well and things come back online quickly... weather is getting colder here and I need the warmth of the CPUs running at 100% ;-)

Ian

zpm
Volunteer tester
Avatar
Send message
Joined: 25 Apr 08
Posts: 284
Credit: 1,598,656
RAC: 950
United States
Message 879100 - Posted: 25 Mar 2009, 15:06:36 UTC - in response to Message 879041.
Last modified: 25 Mar 2009, 15:09:32 UTC

amazing, i hit update and i get 2 more this morning. get a cuda device as they pump out more heat than my quad-core does.

Profile Greg Hogan
Avatar
Send message
Joined: 3 Mar 04
Posts: 33
Credit: 8,486,346
RAC: 0
New Zealand
Message 879346 - Posted: 26 Mar 2009, 7:59:56 UTC

Oh dear.. This is serious !!
With my Q6600, 8800GTS and the 750W PSU all sitting at 99% Idle now they have indeed dropped the ambient temp and it has noticeable slowed down the
Fermentation rate of my homebrew.

I think I'll name this latest batch 'Thumper' in honour of the ensuing headache.
Cheers and Good luck on the fix guys.
____________

Profile Virtual Boss*
Volunteer tester
Avatar
Send message
Joined: 4 May 08
Posts: 417
Credit: 6,189,307
RAC: 703
Australia
Message 879353 - Posted: 26 Mar 2009, 9:33:45 UTC - in response to Message 879346.

Oh dear.. This is serious !!
With my Q6600, 8800GTS and the 750W PSU all sitting at 99% Idle now they have indeed dropped the ambient temp and it has noticeable slowed down the
Fermentation rate of my homebrew.

I think I'll name this latest batch 'Thumper' in honour of the ensuing headache.
....


ROFLMAO !!!!!!!!

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46376
Credit: 36,738,865
RAC: 5,106
United States
Message 879436 - Posted: 26 Mar 2009, 15:59:48 UTC - in response to Message 879353.

Oh dear.. This is serious !!
With my Q6600, 8800GTS and the 750W PSU all sitting at 99% Idle now they have indeed dropped the ambient temp and it has noticeable slowed down the
Fermentation rate of my homebrew.

I think I'll name this latest batch 'Thumper' in honour of the ensuing headache.
....


ROFLMAO !!!!!!!!

I'm still in favor of Wabbit Stew.
____________
My Facebook, War Commander, 2015

Profile Virtual Boss*
Volunteer tester
Avatar
Send message
Joined: 4 May 08
Posts: 417
Credit: 6,189,307
RAC: 703
Australia
Message 879440 - Posted: 26 Mar 2009, 16:02:32 UTC - in response to Message 879436.

I'm still in favor of Wabbit Stew.


But only a Wascally Wabbit ...HeHeHeHe

Profile Borgholio
Avatar
Send message
Joined: 2 Aug 99
Posts: 651
Credit: 12,063,390
RAC: 3,321
United States
Message 879450 - Posted: 26 Mar 2009, 16:22:49 UTC - in response to Message 879436.

[/quote]
I'm still in favor of Wabbit Stew. [/quote]


But it's not Wabbit Season, it's Duck Season!
____________


You will be assimilated...bunghole!

Profile Blurf
Volunteer tester
Send message
Joined: 2 Sep 06
Posts: 7562
Credit: 6,898,270
RAC: 7,883
United States
Message 879452 - Posted: 26 Mar 2009, 16:26:56 UTC

Please keep this thread related to the outage...thanks.
____________


Swibby Bear
Send message
Joined: 1 Aug 01
Posts: 236
Credit: 7,276,504
RAC: 48
United States
Message 879532 - Posted: 26 Mar 2009, 19:13:38 UTC

It appears that Matt or someone carelessly started the MB splitter on Lando, rather than the AP splitter. Hope it doesn't foul up the database while it is resyncing.

ClaggyProject donor
Volunteer tester
Send message
Joined: 5 Jul 99
Posts: 4101
Credit: 33,140,210
RAC: 8,797
United Kingdom
Message 879535 - Posted: 26 Mar 2009, 19:21:33 UTC - in response to Message 879532.

It appears that Matt or someone carelessly started the MB splitter on Lando, rather than the AP splitter. Hope it doesn't foul up the database while it is resyncing.


See Matt's post here:

3/26/09-No new work thread

Claggy

Profile Fred J. Verster
Volunteer tester
Avatar
Send message
Joined: 21 Apr 04
Posts: 3246
Credit: 31,802,079
RAC: 3,468
Netherlands
Message 879727 - Posted: 27 Mar 2009, 14:00:08 UTC - in response to Message 879535.

This morning, I received new MB WU's, together with 34 AP WU's, I won't run dry, for a while ;)

____________

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46376
Credit: 36,738,865
RAC: 5,106
United States
Message 879742 - Posted: 27 Mar 2009, 15:16:49 UTC - in response to Message 879727.

This morning, I received new MB WU's, together with 34 AP WU's, I won't run dry, for a while ;)

I managed to snag a dozen of the little things last night, But so far no luck today.
____________
My Facebook, War Commander, 2015

Profile ajinbc
Avatar
Send message
Joined: 15 Mar 06
Posts: 484
Credit: 318,444
RAC: 0
Canada
Message 879827 - Posted: 27 Mar 2009, 19:48:07 UTC

One of my crunchers has no work, the other has about 40 MB WU's..... go figure
____________

CORE2QUAD Q6600
Avatar
Send message
Joined: 10 Jun 06
Posts: 33
Credit: 65,892
RAC: 0
Australia
Message 879922 - Posted: 28 Mar 2009, 2:28:58 UTC

I'm still not getting any WU's I guess the server is still broken. :(
____________

Andy Williams
Volunteer tester
Avatar
Send message
Joined: 11 May 01
Posts: 187
Credit: 112,464,820
RAC: 0
United States
Message 879925 - Posted: 28 Mar 2009, 2:34:00 UTC - in response to Message 879827.

One of my crunchers has no work, the other has about 40 MB WU's..... go figure


It's downright weird. None of my CUDA machines have received MB WUs, but several of my non-CUDA machines have. Just luck? I guess so.
____________
--
Classic 82353 WU / 400979 h

1 · 2 · 3 · 4 . . . 5 · Next

Message boards : Number crunching : **SETI MB OUTAGE-4/3/09**-CLOSED

Copyright © 2014 University of California