**SETI MB OUTAGE-4/3/09**-CLOSED

Message boards : Number crunching : **SETI MB OUTAGE-4/3/09**-CLOSED

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 5 · Next

AuthorMessage
Profile BlurfProject Donor
Volunteer tester

Send message
Joined: 2 Sep 06
Posts: 8817
Credit: 9,641,609
RAC: 2,876
United States
Message 878924 - Posted: 24 Mar 2009, 21:39:18 UTC
Last modified: 24 Mar 2009, 21:39:41 UTC

This sticky is to inform the users of an outage starting 3/24/09 regarding the Seti MB Workunits.

There is a problem with the Thumper machine and according to Matt's Post- "...don't expect SETI@home to be generating any new work or assimilating anything for a week. We'll at least try to keep Astropulse working during this time, so computers that can run Astropulse will be kept busy."

We'll post updates here whenever possible.




ID: 878924 · Report as offensive
Profile BlurfProject Donor
Volunteer tester

Send message
Joined: 2 Sep 06
Posts: 8817
Credit: 9,641,609
RAC: 2,876
United States
Message 878957 - Posted: 24 Mar 2009, 23:01:34 UTC

Update from Matt:

To help alleviate panic, here's the gist of our general plan:

1. get thumper back up and running with a three-way root mirror. If all goes well, this will be done enough sometime tomorrow (Wednesday), i.e. we'll have a two-way root mirror and let the third one sync up in the background while we bring the system up, then during next week's outage do more drive swapping to install grub/finish the resync on this third drive.

Splitting/assimilating will be completely off for all projects until thumper is back up.

2. as soon as thumper is back up (tomorrow?) we can turn splitting/assimilating on for AP and get to work on the pulse table reconfiguration (which we can only do if the system/database is up). The plan (in simplest terms) is: create new database chunks, copy the current pulse table to these new chunks, then drop the old table and rename the new one. We estimate at least 24 hours for that.

So if we time things right we may be fully functional before the end of thursday, maybe friday. However considering the lost time this morning and the usual unexpected hurdles that crop up.. that's why I give it a week if only to keep expectations realistic, yet leave room for potential pleasant surprises.

- Matt




ID: 878957 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,159
RAC: 4
Canada
Message 878991 - Posted: 25 Mar 2009, 1:17:26 UTC - in response to Message 878957.  

Update from Matt:

.. that's why I give it a week if only to keep expectations realistic, yet leave room for potential pleasant surprises.

- Matt


A guiding premise from my days in flight test: plan for the worst, and then all your surprises are pleasant ones.

Got my fingers crossed, will be watching and hoping.

ID: 878991 · Report as offensive
zpm
Volunteer tester
Avatar

Send message
Joined: 25 Apr 08
Posts: 284
Credit: 1,659,024
RAC: 0
United States
Message 879013 - Posted: 25 Mar 2009, 2:34:41 UTC - in response to Message 878991.  
Last modified: 25 Mar 2009, 2:35:04 UTC

My system got over 100 seti cuda today before it went nuts...

ID: 879013 · Report as offensive
Profile Borgholio
Avatar

Send message
Joined: 2 Aug 99
Posts: 653
Credit: 13,181,071
RAC: 3,805
United States
Message 879017 - Posted: 25 Mar 2009, 3:04:07 UTC - in response to Message 879013.  

My system got over 100 seti cuda today before it went nuts...



I'm gonna find out where you live and hijack your hard drive. All your CUDA are belong to us. Muwahahahaâ„¢


You will be assimilated...bunghole!

ID: 879017 · Report as offensive
Ianab
Volunteer tester

Send message
Joined: 11 Jun 08
Posts: 690
Credit: 16,372,103
RAC: 7,842
New Zealand
Message 879041 - Posted: 25 Mar 2009, 7:43:16 UTC

Thanks for the heads up.

Posting the info / explanation for the outage will hopefully avoid a lot of the stress, pulling of hair and gnashing of teeth that sometimes acompanies these outages.

Hopefully the rebuilds go well and things come back online quickly... weather is getting colder here and I need the warmth of the CPUs running at 100% ;-)

Ian

ID: 879041 · Report as offensive
zpm
Volunteer tester
Avatar

Send message
Joined: 25 Apr 08
Posts: 284
Credit: 1,659,024
RAC: 0
United States
Message 879100 - Posted: 25 Mar 2009, 15:06:36 UTC - in response to Message 879041.  
Last modified: 25 Mar 2009, 15:09:32 UTC

amazing, i hit update and i get 2 more this morning. get a cuda device as they pump out more heat than my quad-core does.

ID: 879100 · Report as offensive
Profile Greg Hogan
Avatar

Send message
Joined: 3 Mar 04
Posts: 33
Credit: 9,235,626
RAC: 0
New Zealand
Message 879346 - Posted: 26 Mar 2009, 7:59:56 UTC

Oh dear.. This is serious !!
With my Q6600, 8800GTS and the 750W PSU all sitting at 99% Idle now they have indeed dropped the ambient temp and it has noticeable slowed down the
Fermentation rate of my homebrew.

I think I'll name this latest batch 'Thumper' in honour of the ensuing headache.
Cheers and Good luck on the fix guys.


ID: 879346 · Report as offensive
Profile Virtual Boss*
Volunteer tester
Avatar

Send message
Joined: 4 May 08
Posts: 417
Credit: 6,358,008
RAC: 216
Australia
Message 879353 - Posted: 26 Mar 2009, 9:33:45 UTC - in response to Message 879346.  

Oh dear.. This is serious !!
With my Q6600, 8800GTS and the 750W PSU all sitting at 99% Idle now they have indeed dropped the ambient temp and it has noticeable slowed down the
Fermentation rate of my homebrew.

I think I'll name this latest batch 'Thumper' in honour of the ensuing headache.
....


ROFLMAO !!!!!!!!

ID: 879353 · Report as offensive
zoom314
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 56765
Credit: 40,748,586
RAC: 4,995
United States
Message 879436 - Posted: 26 Mar 2009, 15:59:48 UTC - in response to Message 879353.  

Oh dear.. This is serious !!
With my Q6600, 8800GTS and the 750W PSU all sitting at 99% Idle now they have indeed dropped the ambient temp and it has noticeable slowed down the
Fermentation rate of my homebrew.

I think I'll name this latest batch 'Thumper' in honour of the ensuing headache.
....


ROFLMAO !!!!!!!!

I'm still in favor of Wabbit Stew.
Pluto is still a planet

Beep! Beep!

ID: 879436 · Report as offensive
Profile Virtual Boss*
Volunteer tester
Avatar

Send message
Joined: 4 May 08
Posts: 417
Credit: 6,358,008
RAC: 216
Australia
Message 879440 - Posted: 26 Mar 2009, 16:02:32 UTC - in response to Message 879436.  

I'm still in favor of Wabbit Stew.


But only a Wascally Wabbit ...HeHeHeHe

ID: 879440 · Report as offensive
Profile Borgholio
Avatar

Send message
Joined: 2 Aug 99
Posts: 653
Credit: 13,181,071
RAC: 3,805
United States
Message 879450 - Posted: 26 Mar 2009, 16:22:49 UTC - in response to Message 879436.  

[/quote]
I'm still in favor of Wabbit Stew. [/quote]


But it's not Wabbit Season, it's Duck Season!




You will be assimilated...bunghole!

ID: 879450 · Report as offensive
Profile BlurfProject Donor
Volunteer tester

Send message
Joined: 2 Sep 06
Posts: 8817
Credit: 9,641,609
RAC: 2,876
United States
Message 879452 - Posted: 26 Mar 2009, 16:26:56 UTC

Please keep this thread related to the outage...thanks.




ID: 879452 · Report as offensive
Swibby Bear

Send message
Joined: 1 Aug 01
Posts: 246
Credit: 7,918,039
RAC: 866
United States
Message 879532 - Posted: 26 Mar 2009, 19:13:38 UTC

It appears that Matt or someone carelessly started the MB splitter on Lando, rather than the AP splitter. Hope it doesn't foul up the database while it is resyncing.

ID: 879532 · Report as offensive
ClaggyProject Donor
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4623
Credit: 46,350,155
RAC: 2,946
United Kingdom
Message 879535 - Posted: 26 Mar 2009, 19:21:33 UTC - in response to Message 879532.  

It appears that Matt or someone carelessly started the MB splitter on Lando, rather than the AP splitter. Hope it doesn't foul up the database while it is resyncing.


See Matt's post here:

3/26/09-No new work thread

Claggy

ID: 879535 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 879727 - Posted: 27 Mar 2009, 14:00:08 UTC - in response to Message 879535.  

This morning, I received new MB WU's, together with 34 AP WU's, I won't run dry, for a while ;)


ID: 879727 · Report as offensive
zoom314
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 56765
Credit: 40,748,586
RAC: 4,995
United States
Message 879742 - Posted: 27 Mar 2009, 15:16:49 UTC - in response to Message 879727.  

This morning, I received new MB WU's, together with 34 AP WU's, I won't run dry, for a while ;)

I managed to snag a dozen of the little things last night, But so far no luck today.
Pluto is still a planet

Beep! Beep!

ID: 879742 · Report as offensive
Profile ajinbc
Avatar

Send message
Joined: 15 Mar 06
Posts: 484
Credit: 318,444
RAC: 0
Canada
Message 879827 - Posted: 27 Mar 2009, 19:48:07 UTC

One of my crunchers has no work, the other has about 40 MB WU's..... go figure


ID: 879827 · Report as offensive
Bruce Streeter
Avatar

Send message
Joined: 10 Jun 06
Posts: 33
Credit: 106,564
RAC: 0
Australia
Message 879922 - Posted: 28 Mar 2009, 2:28:58 UTC

I'm still not getting any WU's I guess the server is still broken. :(


ID: 879922 · Report as offensive
Andy Williams
Volunteer tester
Avatar

Send message
Joined: 11 May 01
Posts: 187
Credit: 112,464,820
RAC: 0
United States
Message 879925 - Posted: 28 Mar 2009, 2:34:00 UTC - in response to Message 879827.  

One of my crunchers has no work, the other has about 40 MB WU's..... go figure


It's downright weird. None of my CUDA machines have received MB WUs, but several of my non-CUDA machines have. Just luck? I guess so.
--
Classic 82353 WU / 400979 h

ID: 879925 · Report as offensive
1 · 2 · 3 · 4 . . . 5 · Next

Message boards : Number crunching : **SETI MB OUTAGE-4/3/09**-CLOSED


 
©2016 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.