More Spring Cleaning (May 31 2007)

Message boards : Technical News : More Spring Cleaning (May 31 2007)
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 579316 - Posted: 31 May 2007, 20:26:21 UTC

Wow. No real major crises today. Time was, about 10 years ago, when it was just me and Jeff and Dan crammed in a tiny office working on SERENDIP, dealing with server problems occupied about 5% of my time. The last 8 years it has been more like 99%.

So I got to catch up on some nagging tasks today. Worked on ravamping my stripchart code (which takesvarious system readings and alerts us when things are amiss) to ease the process of incorpating new servers. Cleaned up the lab in 329 - we have literal piles of retired/dead machines now. When Sun recently donated that new thumper server they also gave us a "parts" machine to upgrade jocelyn so I finally started looking into that. Worked a bit on a script to automate the new multibeam splitter process (whenever we're ready to start that up). Patched isaac's RAID firmware on the off chance that might fix its recent penchant for crashing - it didn't help, but running in a non-xen kernel seems to be a functional workaround. Fixed some broken web pages (donation page, the connecting client types page..). Discussed the next step in server closet upgrades with Jeff - he reminded me there's going to be a lab-wide power outage on Saturday, June 9th lasting all night. How convenient.

Oh.. I see the UOTD updates stopped working, too. Stuff breaks unexpectedly when you hastily retire servers like we've been doing recently...

- Matt

-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 579316 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19062
Credit: 40,757,560
RAC: 67
United Kingdom
Message 579348 - Posted: 31 May 2007, 21:27:02 UTC
Last modified: 31 May 2007, 21:27:35 UTC

Who is going to draw the short straw to close down gracefully on the Saturday and then boot up on the Sunday.
ID: 579348 · Report as offensive
Wander Saito
Volunteer tester

Send message
Joined: 7 Jul 03
Posts: 555
Credit: 2,136,061
RAC: 0
Brazil
Message 579495 - Posted: 1 Jun 2007, 1:53:12 UTC

Wow, looks like somebody had one very productive day :)

Btw, WinterKnight, nothing persnal, but I think can go over a weekend w/o WU and let the guys at UCB to have the weekend. Whats two days when we just braved almost 2 weeks of outage? ;)
I agree that recovering is always a bit messy, but c'mon, give these guys a break :)

Regards,
Wander

ID: 579495 · Report as offensive
Brian Silvers

Send message
Joined: 11 Jun 99
Posts: 1681
Credit: 492,052
RAC: 0
United States
Message 579531 - Posted: 1 Jun 2007, 2:48:52 UTC - in response to Message 579495.  


I agree that recovering is always a bit messy, but c'mon, give these guys a break :)


I think you misunderstood the intent of the question...

Shutting down "gracefully" as Andy mentioned is much better than just having the power go out in the midst of who knows how many disk/nfs operations... Of course there should be battery backups that handle shutdowns gracefully, but anyway... :)
ID: 579531 · Report as offensive
Bounce

Send message
Joined: 3 Apr 99
Posts: 66
Credit: 5,604,569
RAC: 0
United States
Message 579544 - Posted: 1 Jun 2007, 3:33:24 UTC

I move that they shut down gracefully at COB on Friday and restart on Monday; taking the weekend to drink excessively.

Do I hear a second?
ID: 579544 · Report as offensive
Profile popandbob
Volunteer tester

Send message
Joined: 19 Mar 05
Posts: 551
Credit: 4,673,015
RAC: 0
Canada
Message 579545 - Posted: 1 Jun 2007, 3:35:59 UTC - in response to Message 579544.  

I move that they shut down gracefully at COB on Friday and restart on Monday; taking the weekend to drink excessively.

Do I hear a second?


A second?

TICK TOCK TICK (there's 3 seconds!) jk

Give everything a rest over the weekend. It'll do you all good.

~BoB


Do you Good Search for Seti@Home? http://www.goodsearch.com/?charityid=888957
Or Good Shop? http://www.goodshop.com/?charityid=888957
ID: 579545 · Report as offensive
seti@elrcastor.com
Volunteer tester

Send message
Joined: 30 Jan 00
Posts: 35
Credit: 4,879,559
RAC: 0
United States
Message 579562 - Posted: 1 Jun 2007, 4:18:27 UTC - in response to Message 579545.  

I move that they shut down gracefully at COB on Friday and restart on Monday; taking the weekend to drink excessively.

Do I hear a second?


A second?

TICK TOCK TICK (there's 3 seconds!) jk

Give everything a rest over the weekend. It'll do you all good.

~BoB


and a third
ID: 579562 · Report as offensive
Profile KWSN - Chicken of Angnor
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 9 Jul 99
Posts: 1199
Credit: 6,615,780
RAC: 0
Austria
Message 579567 - Posted: 1 Jun 2007, 4:27:43 UTC
Last modified: 1 Jun 2007, 4:28:07 UTC

While the staff do deserve a break,

turning the servers off for 3 days will bring on another lengthy period of catch-up like the one that we just got over. So that may bring even more work in the future.

We could live without the servers for a few days - but could they live with us afterwards? ;o)

Regards,
Simon.
Donate to SETI@Home via PayPal!

Optimized SETI@Home apps + Information
ID: 579567 · Report as offensive
Profile [BOINCstats] Garindan

Send message
Joined: 19 Oct 99
Posts: 49
Credit: 335,515
RAC: 0
Netherlands
Message 579575 - Posted: 1 Jun 2007, 4:37:35 UTC

well, they wont be getting a hard time from me. Seeing the effort of the last few weeks.... errrrr months..... errr years... I think they deserve the rest and let them have the weekend.

As BOINC will be off line then on monday morning, I do advise them to do their backup and compression thing on monday morning before brining the project back up in stead of on monday evening. This saves one more recovery period....
Bas van Zuilen

ID: 579575 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 579589 - Posted: 1 Jun 2007, 4:58:16 UTC

ID: 579589 · Report as offensive
Profile KWSN THE Holy Hand Grenade!
Volunteer tester
Avatar

Send message
Joined: 20 Dec 05
Posts: 3187
Credit: 57,163,290
RAC: 0
United States
Message 579609 - Posted: 1 Jun 2007, 5:47:08 UTC - in response to Message 579567.  
Last modified: 1 Jun 2007, 5:48:07 UTC

While the staff do deserve a break,

turning the servers off for 3 days will bring on another lengthy period of catch-up like the one that we just got over. So that may bring even more work in the future.

We could live without the servers for a few days - but could they live with us afterwards? ;o)

Regards,
Simon.


... and possibly overload something else, causing it to break?!
.

Hello, from Albany, CA!...
ID: 579609 · Report as offensive
KB7RZF
Volunteer tester
Avatar

Send message
Joined: 15 Aug 99
Posts: 9549
Credit: 3,308,926
RAC: 2
United States
Message 579623 - Posted: 1 Jun 2007, 6:18:16 UTC

Thanks for the update again Matt. It is very much appriciated.

Jeremy
ID: 579623 · Report as offensive
Wander Saito
Volunteer tester

Send message
Joined: 7 Jul 03
Posts: 555
Credit: 2,136,061
RAC: 0
Brazil
Message 579736 - Posted: 1 Jun 2007, 13:03:04 UTC - in response to Message 579531.  


I agree that recovering is always a bit messy, but c'mon, give these guys a break :)


I think you misunderstood the intent of the question...

Shutting down "gracefully" as Andy mentioned is much better than just having the power go out in the midst of who knows how many disk/nfs operations... Of course there should be battery backups that handle shutdowns gracefully, but anyway... :)


Hi Brian,

I'm sorry, but I think I didn't expressed myself correctly. I was just suggesting that we shouldn't ask them to work over the weekend just to keep the servers up. As other people here also suggested, they could shutdown on Friday EOD and come back up on Monday. I also pointed out that recovering from any outage always put some strain when thousands of machines try to get some work.

What about a compromise? Shutdown on Friday, but come in on Sunday to put the system back up after the power outage. Just a suggestion.

Again sorry for the misunderstanding :)

Regards,
Wander
ID: 579736 · Report as offensive
Cristalfungus

Send message
Joined: 11 Apr 07
Posts: 3
Credit: 730
RAC: 0
United Kingdom
Message 579741 - Posted: 1 Jun 2007, 13:26:32 UTC

Seti scientists actually getting work done.

Wonders will never cease ;)

Glad you had a day to actually get things done, rather than fix serious problems.

Here's to more days like it.
ID: 579741 · Report as offensive
Profile Clyde C. Phillips, III

Send message
Joined: 2 Aug 00
Posts: 1851
Credit: 5,955,047
RAC: 0
United States
Message 579802 - Posted: 1 Jun 2007, 16:38:10 UTC

Yes, the staff needs a well-deserved break. It's been working awfully hard. I've increased the size of my cache to four days so that should bridge a three-day break.
ID: 579802 · Report as offensive

Message boards : Technical News : More Spring Cleaning (May 31 2007)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.