Up (Dec 07 2010)


log in

Advanced search

Message boards : Technical News : Up (Dec 07 2010)

1 · 2 · 3 · 4 . . . 5 · Next
Author Message
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 1 Mar 99
Posts: 1389
Credit: 74,079
RAC: 0
United States
Message 1053670 - Posted: 7 Dec 2010, 23:57:54 UTC

Today was a "normal" Tuesday outage to back up the mysql database. You may have noted the result table sizes have dropped considerably since we turned on the "resend-lost-results." Hopefully this solved a lot of the ghost workunit problems people have been wondering about forever. If the database can handle it, no reason to leave that setting as is. A lot of people also noticed the server status page line "Results returned and awaiting validation" should really read "Results returned and awaiting validation as long as all the other back-end queues are zero." So most of the time this reads correctly, but if there's a large backlog somewhere this can be quite misleading. It's a painful query to get exactly what we want all the time, so fixing this is low priority.

Meanwhile, after the outage we started the splitters up (though there were some initial configuration snags that required a quick shut down and restart). Actual new work is being generated and sent.

So here we are.



Well, not so fast. I'd say we're "at the light at the end of the tunnel" as far as the public side is concerned, but there is still major cleanup on the inside before we're fully out of the tunnel. Some agenda items include:

1. Getting oscar up to speed: Right now it's operating pretty much as fast as thumper (which seems disappointing at first), though without using any CPU or disk i/o (which means it's able to do a LOT MORE if we tell it to). That's because informix is configured exactly as it was on thumper, so there are some artificial bottlenecks in place. We're collecting stats to understand what knobs to turn, and then we'll really crank them up.

2. Converting thumper to it's new role as internal file server: Remember that our main internal file server (which houses a bunch of important, heavy-random-access data and accounts) is as much of a crashy liability as mork was. So this conversion still needs to take place, but can happen over time while we're live.

3. Basic electrical stuff: Jeff and I tried to move as much around as possible, but there's still some server closet power issues to address.

4. All the tiny specks of sysadmin revolving around replacing old servers with new ones (dangling mounts, dead entries in /etc/hosts, zillions of scripts referring to now-defunct paths, etc.).

I'm also busy revving up the engine to start sending out the annual end-of-the-year news/funding drive mass e-mail. I know many of you already donated in some form or another (thank you!) but this sort of thing needs to happen. I apologize for any redundancy on this front.

- Matt

____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

Profile Anthony ColemanProject donor
Volunteer tester
Avatar
Send message
Joined: 16 May 99
Posts: 10
Credit: 2,007,731
RAC: 593
United Kingdom
Message 1053671 - Posted: 8 Dec 2010, 0:02:33 UTC

Thanks for the update. Nice to see things going again :-)
____________

ClaggyProject donor
Volunteer tester
Send message
Joined: 5 Jul 99
Posts: 4140
Credit: 33,525,968
RAC: 24,365
United Kingdom
Message 1053673 - Posted: 8 Dec 2010, 0:06:36 UTC - in response to Message 1053670.

Thanks for the update Matt, good to have thing running again properly,

Claggy

Profile soft^spirit
Avatar
Send message
Joined: 18 May 99
Posts: 6374
Credit: 28,631,148
RAC: 2
United States
Message 1053675 - Posted: 8 Dec 2010, 0:09:15 UTC - in response to Message 1053670.

Cut over complete, you earned a cookie. Of course there is clean up and spit polishing. But YAY!!!
____________

Janice

Profile Chance
Send message
Joined: 13 Dec 07
Posts: 2
Credit: 43,608
RAC: 0
United States
Message 1053678 - Posted: 8 Dec 2010, 0:18:33 UTC

Thats' great to hear. Got a new work unit today when I just decided to check to see if things were up and running. Now that they are I am up in the air riht now if I should contune to run Enstein@home and Seti@home at the same time. I think Enstien slows my computer down alot much more then Seti ever did. Must have bigger files over at Eistien.

Swibby Bear
Send message
Joined: 1 Aug 01
Posts: 236
Credit: 7,276,504
RAC: 3
United States
Message 1053680 - Posted: 8 Dec 2010, 0:20:34 UTC
Last modified: 8 Dec 2010, 0:21:58 UTC

Congratulations and thanks for all your (plural) hard work.

Thanks for reminding us about the work still needed to convert Thumper. It was easy to think you could all now go to Hawaii for vacation.

I guess this means that Ptolemy is still likely to crap out at some random moment, effectively halting the whole shebang, at least for a little while.

Good luck with that Ptolemy conversion to Thumper.

When you can, could you please speak to the new Raid that was installed on Oscar and Carolyn? I recall long, long ago that you were eager to change Raid-5 to Raid-10, or something like that. How did that go for you?

Thanks for everything. Swibby Bear (Whit)

bill
Send message
Joined: 16 Jun 99
Posts: 861
Credit: 23,960,653
RAC: 13,987
United States
Message 1053683 - Posted: 8 Dec 2010, 0:38:46 UTC - in response to Message 1053670.

Does this mean I won the pool? (Practicing his happy feet dance.)

Profile Robert M Swift
Send message
Joined: 6 Jul 05
Posts: 4
Credit: 1,294,837
RAC: 413
Spain
Message 1053699 - Posted: 8 Dec 2010, 1:17:28 UTC

Congratulations to you all. I hope you all have a chance to to relax a little.
Thank you for all your hard work and TIME.
Best Wishes to all.
____________
Bob Swift

Profile W4NY ADMINISTRATOR
Avatar
Send message
Joined: 15 May 99
Posts: 3
Credit: 49,550,538
RAC: 33,271
United States
Message 1053712 - Posted: 8 Dec 2010, 1:56:34 UTC - in response to Message 1053670.

Good job All, thanks for the update Matt. Lou W4NY...

Profile Gary CharpentierProject donor
Volunteer tester
Avatar
Send message
Joined: 25 Dec 00
Posts: 12721
Credit: 7,226,863
RAC: 16,175
United States
Message 1053724 - Posted: 8 Dec 2010, 2:33:32 UTC

Thanks for the update. Great work to everyone at the lab. Thanks to everyone who donated.


____________

Profile Francis Noel
Avatar
Send message
Joined: 30 Aug 05
Posts: 417
Credit: 58,863,568
RAC: 65,539
Canada
Message 1053733 - Posted: 8 Dec 2010, 2:57:51 UTC

"clap! clap! clap!"
____________
mambo

Profile Scarecrow
Avatar
Send message
Joined: 15 Jul 00
Posts: 4385
Credit: 459,193
RAC: 1
United States
Message 1053741 - Posted: 8 Dec 2010, 3:15:25 UTC - in response to Message 1053733.

"clap! clap! clap!"


"penicillin! penicillin! penicillin!"

Great job SETI gang! Not bad for a bunch of college boys!

Profile soft^spirit
Avatar
Send message
Joined: 18 May 99
Posts: 6374
Credit: 28,631,148
RAC: 2
United States
Message 1053754 - Posted: 8 Dec 2010, 3:58:05 UTC

wow.. finally got a batch, and they downloaded in record time. WTG!!!!!
____________

Janice

Blake Bonkofsky
Volunteer tester
Avatar
Send message
Joined: 29 Dec 99
Posts: 617
Credit: 46,332,781
RAC: 0
United States
Message 1053784 - Posted: 8 Dec 2010, 5:26:29 UTC

All 4 of my machines have varying amounts of work, slowly building. Only the GPU's have been able to return the results faster than they are coming in. It looks as if the splitters are *almost* able to max the link out, which might be a good thing. With 0 WU's waiting to be sent when the project went live again, the link didn't completely peg at ~90mb immediately. Sooo, that's good news for possible ghost avoidance and download speed :)
____________

Wedge009
Volunteer tester
Avatar
Send message
Joined: 3 Apr 99
Posts: 339
Credit: 148,671,365
RAC: 189,698
Australia
Message 1053787 - Posted: 8 Dec 2010, 5:53:59 UTC - in response to Message 1053670.
Last modified: 8 Dec 2010, 5:54:41 UTC

If the database can handle it, no reason to leave that setting as is.

Just to clarify... I hope you meant 'no reason to not leave [resend-lost-WUs] setting as is (switched on)'?

Congratulations to everyone involved for getting to this point. I'm not one of those lucky enough to get a steady flow of WUs, but this is great news all the same.
____________
Soli Deo Gloria

tbretProject donor
Volunteer tester
Avatar
Send message
Joined: 28 May 99
Posts: 2860
Credit: 215,138,743
RAC: 177,278
United States
Message 1053791 - Posted: 8 Dec 2010, 6:02:55 UTC - in response to Message 1053670.

Hip! Hip!
____________

Profile AngelaProject donor
Volunteer tester
Avatar
Send message
Joined: 16 Oct 07
Posts: 9107
Credit: 3,529,196
RAC: 6,724
United States
Message 1053794 - Posted: 8 Dec 2010, 6:09:00 UTC

Hooray!

Profile Merlin SyStems
Send message
Joined: 2 Oct 08
Posts: 8
Credit: 13,639,383
RAC: 0
Netherlands
Message 1053847 - Posted: 8 Dec 2010, 9:07:36 UTC - in response to Message 1053794.

thanks for all your hard work the couple of last weeks
lets crunch again


gr Jeroen

sleepy
Avatar
Send message
Joined: 21 May 99
Posts: 79
Credit: 23,194,242
RAC: 18,239
Italy
Message 1053851 - Posted: 8 Dec 2010, 9:55:14 UTC - in response to Message 1053670.

Thank you for the strong and successful commitment!

This is a day we will remember!

Now let's hope we are lucky and that the next WU we crunch will be carrying Christmas greetings from Aldebaran! ;-)

Sleepy
____________

Profile Konata Izumi
Volunteer tester
Avatar
Send message
Joined: 2 Jul 08
Posts: 162
Credit: 30,090,114
RAC: 707
United States
Message 1053869 - Posted: 8 Dec 2010, 11:35:49 UTC

I gots lots!!!

I can keep my apartment warm again!!

Thanks for all the hard work!!

Merry Crunching!
____________

1 · 2 · 3 · 4 . . . 5 · Next

Message boards : Technical News : Up (Dec 07 2010)

Copyright © 2014 University of California