Hemiola (Aug 12 2010)

Message boards : Technical News : Hemiola (Aug 12 2010)
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 1024344 - Posted: 12 Aug 2010, 20:58:48 UTC

Wrapping up the weekly "extended outage." Jeff's actually out today, but will be back to turn the servers on tomorrow (i.e. Friday, when I'm usually out).

I finally got around to testing a drive on mork (the mysql server) that the RAID card deemed "failed" at some point, but maybe that was a transient problem as it seems fine now. Nevertheless I went through the rigamarole of pulling that drive, putting a new on in, testing it, making it a new hot spare, etc.

That's all good, but the week in general has been tainted by mork issues in general. It had one of its regular mystery crashes on Tuesday (followed by a long recovery). Then last night, and again this morning, the RAID mirror of two solid state drives (where we keep the innodb logs) started going flakey on us. The partition would just disappear, sending mysql into fits. We were able to quickly recover, but we're abandoning the solid state drives for now. Honestly, they weren't adding all that much to the i/o picture because we were cautious about how we were implementing them. Now I'm glad we were cautious. The upshot of all the above meant that we had to recovery the replica as many as four times so far from the weekly backup. What a pain. The latest replica recovery is happening as I type this. All I hope is that all systems are normal and stable by tomorrow.

Everything else is fine. In fact, more than fine as a set of very generous participants donated $6000 towards a new server that will become the new science database server. THANK YOU!! We're still spec'ing out said server, but will go ahead sooner than later now that we don't have to set up a funding drive!

Meanwhile I'm still chipping away at various data analysis projects, Jeff's been fighting with data syncronization issues that have been creeping in more and more lately. We also had a "design meeting" regarding where to go with the public involvement of candidate selection. I'm finding some plug-n-play visualization utilities on line, but pretty much I'm finding (like always) it might just be easier and better if I do it all myself with tools I already know. However, some improvements go beyond that scope, so I'm digging into AJAX which is good stuff to know, I guess.

- Matt

-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 1024344 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1024346 - Posted: 12 Aug 2010, 21:07:11 UTC - in response to Message 1024344.  

Shouldn't you name that new server after the benefactors? Or is MRJHJT too difficult to pronounce in the office? ;-)
ID: 1024346 · Report as offensive
Scarecrow

Send message
Joined: 15 Jul 00
Posts: 4520
Credit: 486,601
RAC: 0
United States
Message 1024348 - Posted: 12 Aug 2010, 21:09:07 UTC - in response to Message 1024346.  

MRJHJT

Oddly enough, that's the noise a solid state drive makes when it augers in.
ID: 1024348 · Report as offensive
DJStarfox

Send message
Joined: 23 May 01
Posts: 1066
Credit: 1,226,053
RAC: 2
United States
Message 1024351 - Posted: 12 Aug 2010, 21:15:53 UTC - in response to Message 1024344.  

With $6k, would be nice to squeeze a real RAID card out for the new server.
ID: 1024351 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1024352 - Posted: 12 Aug 2010, 21:19:19 UTC - in response to Message 1024344.  

Matt, thanks for the news!

ID: 1024352 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1024353 - Posted: 12 Aug 2010, 21:20:17 UTC - in response to Message 1024344.  

Thanks for the update Matt,

Claggy
ID: 1024353 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 1024354 - Posted: 12 Aug 2010, 21:20:44 UTC

So, if Hocket referred to the way you guys share work at Berkely, does Hemiola refer to the days between server failures lately? (1 2 3, 1 2 3, 1 2, 1 2, 1 2)

Oh, and thanks for the update.

ID: 1024354 · Report as offensive
Profile Gary Charpentier Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 25 Dec 00
Posts: 30919
Credit: 53,134,872
RAC: 32
United States
Message 1024362 - Posted: 12 Aug 2010, 21:38:06 UTC

Thanks for the update.

ID: 1024362 · Report as offensive
Profile soft^spirit
Avatar

Send message
Joined: 18 May 99
Posts: 6497
Credit: 34,134,168
RAC: 0
United States
Message 1024369 - Posted: 12 Aug 2010, 22:15:21 UTC

As one of the donations (yet to be delivered.. plans are in progress)..
I would vote for a name like "Planters" Cause its from a bunch of nuts.
Janice
ID: 1024369 · Report as offensive
Profile soft^spirit
Avatar

Send message
Joined: 18 May 99
Posts: 6497
Credit: 34,134,168
RAC: 0
United States
Message 1024377 - Posted: 12 Aug 2010, 22:45:47 UTC - in response to Message 1024344.  

Matt.. if I might share.. from my experiences of keeping together antiques that were often poorly "refurbished"..

many problems clear permanently upon "re-seating" unplugging, and plugging back in. Other times taking things out, some surprise drops loose(seen or unseen 50/50).. and are then magically "fixed". Whether they were dirty connections, a bit of dust, someones raisinette.. does not really matter as long as they clear. a bad connection invisible to the eye might nearly need "bumped".. and could be gone forever.

We came up with things such as "pencil test".. where while monitoring the signal we tapped the outside case and see if it had effects. And some of the equipment was old enough to even contain mercury relays, where the mercury would vaporize, re-solidify in obscure pieces, and refuse to work until we "bounced" (hold edge of component 3-4" above anti-static surface, drop and catch on first bounce, re-insert) to clear.

These are also good reasons why "fault tolerance" is a good(although expensive) principle.

On the reports going back.. all of these were jotted down as "re-seat to clear."

Because if we told the truth, the whole truth, and nothing but the truth... it would have been the Salem Witch trials all over again.
Janice
ID: 1024377 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66199
Credit: 55,293,173
RAC: 49
United States
Message 1024378 - Posted: 12 Aug 2010, 22:47:29 UTC - in response to Message 1024369.  

As one of the donations (yet to be delivered.. plans are in progress)..
I would vote for a name like "Planters" Cause its from a bunch of nuts.

Planters sound good to Me too.

@ Matt: Thanks for the update on Morks Odyssey.
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 1024378 · Report as offensive
John McLeod VII
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jul 99
Posts: 24806
Credit: 790,712
RAC: 0
United States
Message 1024427 - Posted: 13 Aug 2010, 1:17:46 UTC - in response to Message 1024378.  

As one of the donations (yet to be delivered.. plans are in progress)..
I would vote for a name like "Planters" Cause its from a bunch of nuts.

Planters sound good to Me too.

@ Matt: Thanks for the update on Morks Odyssey.

How about Bedlam? As in a house full of fruits, nuts and flakes.


BOINC WIKI
ID: 1024427 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66199
Credit: 55,293,173
RAC: 49
United States
Message 1024449 - Posted: 13 Aug 2010, 3:03:51 UTC - in response to Message 1024427.  

As one of the donations (yet to be delivered.. plans are in progress)..
I would vote for a name like "Planters" Cause its from a bunch of nuts.

Planters sound good to Me too.

@ Matt: Thanks for the update on Morks Odyssey.

How about Bedlam? As in a house full of fruits, nuts and flakes.

I'm sure someone will find a name somewhere.
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 1024449 · Report as offensive
Profile Jack Zhang
Volunteer tester
Avatar

Send message
Joined: 2 Jul 06
Posts: 206
Credit: 6,142,449
RAC: 0
Canada
Message 1024492 - Posted: 13 Aug 2010, 8:58:57 UTC

I hear SSD talk in this news post...

Avoid Kingston and consumer OCZ products when it comes to SSDs. Intel is only good if it's SLC memory and if there was an SSD move made, that SSD must have a supercapacitor to handle Server IOs per second. Pretty much the only choice when it comes to Server SSDs is the Sandforce SF-1500 controller chips with supercapacitor.
What if Fiction was Fact and Fact was Fiction and vice versa?
ID: 1024492 · Report as offensive
Profile Helli_retiered
Volunteer tester
Avatar

Send message
Joined: 15 Dec 99
Posts: 707
Credit: 108,785,585
RAC: 0
Germany
Message 1024514 - Posted: 13 Aug 2010, 12:34:53 UTC - in response to Message 1024346.  

Shouldn't you name that new server after the benefactors? Or is MRJHJT too difficult to pronounce in the office? ;-)


Well i don't believe that we would find a word that's representing the six Sponsors.

But - i would love to see a Sticker on the Server with written on it like "Mainly sponsored by Mark, Richard, Josef, Helli, John and T.A." ;-)
A Picture in the SETI@home Photo Album would also be fine so we can say years later: "Hey, look, a 1/6 of this Rig was sponsored by me". :-)

Only my 2c. :-)

Helli
A loooong time ago: First Credits after SETI@home Restart
ID: 1024514 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1024557 - Posted: 13 Aug 2010, 14:42:37 UTC - in response to Message 1024369.  

soft^spirit wrote:
As one of the donations (yet to be delivered.. plans are in progress)..
I would vote for a name like "Planters" Cause its from a bunch of nuts.

I never doubted your pledge for August 28, and believe the project should be considering $7000 as donated to the cause.

Stretching the allusion to peanuts a bit further, perhaps Carver would be an intersting name possibility.
                                                                Joe
ID: 1024557 · Report as offensive
Profile soft^spirit
Avatar

Send message
Joined: 18 May 99
Posts: 6497
Credit: 34,134,168
RAC: 0
United States
Message 1024563 - Posted: 13 Aug 2010, 15:02:37 UTC - in response to Message 1024557.  

honestly until a couple of posts ago, it never occured to me that it was not considered part of the 6K. There was an after the goal reached announcement donation of 1K..ahh well.

In any case.. add it however they want. "Hardware" is the only stipulation to it.
Janice
ID: 1024563 · Report as offensive
Profile KWSN THE Holy Hand Grenade!
Volunteer tester
Avatar

Send message
Joined: 20 Dec 05
Posts: 3187
Credit: 57,163,290
RAC: 0
United States
Message 1024565 - Posted: 13 Aug 2010, 15:04:24 UTC - in response to Message 1024377.  

Matt.. if I might share.. from my experiences of keeping together antiques that were often poorly "refurbished"..

many problems clear permanently upon "re-seating" unplugging, and plugging back in. Other times taking things out, some surprise drops loose(seen or unseen 50/50).. and are then magically "fixed". Whether they were dirty connections, a bit of dust, someones raisinette.. does not really matter as long as they clear. a bad connection invisible to the eye might nearly need "bumped".. and could be gone forever.

We came up with things such as "pencil test".. where while monitoring the signal we tapped the outside case and see if it had effects. And some of the equipment was old enough to even contain mercury relays, where the mercury would vaporize, re-solidify in obscure pieces, and refuse to work until we "bounced" (hold edge of component 3-4" above anti-static surface, drop and catch on first bounce, re-insert) to clear.

These are also good reasons why "fault tolerance" is a good(although expensive) principle.

On the reports going back.. all of these were jotted down as "re-seat to clear."

Because if we told the truth, the whole truth, and nothing but the truth... it would have been the Salem Witch trials all over again.


One thing that used to work on CRT terminals, back in the '80s, was to give them a "slap upside the screen". Some terminals would come back to life for a time after the slap. Location (and force) was brand-dependent, and with one of the brands, there were two methods that worked, depending on symptom: the slap, directed at the upper right of the CRT, and lifting the front of the CRT about an inch, and dropping. IBM 3278's were pretty reliable, but when they went, they could (sometimes...) be brought back by slapping the back right corner, and picking up the back about .5 inch, and dropping...

.

Hello, from Albany, CA!...
ID: 1024565 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 1024574 - Posted: 13 Aug 2010, 15:24:57 UTC - in response to Message 1024565.  

Matt.. if I might share.. from my experiences of keeping together antiques that were often poorly "refurbished"..

many problems clear permanently upon "re-seating" unplugging, and plugging back in. Other times taking things out, some surprise drops loose(seen or unseen 50/50).. and are then magically "fixed". Whether they were dirty connections, a bit of dust, someones raisinette.. does not really matter as long as they clear. a bad connection invisible to the eye might nearly need "bumped".. and could be gone forever.

We came up with things such as "pencil test".. where while monitoring the signal we tapped the outside case and see if it had effects. And some of the equipment was old enough to even contain mercury relays, where the mercury would vaporize, re-solidify in obscure pieces, and refuse to work until we "bounced" (hold edge of component 3-4" above anti-static surface, drop and catch on first bounce, re-insert) to clear.

These are also good reasons why "fault tolerance" is a good(although expensive) principle.

On the reports going back.. all of these were jotted down as "re-seat to clear."

Because if we told the truth, the whole truth, and nothing but the truth... it would have been the Salem Witch trials all over again.


One thing that used to work on CRT terminals, back in the '80s, was to give them a "slap upside the screen". Some terminals would come back to life for a time after the slap. Location (and force) was brand-dependent, and with one of the brands, there were two methods that worked, depending on symptom: the slap, directed at the upper right of the CRT, and lifting the front of the CRT about an inch, and dropping. IBM 3278's were pretty reliable, but when they went, they could (sometimes...) be brought back by slapping the back right corner, and picking up the back about .5 inch, and dropping...


AS a (mostly) mechanical engineer, it does my heart good to see my electronic colleagues adapting the time honoured and tested ways of the mech eng.

ID: 1024574 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1024911 - Posted: 14 Aug 2010, 5:33:56 UTC
Last modified: 14 Aug 2010, 5:34:46 UTC

What great news re the $6k donation. From Staycation (Jul 01 2010)
Data wise, we were able to get back to merging our various spike tables together full bore
How far through merging the spike tables are you now?

BOINC replica database saying running on the left hand side of the Server Status page yet beside Replica seconds behind master it says Offline. Is it still recovering after it's various crashes throughout the week?

Thanks so much for the update
ID: 1024911 · Report as offensive
1 · 2 · Next

Message boards : Technical News : Hemiola (Aug 12 2010)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.