Suspended in Gaffa (Jan 24 2011)

Message boards : Technical News : Suspended in Gaffa (Jan 24 2011)
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 1070270 - Posted: 24 Jan 2011, 18:37:38 UTC

The problems last week with bruno (which continue, and I'll address below) completely overshadowed problems with our radar blanking suite which suddenly was unable to convert raw data into clean data which can then be split into workunits. So we ran out of work to send out over the weekend, and I was personally unable to do anything to help the effort in figuring out why. However, immediately this morning I spotted the problem. Long story short, this was one of those cases where the wild error messages with impossible number values were obscuring the less obvious real problem, which was simply a configuration file had gone missing. I replaced this file, and new work should be coming down the pike shortly.

Back to bruno: the woes continue with this system regarding its drives, though I am trying a few more things out before I throw my hands up in complete frustration. It would indeed be a shame to simply abandon this server as it has a lot to offer if it works. We'll have our server meeting later today to discuss where to go next on this front - I just wanted to give y'all an earlier than normal "heads up" today given the loss of work, etc.

- Matt

-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 1070270 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1070271 - Posted: 24 Jan 2011, 18:39:45 UTC

Thanks for the news, Matt.

Glad you got the blanking problems sorted.....I am sure many folks will be happy to see new working coming down the pike soon.

And good luck with Bruno, my friend.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1070271 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6658
Credit: 121,090,076
RAC: 0
United States
Message 1070273 - Posted: 24 Jan 2011, 18:43:38 UTC

Thank you Matt! These little bits of information are invaluable to us out here in crunch land. We really do enjoy knowing what's happening, and appreciate your taking the time to tell us.

Steve!
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1070273 · Report as offensive
Profile skildude
Avatar

Send message
Joined: 4 Oct 00
Posts: 9541
Credit: 50,759,529
RAC: 60
Yemen
Message 1070274 - Posted: 24 Jan 2011, 18:44:41 UTC - in response to Message 1070273.  

woohoo let the good times roll


In a rich man's house there is no place to spit but his face.
Diogenes Of Sinope
ID: 1070274 · Report as offensive
Robert Ribbeck
Avatar

Send message
Joined: 7 Jun 02
Posts: 644
Credit: 5,283,174
RAC: 0
United States
Message 1070280 - Posted: 24 Jan 2011, 19:01:46 UTC

Three cheers for Matt

Hail to our chief
ID: 1070280 · Report as offensive
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 1070281 - Posted: 24 Jan 2011, 19:06:00 UTC

Good news - I was just now able to get an OS on the old bruno and reboot the system and it worked! There must have been some data gunk on the old root drives messing up the FC14 install. The installer kept crashing until I finally shredded these three drives (putting all zeros on them).

Of course, we still have to figure out the RAID situation for the remaining 21 drives. We currently have no working controller in the system, so we need to decide what to do next.

- Matt
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 1070281 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1070282 - Posted: 24 Jan 2011, 19:08:27 UTC - in response to Message 1070281.  

Wow!!!
2 victories in 1 day....
Well done, Sir.

If a new raid controller is all it will take to fix Bruno, that is good news.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1070282 · Report as offensive
KB7RZF
Volunteer tester
Avatar

Send message
Joined: 15 Aug 99
Posts: 9549
Credit: 3,308,926
RAC: 2
United States
Message 1070283 - Posted: 24 Jan 2011, 19:09:38 UTC

Awesome new's Matt. Thanks for the quick update!
ID: 1070283 · Report as offensive
Robert Ribbeck
Avatar

Send message
Joined: 7 Jun 02
Posts: 644
Credit: 5,283,174
RAC: 0
United States
Message 1070284 - Posted: 24 Jan 2011, 19:16:19 UTC

more cheers & another Hail to the chief

Matt you da man
ID: 1070284 · Report as offensive
Peter

Send message
Joined: 6 Nov 09
Posts: 40
Credit: 7,805,244
RAC: 0
Netherlands
Message 1070286 - Posted: 24 Jan 2011, 19:22:01 UTC

Great news matt, keep up the good work!!!
ID: 1070286 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1070290 - Posted: 24 Jan 2011, 19:35:45 UTC - in response to Message 1070281.  

Thanks for the updates Matt, glad you're making progress,

Claggy
ID: 1070290 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1070296 - Posted: 24 Jan 2011, 20:10:58 UTC - in response to Message 1070281.  

Excellent Matt. You've certainly earned your keep this month.

I suspect a new controller is just a short fund raiser away.

Just say the word.
ID: 1070296 · Report as offensive
Profile Gary Charpentier Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 25 Dec 00
Posts: 31006
Credit: 53,134,872
RAC: 32
United States
Message 1070304 - Posted: 24 Jan 2011, 20:23:28 UTC

Two down, one to go. Great work.

ID: 1070304 · Report as offensive
Robert Ribbeck
Avatar

Send message
Joined: 7 Jun 02
Posts: 644
Credit: 5,283,174
RAC: 0
United States
Message 1070307 - Posted: 24 Jan 2011, 20:37:32 UTC - in response to Message 1070296.  

Excellent Matt. You've certainly earned your keep this month.

I suspect a new controller is just a short fund raiser away.

Just say the word.


if its just a controller Todd already has offered one

http://setiathome.berkeley.edu/forum_thread.php?id=62871&nowrap=true#1069042
ID: 1070307 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1070315 - Posted: 24 Jan 2011, 20:49:45 UTC

Great news all round. Thanks for all the hard work. On a side note Tech news page hasn't been updated since 13/01/11
ID: 1070315 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1070330 - Posted: 24 Jan 2011, 21:41:13 UTC - in response to Message 1070281.  

Matt, thanks for the news!

ID: 1070330 · Report as offensive
Profile [AF>EDLS] Polynesia
Volunteer tester
Avatar

Send message
Joined: 1 Apr 09
Posts: 54
Credit: 5,361,172
RAC: 0
France
Message 1070338 - Posted: 24 Jan 2011, 22:27:11 UTC

I hope you do not lose data work?


Alliance Francophone
ID: 1070338 · Report as offensive
Doug vE
Avatar

Send message
Joined: 4 Sep 04
Posts: 47
Credit: 12,262,253
RAC: 0
United States
Message 1070347 - Posted: 24 Jan 2011, 23:29:12 UTC

Great news Matt, now I don't have to turn Einstein back on. I shut it down when the new servers got going so Seti could catch up after being down all that time and only Einstein was running. Hooray for Seti@Home and keep up the great work.

Seti@Home Classic Work Units 413
Seti@Home Classic CPU time 3,869 hours

Doug
ID: 1070347 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1070352 - Posted: 24 Jan 2011, 23:45:51 UTC

Guess it wasn't as simple as Matt made it sound. They must have run into some more problems as there is still no new work. :-( Hope it's not too serious and they are able to get back up soon.


PROUD MEMBER OF Team Starfire World BOINC
ID: 1070352 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14679
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1070361 - Posted: 25 Jan 2011, 0:28:17 UTC - in response to Message 1070352.  

Guess it wasn't as simple as Matt made it sound. They must have run into some more problems as there is still no new work. :-( Hope it's not too serious and they are able to get back up soon.

Ain't necessarily so. Doing anything at all with a 50 GB file takes time - even if he squirted one into the upstream end of the pipe just as he posted, we don't really know how long it takes before the complete thing emerges out of the other end of the sausage machine, ready to be mounted and offered up to the splitters.

For a continuous process like this, steady-state throughput gives no indication of how long it takes to recharge the (? many parallel) pipes from a drought.
ID: 1070361 · Report as offensive
1 · 2 · Next

Message boards : Technical News : Suspended in Gaffa (Jan 24 2011)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.