Weirderer (Sep 07 2007)

Message boards : Technical News : Weirderer (Sep 07 2007)
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3

AuthorMessage
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 636892 - Posted: 9 Sep 2007, 0:16:17 UTC

By the way, it looks like the Rosetta folks are close to returning:


Rosetta@home has experienced a large hardware failure - well, firmware failure to be exact - which resulted in a failure of our SAN. We've been recovering everything since the outage began Sept 4th and are nearly ready to bring things back online.... -KEL
ID: 636892 · Report as offensive
Lois Petrolito

Send message
Joined: 9 Nov 03
Posts: 10
Credit: 68,170
RAC: 0
United States
Message 636975 - Posted: 9 Sep 2007, 2:29:16 UTC - in response to Message 635635.  

Last night the assimilators stopped inserting work into the science database. We discovered that one of the indexes on the result table was corrupt - whether or not this was caused by the recent drive failures, or if this had anything to do with the assimilator problem was anybody's guess.

I started off the result index checker last night and quickly after that a THIRD drive failed on thumper in as many days. This is getting ridiculous, especially as there are no apparent signs why the drives are failing, and we're running low on spares.

This morning Bob started rebuilding the corrupt index and once that is finish I'll start the assimilators (hopefully they will be happy) and catch up on the major backlog. Maybe then I'll start the splitters, but given how our science database might tank any second we might hold off on that. In short: there may be no new work until Monday.

- Matt


No problem-I upped my cache to 3 days work, and I'm only running SETI@Home 1/3 of the time(also running Einstein and those WU are HUGE!) BTW, the download speed problem I was having a few days ago seemed to have straightened itself out(before the work unit thingie went down), and I got three work units at about the same rate I had been getting them before.

Good luck on the repairs-I can relate to reindexing database files-I had a server crash on me this past week(power failure)and spent most of the following day reindexing and doing all sorts of other stuff to get the work out to the people who needed it(and this is just a small LAN I have to deal with).
ID: 636975 · Report as offensive
Profile DarkStar
Volunteer tester
Avatar

Send message
Joined: 9 May 07
Posts: 11
Credit: 328,498
RAC: 0
Belgium
Message 637090 - Posted: 9 Sep 2007, 9:15:29 UTC

I've got about 15 WU's too on one of my machines, but now everything is just as dead as it was yesterday :)
I noticed on the server stats page that 3 splitters are brought online, so I guess more WU's will be created, but right now the demand is just too high for them to keep up! At least I'm back to crunching!

You're all doing an amazing job, hang in there!
ID: 637090 · Report as offensive
Lois Petrolito

Send message
Joined: 9 Nov 03
Posts: 10
Credit: 68,170
RAC: 0
United States
Message 637146 - Posted: 9 Sep 2007, 13:24:58 UTC
Last modified: 9 Sep 2007, 13:46:40 UTC

Hallelujah!

I received a new WU.

Here is the copy of part of the messages sitting in the Messages tab when I opened the BOINC manager this AM. The times are Eastern Daylight.

2007-09-09 05:48:16 [SETI@home] Fetching scheduler list
2007-09-09 05:48:22 [SETI@home] Master file download succeeded
2007-09-09 05:48:27 [SETI@home] Sending scheduler request: To fetch work
2007-09-09 05:48:27 [SETI@home] Requesting 15683 seconds of new work
2007-09-09 05:48:33 [SETI@home] Scheduler RPC succeeded [server version 511]
2007-09-09 05:48:33 [SETI@home] Deferring communication for 1 min 0 sec
2007-09-09 05:48:33 [SETI@home] Reason: no work from project
2007-09-09 05:49:38 [SETI@home] Sending scheduler request: To fetch work
2007-09-09 05:49:38 [SETI@home] Requesting 15700 seconds of new work
2007-09-09 05:49:44 [SETI@home] Scheduler RPC succeeded [server version 511]
2007-09-09 05:49:44 [SETI@home] Deferring communication for 1 min 0 sec
2007-09-09 05:49:44 [SETI@home] Reason: no work from project
2007-09-09 05:50:50 [SETI@home] Sending scheduler request: To fetch work
2007-09-09 05:50:50 [SETI@home] Requesting 15708 seconds of new work
2007-09-09 05:50:56 [SETI@home] Scheduler RPC succeeded [server version 511]
2007-09-09 05:50:56 [SETI@home] Deferring communication for 11 sec
2007-09-09 05:50:56 [SETI@home] Reason: requested by project
2007-09-09 05:50:58 [SETI@home] [file_xfer] Started download of file 12mr07ac.8727.1299.8.5.191
2007-09-09 05:51:02 [SETI@home] [file_xfer] Finished download of file 12mr07ac.8727.1299.8.5.191
2007-09-09 05:51:02 [SETI@home] [file_xfer] Throughput 89947 bytes/sec
2007-09-09 07:49:10 [SETI@home] Restarting task 12mr07ab.28553.18068.6.5.147_0 using setiathome_enhanced version 527

I' m no real techie-although I do run the computers at work-I'm just your prime example of an average Josephine running BOINC to keep my computer happy. The projects are running on a two year old Windows XP Home Edition PC with less than a gig of RAM and a VERY small(40GIG)hard drive(since have added two external USB drives to hold music and pictures which my son installed), which replaced a 5 year old 98 machine that I started Classic on.

Good work! And, as mentioned in an earlier post, the download speeds are getting back up there.
ID: 637146 · Report as offensive
Profile Munchkin

Send message
Joined: 19 Mar 02
Posts: 25
Credit: 577,375
RAC: 0
Sweden
Message 637409 - Posted: 9 Sep 2007, 19:48:10 UTC

Strangely enough I've been getting WU's to one of my machines so it hasn't had one second without something to do. At the same time my other one hasn't recieved anything at all. I guess it's just a matter of good timing for one and bad for the other!
ID: 637409 · Report as offensive
Profile Peter Söderlund
Avatar

Send message
Joined: 31 May 99
Posts: 33
Credit: 1,744,426
RAC: 0
Sweden
Message 637559 - Posted: 9 Sep 2007, 23:08:10 UTC - in response to Message 637409.  

Strangely enough I've been getting WU's to one of my machines so it hasn't had one second without something to do. At the same time my other one hasn't recieved anything at all. I guess it's just a matter of good timing for one and bad for the other!


Well, I have four computers, and all of them have recieved new WU's. New one, not recends.

/Peter
ID: 637559 · Report as offensive
MEEK

Send message
Joined: 20 Dec 06
Posts: 1
Credit: 189,199
RAC: 0
United Kingdom
Message 637653 - Posted: 10 Sep 2007, 0:00:50 UTC

cant go rong with some solid state hard drives no moving parts bit pricy but worth it in the long run
ID: 637653 · Report as offensive
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 637672 - Posted: 10 Sep 2007, 0:15:11 UTC

For the record I was so *not* able to do much work yesterday. Jeff was the one who made the observations and determined it was okay to start the splitters "ahead of schedule." So far so good. No drive "failures."

- Matt
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 637672 · Report as offensive
Profile Labbie
Avatar

Send message
Joined: 19 Jun 06
Posts: 4083
Credit: 5,930,102
RAC: 0
United States
Message 637716 - Posted: 10 Sep 2007, 0:43:28 UTC

Well, then we have a big Thank You for Jeff.

And to everyone else on the staff.


Calm Chaos Forum...Join Calm Chaos Now
ID: 637716 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 637793 - Posted: 10 Sep 2007, 1:46:15 UTC - in response to Message 637672.  

For the record I was so *not* able to do much work yesterday. Jeff was the one who made the observations and determined it was okay to start the splitters "ahead of schedule." So far so good. No drive "failures."


Now we know who to blame if the hard drive fails this time! LOL Just kidding Jeff.
ID: 637793 · Report as offensive
DJStarfox

Send message
Joined: 23 May 01
Posts: 1066
Credit: 1,226,053
RAC: 2
United States
Message 637803 - Posted: 10 Sep 2007, 2:22:13 UTC - in response to Message 637672.  

For the record I was so *not* able to do much work yesterday. Jeff was the one who made the observations and determined it was okay to start the splitters "ahead of schedule." So far so good. No drive "failures."

- Matt


Thanks, Matt and Jeff for the work. I would be happy to have a slow project than one that crashes. So, if we have to live with one splitter to keep the system from freaking out, then that's OK with me. I'm thankful that despite the hardware problems, the SETI science is still moving forward (slowly).
ID: 637803 · Report as offensive
John

Send message
Joined: 24 Jul 07
Posts: 1
Credit: 144,425
RAC: 0
Australia
Message 638108 - Posted: 10 Sep 2007, 15:29:53 UTC

Hi,

First time ever posting here. Been involved with this project for some years now.

It's the nature of I.T, things break, i'm sure you guys/girls will find the problem and have it sorted :)

Keep up the good work!
ID: 638108 · Report as offensive
Previous · 1 · 2 · 3

Message boards : Technical News : Weirderer (Sep 07 2007)


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.