status : October 21, 2010

Message boards : Technical News : status : October 21, 2010
Message board moderation

To post messages, you must log in.

AuthorMessage
Jeff Cobb Project Donor
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Mar 99
Posts: 122
Credit: 40,367
RAC: 0
United States
Message 1043464 - Posted: 22 Oct 2010, 3:25:35 UTC

Well, bummer. The boinc db on jocelyn crashed last night. The mysql message made mention that the crash could be due to file system cache corruption. So I rebooted jocelyn in hopes of clearing this. I then ran checks on all of the tables and did a backup in case we need it to get mork going again as the replica.

I will attempt to start the project tomorrow morning, pacific time.

ID: 1043464 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1043469 - Posted: 22 Oct 2010, 3:36:24 UTC - in response to Message 1043464.  

Well, bummer. The boinc db on jocelyn crashed last night. The mysql message made mention that the crash could be due to file system cache corruption. So I rebooted jocelyn in hopes of clearing this. I then ran checks on all of the tables and did a backup in case we need it to get mork going again as the replica.

I will attempt to start the project tomorrow morning, pacific time.

Have you considered my and Eric's suggestion to reload mysql due to possible corruption?
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1043469 · Report as offensive
Profile Gary Charpentier Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 25 Dec 00
Posts: 31012
Credit: 53,134,872
RAC: 32
United States
Message 1043470 - Posted: 22 Oct 2010, 3:36:51 UTC - in response to Message 1043464.  

Well, bummer. The boinc db on jocelyn crashed last night. The mysql message made mention that the crash could be due to file system cache corruption. So I rebooted jocelyn in hopes of clearing this. I then ran checks on all of the tables and did a backup in case we need it to get mork going again as the replica.

I will attempt to start the project tomorrow morning, pacific time.

THanks for the info. Always appreciated.

ID: 1043470 · Report as offensive
Doug vE
Avatar

Send message
Joined: 4 Sep 04
Posts: 47
Credit: 12,262,253
RAC: 0
United States
Message 1043472 - Posted: 22 Oct 2010, 3:40:13 UTC

Thanks for the update Jeff, I have been trying most of the day to post a suggestion. Yesterday people were throwing out names for Mork's replacement.
I would like to add my two cents worth and then I am going to add another contribution to the fund, if I can get the card away from my wife. Anyway how about the name ALF from the TV show. After all that is what we are looking for, Alien Life Form(s).

I hope the kittens aren't to upset with that name, after all he never did catch the family cat.

Keep up the excellent work.



Seti@Home Classic Work Units 413
Seti@Home Classic CPU time 3,869 hours

Doug
ID: 1043472 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1043519 - Posted: 22 Oct 2010, 10:36:46 UTC - in response to Message 1043464.  

Jeff, thanks for the news!

ID: 1043519 · Report as offensive
DJStarfox

Send message
Joined: 23 May 01
Posts: 1066
Credit: 1,226,053
RAC: 2
United States
Message 1043595 - Posted: 22 Oct 2010, 15:22:43 UTC - in response to Message 1043464.  

Well, bummer. The boinc db on jocelyn crashed last night. The mysql message made mention that the crash could be due to file system cache corruption. So I rebooted jocelyn in hopes of clearing this. I then ran checks on all of the tables and did a backup in case we need it to get mork going again as the replica.

I will attempt to start the project tomorrow morning, pacific time.


I had a "Torn Page Detection" happen on a production server many years back. Corrupted the entire DB before we found the cause. Turns out the hardware raid controller had a firmware bug that caused corruption under high I/O load. What's the RAID controller & configuration you're using?
ID: 1043595 · Report as offensive
Profile RandyF
Volunteer tester
Avatar

Send message
Joined: 8 Jan 07
Posts: 15
Credit: 12,296,855
RAC: 1
United States
Message 1043659 - Posted: 22 Oct 2010, 18:08:06 UTC

I truly feel for you Jeff. Are you getting any help from anyone else, or is it just you trying to keep all the servers up and running? I have 550 tasks to report, and am almost out of work... :(
ID: 1043659 · Report as offensive
Sleepy
Volunteer tester
Avatar

Send message
Joined: 21 May 99
Posts: 219
Credit: 98,947,784
RAC: 28,360
Italy
Message 1043687 - Posted: 22 Oct 2010, 18:56:25 UTC

Thank you for the news and up with the good work!
We will crunch what we have still got in store and then wait patiently until everything settles down.

It is a long run project. Question has been around for 14 billion years, or 100.000, it depends on how you put it. We can wait some weeks.

Take a breath, untangle the cables, prepare a good environment for next servers.

We will be still here when you are ready.

All the best to everybody, especially to the generous ones who made it possible that we are down, but new servers are already coming to SETI.

Sleepy
ID: 1043687 · Report as offensive
Profile RottenMutt
Avatar

Send message
Joined: 15 Mar 01
Posts: 1011
Credit: 230,314,058
RAC: 0
United States
Message 1043947 - Posted: 23 Oct 2010, 13:21:41 UTC - in response to Message 1043464.  

Please load new work to be split.
ID: 1043947 · Report as offensive
Profile platium
Avatar

Send message
Joined: 5 Jul 10
Posts: 212
Credit: 262,426
RAC: 0
United Kingdom
Message 1043951 - Posted: 23 Oct 2010, 13:45:48 UTC

if Jeff does his best thats all we can have, he's limited by budget they seem to be doing the best they can with less that perfect equipment, I also heard that they are students that run the project most of the time if that true they are going to get better as time go on, hopefully better equipment will help them alot.

bill from uk
ID: 1043951 · Report as offensive
SockGap

Send message
Joined: 16 Apr 07
Posts: 14
Credit: 7,700,416
RAC: 0
Australia
Message 1044113 - Posted: 23 Oct 2010, 22:09:58 UTC - in response to Message 1043951.  

I also heard that they are students that run the project most of the time if that true they are going to get better as time go on, hopefully better equipment will help them alot.


Yes they are students - and most of them have been running the project since it started 11 years ago. I've worked in IT for 15 years and could only name 5 or 6 system admins anywhere near as good as these guys.

They do a phenomenal job with the equipment they've got - but it's finally given up. Hence why I've donated. Hopefully the new servers will allow the project to support the huge number of clients and answer the ever-increasing number of requests for work.

In the mean time I've set Seti to No New Work. I'll crunch my other projects and reduce the load on Jocelyn...
ID: 1044113 · Report as offensive

Message boards : Technical News : status : October 21, 2010


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.