status : October 21, 2010


log in

Advanced search

Message boards : Technical News : status : October 21, 2010

Author Message
Jeff Cobb
Volunteer moderator
Project administrator
Project developer
Project scientist
Send message
Joined: 1 Mar 99
Posts: 110
Credit: 40,367
RAC: 0
United States
Message 1043464 - Posted: 22 Oct 2010, 3:25:35 UTC

Well, bummer. The boinc db on jocelyn crashed last night. The mysql message made mention that the crash could be due to file system cache corruption. So I rebooted jocelyn in hopes of clearing this. I then ran checks on all of the tables and did a backup in case we need it to get mork going again as the replica.

I will attempt to start the project tomorrow morning, pacific time.

____________

msattler
Volunteer tester
Avatar
Send message
Joined: 9 Jul 00
Posts: 37427
Credit: 500,481,630
RAC: 534,276
United States
Message 1043469 - Posted: 22 Oct 2010, 3:36:24 UTC - in response to Message 1043464.

Well, bummer. The boinc db on jocelyn crashed last night. The mysql message made mention that the crash could be due to file system cache corruption. So I rebooted jocelyn in hopes of clearing this. I then ran checks on all of the tables and did a backup in case we need it to get mork going again as the replica.

I will attempt to start the project tomorrow morning, pacific time.

Have you considered my and Eric's suggestion to reload mysql due to possible corruption?
____________
******************
Seti whacko, resident evil, and town clown...

Crunching Seti, loving all of God's kitties.

I have met a few friends in my life.
Most were cats.

Profile Gary Charpentier
Volunteer tester
Avatar
Send message
Joined: 25 Dec 00
Posts: 11732
Credit: 5,969,877
RAC: 0
United States
Message 1043470 - Posted: 22 Oct 2010, 3:36:51 UTC - in response to Message 1043464.

Well, bummer. The boinc db on jocelyn crashed last night. The mysql message made mention that the crash could be due to file system cache corruption. So I rebooted jocelyn in hopes of clearing this. I then ran checks on all of the tables and did a backup in case we need it to get mork going again as the replica.

I will attempt to start the project tomorrow morning, pacific time.

THanks for the info. Always appreciated.

____________

Doug vE
Avatar
Send message
Joined: 4 Sep 04
Posts: 44
Credit: 4,640,742
RAC: 3,167
United States
Message 1043472 - Posted: 22 Oct 2010, 3:40:13 UTC

Thanks for the update Jeff, I have been trying most of the day to post a suggestion. Yesterday people were throwing out names for Mork's replacement.
I would like to add my two cents worth and then I am going to add another contribution to the fund, if I can get the card away from my wife. Anyway how about the name ALF from the TV show. After all that is what we are looking for, Alien Life Form(s).

I hope the kittens aren't to upset with that name, after all he never did catch the family cat.

Keep up the excellent work.


____________

Seti@Home Classic Work Units 413
Seti@Home Classic CPU time 3,869 hours

Doug

Profile [seti.international] Dirk Sadowski
Volunteer tester
Avatar
Send message
Joined: 6 Apr 07
Posts: 6970
Credit: 57,124,415
RAC: 21,900
Germany
Message 1043519 - Posted: 22 Oct 2010, 10:36:46 UTC - in response to Message 1043464.

Jeff, thanks for the news!

____________
BR



>Das Deutsche Cafe. The German Cafe.<

DJStarfox
Send message
Joined: 23 May 01
Posts: 1040
Credit: 527,839
RAC: 70
United States
Message 1043595 - Posted: 22 Oct 2010, 15:22:43 UTC - in response to Message 1043464.

Well, bummer. The boinc db on jocelyn crashed last night. The mysql message made mention that the crash could be due to file system cache corruption. So I rebooted jocelyn in hopes of clearing this. I then ran checks on all of the tables and did a backup in case we need it to get mork going again as the replica.

I will attempt to start the project tomorrow morning, pacific time.


I had a "Torn Page Detection" happen on a production server many years back. Corrupted the entire DB before we found the cause. Turns out the hardware raid controller had a firmware bug that caused corruption under high I/O load. What's the RAID controller & configuration you're using?

Profile 335deezl
Volunteer tester
Avatar
Send message
Joined: 8 Jan 07
Posts: 15
Credit: 12,207,926
RAC: 0
United States
Message 1043659 - Posted: 22 Oct 2010, 18:08:06 UTC

I truly feel for you Jeff. Are you getting any help from anyone else, or is it just you trying to keep all the servers up and running? I have 550 tasks to report, and am almost out of work... :(
____________

sleepy
Avatar
Send message
Joined: 21 May 99
Posts: 67
Credit: 19,283,540
RAC: 23,832
Italy
Message 1043687 - Posted: 22 Oct 2010, 18:56:25 UTC

Thank you for the news and up with the good work!
We will crunch what we have still got in store and then wait patiently until everything settles down.

It is a long run project. Question has been around for 14 billion years, or 100.000, it depends on how you put it. We can wait some weeks.

Take a breath, untangle the cables, prepare a good environment for next servers.

We will be still here when you are ready.

All the best to everybody, especially to the generous ones who made it possible that we are down, but new servers are already coming to SETI.

Sleepy
____________

Profile RottenMutt
Avatar
Send message
Joined: 15 Mar 01
Posts: 992
Credit: 207,640,152
RAC: 9,518
United States
Message 1043947 - Posted: 23 Oct 2010, 13:21:41 UTC - in response to Message 1043464.

Please load new work to be split.
____________

Profile platium
Avatar
Send message
Joined: 5 Jul 10
Posts: 212
Credit: 262,426
RAC: 0
United Kingdom
Message 1043951 - Posted: 23 Oct 2010, 13:45:48 UTC

if Jeff does his best thats all we can have, he's limited by budget they seem to be doing the best they can with less that perfect equipment, I also heard that they are students that run the project most of the time if that true they are going to get better as time go on, hopefully better equipment will help them alot.

bill from uk

SockGap
Send message
Joined: 16 Apr 07
Posts: 13
Credit: 5,389,175
RAC: 2,308
Australia
Message 1044113 - Posted: 23 Oct 2010, 22:09:58 UTC - in response to Message 1043951.

I also heard that they are students that run the project most of the time if that true they are going to get better as time go on, hopefully better equipment will help them alot.


Yes they are students - and most of them have been running the project since it started 11 years ago. I've worked in IT for 15 years and could only name 5 or 6 system admins anywhere near as good as these guys.

They do a phenomenal job with the equipment they've got - but it's finally given up. Hence why I've donated. Hopefully the new servers will allow the project to support the huge number of clients and answer the ever-increasing number of requests for work.

In the mean time I've set Seti to No New Work. I'll crunch my other projects and reduce the load on Jocelyn...
____________

Message boards : Technical News : status : October 21, 2010

Copyright © 2014 University of California