Corrupt (Nov 05 2009)


log in

Advanced search

Message boards : Technical News : Corrupt (Nov 05 2009)

Author Message
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 1 Mar 99
Posts: 1389
Credit: 74,079
RAC: 0
United States
Message 945312 - Posted: 5 Nov 2009, 22:53:58 UTC

Eeeeoooo. Looks like this minor corruption in the science database is really snagging us, at least right now. We're talking one or two rows of the zillions in the astropulse signal table - but informix isn't being very informative about which row or two, nor what to do about it. Meanwhile, this broke the replication of astropulse - or at least we think it broke replication. This may very well have failed for some other reason.

This hasn't been a public data flow issue - we can still split/assimilate multibeam and astropulse work for the most part. Still, it's been preventing us from doing any science for a while now. So it's roll-up-our-sleeves time. We're doing a more robust table check (and hopefully repair) overnight tonight, and had to shut off astropulse splitting for now. Which means only multibeam workunits for the near term.

Meanwhile we filled up the raw data drive during all this software blanking analysis. I forgot to carry the one or something. Anyway, no big deal, some minor cleanup this morning, and we're back on track with that.

- Matt

____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

ClaggyProject donor
Volunteer tester
Send message
Joined: 5 Jul 99
Posts: 4149
Credit: 33,824,273
RAC: 32,736
United Kingdom
Message 945314 - Posted: 5 Nov 2009, 22:58:04 UTC - in response to Message 945312.

Thanks for the update Matt,

Claggy
____________


DJStarfox
Send message
Joined: 23 May 01
Posts: 1045
Credit: 561,441
RAC: 371
United States
Message 945321 - Posted: 5 Nov 2009, 23:18:28 UTC - in response to Message 945312.

Yeah, databases generally can only fix pages, not rows. Let's hope your repair operation reports success in the morning.

ront
Send message
Joined: 25 Aug 01
Posts: 77
Credit: 386,336
RAC: 0
United States
Message 945406 - Posted: 6 Nov 2009, 9:10:14 UTC - in response to Message 945312.

Morning Matt,

Thanks for keeping us posted on the difficulties.

I may not understand some (all?) of the technical language, but I do get the "flavor."

Does any of this (and this may demonstrate my relative ignorance)affect tabulation of results? My little laptop is working silly churning out tasks as they are received. However, the "pending" table is growing ever larger (up to 37 now - going back to early September).

Please advise.

Thanks,

ront
____________

1mp0£173
Volunteer tester
Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 945526 - Posted: 7 Nov 2009, 0:16:33 UTC - in response to Message 945406.

Does any of this (and this may demonstrate my relative ignorance)affect tabulation of results? My little laptop is working silly churning out tasks as they are received. However, the "pending" table is growing ever larger (up to 37 now - going back to early September).

It's best to post these questions in Number Crunching or the "help" forums, not Technical News.

SETI@Home sends each work unit to two different machines to process, and I looked through your pendings, and all of the ones I saw were waiting for the other machine (your "wingman") to report results.

Sometimes, that takes a while, and other times, crunchers just quit, or the machine breaks, or the boss finds out and orders work deleted, or a half-dozen other issues I haven't noted causes work to be lost.

That's why we have deadlines, and why those will be reassigned to new wingmen just as soon as the deadlines pass.

It looks to me like it's just normal pendings....
____________

Luke
Volunteer developer
Avatar
Send message
Joined: 31 Dec 06
Posts: 2546
Credit: 817,560
RAC: 0
New Zealand
Message 946062 - Posted: 9 Nov 2009, 5:14:31 UTC
Last modified: 9 Nov 2009, 5:14:45 UTC

What happened to the database servers Matt? Any specifics about why they went down, or no?

- Luke.
____________
- Luke.

Profile Gary CharpentierProject donor
Volunteer tester
Avatar
Send message
Joined: 25 Dec 00
Posts: 12805
Credit: 7,378,744
RAC: 18,085
United States
Message 946075 - Posted: 9 Nov 2009, 6:06:57 UTC

To whomever spent time fixing it on Sunday, THANKS.

____________

Message boards : Technical News : Corrupt (Nov 05 2009)

Copyright © 2014 University of California