Corrupt (Nov 05 2009)

Message boards : Technical News : Corrupt (Nov 05 2009)
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 945312 - Posted: 5 Nov 2009, 22:53:58 UTC

Eeeeoooo. Looks like this minor corruption in the science database is really snagging us, at least right now. We're talking one or two rows of the zillions in the astropulse signal table - but informix isn't being very informative about which row or two, nor what to do about it. Meanwhile, this broke the replication of astropulse - or at least we think it broke replication. This may very well have failed for some other reason.

This hasn't been a public data flow issue - we can still split/assimilate multibeam and astropulse work for the most part. Still, it's been preventing us from doing any science for a while now. So it's roll-up-our-sleeves time. We're doing a more robust table check (and hopefully repair) overnight tonight, and had to shut off astropulse splitting for now. Which means only multibeam workunits for the near term.

Meanwhile we filled up the raw data drive during all this software blanking analysis. I forgot to carry the one or something. Anyway, no big deal, some minor cleanup this morning, and we're back on track with that.

- Matt

-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 945312 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 945314 - Posted: 5 Nov 2009, 22:58:04 UTC - in response to Message 945312.  

Thanks for the update Matt,

Claggy
ID: 945314 · Report as offensive
DJStarfox

Send message
Joined: 23 May 01
Posts: 1066
Credit: 1,226,053
RAC: 2
United States
Message 945321 - Posted: 5 Nov 2009, 23:18:28 UTC - in response to Message 945312.  

Yeah, databases generally can only fix pages, not rows. Let's hope your repair operation reports success in the morning.
ID: 945321 · Report as offensive
ront

Send message
Joined: 25 Aug 01
Posts: 77
Credit: 386,336
RAC: 0
United States
Message 945406 - Posted: 6 Nov 2009, 9:10:14 UTC - in response to Message 945312.  

Morning Matt,

Thanks for keeping us posted on the difficulties.

I may not understand some (all?) of the technical language, but I do get the "flavor."

Does any of this (and this may demonstrate my relative ignorance)affect tabulation of results? My little laptop is working silly churning out tasks as they are received. However, the "pending" table is growing ever larger (up to 37 now - going back to early September).

Please advise.

Thanks,

ront
ID: 945406 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 945526 - Posted: 7 Nov 2009, 0:16:33 UTC - in response to Message 945406.  

Does any of this (and this may demonstrate my relative ignorance)affect tabulation of results? My little laptop is working silly churning out tasks as they are received. However, the "pending" table is growing ever larger (up to 37 now - going back to early September).

It's best to post these questions in Number Crunching or the "help" forums, not Technical News.

SETI@Home sends each work unit to two different machines to process, and I looked through your pendings, and all of the ones I saw were waiting for the other machine (your "wingman") to report results.

Sometimes, that takes a while, and other times, crunchers just quit, or the machine breaks, or the boss finds out and orders work deleted, or a half-dozen other issues I haven't noted causes work to be lost.

That's why we have deadlines, and why those will be reassigned to new wingmen just as soon as the deadlines pass.

It looks to me like it's just normal pendings....
ID: 945526 · Report as offensive
Luke
Volunteer developer
Avatar

Send message
Joined: 31 Dec 06
Posts: 2546
Credit: 817,560
RAC: 0
New Zealand
Message 946062 - Posted: 9 Nov 2009, 5:14:31 UTC
Last modified: 9 Nov 2009, 5:14:45 UTC

What happened to the database servers Matt? Any specifics about why they went down, or no?

- Luke.
- Luke.
ID: 946062 · Report as offensive
Profile Gary Charpentier Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 25 Dec 00
Posts: 30593
Credit: 53,134,872
RAC: 32
United States
Message 946075 - Posted: 9 Nov 2009, 6:06:57 UTC

To whomever spent time fixing it on Sunday, THANKS.

ID: 946075 · Report as offensive

Message boards : Technical News : Corrupt (Nov 05 2009)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.