Return (Feb 16 2012)

Message boards : Technical News : Return (Feb 16 2012)
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 1196110 - Posted: 16 Feb 2012, 21:04:27 UTC

Hello gang. I'm back from the latest bout of alternative career maintenance. Seems like I didn't miss too much, and unlike normal the server problems waited until *after* I returned. My next disappearance (only about 10 days) will be in mid-April (touring in Argentina, Chile, and Brazil).

Before the usual Tuesday server outage Jeff noticed the splitters having trouble inserting new work into the science database. After some detective work and tests we found we hit one of several possible informix logical limits: we ran out of extents in the workunit table.

Not a big deal, and we hit this limit with other tables several times before. But the fix is a bit of a hassle. Basically you have to recreate a whole new table from scratch with more extents and repopulate it with all the data from the "full" table. We have a billion workunits in that table, so to speed this process up we only moved over workunits 90 days old (or newer) before turning the projects on again. We only need 90 days of recent workunits around for the assimilators to work, but to get the NTPCkrs rolling again we need to repopulate the whole thing, which we'll do more casually.

Not sure if anybody noticed, but I got the "connecting client types" page working again (for the umpteenth time). Let's see how long before it breaks again for some inexplicable reason: http://setiathome.berkeley.edu/client_types.php

Okay. I'm sure there's lots more to report but I'm going back to beating down my e-mail spool.

- Matt

-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 1196110 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1196112 - Posted: 16 Feb 2012, 21:17:17 UTC - in response to Message 1196110.  

Welcome back Matt, hope you had a good time.


PROUD MEMBER OF Team Starfire World BOINC
ID: 1196112 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1196113 - Posted: 16 Feb 2012, 21:21:28 UTC - in response to Message 1196110.  

Thanks for the update Matt, welcome back,

Claggy
ID: 1196113 · Report as offensive
QSilver

Send message
Joined: 26 May 99
Posts: 232
Credit: 6,452,764
RAC: 0
United States
Message 1196122 - Posted: 16 Feb 2012, 22:20:13 UTC

Welcome back, Matt, and thanks for the quick update.
ID: 1196122 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1196137 - Posted: 16 Feb 2012, 23:45:21 UTC

Welcome back. It's nice to hear what's going on behind the scenes.

Since we're talking about database stuff.. is there anything that can be done for "stuck" WUs that have been pending for several months, or in some cases..years? They are ones where _0 and _1 got credit granted before _2 returned their result, and therefore, _2 is stuck waiting.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1196137 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1196252 - Posted: 17 Feb 2012, 7:11:54 UTC

Welcome back, Matt. Glad you got things sorted and running again.

And I shall only say again how much your technical tidbits are missed when you are on the road making music. But I am sure the change of pace is good for you.

Meow!
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1196252 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1196397 - Posted: 17 Feb 2012, 17:47:58 UTC - in response to Message 1196137.  

Welcome back. It's nice to hear what's going on behind the scenes.

Since we're talking about database stuff.. is there anything that can be done for "stuck" WUs that have been pending for several months, or in some cases..years? They are ones where _0 and _1 got credit granted before _2 returned their result, and therefore, _2 is stuck waiting.

Examples found at the end of the pending lists of the current top 20 hosts, WUs 764386014, 783672952, 785186126, 785467923, 785746766, 798674557, 802307404, 805724125, 806011986, 811044806, and 836743548.

As the last activity on all of those is more than 90 days ago, doing something now might not be sensible.
                                                                 Joe
ID: 1196397 · Report as offensive
Profile KWSN THE Holy Hand Grenade!
Volunteer tester
Avatar

Send message
Joined: 20 Dec 05
Posts: 3187
Credit: 57,163,290
RAC: 0
United States
Message 1196429 - Posted: 17 Feb 2012, 19:46:09 UTC
Last modified: 17 Feb 2012, 19:50:21 UTC

Thanks for getting that page back online... the data in it were really old (around October 2010, IIRC) and didn't include a lot of the more modern versions of the BOINC client...

Next low-priority thing to work on: getting the telescope pointing data on the "Science Status" page back working.
.

Hello, from Albany, CA!...
ID: 1196429 · Report as offensive

Message boards : Technical News : Return (Feb 16 2012)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.