Message boards :
Technical News :
Can't talk.. Debugging.. (May 15 2007)
Message board moderation
Author | Message |
---|---|
Matt Lebofsky Send message Joined: 1 Mar 99 Posts: 1444 Credit: 957,058 RAC: 0 |
We had the usual outage today which was mostly a success. The database compressed and was backed up in just over an hour. Normally this takes almost twice as long but the result table has significantly shrunk over the past two weeks (wonder why?). After that we put the new thumper in the closet (we being me, Eric, Jeff, and Kevin - it's a heavy machine). We also rebooted bruno to cleanly pick up a new disk (replacing a failed disk from yesterday). And I rebooted penguin to attach koloth's old tape drive to it (so it could read the classic data tapes for splitting). That all went well. We also updated all the BOINC-side code to bring the SETI@home project in line with the current BOINC source tree and a few things broke, namely our validators and assimilators. These aren't project critical for the time being, so we're postponing dealing with these until we deal with the real problem at hand: getting people to connect to our data servers. I think this is the longest outage we've ever had (even though it wasn't a "complete" outage - just no work was available) and we're in a whole new network configuration since the last major outage (new OS, new servers, new ISP, new switches, new router). In short, we're being clobbered by the returning flood of work requests. The major bottleneck is somewhere in the direction of our Hurricane router or bruno. Or at least that's the way it seems right now and there's no guarantee that when we break that dam a new bottleneck won't arise. I don't have the time to spell out what is broken and what we tried and what failed and what yielded unexpected results. Just know we're working on it and we understand most connections are being dropped. - Matt -- BOINC/SETI@home network/web/science/development person -- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude |
SATAN Send message Joined: 27 Aug 06 Posts: 835 Credit: 2,129,006 RAC: 0 |
Cheers guys, we know your doing the best you can. |
JerWA Send message Joined: 3 Apr 99 Posts: 13 Credit: 4,262,442 RAC: 0 |
|
TimeLord04 Send message Joined: 9 Mar 06 Posts: 21140 Credit: 33,933,039 RAC: 23 |
Thanks for the update, Matt; good luck with everything. 8-D TimeLord04 Have TARDIS, will travel... Come along K-9! Join Calm Chaos |
Dingo Send message Joined: 28 Jun 99 Posts: 104 Credit: 16,364,896 RAC: 1 |
Thanks for the update, I think everyone realised that there would be a bottleneck somewhere. Proud Founder and member of Have a look at my WebCam |
Daniel Michel Send message Joined: 2 Feb 04 Posts: 14925 Credit: 1,378,607 RAC: 6 |
I remember back in the early days of SETI/BOINC there was no weekly scheduled outage...With all the new equipment coming on line...Does that mean that back up day may become a thing of the past? PROUD TO BE TFFE! |
Jose Montesinos Send message Joined: 15 Apr 07 Posts: 1 Credit: 422,046 RAC: 0 |
Is there a way to send the results? I don't care about the credits, but some of the results will expire tomorrow. |
KZ3AB Send message Joined: 1 Mar 00 Posts: 6 Credit: 4,084,338 RAC: 0 |
Z-Z-Z-Z Waiting. |
Gavin Shaw Send message Joined: 8 Aug 00 Posts: 1116 Credit: 1,304,337 RAC: 0 |
Is there a way to send the results? I don't care about the credits, but some of the results will expire tomorrow. Same here. I've got results that haven't uploaded since this all started. Except I got some that expire today. Never surrender and never give up. In the darkest hour there is always hope. |
Ned Slider Send message Joined: 12 Oct 01 Posts: 668 Credit: 4,375,315 RAC: 0 |
Totally understandable Matt. We know it will take a week or two for things to settle down again. We also know you guys will be doing all you can to ease the situation in the meantime, but you must be fighting a very uphill battle! I bet no one ever envisaged these levels of network traffic when they dreamed up SETI ;) *** My Guide to Compiling Optimised BOINC and SETI Clients *** *** Download Optimised BOINC and SETI Clients for Linux Here *** |
paul Send message Joined: 29 Jul 01 Posts: 42 Credit: 23,126,185 RAC: 0 |
Boincers massing at the southern wall, sir. ;-) I've suspended Seti since the outage occurred, the fleet picked up on backup projects, at least until the logjam breaks. Our Team certainly ensured that backups projects were added, and gave everyone a quick lesson on how to ensure that they don't run out of work. I suspect many other BOINC projects benefited from increased resources the past week or two. Kudos for your team getting the project back online, the efforts are appreciated. Team Starfire World BOINC IRC- irc//irc.teamstarfire.net:6667/team_starfire |
PUCE II Send message Joined: 12 Oct 02 Posts: 3 Credit: 175,156 RAC: 0 |
Don't worry about rushing, guys. She'll be up when she's up, and we'll be here then. |
[SETI.USA]Tank_Master Send message Joined: 1 Jan 01 Posts: 24 Credit: 2,194,285 RAC: 0 |
does this meen the 64bit clients will now be supported? |
littlegreenmanfrommars Send message Joined: 28 Jan 06 Posts: 1410 Credit: 934,158 RAC: 0 |
I already have WUs that are behind deadline, and they look like they were downloaded after deadline. Of course, the outage is also affecting Beta. *sigh* Keep up the good work lads, we appreciate it! |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
I already have WUs that are behind deadline, and they look like they were downloaded after deadline. As Odysseus pointed out, the validators are off..... |
Brian Silvers Send message Joined: 11 Jun 99 Posts: 1681 Credit: 492,052 RAC: 0 |
I already have WUs that are behind deadline, and they look like they were downloaded after deadline. Is my understanding correct though that units that exceed deadline will still be reissued, thus creating more download traffic (reissue) and more upload attempts once completed? If so, this is a snowball. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
Is my understanding correct though that units that exceed deadline will still be reissued, thus creating more download traffic (reissue) and more upload attempts once completed? Nope. It won't add or subtract from the network traffic. It will just be one more Work Unit available to crucnch amongst all the others that haven't yet been downloaded at all. Grant Darwin NT |
Aragon Speed Send message Joined: 1 Apr 07 Posts: 3 Credit: 140,717 RAC: 0 |
It's a shame there isn't a smilie for pulling your hair out in frustration. ;) Aragon Speed XTM Team Member X-Tended Mod Website |
HachPi Send message Joined: 2 Aug 99 Posts: 481 Credit: 21,807,425 RAC: 21 |
Keep on smiling... We will overcome some day. Grtz HP |
Mephist0 Send message Joined: 4 Dec 99 Posts: 12 Credit: 1,401,540 RAC: 0 |
Isn't it possible to turn of the "due date" of the results until the connection problems is resolved. that way one result dont have to be sent out to more computers than neccesary.. Its just a waste of computing power in my eyes... |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.