Off the Beach (Feb 16 2010)

Message boards : Technical News : Off the Beach (Feb 16 2010)
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 970937 - Posted: 16 Feb 2010, 23:38:24 UTC

Hello again. Happy President's Day - we had the Monday off, plus I took the whole previous week off to go hang out in Kauai. First real vacation in a while, and last for the foreseeable future.

So what did I miss? Looks like the upload/scheduling servers have been clogged a while due to a swarm of short-runners (workunits the complete quickly due to excessive noise). This should simmer down in due time. Plus we're having the usual outage today so there will be painful recovery from that as well. And things were running a little late today as a permissions problem held up the start of the outage. Patience.

While we did finally get the science database back in working order, we were finding the server still didn't have enough resources to meet our demands. So a new plan is being put into action over the coming weeks: instead of having both SETI@home and Astropulse reside on one server (thumper) and both replicated to another (bambi) - we're going to have SETI@home live on thumper and Astropulse live on bambi, both without replication. This will keep painfully long Astropulse analysis queries from clobbering the SETI@home project (which has been happening a lot lately). We may implement some form of our own replication, but we do back up the database regularly (and store those backups off site), so the replica doesn't buy us that much, especially considering we could double our database power by converting it to another primary server.

- Matt

-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 970937 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 970943 - Posted: 17 Feb 2010, 0:10:41 UTC


Thanks for the update!


Normally if we have the weekly server maintenance, my PCs can easily UL.
Not today.

Now also not possible to UL.

You think this prob will be solved soon?


____________
[Optimized project applications, for to increase your PC performance (double RAC)!][Overview of abbreviations, which are used often in forum and their meaning.]
ID: 970943 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13727
Credit: 208,696,464
RAC: 304
Australia
Message 970944 - Posted: 17 Feb 2010, 0:11:29 UTC - in response to Message 970937.  

So what did I miss? Looks like the upload/scheduling servers have been clogged a while due to a swarm of short-runners (workunits the complete quickly due to excessive noise). This should simmer down in due time.

I think it may be more than that- upload & down load traffic was very light before the outage, and even if extremely heavy before an outage it's still possible to upload results during the outage.
From the looks of things i've been unable to upload for about 24 hours, and still no joy.
BOINC 6.6.41
Grant
Darwin NT
ID: 970944 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65736
Credit: 55,293,173
RAC: 49
United States
Message 970948 - Posted: 17 Feb 2010, 0:22:02 UTC - in response to Message 970944.  
Last modified: 17 Feb 2010, 0:24:29 UTC

So what did I miss? Looks like the upload/scheduling servers have been clogged a while due to a swarm of short-runners (workunits the complete quickly due to excessive noise). This should simmer down in due time.

I think it may be more than that- upload & down load traffic was very light before the outage, and even if extremely heavy before an outage it's still possible to upload results during the outage.
From the looks of things i've been unable to upload for about 24 hours, and still no joy.
BOINC 6.6.41

Same here on Boinc 6.10.32, No joy on uploads or reporting.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 970948 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13727
Credit: 208,696,464
RAC: 304
Australia
Message 970951 - Posted: 17 Feb 2010, 0:40:57 UTC - in response to Message 970948.  

Same here on Boinc 6.10.32, No joy on uploads or reporting.

Just had a look in my message log,
17/02/2010 9:10:27 SETI@home Scheduler request failed: Couldn't connect to server

Cricket graphs stll showing low traffic volumes. Although after the last couple of outages it took a couple of hours for things to pick up fully once they came back on line so i'll see how it is in 4-6 hours.
If they're still down, then there's more of a problem than just the uploads.
Grant
Darwin NT
ID: 970951 · Report as offensive
the3dge

Send message
Joined: 16 May 99
Posts: 19
Credit: 248,813,983
RAC: 0
United States
Message 970956 - Posted: 17 Feb 2010, 0:58:05 UTC - in response to Message 970951.  

I've been getting "Temporarily failed upload of (wu#): HTTP Error or connect() failed.

and
Project communication failed: attempting access to reference site. Internet access OK - project servers may temporarily be down.

It's been doing this since the 14th for me. I had no check ins all of yesterday. I've rebooted all the systems several times, no luck.

I, personally, think there's something else at play. Others were reporting the same thing in a panic thread yesterday.
ID: 970956 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65736
Credit: 55,293,173
RAC: 49
United States
Message 970966 - Posted: 17 Feb 2010, 1:42:45 UTC - in response to Message 970951.  

Same here on Boinc 6.10.32, No joy on uploads or reporting.

Just had a look in my message log,
17/02/2010 9:10:27 SETI@home Scheduler request failed: Couldn't connect to server

Cricket graphs stll showing low traffic volumes. Although after the last couple of outages it took a couple of hours for things to pick up fully once they came back on line so i'll see how it is in 4-6 hours.
If they're still down, then there's more of a problem than just the uploads.

Grant, this was happening before the outage too.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 970966 · Report as offensive
Profile Mike O
Avatar

Send message
Joined: 1 Sep 07
Posts: 428
Credit: 6,670,998
RAC: 0
United States
Message 971018 - Posted: 18 Feb 2010, 0:42:30 UTC

Every client I have running has stuck WUs waiting for upload.
Normally, I dont worry about it but, I cant seem to get a single thing uploaded now.
The servers mostly look ok ( a few are down ).. Thumper and Bambi are running OK tho and yet, nothing moving.

Not Ready Reading BRAIN. Abort/Retry/Fail?
ID: 971018 · Report as offensive
Doug Bannister

Send message
Joined: 9 Oct 02
Posts: 1
Credit: 2,811,009
RAC: 0
Canada
Message 971287 - Posted: 18 Feb 2010, 18:47:12 UTC - in response to Message 970956.  

Seeing the same thing with that error message on a couple of PC's i am running on seti.
ID: 971287 · Report as offensive
cougar9t9

Send message
Joined: 11 Jun 04
Posts: 1
Credit: 775,062
RAC: 0
United Kingdom
Message 971606 - Posted: 19 Feb 2010, 10:38:30 UTC - in response to Message 971287.  

i to am having the same comunication problems. i have so many completed work units building up and unable to upload back to the server(project backoff, server may be down). hope the problen is sorted soon
ID: 971606 · Report as offensive
Profile Jim H

Send message
Joined: 28 Nov 06
Posts: 12
Credit: 2,186,439
RAC: 0
United States
Message 971702 - Posted: 19 Feb 2010, 16:50:52 UTC

I'm in the same boat...........no uploads for several days now.

Hope they get it going soon.

Jim
Clear Skies to all amateur Astronomers out there...
ID: 971702 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 971720 - Posted: 19 Feb 2010, 17:16:55 UTC

Hey guys, there is more information on this is the number crunching Forum. They had an a/c breakdown early in the week, and we seem to be still coping with the results of that.

ID: 971720 · Report as offensive

Message boards : Technical News : Off the Beach (Feb 16 2010)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.