Panic Mode On (13) Server problems

Message boards : Number crunching : Panic Mode On (13) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 14 · Next

AuthorMessage
Profile al

Send message
Joined: 30 Nov 02
Posts: 3
Credit: 3,500,759
RAC: 0
United Kingdom
Message 868042 - Posted: 22 Feb 2009, 13:11:13 UTC - in response to Message 868006.  

Just so you know, other projects work fine.
ID: 868042 · Report as offensive
Profile Joe

Send message
Joined: 2 Sep 02
Posts: 2
Credit: 125,460
RAC: 0
United States
Message 868069 - Posted: 22 Feb 2009, 15:24:27 UTC

The last 2 days I have been able to recieve work but completed tasks wont clear up. I keep getting the message that internet access is ok but servers might be temporarily down. If sight was down i believe i would not recieve any new work. Anyone have any insight, have about 10 tasks per computer waiting to upload + 5 computers. Need answers Thank you
ID: 868069 · Report as offensive
Aurora Borealis
Volunteer tester
Avatar

Send message
Joined: 14 Jan 01
Posts: 3075
Credit: 5,631,463
RAC: 0
Canada
Message 868071 - Posted: 22 Feb 2009, 15:30:38 UTC - in response to Message 868069.  
Last modified: 22 Feb 2009, 15:32:22 UTC

The last 2 days I have been able to recieve work but completed tasks wont clear up. I keep getting the message that internet access is ok but servers might be temporarily down. If sight was down i believe i would not recieve any new work. Anyone have any insight, have about 10 tasks per computer waiting to upload + 5 computers. Need answers Thank you

Look for explanation in the Number Crunching forum, latest posts in the Panic tread. The servers are overloaded.
ID: 868071 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 868104 - Posted: 22 Feb 2009, 16:58:40 UTC

12 is getting long in the tooth, time for a new one.

ID: 868104 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 868105 - Posted: 22 Feb 2009, 17:00:03 UTC

... a temporary stop to AP downloads, until the cause of the anomaly can be investigated and corrected. Once the runaway download train is brought under control, uploads will look after themselves.

After having a few more looks at Scarecrow's AP graphs thoughout the day, this measure gets my vote. It's all those AP units being downloaded that's clogging up the pipe.

OK, had a night's sleep and I think I've found the problem - well, the next stage in the chain.

Have a look at WU 417685549. Downloaded seven times, mine is the only one which is running - every other copy failed because they couldn't download the executable file. All my recent AP allocations look like that, though this is the most extreme.

Eric needs to turn on the 'proxy server' distribution channel used when new MB executables threaten to clog the pipes - or AP distribution needs to be restricted to those who have manually downloaded and installed the new Lunatics r112 optimisation for Astropulse_v5 (plug!).


As far as I can tell, they have Coral Cache turned on for the AP v5 as this link still works. The problem is a lot of the antivirus apps mark the redirect as suspect and do not allow the download to happen.

Of course then then we get the task errors because there is no app to process the work.

ID: 868105 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 868109 - Posted: 22 Feb 2009, 17:21:52 UTC - in response to Message 868104.  

12 is getting long in the tooth, time for a new one.



..but.. it would be better if we would not need a thread like this.. ;-D


..it would be nice if everything would running every time well..


But then we would have every day the same and no 'fun' with our loved project. ;-D

ID: 868109 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65690
Credit: 55,293,173
RAC: 49
United States
Message 868111 - Posted: 22 Feb 2009, 17:30:32 UTC

I'm still getting problems on uploads, Someone suggested in the last thread(#12) that donating $5 or so(whatever one can afford right now) from everyone here could go towards fixing the bandwidth bottleneck We have here. As in 3 Days and 3:55(Hours:Minutes) I will be running on either fumes or be empty here, I can take $10 from My savings account on the 27th and earmark It for Fixing the Seti Bandwidth Problems.

2/22/2009 9:23:17 AM||Resuming network activity
2/22/2009 9:23:17 AM|SETI@home|[file_xfer] Started upload of file 17ja09aa.6195.388982.6.8.107_1_0
2/22/2009 9:23:17 AM|SETI@home|[file_xfer] Started upload of file 17ja09aa.6195.388982.6.8.109_1_0
2/22/2009 9:23:39 AM||Project communication failed: attempting access to reference site
2/22/2009 9:23:39 AM|SETI@home|[file_xfer] Temporarily failed upload of 17ja09aa.6195.388982.6.8.109_1_0: HTTP error
2/22/2009 9:23:39 AM|SETI@home|Backing off 1 hr 42 min 19 sec on upload of file 17ja09aa.6195.388982.6.8.109_1_0
2/22/2009 9:23:39 AM|SETI@home|[file_xfer] Started upload of file 17ja09aa.6195.388982.6.8.160_1_0
2/22/2009 9:23:40 AM||Access to reference site succeeded - project servers may be temporarily down.
2/22/2009 9:24:00 AM||Project communication failed: attempting access to reference site
2/22/2009 9:24:00 AM|SETI@home|[file_xfer] Temporarily failed upload of 17ja09aa.6195.388982.6.8.160_1_0: HTTP error
2/22/2009 9:24:00 AM|SETI@home|Backing off 3 hr 19 min 8 sec on upload of file 17ja09aa.6195.388982.6.8.160_1_0
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 868111 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 868115 - Posted: 22 Feb 2009, 17:41:35 UTC - in response to Message 868109.  
Last modified: 22 Feb 2009, 17:47:10 UTC

[quote]12 is getting long in the tooth, time for a new one.


..but.. it would be better if we would not need a thread like this.. ;-D


..it would be nice if everything would running every time well..


But then we would have every day the same and no 'fun' with our loved project. ;-D


That's also true, but I'de rather be able to UP- & DOWN-load, since >200 WU's are trying to get 'IN' and some new WU's, would be nice, too.
This is too much (excitement), for me ;^)

22-2-2009 18:26:26|SETI@home|Backing off 2 hr 53 min 9 sec on upload of 15ja09aa.4494.11524.7.8.149_0_0
22-2-2009 18:26:26|SETI@home|Temporarily failed upload of 15ja09aa.4494.11524.7.8.141_1_0: connect() failed
22-2-2009 18:26:26|SETI@home|Backing off 3 hr 28 min 32 sec on upload of 15ja09aa.4494.11524.7.8.141_1_0
22-2-2009 18:26:26|SETI@home|Started upload of 15ja09aa.4494.11524.7.8.133_0_0
22-2-2009 18:26:26|SETI@home|Started upload of 15ja09aa.4494.11524.7.8.152_0_0
22-2-2009 18:26:27||Internet access OK - project servers may be temporarily down.
22-2-2009 18:26:37||Suspending network activity - user request

1 host of the 1,5 Million hosts less, doesn't make much sense . . .
ID: 868115 · Report as offensive
Profile Bruce
Volunteer tester
Avatar

Send message
Joined: 18 Sep 02
Posts: 92
Credit: 78,331
RAC: 0
Canada
Message 868157 - Posted: 22 Feb 2009, 19:07:45 UTC

Does this happen EVERY weekend or what?? Cant upload cant download what a farce! I thought this was an established project? Most Alpha & beta projects are able to run without a weekly maintenance shutdown & a weekend server outage every weekend. So what gives??
ID: 868157 · Report as offensive
Profile Hammeh
Volunteer tester
Avatar

Send message
Joined: 21 May 01
Posts: 135
Credit: 1,143,316
RAC: 0
United Kingdom
Message 868162 - Posted: 22 Feb 2009, 19:14:01 UTC

Seti@home is one of the oldest projects out there, and has a large user base.
Large number of users = high trafic and at the moment, the pipeline just does not have enough bandwidth.
ID: 868162 · Report as offensive
Tribble

Send message
Joined: 21 Feb 02
Posts: 65
Credit: 7,978,002
RAC: 0
Australia
Message 868166 - Posted: 22 Feb 2009, 19:17:36 UTC - in response to Message 868162.  

If I am understanding the problem, increasing the bandwidth won't do much.

The amount of data each user is asking for per day is a few GB due to the looping effect ... we are talking several tens of thousands of users here.
ID: 868166 · Report as offensive
-Bert-

Send message
Joined: 23 Mar 02
Posts: 152
Credit: 412,754
RAC: 0
Netherlands
Message 868171 - Posted: 22 Feb 2009, 19:22:34 UTC - in response to Message 868157.  

Does this happen EVERY weekend or what??


Yes, almost every weekend, starting last christmas and even before that :)

I thought this was an established project?


No, this is just a test lab :)

(Sorry, I couldn't resist).

BUT, I won't give up. I now have a lot op AP's to upload which will be credited at once, once the upload problem is gone, because there're all re-issued ones :) Still waiting for a 5.03 WU though.
ID: 868171 · Report as offensive
Manny S. Jimenez

Send message
Joined: 24 Jan 09
Posts: 1
Credit: 241,086
RAC: 0
Philippines
Message 868179 - Posted: 22 Feb 2009, 19:40:14 UTC - in response to Message 867999.  

Might have to switch to Milky Way. This is the first project that i encountered upload problems.
ID: 868179 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65690
Credit: 55,293,173
RAC: 49
United States
Message 868181 - Posted: 22 Feb 2009, 19:41:27 UTC

Go here and read and maybe donate to the cause and If one does donate, Make It a specific donation.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 868181 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65690
Credit: 55,293,173
RAC: 49
United States
Message 868183 - Posted: 22 Feb 2009, 19:41:59 UTC

Go here and read and maybe donate to the cause and If one does donate, Make It a specific donation.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 868183 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 868187 - Posted: 22 Feb 2009, 19:51:36 UTC
Last modified: 22 Feb 2009, 19:52:31 UTC

I note it in nearly all 'panic threads'.. ;-)

Rise the WUs in your cache and you will 'nearly' never run out of work..
ID: 868187 · Report as offensive
Profile Hammeh
Volunteer tester
Avatar

Send message
Joined: 21 May 01
Posts: 135
Credit: 1,143,316
RAC: 0
United Kingdom
Message 868190 - Posted: 22 Feb 2009, 19:57:25 UTC

Very trure. I run all my computers keeping 10 days work just in case something like this happens.
ID: 868190 · Report as offensive
Alinator
Volunteer tester

Send message
Joined: 19 Apr 05
Posts: 4178
Credit: 4,647,982
RAC: 0
United States
Message 868192 - Posted: 22 Feb 2009, 20:00:48 UTC - in response to Message 868187.  

I note it in nearly all 'panic threads'.. ;-)

Rise the WUs in your cache and you will 'nearly' never run out of work..


Of course, if you think that through, maxing out your cache settings is inherently self defeating in the long run. ;-)

What if everybody did that? :-)

In any event, this is the weekend so I call it 'Par for the course'. If it was Wednesday OTOH, I might think about having a minor Panic. :-D

Alinator
ID: 868192 · Report as offensive
Dave Stegner
Volunteer tester
Avatar

Send message
Joined: 20 Oct 04
Posts: 540
Credit: 65,583,328
RAC: 27
United States
Message 868194 - Posted: 22 Feb 2009, 20:04:43 UTC

Have not been able to upload for about 16 hours now, more than 500 WU to upload. 4 day cache, so i guess it is not an issue at the moment.

BUT:

Looking at the server status page and Scarecrow graphs makes me wonder.

Normally MB WUs return at the rate of 10 to 20 to 1 AP WU.

Why is it suddenly appear easier to return an AP work unit than an MB WU ??

Presently the ratio is 2 to 1, prox.


Dave

ID: 868194 · Report as offensive
FiveHamlet
Avatar

Send message
Joined: 5 Oct 99
Posts: 783
Credit: 32,638,578
RAC: 0
United Kingdom
Message 868195 - Posted: 22 Feb 2009, 20:06:05 UTC - in response to Message 868190.  
Last modified: 22 Feb 2009, 20:24:46 UTC

What sort of timescale are we looking at to clear the backlog.
I only keep about half a days work,so I ran out of work Sat lunch time.
I have 50 or so wu's to upload.
If there are 50,000 hosts each with 50 wu's ( average say ) to upload that is 2.5 million wu's.
It is like a lake full of water trying to go down a 1 inch pipe.
It could take days to clear or am I on the wrong path.
ID: 868195 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 . . . 14 · Next

Message boards : Number crunching : Panic Mode On (13) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.