Panic Mode On (9) Server problems

Message boards : Number crunching : Panic Mode On (9) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 11 · Next

AuthorMessage
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65734
Credit: 55,293,173
RAC: 49
United States
Message 815895 - Posted: 8 Oct 2008, 12:44:27 UTC

NASA We have a problem with the uploads and maybe reporting too. I went to bed around midnight about 5.5hrs ago and according to My logs It's been happening since 1:53am PDT here.

10/8/2008 1:53:35 AM|SETI@home|Scheduler RPC succeeded [server version 603]
10/8/2008 1:53:35 AM|SETI@home|Deferring communication for 11 sec
10/8/2008 1:53:35 AM|SETI@home|Reason: requested by project
10/8/2008 2:16:54 AM|SETI@home|Computation for task 17au08ae.13404.323159.13.8.69_1 finished
10/8/2008 2:16:54 AM|SETI@home|Restarting task 17au08ae.13404.323159.13.8.46_0 using setiathome_enhanced version 528
10/8/2008 2:16:58 AM|SETI@home|[file_xfer] Started upload of file 17au08ae.13404.323159.13.8.69_1_0
10/8/2008 2:17:00 AM||Project communication failed: attempting access to reference site
10/8/2008 2:17:00 AM|SETI@home|[file_xfer] Temporarily failed upload of 17au08ae.13404.323159.13.8.69_1_0: connect() failed
[snip]
10/8/2008 5:35:04 AM|SETI@home|[file_xfer] Started upload of file 22au08ac.19073.13160.3.8.75_0_0
10/8/2008 5:35:06 AM||Project communication failed: attempting access to reference site
10/8/2008 5:35:06 AM|SETI@home|[file_xfer] Temporarily failed upload of 22au08ac.19073.13160.3.8.75_0_0: connect() failed
10/8/2008 5:35:06 AM|SETI@home|Backing off 1 hr 25 min 0 sec on upload of file 22au08ac.19073.13160.3.8.75_0_0
10/8/2008 5:35:08 AM||Access to reference site succeeded - project servers may be temporarily down.
10/8/2008 5:35:14 AM|SETI@home|[file_xfer] Started upload of file 17au08ae.13404.323159.13.8.31_1_0
10/8/2008 5:35:16 AM||Project communication failed: attempting access to reference site
10/8/2008 5:35:17 AM||Access to reference site succeeded - project servers may be temporarily down.
10/8/2008 5:35:17 AM|SETI@home|[file_xfer] Temporarily failed upload of 17au08ae.13404.323159.13.8.31_1_0: connect() failed
10/8/2008 5:35:17 AM|SETI@home|Backing off 1 hr 36 min 12 sec on upload of file 17au08ae.13404.323159.13.8.31_1_0
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 815895 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19045
Credit: 40,757,560
RAC: 67
United Kingdom
Message 815905 - Posted: 8 Oct 2008, 13:34:08 UTC - in response to Message 815895.  

See Server Status page.

upload server	bruno	Disabled

ID: 815905 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65734
Credit: 55,293,173
RAC: 49
United States
Message 815916 - Posted: 8 Oct 2008, 14:30:47 UTC - in response to Message 815905.  

See Server Status page.

upload server	bruno	Disabled


Ok that answers that, Now as to the why?
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 815916 · Report as offensive
Profile Aristoteles Doukas
Avatar

Send message
Joined: 11 Apr 08
Posts: 1091
Credit: 2,140,913
RAC: 0
Finland
Message 815919 - Posted: 8 Oct 2008, 14:36:20 UTC

everytime i get the rac going allmost to a new record, something drastic happens
to servers and down we go again, it is been like that last three months,bummer
ID: 815919 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 815929 - Posted: 8 Oct 2008, 15:05:24 UTC

Well, it's just now 8:00am in Berkeley....
Give Matt a little bit to do his stretching exercises before he limbers up his steel-toed boots to give bruno a kick.........
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 815929 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 815932 - Posted: 8 Oct 2008, 15:13:42 UTC

It appears that Matt has done his kick-start thing.........give it a bit for backed up traffic to clear a little and we should be OK again......
Uploads are working again, with some retries due to the traffic load.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 815932 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65734
Credit: 55,293,173
RAC: 49
United States
Message 815988 - Posted: 8 Oct 2008, 18:58:02 UTC - in response to Message 815929.  

Well, it's just now 8:00am in Berkeley....
Give Matt a little bit to do his stretching exercises before he limbers up his steel-toed boots to give bruno a kick.........

At least Matt didn't drop Half Dome on to the server(Possibly the Largest known single piece of Granite in the world).

The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 815988 · Report as offensive
Swibby Bear

Send message
Joined: 1 Aug 01
Posts: 246
Credit: 7,945,093
RAC: 0
United States
Message 816066 - Posted: 9 Oct 2008, 0:28:57 UTC
Last modified: 9 Oct 2008, 1:07:12 UTC

Downloads not working!

(edit) Never mind! I rebooted and they downloaded fine. Sorry for the false alarm.
ID: 816066 · Report as offensive
DJStarfox

Send message
Joined: 23 May 01
Posts: 1066
Credit: 1,226,053
RAC: 2
United States
Message 816112 - Posted: 9 Oct 2008, 3:04:50 UTC - in response to Message 815988.  

At least Matt didn't drop Half Dome on to the server(Possibly the Largest known single piece of Granite in the world).


Funny you should mention that. When I was in Yosemite, I took a picture of that mountain. Just then, my digital camera broke, and I haven't been able to take a picture since. :(
ID: 816112 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65734
Credit: 55,293,173
RAC: 49
United States
Message 816119 - Posted: 9 Oct 2008, 3:10:01 UTC - in response to Message 816112.  

At least Matt didn't drop Half Dome on to the server(Possibly the Largest known single piece of Granite in the world).


Funny you should mention that. When I was in Yosemite, I took a picture of that mountain. Just then, my digital camera broke, and I haven't been able to take a picture since. :(

Half Dome probably used up any warranty or life the camera still had left, Poor, Poor camera, It died at such a young age, Must have had a heart attack at seeing such sights. ;)
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 816119 · Report as offensive
Profile Misfit
Volunteer tester
Avatar

Send message
Joined: 21 Jun 01
Posts: 21804
Credit: 2,815,091
RAC: 0
United States
Message 816161 - Posted: 9 Oct 2008, 4:46:22 UTC - in response to Message 815988.  

At least Matt didn't drop Half Dome on to the server

Are you sure? It used to be Full Dome.
me@rescam.org
ID: 816161 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13727
Credit: 208,696,464
RAC: 304
Australia
Message 816219 - Posted: 9 Oct 2008, 11:32:57 UTC


Anyone having download problems?
I've got a Work Unit that just won't download- it times out as soon as it starts to download.
Grant
Darwin NT
ID: 816219 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 816221 - Posted: 9 Oct 2008, 11:35:56 UTC - in response to Message 816219.  

Try an ipconfig /flushdns .. that worked for me.
ID: 816221 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 816223 - Posted: 9 Oct 2008, 11:45:45 UTC - in response to Message 816219.  

Anyone having download problems?
I've got a Work Unit that just won't download- it times out as soon as it starts to download.

It looks as if download server 208.68.240.13 is misconfigured or has suffered an HTTP service failure, but download server 208.68.240.18 is working OK - see the Users from Germany reporting bad route thread.

So yes, flushing (or otherwise bypassing) DNS should do the trick.

Jord, the users in Germany are also reporting that they can't get HTTP access to BOINC resources either - would you happen to know whether 208.68.240.13 is one of the servers which SETI and BOINC share?
ID: 816223 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 816225 - Posted: 9 Oct 2008, 11:52:06 UTC - in response to Message 816223.  

Jord, the users in Germany are also reporting that they can't get HTTP access to BOINC resources either - would you happen to know whether 208.68.240.13 is one of the servers which SETI and BOINC share?

No, I don't know any of the IP addresses of the servers by heart, but the BOINC server sits in the same cabinet as the Seti servers, so if they can't reach one, they won't reach the other either.

Doing a trace route to boinc.berkeley.edu and to setiathome.berkeley.edu gives me a timeout on the server after g6-1.inr-230-spr.Berkeley.EDU [128.32.255.110] as well. It does get there though and I can navigate to the sites (as you can see from my answer).
ID: 816225 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 816229 - Posted: 9 Oct 2008, 12:26:14 UTC - in response to Message 816225.  

Jord, the users in Germany are also reporting that they can't get HTTP access to BOINC resources either - would you happen to know whether 208.68.240.13 is one of the servers which SETI and BOINC share?

No, I don't know any of the IP addresses of the servers by heart, but the BOINC server sits in the same cabinet as the Seti servers, so if they can't reach one, they won't reach the other either.

Doing a trace route to boinc.berkeley.edu and to setiathome.berkeley.edu gives me a timeout on the server after g6-1.inr-230-spr.Berkeley.EDU [128.32.255.110] as well. It does get there though and I can navigate to the sites (as you can see from my answer).

The Germans now reckon that it's some kind of peering problem in Germany. People going via N-IX can see Berkeley resources: people going via DE-CIX can't.

But they confirm SETI currently has an HTTP service problem on 208.68.240.13
ID: 816229 · Report as offensive
Swibby Bear

Send message
Joined: 1 Aug 01
Posts: 246
Credit: 7,945,093
RAC: 0
United States
Message 816244 - Posted: 9 Oct 2008, 13:26:17 UTC
Last modified: 9 Oct 2008, 13:27:30 UTC

Same HTTP problem encountered from Eastern USA (Pennsylvania)
ID: 816244 · Report as offensive
QSilver

Send message
Joined: 26 May 99
Posts: 232
Credit: 6,452,764
RAC: 0
United States
Message 816248 - Posted: 9 Oct 2008, 13:34:55 UTC

Just reported about a dozen completed WUs and downloaded more than that in return. Everything went through at normal speeds.

[Chicago USA]

QS
ID: 816248 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 816249 - Posted: 9 Oct 2008, 13:47:51 UTC

There are two download servers, Bane and Vader.

One of them is working normally (208.68.240.18), the other one is snafu'd (208.68.240.13) - I don't know which is which.

It's the luck of the (DNS) draw which one you try to download from.
ID: 816249 · Report as offensive
Profile dancrista

Send message
Joined: 19 Dec 01
Posts: 3
Credit: 78,546
RAC: 0
Romania
Message 816255 - Posted: 9 Oct 2008, 14:14:04 UTC - in response to Message 816249.  
Last modified: 9 Oct 2008, 14:22:39 UTC

Edited:
Got the problem with no work from server.

10/9/2008 5:18:30 PM|Einstein@Home|Message from server: No work sent
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Hierarchical all-sky pulsar search needs 11.52MB more disk space. You currently have 83.85 MB available and it needs 95.37 MB.
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Not enough disk space (only 87.9 MB free for BOINC). Review preferences for maximum disk space used.

I had an astropulse work, and in preferences I had a low disk usage settings.
ID: 816255 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (9) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.