Panic Mode On (9) Server problems


log in

Advanced search

Message boards : Number crunching : Panic Mode On (9) Server problems

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 11 · Next
Author Message
zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46122
Credit: 36,593,924
RAC: 5,358
Message 815895 - Posted: 8 Oct 2008, 12:44:27 UTC

NASA We have a problem with the uploads and maybe reporting too. I went to bed around midnight about 5.5hrs ago and according to My logs It's been happening since 1:53am PDT here.

10/8/2008 1:53:35 AM|SETI@home|Scheduler RPC succeeded [server version 603]
10/8/2008 1:53:35 AM|SETI@home|Deferring communication for 11 sec
10/8/2008 1:53:35 AM|SETI@home|Reason: requested by project
10/8/2008 2:16:54 AM|SETI@home|Computation for task 17au08ae.13404.323159.13.8.69_1 finished
10/8/2008 2:16:54 AM|SETI@home|Restarting task 17au08ae.13404.323159.13.8.46_0 using setiathome_enhanced version 528
10/8/2008 2:16:58 AM|SETI@home|[file_xfer] Started upload of file 17au08ae.13404.323159.13.8.69_1_0
10/8/2008 2:17:00 AM||Project communication failed: attempting access to reference site
10/8/2008 2:17:00 AM|SETI@home|[file_xfer] Temporarily failed upload of 17au08ae.13404.323159.13.8.69_1_0: connect() failed
[snip]
10/8/2008 5:35:04 AM|SETI@home|[file_xfer] Started upload of file 22au08ac.19073.13160.3.8.75_0_0
10/8/2008 5:35:06 AM||Project communication failed: attempting access to reference site
10/8/2008 5:35:06 AM|SETI@home|[file_xfer] Temporarily failed upload of 22au08ac.19073.13160.3.8.75_0_0: connect() failed
10/8/2008 5:35:06 AM|SETI@home|Backing off 1 hr 25 min 0 sec on upload of file 22au08ac.19073.13160.3.8.75_0_0
10/8/2008 5:35:08 AM||Access to reference site succeeded - project servers may be temporarily down.
10/8/2008 5:35:14 AM|SETI@home|[file_xfer] Started upload of file 17au08ae.13404.323159.13.8.31_1_0
10/8/2008 5:35:16 AM||Project communication failed: attempting access to reference site
10/8/2008 5:35:17 AM||Access to reference site succeeded - project servers may be temporarily down.
10/8/2008 5:35:17 AM|SETI@home|[file_xfer] Temporarily failed upload of 17au08ae.13404.323159.13.8.31_1_0: connect() failed
10/8/2008 5:35:17 AM|SETI@home|Backing off 1 hr 36 min 12 sec on upload of file 17au08ae.13404.323159.13.8.31_1_0
____________
My Facebook, War Commander, 2015

WinterKnight
Volunteer tester
Send message
Joined: 18 May 99
Posts: 8630
Credit: 23,729,131
RAC: 19,326
United Kingdom
Message 815905 - Posted: 8 Oct 2008, 13:34:08 UTC - in response to Message 815895.

See Server Status page.

upload server bruno Disabled

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46122
Credit: 36,593,924
RAC: 5,358
Message 815916 - Posted: 8 Oct 2008, 14:30:47 UTC - in response to Message 815905.

See Server Status page.

upload server bruno Disabled


Ok that answers that, Now as to the why?
____________
My Facebook, War Commander, 2015

Profile Aristoteles Doukas
Avatar
Send message
Joined: 11 Apr 08
Posts: 1091
Credit: 2,140,913
RAC: 0
Finland
Message 815919 - Posted: 8 Oct 2008, 14:36:20 UTC

everytime i get the rac going allmost to a new record, something drastic happens
to servers and down we go again, it is been like that last three months,bummer

msattlerProject donor
Volunteer tester
Avatar
Send message
Joined: 9 Jul 00
Posts: 38923
Credit: 578,755,438
RAC: 514,543
United States
Message 815929 - Posted: 8 Oct 2008, 15:05:24 UTC

Well, it's just now 8:00am in Berkeley....
Give Matt a little bit to do his stretching exercises before he limbers up his steel-toed boots to give bruno a kick.........
____________
*********************************************
Embrace your inner kitty...ya know ya wanna!

I have met a few friends in my life.
Most were cats.

msattlerProject donor
Volunteer tester
Avatar
Send message
Joined: 9 Jul 00
Posts: 38923
Credit: 578,755,438
RAC: 514,543
United States
Message 815932 - Posted: 8 Oct 2008, 15:13:42 UTC

It appears that Matt has done his kick-start thing.........give it a bit for backed up traffic to clear a little and we should be OK again......
Uploads are working again, with some retries due to the traffic load.
____________
*********************************************
Embrace your inner kitty...ya know ya wanna!

I have met a few friends in my life.
Most were cats.

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46122
Credit: 36,593,924
RAC: 5,358
Message 815988 - Posted: 8 Oct 2008, 18:58:02 UTC - in response to Message 815929.

Well, it's just now 8:00am in Berkeley....
Give Matt a little bit to do his stretching exercises before he limbers up his steel-toed boots to give bruno a kick.........

At least Matt didn't drop Half Dome on to the server(Possibly the Largest known single piece of Granite in the world).

____________
My Facebook, War Commander, 2015

Swibby Bear
Send message
Joined: 1 Aug 01
Posts: 236
Credit: 7,276,138
RAC: 415
United States
Message 816066 - Posted: 9 Oct 2008, 0:28:57 UTC
Last modified: 9 Oct 2008, 1:07:12 UTC

Downloads not working!

(edit) Never mind! I rebooted and they downloaded fine. Sorry for the false alarm.

DJStarfox
Send message
Joined: 23 May 01
Posts: 1040
Credit: 544,758
RAC: 267
United States
Message 816112 - Posted: 9 Oct 2008, 3:04:50 UTC - in response to Message 815988.

At least Matt didn't drop Half Dome on to the server(Possibly the Largest known single piece of Granite in the world).


Funny you should mention that. When I was in Yosemite, I took a picture of that mountain. Just then, my digital camera broke, and I haven't been able to take a picture since. :(

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46122
Credit: 36,593,924
RAC: 5,358
Message 816119 - Posted: 9 Oct 2008, 3:10:01 UTC - in response to Message 816112.

At least Matt didn't drop Half Dome on to the server(Possibly the Largest known single piece of Granite in the world).


Funny you should mention that. When I was in Yosemite, I took a picture of that mountain. Just then, my digital camera broke, and I haven't been able to take a picture since. :(

Half Dome probably used up any warranty or life the camera still had left, Poor, Poor camera, It died at such a young age, Must have had a heart attack at seeing such sights. ;)
____________
My Facebook, War Commander, 2015

Profile Misfit
Volunteer tester
Avatar
Send message
Joined: 21 Jun 01
Posts: 21790
Credit: 2,510,901
RAC: 0
United States
Message 816161 - Posted: 9 Oct 2008, 4:46:22 UTC - in response to Message 815988.

At least Matt didn't drop Half Dome on to the server

Are you sure? It used to be Full Dome.
____________

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5791
Credit: 58,036,200
RAC: 48,260
Australia
Message 816219 - Posted: 9 Oct 2008, 11:32:57 UTC


Anyone having download problems?
I've got a Work Unit that just won't download- it times out as soon as it starts to download.
____________
Grant
Darwin NT.

Profile Ageless
Avatar
Send message
Joined: 9 Jun 99
Posts: 12284
Credit: 2,575,473
RAC: 772
Netherlands
Message 816221 - Posted: 9 Oct 2008, 11:35:56 UTC - in response to Message 816219.

Try an ipconfig /flushdns .. that worked for me.
____________
Jord

Fighting for the correct use of the apostrophe, together with Weird Al Yankovic

Richard HaselgroveProject donor
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8465
Credit: 48,952,021
RAC: 75,553
United Kingdom
Message 816223 - Posted: 9 Oct 2008, 11:45:45 UTC - in response to Message 816219.

Anyone having download problems?
I've got a Work Unit that just won't download- it times out as soon as it starts to download.

It looks as if download server 208.68.240.13 is misconfigured or has suffered an HTTP service failure, but download server 208.68.240.18 is working OK - see the Users from Germany reporting bad route thread.

So yes, flushing (or otherwise bypassing) DNS should do the trick.

Jord, the users in Germany are also reporting that they can't get HTTP access to BOINC resources either - would you happen to know whether 208.68.240.13 is one of the servers which SETI and BOINC share?

Profile Ageless
Avatar
Send message
Joined: 9 Jun 99
Posts: 12284
Credit: 2,575,473
RAC: 772
Netherlands
Message 816225 - Posted: 9 Oct 2008, 11:52:06 UTC - in response to Message 816223.

Jord, the users in Germany are also reporting that they can't get HTTP access to BOINC resources either - would you happen to know whether 208.68.240.13 is one of the servers which SETI and BOINC share?

No, I don't know any of the IP addresses of the servers by heart, but the BOINC server sits in the same cabinet as the Seti servers, so if they can't reach one, they won't reach the other either.

Doing a trace route to boinc.berkeley.edu and to setiathome.berkeley.edu gives me a timeout on the server after g6-1.inr-230-spr.Berkeley.EDU [128.32.255.110] as well. It does get there though and I can navigate to the sites (as you can see from my answer).
____________
Jord

Fighting for the correct use of the apostrophe, together with Weird Al Yankovic

Richard HaselgroveProject donor
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8465
Credit: 48,952,021
RAC: 75,553
United Kingdom
Message 816229 - Posted: 9 Oct 2008, 12:26:14 UTC - in response to Message 816225.

Jord, the users in Germany are also reporting that they can't get HTTP access to BOINC resources either - would you happen to know whether 208.68.240.13 is one of the servers which SETI and BOINC share?

No, I don't know any of the IP addresses of the servers by heart, but the BOINC server sits in the same cabinet as the Seti servers, so if they can't reach one, they won't reach the other either.

Doing a trace route to boinc.berkeley.edu and to setiathome.berkeley.edu gives me a timeout on the server after g6-1.inr-230-spr.Berkeley.EDU [128.32.255.110] as well. It does get there though and I can navigate to the sites (as you can see from my answer).

The Germans now reckon that it's some kind of peering problem in Germany. People going via N-IX can see Berkeley resources: people going via DE-CIX can't.

But they confirm SETI currently has an HTTP service problem on 208.68.240.13

Swibby Bear
Send message
Joined: 1 Aug 01
Posts: 236
Credit: 7,276,138
RAC: 415
United States
Message 816244 - Posted: 9 Oct 2008, 13:26:17 UTC
Last modified: 9 Oct 2008, 13:27:30 UTC

Same HTTP problem encountered from Eastern USA (Pennsylvania)

QSilver
Send message
Joined: 26 May 99
Posts: 228
Credit: 4,590,400
RAC: 3,033
United States
Message 816248 - Posted: 9 Oct 2008, 13:34:55 UTC

Just reported about a dozen completed WUs and downloaded more than that in return. Everything went through at normal speeds.

[Chicago USA]

QS

Richard HaselgroveProject donor
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8465
Credit: 48,952,021
RAC: 75,553
United Kingdom
Message 816249 - Posted: 9 Oct 2008, 13:47:51 UTC

There are two download servers, Bane and Vader.

One of them is working normally (208.68.240.18), the other one is snafu'd (208.68.240.13) - I don't know which is which.

It's the luck of the (DNS) draw which one you try to download from.

Profile dancrista
Send message
Joined: 19 Dec 01
Posts: 3
Credit: 78,546
RAC: 0
Romania
Message 816255 - Posted: 9 Oct 2008, 14:14:04 UTC - in response to Message 816249.
Last modified: 9 Oct 2008, 14:22:39 UTC

Edited:
Got the problem with no work from server.

10/9/2008 5:18:30 PM|Einstein@Home|Message from server: No work sent
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Hierarchical all-sky pulsar search needs 11.52MB more disk space. You currently have 83.85 MB available and it needs 95.37 MB.
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Not enough disk space (only 87.9 MB free for BOINC). Review preferences for maximum disk space used.

I had an astropulse work, and in preferences I had a low disk usage settings.
____________

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (9) Server problems

Copyright © 2014 University of California