Panic Mode On (9) Server problems


log in

Advanced search

Message boards : Number crunching : Panic Mode On (9) Server problems

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 11 · Next
Author Message
PhonAcq
Send message
Joined: 14 Apr 01
Posts: 1622
Credit: 21,568,346
RAC: 2,623
United States
Message 816264 - Posted: 9 Oct 2008, 14:32:37 UTC

Not the end of the world, but I have about 80 wu's that won't download to one of my clients. Continually get errors like this.

10/9/2008 7:30:17 AM|SETI@home|Temporarily failed download of 19au08ac.10227.3344.6.8.24: http error


Yesterday, it took Matt to come in and give something the steel shank kick, or something to remedy. Not looking good for the weekend when he is not there??

msattler
Volunteer tester
Avatar
Send message
Joined: 9 Jul 00
Posts: 37288
Credit: 498,727,876
RAC: 501,712
United States
Message 816266 - Posted: 9 Oct 2008, 14:58:47 UTC - in response to Message 816264.

Not the end of the world, but I have about 80 wu's that won't download to one of my clients. Continually get errors like this.

10/9/2008 7:30:17 AM|SETI@home|Temporarily failed download of 19au08ac.10227.3344.6.8.24: http error


Yesterday, it took Matt to come in and give something the steel shank kick, or something to remedy. Not looking good for the weekend when he is not there??

Looks like Matt's steel toe has done the job again......as for this weekend...(shudders).....
____________
******************
Crunching Seti, loving all of God's kitties.

I have met a few friends in my life.
Most were cats.

zoom314
Avatar
Send message
Joined: 30 Nov 03
Posts: 44541
Credit: 35,405,303
RAC: 9,041
Message 816268 - Posted: 9 Oct 2008, 15:01:43 UTC - in response to Message 816255.

Edited:
Got the problem with no work from server.

10/9/2008 5:18:30 PM|Einstein@Home|Message from server: No work sent
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Hierarchical all-sky pulsar search needs 11.52MB more disk space. You currently have 83.85 MB available and it needs 95.37 MB.
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Not enough disk space (only 87.9 MB free for BOINC). Review preferences for maximum disk space used.

I had an astropulse work, and in preferences I had a low disk usage settings.

Einstein? This is the Seti@Home forum, Not Einstein.
____________

Profile dancrista
Send message
Joined: 19 Dec 01
Posts: 3
Credit: 78,546
RAC: 0
Romania
Message 816309 - Posted: 9 Oct 2008, 16:59:13 UTC - in response to Message 816268.

Yes Joker, I know that, Seti was my main project, but i started Einstein to see if i have the same problem, it seems einstein project give to me the right error, seti did not want to say any error, just this:
"10/9/2008 4:38:11 PM|SETI@home|Message from server: No work sent"

____________

zoom314
Avatar
Send message
Joined: 30 Nov 03
Posts: 44541
Credit: 35,405,303
RAC: 9,041
Message 816313 - Posted: 9 Oct 2008, 17:04:25 UTC - in response to Message 816309.

Yes Joker, I know that, Seti was my main project, but i started Einstein to see if i have the same problem, it seems Einstein project give to me the right error, seti did not want to say any error, just this:
"10/9/2008 4:38:11 PM|SETI@home|Message from server: No work sent"

The fact that You're reporting about the servers at Einstein having trouble is interesting, But shouldn't You have mentioned It there and not here? As It has nothing to do with the Seti Servers to which this thread is dedicated. As mentioning It here is kinda useless to a staff that's only concerned with Seti and maybe Boinc.
____________

Profile dancrista
Send message
Joined: 19 Dec 01
Posts: 3
Credit: 78,546
RAC: 0
Romania
Message 816320 - Posted: 9 Oct 2008, 17:36:25 UTC - in response to Message 816313.
Last modified: 9 Oct 2008, 17:59:49 UTC

Dunno why, but I will say it one more time: I had problem with seti@home !!! who did not gave me a proper error to solve the problem, as a test i started eintein project, lucky me, who has a proper error implementation.
So, problem solved for me, want to discuss more, drop a private.
Happy crunching !
____________

zoom314
Avatar
Send message
Joined: 30 Nov 03
Posts: 44541
Credit: 35,405,303
RAC: 9,041
Message 816332 - Posted: 9 Oct 2008, 18:02:35 UTC

Is anybody else having a problem downloading WU's? I can upload and such just fine.

10/9/2008 10:48:00 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:48:02 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:48:02 AM|SETI@home|Backing off 1 min 0 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:49:03 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:49:04 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:49:04 AM|SETI@home|Backing off 1 min 0 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:50:04 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:50:05 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:50:05 AM|SETI@home|Backing off 1 min 0 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:51:06 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:51:07 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:51:07 AM|SETI@home|Backing off 1 min 0 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:52:08 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:52:10 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:52:10 AM|SETI@home|Backing off 1 min 2 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:53:12 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:53:14 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:53:14 AM|SETI@home|Backing off 2 min 17 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:55:31 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:55:33 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:55:33 AM|SETI@home|Backing off 16 min 47 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:56:17 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:56:18 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:56:18 AM|SETI@home|Backing off 22 min 15 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:18 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:20 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:58:20 AM|SETI@home|Backing off 1 hr 24 min 3 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:23 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:24 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:58:24 AM|SETI@home|Backing off 3 hr 2 min 53 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:26 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:27 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:58:27 AM|SETI@home|Backing off 3 hr 42 min 35 sec on download of file 18au08ag.29343.8661.14.8.254
____________

Profile Ageless
Avatar
Send message
Joined: 9 Jun 99
Posts: 12127
Credit: 2,519,625
RAC: 353
Netherlands
Message 816335 - Posted: 9 Oct 2008, 18:11:59 UTC - in response to Message 816313.

The fact that You're reporting about the servers at Einstein having trouble is interesting

The Einstein servers do not have problems with giving out work, but crome has problems getting it until he changes the amount of free disk space on the drive his BOINC is on. It all says it in his messages. :-)

10/9/2008 5:18:30 PM|Einstein@Home|Message from server: No work sent
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Hierarchical all-sky pulsar search needs 11.52MB more disk space. You currently have 83.85 MB available and it needs 95.37 MB.
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Not enough disk space (only 87.9 MB free for BOINC). Review preferences for maximum disk space used.

____________
Jord

Loving awareness is free.

Richard Haselgrove
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8275
Credit: 44,935,241
RAC: 13,631
United Kingdom
Message 816339 - Posted: 9 Oct 2008, 18:24:03 UTC - in response to Message 816332.

Is anybody else having a problem downloading WU's? I can upload and such just fine.

Yes. The HTTP service on one of the download servers (208.68.240.13) has gone on strike. I thought it might be related to the access problems our German colleagues were having, so I reported it there.

Richard Haselgrove
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8275
Credit: 44,935,241
RAC: 13,631
United Kingdom
Message 816344 - Posted: 9 Oct 2008, 18:44:15 UTC - in response to Message 816335.

The Einstein servers do not have problems with giving out work, ...

but the SETI servers have problems giving out error messages.

Since SETI is notoriously the testbed for new server versions, and Einstein is comparatively conservative by comparison, is this a sign of things to come in BOINC-land? Guess your own error message?

Profile Fred J. Verster
Volunteer tester
Avatar
Send message
Joined: 21 Apr 04
Posts: 3232
Credit: 31,585,541
RAC: 0
Netherlands
Message 816351 - Posted: 9 Oct 2008, 19:01:08 UTC - in response to Message 816344.
Last modified: 9 Oct 2008, 19:01:48 UTC

Goodafternoon, ladies and gentlemen ,
Yes, EINSTEIN is fine, concerning UPloading, it takes some time, before ready to send WU's, get's UPloaded. No BIG deal.
Matt has explained that in his last post, or the one before, Technical Thread.
Sometimes I have hundreds off ready to WU's, waitin to UPLOAD, or they 'say'[i]uploading,but nothing happens. When I looked @ the SERVER state, most off the time the download server (BRUNO), was DISABLED, meaning, probably someone there, who's going to kick it ;).
____________


Knight Who Says Ni N!, OUT numbered.................

Richard Haselgrove
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8275
Credit: 44,935,241
RAC: 13,631
United Kingdom
Message 816355 - Posted: 9 Oct 2008, 19:31:17 UTC

Matt just PM'd me to say that he's just kicked the offending download server.

So now there's a massive spike in downloads, and I can't get my uploads through the noise....

Some days you just can't win. But thanks anyway, Matt.

zoom314
Avatar
Send message
Joined: 30 Nov 03
Posts: 44541
Credit: 35,405,303
RAC: 9,041
Message 816357 - Posted: 9 Oct 2008, 19:43:41 UTC - in response to Message 816355.

Matt just PM'd me to say that he's just kicked the offending download server.

So now there's a massive spike in downloads, and I can't get my uploads through the noise....

Some days you just can't win. But thanks anyway, Matt.

Well He may as well give Bruno an enema as It's stuck too now.


10/9/2008 12:20:08 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed
10/9/2008 12:20:08 PM|SETI@home|Backing off 1 min 0 sec on upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:20:10 PM||Access to reference site succeeded - project servers may be temporarily down.
10/9/2008 12:21:09 PM|SETI@home|[file_xfer] Started upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:21:30 PM||Project communication failed: attempting access to reference site
10/9/2008 12:21:30 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed
10/9/2008 12:21:30 PM|SETI@home|Backing off 1 min 0 sec on upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:21:31 PM||Access to reference site succeeded - project servers may be temporarily down.
10/9/2008 12:22:31 PM|SETI@home|[file_xfer] Started upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:23:15 PM||Project communication failed: attempting access to reference site
10/9/2008 12:23:15 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed
10/9/2008 12:23:15 PM|SETI@home|Backing off 1 min 0 sec on upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:23:16 PM||Access to reference site succeeded - project servers may be temporarily down.
10/9/2008 12:24:15 PM|SETI@home|[file_xfer] Started upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:24:37 PM||Project communication failed: attempting access to reference site
10/9/2008 12:24:37 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed
10/9/2008 12:24:37 PM|SETI@home|Backing off 1 min 0 sec on upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:24:40 PM||Access to reference site succeeded - project servers may be temporarily down.
10/9/2008 12:25:37 PM|SETI@home|[file_xfer] Started upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:25:59 PM||Project communication failed: attempting access to reference site
10/9/2008 12:25:59 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed
10/9/2008 12:25:59 PM|SETI@home|Backing off 1 min 1 sec on upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:26:01 PM||Access to reference site succeeded - project servers may be temporarily down.
10/9/2008 12:27:01 PM|SETI@home|[file_xfer] Started upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:27:22 PM||Project communication failed: attempting access to reference site
10/9/2008 12:27:22 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed

____________

Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 1 Mar 99
Posts: 1384
Credit: 74,079
RAC: 0
United States
Message 816370 - Posted: 9 Oct 2008, 20:19:00 UTC

Yeah - when one dam breaks, everything floods over. After kicking the one download server we hit our network bandwidth limit, so all servers are currently gasping for air (or bits, or whatever...).

- Matt
____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude

zoom314
Avatar
Send message
Joined: 30 Nov 03
Posts: 44541
Credit: 35,405,303
RAC: 9,041
Message 816385 - Posted: 9 Oct 2008, 20:40:00 UTC - in response to Message 816370.

Yeah - when one dam breaks, everything floods over. After kicking the one download server we hit our network bandwidth limit, so all servers are currently gasping for air (or bits, or whatever...).

- Matt

Not enough Lung power, Ok. I wish I could help in that. I truly do.
____________

Eric Findley
Avatar
Send message
Joined: 28 Mar 03
Posts: 39
Credit: 2,884,749
RAC: 5,883
United States
Message 816392 - Posted: 9 Oct 2008, 20:50:20 UTC

10/9/2008 4:48:33 PM||Internet access OK - project servers may be temporarily down.
not up loading at present here.
____________

Profile BroncoBob9
Avatar
Send message
Joined: 29 May 03
Posts: 62
Credit: 2,443,241
RAC: 0
United States
Message 816407 - Posted: 9 Oct 2008, 21:19:37 UTC - in response to Message 816392.

10/9/2008 4:48:33 PM||Internet access OK - project servers may be temporarily down.
not up loading at present here.


Keep trying. The bandwidth is just maxxed right now. It will ease up soon enough.
____________

Richard Haselgrove
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8275
Credit: 44,935,241
RAC: 13,631
United Kingdom
Message 816479 - Posted: 10 Oct 2008, 0:19:53 UTC

Looks like the worst of the backlog is over now - all my downloads have cleared.

So could anyone with continuing stalled downloads - especially if you're running a recent BOINC v6.2.18 or v6.2.19 - help me with a bit of testing before you restart BOINC, please?

Could you make a cc_config.xml file to debug HTTP transfers - this should do it:

<cc_config>
<log_flags>
<task>1</task>
<file_xfer>1</file_xfer>
<sched_ops>1</sched_ops>
<http_debug>1</http_debug>
</log_flags>
</cc_config>

Put it in the BOINC (data) folder, and read it in by going to advanced view, advanced menu, 'read config file'. Then retry one of the stalled downloads, and have a look in the messages tab.

I'm interested in the IP address that BOINC tries to reach when it retries a failed download. For SETI, it should be 208.68.240.13 or 208.68.240.18: but I've found three machines today which were trying 13.240.68.208 or 18.240.68.208 - i.e. the IP numbers were in reverse order.

All of mine were on BOINC v5.10.13, which won't interest the developers - but we should try and find out if the bug is still present in current versions. Feel free to contribute in the DNS caching.... thread on the BOINC Development message board - there are examples there of the sort of output I'm looking for.

Thanks.

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5562
Credit: 51,302,929
RAC: 39,695
Australia
Message 816572 - Posted: 10 Oct 2008, 6:11:52 UTC - in response to Message 816221.

Try an ipconfig /flushdns .. that worked for me.

Thanks.
I had to go to work, and by the time i'd got back Mat had sorted the server out & everything else had sorted itself out.
____________
Grant
Darwin NT.

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5562
Credit: 51,302,929
RAC: 39,695
Australia
Message 816994 - Posted: 11 Oct 2008, 9:00:56 UTC - in response to Message 816572.


Disk space problems again?
The Ready to Send Queue dropped down to zero for quite some time before the splitters picked up the pace. And now there must be another load of short Work Units going out as even at more than 20 results per second the buffer isn't growing.
____________
Grant
Darwin NT.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (9) Server problems

Copyright © 2014 University of California