Panic Mode On (9) Server problems

Message boards : Number crunching : Panic Mode On (9) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 11 · Next

AuthorMessage
PhonAcq

Send message
Joined: 14 Apr 01
Posts: 1656
Credit: 30,658,217
RAC: 1
United States
Message 816264 - Posted: 9 Oct 2008, 14:32:37 UTC

Not the end of the world, but I have about 80 wu's that won't download to one of my clients. Continually get errors like this.

10/9/2008 7:30:17 AM|SETI@home|Temporarily failed download of 19au08ac.10227.3344.6.8.24: http error


Yesterday, it took Matt to come in and give something the steel shank kick, or something to remedy. Not looking good for the weekend when he is not there??
ID: 816264 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 816266 - Posted: 9 Oct 2008, 14:58:47 UTC - in response to Message 816264.  

Not the end of the world, but I have about 80 wu's that won't download to one of my clients. Continually get errors like this.

10/9/2008 7:30:17 AM|SETI@home|Temporarily failed download of 19au08ac.10227.3344.6.8.24: http error


Yesterday, it took Matt to come in and give something the steel shank kick, or something to remedy. Not looking good for the weekend when he is not there??

Looks like Matt's steel toe has done the job again......as for this weekend...(shudders).....
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 816266 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65737
Credit: 55,293,173
RAC: 49
United States
Message 816268 - Posted: 9 Oct 2008, 15:01:43 UTC - in response to Message 816255.  

Edited:
Got the problem with no work from server.

10/9/2008 5:18:30 PM|Einstein@Home|Message from server: No work sent
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Hierarchical all-sky pulsar search needs 11.52MB more disk space. You currently have 83.85 MB available and it needs 95.37 MB.
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Not enough disk space (only 87.9 MB free for BOINC). Review preferences for maximum disk space used.

I had an astropulse work, and in preferences I had a low disk usage settings.

Einstein? This is the Seti@Home forum, Not Einstein.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 816268 · Report as offensive
Profile dancrista

Send message
Joined: 19 Dec 01
Posts: 3
Credit: 78,546
RAC: 0
Romania
Message 816309 - Posted: 9 Oct 2008, 16:59:13 UTC - in response to Message 816268.  

Yes Joker, I know that, Seti was my main project, but i started Einstein to see if i have the same problem, it seems einstein project give to me the right error, seti did not want to say any error, just this:
"10/9/2008 4:38:11 PM|SETI@home|Message from server: No work sent"

ID: 816309 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65737
Credit: 55,293,173
RAC: 49
United States
Message 816313 - Posted: 9 Oct 2008, 17:04:25 UTC - in response to Message 816309.  

Yes Joker, I know that, Seti was my main project, but i started Einstein to see if i have the same problem, it seems Einstein project give to me the right error, seti did not want to say any error, just this:
"10/9/2008 4:38:11 PM|SETI@home|Message from server: No work sent"

The fact that You're reporting about the servers at Einstein having trouble is interesting, But shouldn't You have mentioned It there and not here? As It has nothing to do with the Seti Servers to which this thread is dedicated. As mentioning It here is kinda useless to a staff that's only concerned with Seti and maybe Boinc.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 816313 · Report as offensive
Profile dancrista

Send message
Joined: 19 Dec 01
Posts: 3
Credit: 78,546
RAC: 0
Romania
Message 816320 - Posted: 9 Oct 2008, 17:36:25 UTC - in response to Message 816313.  
Last modified: 9 Oct 2008, 17:59:49 UTC

Dunno why, but I will say it one more time: I had problem with seti@home !!! who did not gave me a proper error to solve the problem, as a test i started eintein project, lucky me, who has a proper error implementation.
So, problem solved for me, want to discuss more, drop a private.
Happy crunching !
ID: 816320 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65737
Credit: 55,293,173
RAC: 49
United States
Message 816332 - Posted: 9 Oct 2008, 18:02:35 UTC

Is anybody else having a problem downloading WU's? I can upload and such just fine.

10/9/2008 10:48:00 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:48:02 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:48:02 AM|SETI@home|Backing off 1 min 0 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:49:03 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:49:04 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:49:04 AM|SETI@home|Backing off 1 min 0 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:50:04 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:50:05 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:50:05 AM|SETI@home|Backing off 1 min 0 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:51:06 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:51:07 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:51:07 AM|SETI@home|Backing off 1 min 0 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:52:08 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:52:10 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:52:10 AM|SETI@home|Backing off 1 min 2 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:53:12 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:53:14 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:53:14 AM|SETI@home|Backing off 2 min 17 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:55:31 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:55:33 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:55:33 AM|SETI@home|Backing off 16 min 47 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:56:17 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:56:18 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:56:18 AM|SETI@home|Backing off 22 min 15 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:18 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:20 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:58:20 AM|SETI@home|Backing off 1 hr 24 min 3 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:23 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:24 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:58:24 AM|SETI@home|Backing off 3 hr 2 min 53 sec on download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:26 AM|SETI@home|[file_xfer] Started download of file 18au08ag.29343.8661.14.8.254
10/9/2008 10:58:27 AM|SETI@home|[file_xfer] Temporarily failed download of 18au08ag.29343.8661.14.8.254: HTTP error
10/9/2008 10:58:27 AM|SETI@home|Backing off 3 hr 42 min 35 sec on download of file 18au08ag.29343.8661.14.8.254
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 816332 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 816335 - Posted: 9 Oct 2008, 18:11:59 UTC - in response to Message 816313.  

The fact that You're reporting about the servers at Einstein having trouble is interesting

The Einstein servers do not have problems with giving out work, but crome has problems getting it until he changes the amount of free disk space on the drive his BOINC is on. It all says it in his messages. :-)

10/9/2008 5:18:30 PM|Einstein@Home|Message from server: No work sent
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Hierarchical all-sky pulsar search needs 11.52MB more disk space. You currently have 83.85 MB available and it needs 95.37 MB.
10/9/2008 5:18:30 PM|Einstein@Home|Message from server: Not enough disk space (only 87.9 MB free for BOINC). Review preferences for maximum disk space used.

ID: 816335 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 816339 - Posted: 9 Oct 2008, 18:24:03 UTC - in response to Message 816332.  

Is anybody else having a problem downloading WU's? I can upload and such just fine.

Yes. The HTTP service on one of the download servers (208.68.240.13) has gone on strike. I thought it might be related to the access problems our German colleagues were having, so I reported it there.
ID: 816339 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 816344 - Posted: 9 Oct 2008, 18:44:15 UTC - in response to Message 816335.  

The Einstein servers do not have problems with giving out work, ...

but the SETI servers have problems giving out error messages.

Since SETI is notoriously the testbed for new server versions, and Einstein is comparatively conservative by comparison, is this a sign of things to come in BOINC-land? Guess your own error message?
ID: 816344 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 816351 - Posted: 9 Oct 2008, 19:01:08 UTC - in response to Message 816344.  
Last modified: 9 Oct 2008, 19:01:48 UTC

Goodafternoon, ladies and gentlemen ,
Yes, EINSTEIN is fine, concerning UPloading, it takes some time, before ready to send WU's, get's UPloaded. No BIG deal.
Matt has explained that in his last post, or the one before, Technical Thread.
Sometimes I have hundreds off ready to WU's, waitin to UPLOAD, or they 'say'[i]uploading,but nothing happens. When I looked @ the SERVER state, most off the time the download server (BRUNO), was DISABLED, meaning, probably someone there, who's going to kick it ;).
ID: 816351 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 816355 - Posted: 9 Oct 2008, 19:31:17 UTC

Matt just PM'd me to say that he's just kicked the offending download server.

So now there's a massive spike in downloads, and I can't get my uploads through the noise....

Some days you just can't win. But thanks anyway, Matt.
ID: 816355 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65737
Credit: 55,293,173
RAC: 49
United States
Message 816357 - Posted: 9 Oct 2008, 19:43:41 UTC - in response to Message 816355.  

Matt just PM'd me to say that he's just kicked the offending download server.

So now there's a massive spike in downloads, and I can't get my uploads through the noise....

Some days you just can't win. But thanks anyway, Matt.

Well He may as well give Bruno an enema as It's stuck too now.


10/9/2008 12:20:08 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed
10/9/2008 12:20:08 PM|SETI@home|Backing off 1 min 0 sec on upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:20:10 PM||Access to reference site succeeded - project servers may be temporarily down.
10/9/2008 12:21:09 PM|SETI@home|[file_xfer] Started upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:21:30 PM||Project communication failed: attempting access to reference site
10/9/2008 12:21:30 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed
10/9/2008 12:21:30 PM|SETI@home|Backing off 1 min 0 sec on upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:21:31 PM||Access to reference site succeeded - project servers may be temporarily down.
10/9/2008 12:22:31 PM|SETI@home|[file_xfer] Started upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:23:15 PM||Project communication failed: attempting access to reference site
10/9/2008 12:23:15 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed
10/9/2008 12:23:15 PM|SETI@home|Backing off 1 min 0 sec on upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:23:16 PM||Access to reference site succeeded - project servers may be temporarily down.
10/9/2008 12:24:15 PM|SETI@home|[file_xfer] Started upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:24:37 PM||Project communication failed: attempting access to reference site
10/9/2008 12:24:37 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed
10/9/2008 12:24:37 PM|SETI@home|Backing off 1 min 0 sec on upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:24:40 PM||Access to reference site succeeded - project servers may be temporarily down.
10/9/2008 12:25:37 PM|SETI@home|[file_xfer] Started upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:25:59 PM||Project communication failed: attempting access to reference site
10/9/2008 12:25:59 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed
10/9/2008 12:25:59 PM|SETI@home|Backing off 1 min 1 sec on upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:26:01 PM||Access to reference site succeeded - project servers may be temporarily down.
10/9/2008 12:27:01 PM|SETI@home|[file_xfer] Started upload of file 18au08aa.17019.9070.8.8.218_1_0
10/9/2008 12:27:22 PM||Project communication failed: attempting access to reference site
10/9/2008 12:27:22 PM|SETI@home|[file_xfer] Temporarily failed upload of 18au08aa.17019.9070.8.8.218_1_0: connect() failed

The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 816357 · Report as offensive
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 816370 - Posted: 9 Oct 2008, 20:19:00 UTC

Yeah - when one dam breaks, everything floods over. After kicking the one download server we hit our network bandwidth limit, so all servers are currently gasping for air (or bits, or whatever...).

- Matt
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 816370 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65737
Credit: 55,293,173
RAC: 49
United States
Message 816385 - Posted: 9 Oct 2008, 20:40:00 UTC - in response to Message 816370.  

Yeah - when one dam breaks, everything floods over. After kicking the one download server we hit our network bandwidth limit, so all servers are currently gasping for air (or bits, or whatever...).

- Matt

Not enough Lung power, Ok. I wish I could help in that. I truly do.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 816385 · Report as offensive
Eric Findley
Avatar

Send message
Joined: 28 Mar 03
Posts: 72
Credit: 8,674,945
RAC: 0
United States
Message 816392 - Posted: 9 Oct 2008, 20:50:20 UTC

10/9/2008 4:48:33 PM||Internet access OK - project servers may be temporarily down.
not up loading at present here.
ID: 816392 · Report as offensive
Profile BroncoBob9
Avatar

Send message
Joined: 29 May 03
Posts: 62
Credit: 2,443,241
RAC: 0
United States
Message 816407 - Posted: 9 Oct 2008, 21:19:37 UTC - in response to Message 816392.  

10/9/2008 4:48:33 PM||Internet access OK - project servers may be temporarily down.
not up loading at present here.


Keep trying. The bandwidth is just maxxed right now. It will ease up soon enough.
ID: 816407 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 816479 - Posted: 10 Oct 2008, 0:19:53 UTC

Looks like the worst of the backlog is over now - all my downloads have cleared.

So could anyone with continuing stalled downloads - especially if you're running a recent BOINC v6.2.18 or v6.2.19 - help me with a bit of testing before you restart BOINC, please?

Could you make a cc_config.xml file to debug HTTP transfers - this should do it:

<cc_config>
<log_flags>
<task>1</task>
<file_xfer>1</file_xfer>
<sched_ops>1</sched_ops>
<http_debug>1</http_debug>
</log_flags>
</cc_config>

Put it in the BOINC (data) folder, and read it in by going to advanced view, advanced menu, 'read config file'. Then retry one of the stalled downloads, and have a look in the messages tab.

I'm interested in the IP address that BOINC tries to reach when it retries a failed download. For SETI, it should be 208.68.240.13 or 208.68.240.18: but I've found three machines today which were trying 13.240.68.208 or 18.240.68.208 - i.e. the IP numbers were in reverse order.

All of mine were on BOINC v5.10.13, which won't interest the developers - but we should try and find out if the bug is still present in current versions. Feel free to contribute in the DNS caching.... thread on the BOINC Development message board - there are examples there of the sort of output I'm looking for.

Thanks.
ID: 816479 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13727
Credit: 208,696,464
RAC: 304
Australia
Message 816572 - Posted: 10 Oct 2008, 6:11:52 UTC - in response to Message 816221.  

Try an ipconfig /flushdns .. that worked for me.

Thanks.
I had to go to work, and by the time i'd got back Mat had sorted the server out & everything else had sorted itself out.
Grant
Darwin NT
ID: 816572 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13727
Credit: 208,696,464
RAC: 304
Australia
Message 816994 - Posted: 11 Oct 2008, 9:00:56 UTC - in response to Message 816572.  


Disk space problems again?
The Ready to Send Queue dropped down to zero for quite some time before the splitters picked up the pace. And now there must be another load of short Work Units going out as even at more than 20 results per second the buffer isn't growing.
Grant
Darwin NT
ID: 816994 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (9) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.