Panic Mode On (80) Server Problems?

Message boards : Number crunching : Panic Mode On (80) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 25 · Next

AuthorMessage
Keith White
Avatar

Send message
Joined: 29 May 99
Posts: 392
Credit: 13,035,233
RAC: 22
United States
Message 1329530 - Posted: 20 Jan 2013, 19:08:51 UTC

The cricket graph for packet count shows it's getting ... wonky again.
"Life is just nature's way of keeping meat fresh." - The Doctor
ID: 1329530 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1329539 - Posted: 20 Jan 2013, 19:33:11 UTC
Last modified: 20 Jan 2013, 19:56:05 UTC

So, what's a "permanent HTTP error"? First time I've seen that error when trying to use a proxy. Now if you choose the wrong proxy your download is terminated and listed as an Error? Is this new with 7.0.44? It's always something...
1/20/2013 2:15:45 PM |  | Using proxy info from GUI
1/20/2013 2:15:45 PM |  | Using HTTP proxy 165.24.5.219:8080
1/20/2013 2:16:15 PM |  | Suspending network activity - user request
1/20/2013 2:16:20 PM |  | Resuming network activity
1/20/2013 2:16:20 PM | SETI@home | Started download of ap_25jn12ad_B4_P0_00126_20130120_04307.wu
1/20/2013 2:16:20 PM | SETI@home | Started download of ap_25jn12ad_B4_P1_00162_20130120_04699.wu
1/20/2013 2:16:20 PM | SETI@home | Started download of ap_25jn12ad_B4_P0_00160_20130120_04307.wu
1/20/2013 2:16:20 PM | SETI@home | Started download of ap_25jn12ad_B4_P0_00235_20130120_04307.wu
1/20/2013 2:16:21 PM | SETI@home | Giving up on download of ap_25jn12ad_B4_P0_00126_20130120_04307.wu: permanent HTTP error
1/20/2013 2:16:21 PM | SETI@home | Giving up on download of ap_25jn12ad_B4_P1_00162_20130120_04699.wu: permanent HTTP error
1/20/2013 2:16:21 PM | SETI@home | Giving up on download of ap_25jn12ad_B4_P0_00160_20130120_04307.wu: permanent HTTP error
1/20/2013 2:16:21 PM | SETI@home | Giving up on download of ap_25jn12ad_B4_P0_00235_20130120_04307.wu: permanent HTTP error
1/20/2013 2:16:21 PM | SETI@home | Started download of ap_25jn12ad_B3_P1_00315_20130120_30148.wu
1/20/2013 2:16:21 PM | SETI@home | Started download of ap_29no12ae_B0_P1_00146_20130120_12722.wu
1/20/2013 2:16:23 PM | SETI@home | Giving up on download of ap_25jn12ad_B3_P1_00315_20130120_30148.wu: permanent HTTP error
1/20/2013 2:16:23 PM | SETI@home | Giving up on download of ap_29no12ae_B0_P1_00146_20130120_12722.wu: permanent HTTP error
...


1/20/2013 2:21:11 PM |  | Using proxy info from GUI
1/20/2013 2:21:11 PM |  | Using HTTP proxy 178.18.17.250:3128
1/20/2013 2:21:22 PM |  | Suspending network activity - user request
1/20/2013 2:21:28 PM |  | Resuming network activity
1/20/2013 2:21:28 PM | SETI@home | Started download of ap_25jn12ad_B4_P0_00278_20130120_04307.wu
1/20/2013 2:21:28 PM | SETI@home | Started download of ap_25jn12ad_B4_P1_00282_20130120_04699.wu
1/20/2013 2:21:28 PM | SETI@home | Started download of ap_24my12ab_B4_P0_00202_20130119_05992.wu
1/20/2013 2:21:30 PM | SETI@home | Giving up on download of ap_25jn12ad_B4_P0_00278_20130120_04307.wu: permanent HTTP error
1/20/2013 2:21:30 PM | SETI@home | Giving up on download of ap_25jn12ad_B4_P1_00282_20130120_04699.wu: permanent HTTP error
1/20/2013 2:21:30 PM | SETI@home | Giving up on download of ap_24my12ab_B4_P0_00202_20130119_05992.wu: permanent HTTP error
...


And...I was just about to say how great things were working with 7.0.44. You can rack up a lot of Errors quickly this way.
ID: 1329539 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1329553 - Posted: 20 Jan 2013, 19:54:42 UTC - in response to Message 1329539.  
Last modified: 20 Jan 2013, 19:59:35 UTC

Boinc couldn't get the files as they didn't exist on that network connection, so it totally gave up on them, it isn't because of Boinc 7.0.44, the project has had Workunits that have got lost before, this time i'd blame the proxy.

I've had this before when i've connected via a Wireless hotspot, once the login times out, downloads all fail,
I get round it by manually downloading the Wu's, overwriting the failed downloads, then editting my client_state.xml so the failed downloads haven't happened.

Claggy
ID: 1329553 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1329563 - Posted: 20 Jan 2013, 20:16:41 UTC - in response to Message 1329553.  

Boinc couldn't get the files as they didn't exist on that network connection, so it totally gave up on them, it isn't because of Boinc 7.0.44, the project has had Workunits that have got lost before, this time i'd blame the proxy.

I've had this before when i've connected via a Wireless hotspot, once the login times out, downloads all fail,
I get round it by manually downloading the Wu's, overwriting the failed downloads, then editting my client_state.xml so the failed downloads haven't happened.

Claggy

Can you point to a link that explains how to resume those downloads? I'd like to get them back. I'm not concerned about erasing the errors, download errors are almost a badge of honor at this point.
ID: 1329563 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1329575 - Posted: 20 Jan 2013, 21:00:25 UTC - in response to Message 1329563.  

Boinc couldn't get the files as they didn't exist on that network connection, so it totally gave up on them, it isn't because of Boinc 7.0.44, the project has had Workunits that have got lost before, this time i'd blame the proxy.

I've had this before when i've connected via a Wireless hotspot, once the login times out, downloads all fail,
I get round it by manually downloading the Wu's, overwriting the failed downloads, then editting my client_state.xml so the failed downloads haven't happened.

Claggy

Can you point to a link that explains how to resume those downloads? I'd like to get them back. I'm not concerned about erasing the errors, download errors are almost a badge of honor at this point.

You can't resume the download, but what you can do if you haven't already reported the errored downloads, is get the urls from your client_state.xml,
download the Wu's with a download manager, overwrite the failed downloads with your good downloads, and edit your client_state.xml (very carefully) like so:

http://setiathome.berkeley.edu/forum_thread.php?id=68768&postid=1260895

If you're reported them, then there's nothing you can do.

Claggy
ID: 1329575 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1329583 - Posted: 20 Jan 2013, 21:18:41 UTC - in response to Message 1329575.  

They were reported right after I discontinued the proxy. I seem to be having trouble receiving AP work for the ATI card, and while I wasn't paying attention, over half my ATI AP cache was replaced by Cuda Shorties. I'm down to about 10 hours work for the 6850.
ID: 1329583 · Report as offensive
Chris

Send message
Joined: 11 Apr 12
Posts: 9
Credit: 356,617
RAC: 0
United States
Message 1329689 - Posted: 21 Jan 2013, 2:04:16 UTC

Burning through the short work units before the APs download if I don't babysit.

Looks like the server wont send more MB if there are AP in queue but boinc won't go to a backup project (zero work share) if the downloads are in progress. I can suspend the APs so it goes to backup at least.
ID: 1329689 · Report as offensive
Horacio

Send message
Joined: 14 Jan 00
Posts: 536
Credit: 75,967,266
RAC: 0
Argentina
Message 1329760 - Posted: 21 Jan 2013, 4:34:22 UTC - in response to Message 1329689.  

Burning through the short work units before the APs download if I don't babysit.

Looks like the server wont send more MB if there are AP in queue but boinc won't go to a backup project (zero work share) if the downloads are in progress. I can suspend the APs so it goes to backup at least.

Exactly that is happening on my hosts...
If I dont care about them, they stop working at all... the stalled downloads prevents the project to get any aditional work and as there is work "pending" for the main project the backup projects are not asked for work...

But if I set any other project with a resource share higher than zero then those projects become the only ones crunched because, no matter what the cache size is, as SETI is not able to give work at all, BOINC fills the cache with work for the other projects...

At the end... the only workaround Ive found was to give up on SETI because with the current limits, there is no way to keep my hosts feeded, not even with me fully dedicated to hit manually the retry button and/or the suspend/restart network menu options... So right now, my "SETI Crunchers" are set to 99% for SETI and 1% for Einstein and even with that settings they are crunching only for Einstein...
ID: 1329760 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1329807 - Posted: 21 Jan 2013, 12:31:15 UTC - in response to Message 1329760.  
Last modified: 21 Jan 2013, 12:32:39 UTC

The roule still in place, "far" you are from the lab (and we are far - internet web speaking) worst is the comunication problem.

From my side i can´t even feed the slowest hosts, imagine the 3x690 host (i stop to use it and switch the GPU´s on 3 diferent hosts)... it´s simply a hopeless task! Takes more time to DL than to process the WUs.

The only way to keep our hungry hosts feeded if when AP split stop and we can get a small part of the BW without comunicating errors.
ID: 1329807 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1329908 - Posted: 21 Jan 2013, 19:03:21 UTC

I just remoted into one of my hosts and found it in the middle of a download. It took a total of 4m59s. It's at the limit, so I'm satisfied with speed.

I'm not satisfied with the limit, though. I recently passed 3 million for Einstein and in a lot less time than it took to get from 1 million to 2. My resource shares remain at 110 Seti, 30 Einstein.

David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1329908 · Report as offensive
Profile KWSN Ekky Ekky Ekky
Avatar

Send message
Joined: 25 May 99
Posts: 944
Credit: 52,956,491
RAC: 67
United Kingdom
Message 1329909 - Posted: 21 Jan 2013, 19:09:36 UTC
Last modified: 21 Jan 2013, 19:15:10 UTC

I should love to have some new work downloading, even if it takes all day for one. All I have had on my machine for the last 24 hours is:

"21/01/2013 19:03:15 | SETI@home | Reporting 2 completed tasks, not requesting new tasks"

I am now not even remotely near my customary cache but it still is "not requesting new tasks".

Is it me or is it something else?

[Edit]
Yes it was me. Somehow or other I had managed to set just one task to "Task suspended by user". Fixed. Doh.

ID: 1329909 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1329911 - Posted: 21 Jan 2013, 19:15:38 UTC - in response to Message 1329909.  
Last modified: 21 Jan 2013, 19:17:05 UTC

I should love to have some new work downloading, even if it takes all day for one. All I have had on my machine for the last 24 hours is:

"21/01/2013 19:03:15 | SETI@home | Reporting 2 completed tasks, not requesting new tasks"

I am now not even remotely near my customary cache but it still is "not requesting new tasks".

Is it me or is it something else?

If your machine is not requesting new tasks, the issue is on it and not the servers. What is the total estimated time of all the work you have on hand? Do you have any work for other projects?

[edit]Saw your edit after I posted. Glad you figured it out.
David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1329911 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1329912 - Posted: 21 Jan 2013, 19:16:26 UTC - in response to Message 1329909.  
Last modified: 21 Jan 2013, 19:17:59 UTC

I should love to have some new work downloading, even if it takes all day for one. All I have had on my machine for the last 24 hours is:

"21/01/2013 19:03:15 | SETI@home | Reporting 2 completed tasks, not requesting new tasks"

I am now not even remotely near my customary cache but it still is "not requesting new tasks".

Is it me or is it something else?

[Edit]
Yes it was me. Somehow or other I had managed to set just one task to "Task suspended by user". Fixed. Doh.

Make sure none of your downloads are Backed off, otherwise Boinc won't ask for work.

Edit: That'll stop Boinc asking for work too, glad you're worked it out.

Claggy
ID: 1329912 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1329979 - Posted: 21 Jan 2013, 23:26:38 UTC

It looks like the Shortie Storm is over, and my downloads are working much better now. I just downloaded 7 Cuda MBs and it took less than 2 minutes.
ID: 1329979 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19013
Credit: 40,757,560
RAC: 67
United Kingdom
Message 1330439 - Posted: 23 Jan 2013, 18:19:39 UTC

Cannot see anything wrong on cricket or server status but last three report/requests have
23/01/2013 18:14:39 | SETI@home | Scheduler request failed: HTTP service unavailable

First was at 18:04:03
ID: 1330439 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13720
Credit: 208,696,464
RAC: 304
Australia
Message 1330441 - Posted: 23 Jan 2013, 18:23:05 UTC


Looks like we have Scheduler issues again- for the last 10min every Scheduler request has resulted in "Server returned nothing (No headers, no data)". Network traffic shows downloads have dropped off, and there have been several glitches in inbound traffic. Going back through my log shows several periods of Scheduler returning nothing (no headers, no data) over the last 10 hours or so.
Grant
Darwin NT
ID: 1330441 · Report as offensive
Profile KWSN Ekky Ekky Ekky
Avatar

Send message
Joined: 25 May 99
Posts: 944
Credit: 52,956,491
RAC: 67
United Kingdom
Message 1330446 - Posted: 23 Jan 2013, 18:38:41 UTC - in response to Message 1330441.  

Someone using insecticide on crickets?

ID: 1330446 · Report as offensive
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 1330447 - Posted: 23 Jan 2013, 18:45:09 UTC - in response to Message 1330446.  

Well, at least this outage is during the day so someone can take a look onsite in a timely manner...
ID: 1330447 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13720
Credit: 208,696,464
RAC: 304
Australia
Message 1330451 - Posted: 23 Jan 2013, 18:50:53 UTC


Sever status shows green, but inbound & outbound traffic have hit bottom.
Grant
Darwin NT
ID: 1330451 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1330458 - Posted: 23 Jan 2013, 19:19:37 UTC - in response to Message 1330451.  


Sever status shows green, but inbound & outbound traffic have hit bottom.

And bounced....
ID: 1330458 · Report as offensive
Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 25 · Next

Message boards : Number crunching : Panic Mode On (80) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.