Incessant

Message boards : Number crunching : Incessant
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Profile The Gas Giant
Volunteer tester
Avatar

Send message
Joined: 22 Nov 01
Posts: 1904
Credit: 2,646,654
RAC: 0
Australia
Message 167131 - Posted: 13 Sep 2005, 10:32:24 UTC

For the last day I have constantly getting...

|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed with a return value of 500

Same old problem again. Why is it that immediately after the outages I was able to upload and contact the schedular with no real problems, but now that the traffic has slowed down I'm seeing this failure again?

Live long and crunch.

Paul
(S@H1 8888)
And proud of it!
ID: 167131 · Report as offensive
Profile Kajunfisher
Volunteer tester
Avatar

Send message
Joined: 29 Mar 05
Posts: 1407
Credit: 126,476
RAC: 0
United States
Message 167141 - Posted: 13 Sep 2005, 11:03:23 UTC - in response to Message 167131.  

Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed with a return value of 500


Paul, which one of your computers has that problem? Or is it all of them?

What version is giving that error?

James :-)
ID: 167141 · Report as offensive
Georges H
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 26
Credit: 624,880
RAC: 8
Luxembourg
Message 167171 - Posted: 13 Sep 2005, 13:15:36 UTC - in response to Message 167131.  

For the last day I have constantly getting...

|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed with a return value of 500



I frequently get timeouts (on every 3rd, 4th or 5th request) where the scheduler doesn't respond but no error 500.
This happens to all my 3 computers, 2 using 4.45 and 1 using 4.19.

Does this error 500 also have to do sth with these timeouts?


Greetings from Luxembourg,

Georges
ID: 167171 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 167183 - Posted: 13 Sep 2005, 14:00:50 UTC
Last modified: 13 Sep 2005, 14:03:53 UTC

Paul, could I ask if you are trying to force a connection using project update? I've seen this when I (dial up user) try to force two projects to connect at the same time, OR if seti had made a request earlier(that I knew nothing about) and then I tried to force a connection to seti or beta or ppah, etc.

tony

I have taken to only sending one project in at a time and then waiting until each one is completely finished before trying the next one. Also, I notice ppah will download some then wait, then download more, than wait and finish up the request.
ID: 167183 · Report as offensive
Profile Kajunfisher
Volunteer tester
Avatar

Send message
Joined: 29 Mar 05
Posts: 1407
Credit: 126,476
RAC: 0
United States
Message 167186 - Posted: 13 Sep 2005, 14:13:30 UTC

From the Wiki: 500 - The Server experienced an internal error and could not process the request.

James :-)
ID: 167186 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 167187 - Posted: 13 Sep 2005, 14:18:48 UTC

James, Urs enternacht, Paul and a few others were having a problem before THE outage, and the outage made their problems irrelevent. Now they are having it yet again, it seems, We've already been through most the diagnostic procedures (firewall, ports, ipconfig, dns flush, and many more) and haven't found a solution to their issues. I wish I could remember the thread name, but it's a very interesting problem.

tony
ID: 167187 · Report as offensive
nairb

Send message
Joined: 18 Mar 03
Posts: 201
Credit: 5,447,501
RAC: 5
United Kingdom
Message 167249 - Posted: 13 Sep 2005, 18:14:51 UTC

Also on dial up. I have had this problem for months. When there wasent much work to send the server 500 error went away for a few days.
Now it may take 30 or 40 tries before I get an UPDATE thru.
Nothing I have done has worked for long. It can take days to get an Update thru.
Been getting server 504 error also - makes a change.

Nairb
ID: 167249 · Report as offensive
Profile Kajunfisher
Volunteer tester
Avatar

Send message
Joined: 29 Mar 05
Posts: 1407
Credit: 126,476
RAC: 0
United States
Message 167257 - Posted: 13 Sep 2005, 19:02:56 UTC - in response to Message 167187.  

James, Urs enternacht, Paul and a few others were having a problem before THE outage, and the outage made their problems irrelevent. Now they are having it yet again, it seems, We've already been through most the diagnostic procedures (firewall, ports, ipconfig, dns flush, and many more) and haven't found a solution to their issues. I wish I could remember the thread name, but it's a very interesting problem.

tony


I remember the thread Tony, is he still usig a proxy to connect? (looking for that thread now)

James :-)
ID: 167257 · Report as offensive
Profile Kajunfisher
Volunteer tester
Avatar

Send message
Joined: 29 Mar 05
Posts: 1407
Credit: 126,476
RAC: 0
United States
Message 167259 - Posted: 13 Sep 2005, 19:18:03 UTC

There was this thread where Galileo was being used as an option, not sure if it's still available as one now, I'm stil looking for the thread.


http://setiathome.berkeley.edu/forum_thread.php?id=19428
ID: 167259 · Report as offensive
Profile Kajunfisher
Volunteer tester
Avatar

Send message
Joined: 29 Mar 05
Posts: 1407
Credit: 126,476
RAC: 0
United States
Message 167265 - Posted: 13 Sep 2005, 19:33:28 UTC

I think I found it: http://setiathome.berkeley.edu/forum_thread.php?id=18705

I hope that's it.

James :-)
ID: 167265 · Report as offensive
Profile mikey
Volunteer tester
Avatar

Send message
Joined: 17 Dec 99
Posts: 4215
Credit: 3,474,603
RAC: 0
United States
Message 167483 - Posted: 14 Sep 2005, 2:57:24 UTC - in response to Message 167259.  
Last modified: 14 Sep 2005, 2:57:59 UTC

There was this thread where Galileo was being used as an option, not sure if it's still available as one now, I'm stil looking for the thread.
http://setiathome.berkeley.edu/forum_thread.php?id=19428

Matt closed it at the time and I checked the other day and it is still closed.
galileo.ssl.berkeley.edu

ID: 167483 · Report as offensive
Profile The Gas Giant
Volunteer tester
Avatar

Send message
Joined: 22 Nov 01
Posts: 1904
Credit: 2,646,654
RAC: 0
Australia
Message 167509 - Posted: 14 Sep 2005, 4:03:40 UTC

Thanks for the feedback.....from Tonys other thread....

Ping is OK
Trace route appears to time out on one jump within the US then works OK (maybe there is a fault between Australia and the US).
Another computer using the same adsl link (but individually connected since I don't have a network set up) mostly connects OK.
My laptop at work never gets this failure (I have not done a traceroute at work and compared it to the one done at home - I might try that).
I have never lost a wu due to deadline issues so this is just painful.
Uploads work OK, it's just connecting to the scheduler for reporting of wu's and getting new work.
It is only a problem with seti@home not LHC, Predictor or Climate (I haven't done Einstein in a while).
I do not use a proxy normally, but do get it to connect via a proxy sometimes (until that one stops working and I have to find another - but I only do that once the wu's start getting a little old).
After 10 retries (I think) it does a master_file fetch gets it, but I still get the 500 failure.
Firewall all checked OK (even turned it off for a check - it is back on).
Anti-virus turned off as well for a check (it is back on).
Hmmm...wonder what else I've checked?
--------

I have 3 hosts, 2 PCs at home and a work laptop that is off for about a total of 1.5hrs per day as I commute between work and home. The laptop does not have a problem connecting at work - as said above I should do a traceroute from home and work and compare. I do not normally connect my laptop at home, but have done when I've run our of LHC work so I can't really compare. I will connect my laptop at home and do some seti uploads etc and compare.

The 2 PCs at home use a single non-networked usb adsl connection. My 3.2GHz HT machine is normally the always on connection and I see the 500 failure on auto retries and manual retries. My 2.0GHz box is mostly only connected to upload, report and download. It has recently had the problem, but not as much as the 3.2GHz machine, but then it's not always connected so it's hard to gauge. I have previously compared traceroutes and they show the same hops. I used Galileo with great success for a few hrs after the big outage but am now back to my normal non-proxy connection. I can usually connect using a proxy for a while, then I need to find another one.

I use BOINC V4.72 (all on XP) and am about to start using 5.1.x series (I'll let you know if I still have a problem after that). I think I was having the same problem with 4.45, but it has been a long time since I have used 4.45. If nothing else works I might wind back a few revs (4.45, 4.25 then 4.19) and see what happens.

Live long and crunch.

Paul
(S@H1 8888)
And proud of it!
ID: 167509 · Report as offensive
Profile StokeyBob
Avatar

Send message
Joined: 31 Aug 03
Posts: 848
Credit: 2,218,691
RAC: 0
United States
Message 167519 - Posted: 14 Sep 2005, 4:51:17 UTC
Last modified: 14 Sep 2005, 4:52:49 UTC

I suppose this has already been mentioned.

Trying some of the tools at;

www.pcpitstop.com
or
www.dslreports.com/tools

Maybe some setting is off and one of their tools can help get it set right.
ID: 167519 · Report as offensive
Profile mikey
Volunteer tester
Avatar

Send message
Joined: 17 Dec 99
Posts: 4215
Credit: 3,474,603
RAC: 0
United States
Message 167853 - Posted: 15 Sep 2005, 2:08:23 UTC - in response to Message 167509.  

Thanks for the feedback.....from Tonys other thread....
Ping is OK
Trace route appears to time out on one jump within the US then works OK (maybe there is a fault between Australia and the US).

Just a thought then, try using the proxies in Europe and go the long way around.

ID: 167853 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 168064 - Posted: 15 Sep 2005, 16:00:56 UTC - in response to Message 167519.  

I suppose this has already been mentioned.

Trying some of the tools at;

www.pcpitstop.com
or
www.dslreports.com/tools

Maybe some setting is off and one of their tools can help get it set right.

This problem is not likely to show in a traceroute or ping.

Paul's machine sends a TCP "SYN" packet. It arrives at Berkeley and he gets back a SYN+ACK. His machine then sends an ACK, possibly along with his request, and the servers in Berkeley get the request, and the request has an error. Paul's machine gets back the response with the 500. That packet gets an ACK.

This is a minimum of four successful packets. The BOINC protocol is a little verbose so it's probably more than four.

The $64,000 question is: why doesn't the request get through to Berkeley undamaged.

This is why someone else suggested "smaller packets" -- in case something between here and there is just dropping something when a packet is exactly a certain size (unlikely but possible) and why I suggested "no proxies" because a proxy will receive and resend the request, not simply forward it.

Paul also reported that he's on a slow DSL connection. Shorter packets should make it through better if his line is poor.
ID: 168064 · Report as offensive
Profile Tigher
Volunteer tester

Send message
Joined: 18 Mar 04
Posts: 1547
Credit: 760,577
RAC: 0
United Kingdom
Message 168127 - Posted: 15 Sep 2005, 18:58:39 UTC - in response to Message 168064.  

I suppose this has already been mentioned.

Trying some of the tools at;

www.pcpitstop.com
or
www.dslreports.com/tools

Maybe some setting is off and one of their tools can help get it set right.

This problem is not likely to show in a traceroute or ping.

Paul's machine sends a TCP "SYN" packet. It arrives at Berkeley and he gets back a SYN+ACK. His machine then sends an ACK, possibly along with his request, and the servers in Berkeley get the request, and the request has an error. Paul's machine gets back the response with the 500. That packet gets an ACK.

This is a minimum of four successful packets. The BOINC protocol is a little verbose so it's probably more than four.

The $64,000 question is: why doesn't the request get through to Berkeley undamaged.

This is why someone else suggested "smaller packets" -- in case something between here and there is just dropping something when a packet is exactly a certain size (unlikely but possible) and why I suggested "no proxies" because a proxy will receive and resend the request, not simply forward it.

Paul also reported that he's on a slow DSL connection. Shorter packets should make it through better if his line is poor.


Ned Hi,
You have seen a trace now of the packets? Any dropped packets?

Paul has not come back in the other thread to confirm a trace nor a reduced MTU. Have you heard from him?

ID: 168127 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 168149 - Posted: 15 Sep 2005, 20:15:43 UTC - in response to Message 168127.  

Ned Hi,
You have seen a trace now of the packets? Any dropped packets?

Paul has not come back in the other thread to confirm a trace nor a reduced MTU. Have you heard from him?

I've not seen a traceroute and I haven't heard if he tried changing MTU from 1500 (the normal ethernet default) to something like 576 (the normal dialup default).

I'm pretty sure though that we aren't going to see much on the traceroute because he is connecting and he is getting a response. TCP is sufficiently robust that he shouldn't be having trouble if he can connect and get the "500" response, but there are 'broken' IP stacks out there.

I'd love to see what a smaller MTU does for him.
ID: 168149 · Report as offensive
Profile Tigher
Volunteer tester

Send message
Joined: 18 Mar 04
Posts: 1547
Credit: 760,577
RAC: 0
United Kingdom
Message 168153 - Posted: 15 Sep 2005, 20:26:51 UTC - in response to Message 168149.  

Ned Hi,
You have seen a trace now of the packets? Any dropped packets?

Paul has not come back in the other thread to confirm a trace nor a reduced MTU. Have you heard from him?

I've not seen a traceroute and I haven't heard if he tried changing MTU from 1500 (the normal ethernet default) to something like 576 (the normal dialup default).

I'm pretty sure though that we aren't going to see much on the traceroute because he is connecting and he is getting a response. TCP is sufficiently robust that he shouldn't be having trouble if he can connect and get the "500" response, but there are 'broken' IP stacks out there.

I'd love to see what a smaller MTU does for him.


OK. I was hoping for an ethereal trace to see what was sent and how the server was responding. I guess we wait for him to try that and to reduce his MTU size. Interesting one. I don't know of you can recall but is this the one where there is satellite involved?

ID: 168153 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 168159 - Posted: 15 Sep 2005, 20:32:34 UTC - in response to Message 168153.  


OK. I was hoping for an ethereal trace to see what was sent and how the server was responding. I guess we wait for him to try that and to reduce his MTU size. Interesting one. I don't know of you can recall but is this the one where there is satellite involved?

What I'd like to see is some sort of Ethereal trace or something like that from near the server. If it's close to Paul we'd know it's getting out of his machine okay, but have no idea how it gets molested by the time it reaches the scheduler.

Changing MTU may not help, but if it does, we're golden. If it doesn't, well, it isn't that hard to change it back.

I included a link to DrTCP on another message as that is an easy way to fiddle.
ID: 168159 · Report as offensive
Profile The Gas Giant
Volunteer tester
Avatar

Send message
Joined: 22 Nov 01
Posts: 1904
Credit: 2,646,654
RAC: 0
Australia
Message 168170 - Posted: 15 Sep 2005, 20:42:45 UTC - in response to Message 168149.  

Ned Hi,
You have seen a trace now of the packets? Any dropped packets?

Paul has not come back in the other thread to confirm a trace nor a reduced MTU. Have you heard from him?

I've not seen a traceroute and I haven't heard if he tried changing MTU from 1500 (the normal ethernet default) to something like 576 (the normal dialup default).

I'm pretty sure though that we aren't going to see much on the traceroute because he is connecting and he is getting a response. TCP is sufficiently robust that he shouldn't be having trouble if he can connect and get the "500" response, but there are 'broken' IP stacks out there.

I'd love to see what a smaller MTU does for him.

Sorry for the long delay between responses, it's been extrememly busy at work and home. And thanks for the repsonses. It really is appreciated.

I haven't dropped the MTU down to 500 or 576 yet. It is currently at 1492. The slow ADSL connection is just what I'm paying for. "Broadband" downunder is extremely expensive. What I pay $A49.90 per month for you'd get a 1.5Mbit to 5Mbit connection, but I get a 256kbit connection with unlimited downloads. My closeness to the telephone exchange means if I wanted an ADSL2 connection I could get one, but it ain't worth the money (and the Australian government wonders why the uptake of broadband is slow - but that's what happens with a pseudo monopoly - but I digress). As I have said previously I do not have a problem with other projects (LHC, CPDN, Predictor).

I'm currently hammering away on LHC trying to resist being overtaken for total credit by a teammate for as long as possible (his ARC is 1100, mine is 460 so it is just a matter of hrs now). Once I am overtaken I will be setting the resource share back to where it was and seti will get a few cycles on this machine and we will be able to see if things improve with a lower MTU, but I still don't understand why that would make a difference when I can connect and do other projects with no problems what-so-ever.

Live long and crunch.

Paul
(S@H1 8888)
And proud of it!
ID: 168170 · Report as offensive
1 · 2 · 3 · Next

Message boards : Number crunching : Incessant


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.