Fixed scheduler?

Message boards : Number crunching : Fixed scheduler?
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 170138 - Posted: 20 Sep 2005, 19:47:55 UTC

Hi, All -

A lot of people have been having trouble connecting to our scheduler lately. Well, I think I fixed the problem, or at least one problem. If you were having trouble connecting before, can you try again and post positive/negative results to this thread?

Thanks,

- Matt
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 170138 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 170166 - Posted: 20 Sep 2005, 21:15:27 UTC

Hi Matt, can you post a similar thread in one of the Q&A forums? A lot of people who try to attach, but cannot, don't have credit yet. So they can't post here.


ID: 170166 · Report as offensive
Profile Sir Ulli
Volunteer tester
Avatar

Send message
Joined: 21 Oct 99
Posts: 2246
Credit: 6,136,250
RAC: 0
Germany
Message 170179 - Posted: 20 Sep 2005, 21:33:41 UTC

no Probs here from Germany

SETI@home - 2005-09-20 23:30:10 - Sending request to scheduler: http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
SETI@home - 2005-09-20 23:30:16 - Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi succeeded

all is working perfectly



Greetings from Germany NRW
Ulli



ID: 170179 · Report as offensive
Profile Raven
Volunteer tester
Avatar

Send message
Joined: 28 Aug 02
Posts: 373
Credit: 99,071
RAC: 0
Canada
Message 170188 - Posted: 20 Sep 2005, 22:26:33 UTC
Last modified: 20 Sep 2005, 22:57:01 UTC

I didn't have a problem connecting, but did get an odd message:

20/09/2005 2:38:00 PM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
20/09/2005 2:38:00 PM|SETI@home|Requesting 0 seconds of work, returning 0 results
20/09/2005 2:38:01 PM|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi succeeded
20/09/2005 2:38:02 PM|SETI@home|Deferring communication with project for 10 minutes and 4 seconds

I was refreshing my preferences at the time, and was set to no new work, hence the request for 0 seconds of work, but even after a successful connection I get the deferring communication message afterwards.

I don't know if this is a symptom of a bigger problem, so... now you know about it! There was no harm done on my end that I could see.

[EDIT] I should add, also, that BOINC did not connect again after the 10:04 was up.[/EDIT]
ID: 170188 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 170189 - Posted: 20 Sep 2005, 22:29:25 UTC - in response to Message 170188.  

20/09/2005 2:38:00 PM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
20/09/2005 2:38:00 PM|SETI@home|Requesting 0 seconds of work, returning 0 results
20/09/2005 2:38:01 PM|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi succeeded
20/09/2005 2:38:02 PM|SETI@home|Deferring communication with project for 10 minutes and 4 seconds

I am getting the exact same thing on a manual update to the project.
ID: 170189 · Report as offensive
Profile Tern
Volunteer tester
Avatar

Send message
Joined: 4 Dec 03
Posts: 1122
Credit: 13,376,822
RAC: 44
United States
Message 170195 - Posted: 20 Sep 2005, 22:53:13 UTC - in response to Message 170189.  

20/09/2005 2:38:02 PM|SETI@home|Deferring communication with project for 10 minutes and 4 seconds

I am getting the exact same thing on a manual update to the project.


It's spreading... sigh. Einstein started doing this a few days ago, people are complaining, they're trying to find a way to tell the client to hold off connecting WITHOUT sending this annoying message. And now SETI _adds_ it?
ID: 170195 · Report as offensive
Swibby Bear

Send message
Joined: 1 Aug 01
Posts: 246
Credit: 7,945,093
RAC: 0
United States
Message 170202 - Posted: 20 Sep 2005, 23:08:40 UTC
Last modified: 20 Sep 2005, 23:20:02 UTC

getting the same message in Pennsylvania - using S@H 4.18 and BOINC 4.19. But I am connecting okay.

SETI@home - 2005-09-20 15:51:03 - Deferring communication with project for 10 minutes and 6 seconds

(EDIT) I just updated manually okay, and did not get the "deferring" msg.
ID: 170202 · Report as offensive
Terry
Volunteer tester

Send message
Joined: 17 Sep 00
Posts: 153
Credit: 1,805,202
RAC: 0
United States
Message 170205 - Posted: 20 Sep 2005, 23:21:49 UTC

Seeing the same thing, but still getting work.

9/20/2005 5:27:35 PM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
9/20/2005 5:27:35 PM|SETI@home|Reason: To fetch work
9/20/2005 5:27:35 PM|SETI@home|Requesting 1 seconds of work, returning 0 results
9/20/2005 5:27:36 PM|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi succeeded
9/20/2005 5:27:37 PM|SETI@home|Deferring communication with project for 10 minutes and 4 seconds
9/20/2005 5:27:37 PM|SETI@home|Started download of 26ja04aa.9681.31665.504824.197
9/20/2005 5:27:41 PM|SETI@home|Finished download of 26ja04aa.9681.31665.504824.197
9/20/2005 5:27:41 PM|SETI@home|Throughput 90140 bytes/sec
ID: 170205 · Report as offensive
nairb

Send message
Joined: 18 Mar 03
Posts: 201
Credit: 5,447,501
RAC: 5
United Kingdom
Message 170212 - Posted: 20 Sep 2005, 23:59:17 UTC

Its a miracle. Not a single server 500 error. Not one. I had got to the point of having to leave the dial up on overnight just to get an update thru. It took 20 or 30 tries to get an update. At the weekend's it was almost impossible to get an update thru.
Today I have got all machines updated.

Just in case we have forgotten the problem:-
9/20/05 1:43:00 PM|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed with a return value of 500
9/20/05 1:43:00 PM|SETI@home|No schedulers responded

I dont give a toss about the 10 min wait that's appeared.

Give that man a nobel prize.

Nairb


ID: 170212 · Report as offensive
itenginerd
Avatar

Send message
Joined: 1 Aug 00
Posts: 37
Credit: 39,905
RAC: 0
United States
Message 170216 - Posted: 21 Sep 2005, 0:16:54 UTC - in response to Message 170195.  

20/09/2005 2:38:02 PM|SETI@home|Deferring communication with project for 10 minutes and 4 seconds

I am getting the exact same thing on a manual update to the project.


It's spreading... sigh. Einstein started doing this a few days ago, people are complaining, they're trying to find a way to tell the client to hold off connecting WITHOUT sending this annoying message. And now SETI _adds_ it?


I've gotten it from both LHC, E@H, and Seti in the last week or so. LHC likes to back off about 27 hours instead of 10 minutes, then chirp about it every hour after that. Good thing I log some face time with my BOINC Client...

I'm running BOINC 4.72 on XP SP2 and 4.45 on 2000 SP4.

(j)
James
ID: 170216 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 170228 - Posted: 21 Sep 2005, 0:39:29 UTC - in response to Message 170216.  
Last modified: 21 Sep 2005, 1:19:12 UTC

LHC likes to back off about 27 hours instead of 10 minutes

Sounds like they are using Mars times. Is LHC trying to build their large hadron collider on our neighbour planet? ;)

(I must edit and say: Someone has had fun minusing all our posts. Maybe someone who thought this wasn't the correct forum for these posts. ;))
ID: 170228 · Report as offensive
Profile [HWU] GHz & CO. - BOINC.Italy
Volunteer tester
Avatar

Send message
Joined: 1 Jul 02
Posts: 139
Credit: 1,466,611
RAC: 0
Italy
Message 170250 - Posted: 21 Sep 2005, 1:27:41 UTC
Last modified: 21 Sep 2005, 1:28:59 UTC

Hi Matt,
I've controlled now my 3 hosts and all have a long queue of wu in transfer with many message of deferring download, but that work units was not listed in Work tab. Restarting BOINC, the queue of WU disappeared and BOINC continue with the work in cache, but try to DONWLOAD new WU and the download queue has been reformed, with deferring and this error:

21/09/2005 3.23.34|SETI@home|Unrecoverable error for result 09no03aa.23825.18768.59646.132_0 (WU download error: couldn't get input files:<file_xfer_error> <file_name>09no03aa.23825.18768.59646.132</file_name> <error_code>-200</error_code> <error_message></error_message></file_xfer_error>)


I don't understand why the Wu in transfer do not compare in the Work tab.....what's happening at the scheduler and download server? :(
GHz
BOINC.Italy
ID: 170250 · Report as offensive
J D K
Volunteer tester
Avatar

Send message
Joined: 26 May 04
Posts: 1295
Credit: 311,371
RAC: 0
United States
Message 170271 - Posted: 21 Sep 2005, 2:55:59 UTC
Last modified: 21 Sep 2005, 2:56:35 UTC

2 hosts are getting this about 1 to 1hr 1/2, nothing else just this one line.....

SETI@home 9/20/2005 4:56:28 PM Deferring communication with project for 9 minutes and 31 seconds

SETI@home 9/20/2005 6:03:37 PM Deferring communication with project for 10 minutes and 5 seconds
And the beat goes on
Sonny and Cher

BOINC Wiki

ID: 170271 · Report as offensive
Profile [HWU] GHz & CO. - BOINC.Italy
Volunteer tester
Avatar

Send message
Joined: 1 Jul 02
Posts: 139
Credit: 1,466,611
RAC: 0
Italy
Message 170275 - Posted: 21 Sep 2005, 3:19:45 UTC - in response to Message 170250.  

Hi Matt,
I've controlled now my 3 hosts and all have a long queue of wu in transfer with many message of deferring download, but that work units was not listed in Work tab. Restarting BOINC, the queue of WU disappeared and BOINC continue with the work in cache, but try to DONWLOAD new WU and the download queue has been reformed, with deferring and this error:

21/09/2005 3.23.34|SETI@home|Unrecoverable error for result 09no03aa.23825.18768.59646.132_0 (WU download error: couldn't get input files:<file_xfer_error> <file_name>09no03aa.23825.18768.59646.132</file_name> <error_code>-200</error_code> <error_message></error_message></file_xfer_error>)


I don't understand why the Wu in transfer do not compare in the Work tab.....what's happening at the scheduler and download server? :(


OK, no allarm, problem solved! It was my router! All OK now! Fiuuu :P
GHz
BOINC.Italy
ID: 170275 · Report as offensive
Jim O'Dell

Send message
Joined: 27 May 99
Posts: 64
Credit: 6,320,552
RAC: 10
United States
Message 170279 - Posted: 21 Sep 2005, 3:48:17 UTC - in response to Message 170138.  

Hi, All -

A lot of people have been having trouble connecting to our scheduler lately. Well, I think I fixed the problem, or at least one problem. If you were having trouble connecting before, can you try again and post positive/negative results to this thread?

Thanks,

- Matt



I'vcew beeb having lots of trouble over the lastg couple of months. I just tried several times in a row after learning of your fix and connected successfully every time. I don't believe that has happened sincce I have been using the BOINC version.

- Jim
ID: 170279 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13727
Credit: 208,696,464
RAC: 304
Australia
Message 170299 - Posted: 21 Sep 2005, 5:31:34 UTC - in response to Message 170188.  
Last modified: 21 Sep 2005, 5:32:49 UTC

I didn't have a problem connecting, but did get an odd message: ....

Same here, contacts the scheduler OK, and then defers communication for 10 minutes or so.

Edit- just did a manaul update after all my results had been returned & new work downloaded, no deferral this time.
Grant
Darwin NT
ID: 170299 · Report as offensive
Profile The Gas Giant
Volunteer tester
Avatar

Send message
Joined: 22 Nov 01
Posts: 1904
Credit: 2,646,654
RAC: 0
Australia
Message 170347 - Posted: 21 Sep 2005, 10:49:33 UTC

Works a treat!

"Same here, contacts the scheduler OK, and then defers communication for 10 minutes or so." This is new behaviour.

Live long and crunch.

Paul
(S@H1 8888)
And proud of it!
ID: 170347 · Report as offensive
Karl H. Kruhoffer

Send message
Joined: 17 Apr 00
Posts: 32
Credit: 4,972,575
RAC: 0
Denmark
Message 170356 - Posted: 21 Sep 2005, 12:03:17 UTC


Worked nicely using a proxy (apparently a local router problem as an alternate router works without proxy) until some time within the last 12-24 hours. Now it doesn't work. So it would seem you've perhaps fixed one bug, but at the same time introduced another... wouldn't be the first time in the history of programming.. :o)
ID: 170356 · Report as offensive
Profile Landroval

Send message
Joined: 7 Oct 01
Posts: 188
Credit: 2,098,881
RAC: 1
United States
Message 170358 - Posted: 21 Sep 2005, 12:14:00 UTC

Just a "ditto" to what others have posted... Running BOINC 4.45, WinXP Home, DSL connection. Not causing problems at the moment (still work in the queue), but I'm seeing the same behavior. The deferral is always 10 minutes, 4 seconds.


9/21/2005 6:20:42 AM|SETI@home|Finished upload of 09no03aa.23825.7121.292318.226_1_0
9/21/2005 6:20:42 AM|SETI@home|Throughput 233040 bytes/sec
9/21/2005 6:20:43 AM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
9/21/2005 6:20:43 AM|SETI@home|Requesting 0 seconds of work, returning 1 results
9/21/2005 6:20:45 AM|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi succeeded
9/21/2005 6:20:46 AM|SETI@home|Deferring communication with project for 10 minutes and 4 seconds

If you think education is expensive, try ignorance.
ID: 170358 · Report as offensive
Karl H. Kruhoffer

Send message
Joined: 17 Apr 00
Posts: 32
Credit: 4,972,575
RAC: 0
Denmark
Message 170359 - Posted: 21 Sep 2005, 12:14:08 UTC


To be a bit more specific - when disabling the proxy server I can upload results, but when running 'update' manually the connection fails, with and without the use of proxy...
ID: 170359 · Report as offensive
1 · 2 · 3 · Next

Message boards : Number crunching : Fixed scheduler?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.