Message boards :
Number crunching :
No schedulars responded.
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 . . . 6 · Next
Author | Message |
---|---|
Pooh Bear 27 Send message Joined: 14 Jul 03 Posts: 3224 Credit: 4,603,826 RAC: 0 |
From the Technical News Page: June 16, 2005 - 19:00 UTC Since most (well over 99%) of scheduler accesses were now reaching the new scheduling server, we shut down the scheduler on the old server (which now only handles uploads/downloads). I am suspecting these people that are having problems are maybe still hitting the old server, and have not updated to hit the new server. I have had no problems, so far this weekend. Plenty of work to go around, it seems. My movie https://vimeo.com/manage/videos/502242 |
Steve Cressman Send message Joined: 6 Jun 02 Posts: 583 Credit: 65,644 RAC: 0 |
I know about the planned outages, but this has gone on for me for 12 hours now and others I have asked on some irc channels are not having a problem. So why would I consistently be getting the dreaded 'no schedular responded' message for the last 12 hrs. Web access is OK and the computer has been re-booted a few times today due to various updates. I was having the same problem, deferring for 58 sec everytime. After several hours of this I tried using a proxy to attach, had success and work dl/ul. Turned off the proxy and the deferring for 58 sec problem returned. Initiated use of the proxy again and now it works again. >I am suspecting these people that are having problems are maybe still hitting >the old server, and have not updated to hit the new server. This seems highly likely and was wondering what to do to have the dns update. (me thinks dumping the dns cache in my firewall should do it.) EDIT : Just removed the entries for berkley instead of deleting whole cache and will see what happens with next conection. CONT: That didn't work but using proxy does solve it for now. 98SE XP2500+ @ 2.1 GHz Boinc v5.8.8 And God said"Let there be light."But then the program crashed because he was trying to access the 'light' property of a NULL universe pointer. |
EclipseHA Send message Joined: 28 Jul 99 Posts: 1018 Credit: 530,719 RAC: 0 |
From the Technical News Page: Let's see 99% of 100000 means that 1000 people are still using the old server! Keeping the old server active until it was 99.9 or even 99.999% might have been the correct move. "hey, we get less than one scheduler hit an hour on the old server, ok to shut it down?" might have been a more valid guage. The better test would have been only a handful of hits to the old server in a day.... Some ISP's set their DNS/Bind to always cache for X amount of time, reguardless of what was specified in the original zone file... Ya got to give a DNS change like this about a week to fully ripple thru the internet... Trust me.. I've seen this many times before..... Saying that "99%" is the mark" just leads to frustation for, in this example, 1000 people. Rebooting their PC won't clear the problem if their DNS is back at their ISP (as it would be is the case for most users, or proxy sites) |
The Gas Giant Send message Joined: 22 Nov 01 Posts: 1904 Credit: 2,646,654 RAC: 0 |
I don't want to reset with 26 results ready to report. Well it updated ok while I was out and downloaded 14 wu's, but it's back to no schedulars responded now with 5 results ready to report. It has now stopped the 58 second deferral and is doing exponential backoffs, with it now at 1hr11min defferal. Odd really. Paul. |
Ingleside Send message Joined: 4 Feb 03 Posts: 1546 Credit: 15,832,022 RAC: 13 |
Let's see 99% of 100000 means that 1000 people are still using the old server! If a project uses 2 scheduling-servers, one of them is handing out even-result-id while the other is handing out odd result-id. In instances like now one of the scheduling-servers only gets 1% of the traffic, this means very few wu can be validated since for most wu only 2 results have been sent out. This also means very little credit is handed out, splitters must work much harder generating enough wu, and disk-usage will increase very fast since no old wu can be deleted. BTW, if you tries to connect to the old scheduling-server, you'll get this message: <scheduler_reply> |
Urs Echternacht Send message Joined: 15 May 99 Posts: 692 Credit: 135,197,781 RAC: 211 |
[qoute] I was having the same problem, deferring for 58 sec everytime. After several hours of this I tried using a proxy to attach, had success and work dl/ul. Turned off the proxy and the deferring for 58 sec problem returned. Initiated use of the proxy again and now it works again. ... EDIT : Just removed the entries for berkley instead of deleting whole cache and will see what happens with next conection. CONT: That didn't work but using proxy does solve it for now. [/quote] Hi Steve, which proxy did you use? I tried several around the world but these did not change my situation. Still no response from the scheduler. The last two wus of my 6 day cache are on now and the deadline is eight days away. So there is a chance to figure this out in time. 39 wus to report. _\|/_ U r s |
Thierry Van Driessche Send message Joined: 20 Aug 02 Posts: 3083 Credit: 150,096 RAC: 0 |
A list of proxies that worked some 2 months ago can be find here. |
Urs Echternacht Send message Joined: 15 May 99 Posts: 692 Credit: 135,197,781 RAC: 211 |
Thanks Thierry. This is the list i have tried today and two other sides lists of free proxies, too. Maybe a proxy will not solve this. My hope is, the machine will run out of work and then be able to connect. Approximately two hours till then. edit: Before i forgot that two machines are unable to report/download wus. So its 59 wus to report. My third machine just was able to connect via the same dial-up connection. So DNS seems not to be the problem. _\|/_ U r s |
Urs Echternacht Send message Joined: 15 May 99 Posts: 692 Credit: 135,197,781 RAC: 211 |
Something positiv: After the last wu was half through, the connection happend. Now it is downloading new work. I reduced the cache-size to 4 days. Hopefully this works better. _\|/_ U r s |
The Gas Giant Send message Joined: 22 Nov 01 Posts: 1904 Credit: 2,646,654 RAC: 0 |
I don't want to reset with 26 results ready to report. After hitting update a few times, BOINC went into the 58 second deferral and stayed that way for another 10hrs when it was able to report and get get new work again. Is there something that Berkeley have implemented that they haven't told us yet? I've never had this sort of behaviour from BOINC before. Interestinger and interestinger.... Paul (S@H1 8888) And proud of it! |
John McLeod VII Send message Joined: 15 Jul 99 Posts: 24806 Credit: 790,712 RAC: 0 |
The address of the Scheduler(s) are stored in the Master File. The master File is checked after 10 contact failures. It should be self correcting, and if you don't have the patience, you can just hit update enough times. BOINC WIKI |
The Gas Giant Send message Joined: 22 Nov 01 Posts: 1904 Credit: 2,646,654 RAC: 0 |
The address of the Scheduler(s) are stored in the Master File. The master File is checked after 10 contact failures. It should be self correcting, and if you don't have the patience, you can just hit update enough times. Let me see....10hrs of 20/06/2005 9:20:42 AM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi 20/06/2005 9:20:42 AM|SETI@home|Requesting 0 seconds of work, returning 6 results 20/06/2005 9:20:45 AM|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed 20/06/2005 9:20:45 AM|SETI@home|No schedulers responded 20/06/2005 9:20:46 AM|SETI@home|Deferring communication with project for 58 seconds equals just a few more than 10 contact failures..... As it is said, "Patience is a virue". I am trying to be very virtuous when it comes to BOINC atm. What's interesting in all this is that BOINC will eventually make contact, report work and get new work then start repeating the above for another 10hrs. Paul. |
The Gas Giant Send message Joined: 22 Nov 01 Posts: 1904 Credit: 2,646,654 RAC: 0 |
Still at it. Other projects are working OK, so it's not my ISP, it's not my connection, it has to be something at Berkeley...anyone? Paul. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13736 Credit: 208,696,464 RAC: 304 |
Still at it. Other projects are working OK, so it's not my ISP, it's not my connection, it has to be something at Berkeley...anyone? No problem here- 3 seconds from request to response from the Scheduler at the moment. Grant Darwin NT |
The Gas Giant Send message Joined: 22 Nov 01 Posts: 1904 Credit: 2,646,654 RAC: 0 |
Still at it. Other projects are working OK, so it's not my ISP, it's not my connection, it has to be something at Berkeley...anyone? Still at it..about to run out of work. I am not having this problem on my laptop which also uses the same connection from home to get work on occasion. After hitting update many times I have seen a new message.... 22/06/2005 10:48:56 PM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi 22/06/2005 10:48:56 PM|SETI@home|Requesting 518400 seconds of work, returning 13 results 22/06/2005 10:49:00 PM|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed with a return value of 500 22/06/2005 10:49:00 PM|SETI@home|No schedulers responded 22/06/2005 10:49:01 PM|SETI@home|Deferring communication with project for 59 seconds 22/06/2005 10:50:01 PM|SETI@home|Fetching master file 22/06/2005 10:50:05 PM|SETI@home|Master page download succeeded 22/06/2005 10:50:06 PM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi 22/06/2005 10:50:06 PM|SETI@home|Requesting 518400 seconds of work, returning 13 results 22/06/2005 10:50:10 PM|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed with a return value of 500 22/06/2005 10:50:10 PM|SETI@home|No schedulers responded 22/06/2005 10:50:11 PM|SETI@home|Deferring communication with project for 59 seconds So a new master file has been retrieved which must have the required url in it. I am now out of seti work :( Why is it so.... Paul. |
Alite Send message Joined: 1 Sep 99 Posts: 3 Credit: 2,807,174 RAC: 0 |
On most of my machines I can contact the Scheduler but don't receive any new work. Just these messages... 6/22/2005 7:59:41 AM|SETI@home|Sending request to scheduler: http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi 6/22/2005 7:59:42 AM|SETI@home|Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi succeeded Any ideas? Arie |
Jord Send message Joined: 9 Jun 99 Posts: 15184 Credit: 4,362,181 RAC: 3 |
I've joined you into the deferal of communication, Paul. Even a project reset doesn't help. Only thing different so far is that I only had 4 of the 58 seconds deferals, before I jumped to 38 minutes. ;) 23/06/2005 02:13:58|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi 23/06/2005 02:13:58|SETI@home|Requesting 43200 seconds of work, returning 0 results 23/06/2005 02:13:59|SETI@home|Scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed 23/06/2005 02:13:59|SETI@home|No schedulers responded 23/06/2005 02:14:00|SETI@home|Deferring communication with project for 38 minutes and 20 seconds About to put the thing to NNW again and just leave it be. Beta can then do the work. :) |
mikey Send message Joined: 17 Dec 99 Posts: 4215 Credit: 3,474,603 RAC: 0 |
I've joined you into the deferal of communication, Paul. Even a project reset doesn't help. Only thing different so far is that I only had 4 of the 58 seconds deferals, before I jumped to 38 minutes. ;) If you look at the Cogent connection: http://fragment1.berkeley.edu/~cricket/inr-668-interfaces.html it shows Berkeley is busy doing something! |
Daniel Michel Send message Joined: 2 Feb 04 Posts: 14925 Credit: 1,378,607 RAC: 6 |
i think the traffic jam is being caused by the new seti 4.18 application being downloaded to seti users all over the planet...both of my machines have now recieved it...but it took some time. PROUD TO BE TFFE! |
Jord Send message Joined: 9 Jun 99 Posts: 15184 Credit: 4,362,181 RAC: 3 |
Yup, it was the traffic jam that caused it for me. ;) I've now got 4.18, one unit and hot damn, I got graphics. :) |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.