Message boards :
Number crunching :
Panic Mode On (115) Server Problems?
Message board moderation
Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 31 · Next
Author | Message |
---|---|
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
The way I understand the download servers to work from a long ago post was the client asks for work from boinc2.ssl.berkeley.edu. That gets resolved to both Georgem and vader through a round-robin load balancer mechanism. When one of the servers is disabled, the single surviving download server has to service the entire download workload. It can't support the total number of requests on its own and the download mechanism falls over. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Tom M Send message Joined: 28 Nov 02 Posts: 5126 Credit: 276,046,078 RAC: 462 |
The way I understand the download servers to work from a long ago post was the client asks for work from boinc2.ssl.berkeley.edu. Is there any chance the server that is down can be rebooted remotely? Tom A proud member of the OFA (Old Farts Association). |
Ian&Steve C. Send message Joined: 28 Sep 99 Posts: 4267 Credit: 1,282,604,591 RAC: 6,640 |
im getting work, but every request i get some downloads that stick and i have to refresh them about 10 times before they finally kick through. sigh. Seti@Home classic workunits: 29,492 CPU time: 134,419 hours |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
I believe all the servers can be rebooted remotely. But that is not what happened today. They took the project down for maintenance earlier this morning and when they brought the project back up, the replica database was at first disabled, then enabled but left offline, then they put Georgem to disabled. And that is where it has stayed.For what reason, we can only guess as the staff is not forthcoming with any current technical news except on rare occasions. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
The inability to receive work because of stuck download is really dropping the return rate. Earlier this morning before the unexpected short maintenance event, we were returning tasks at about 150K/hr. Now we have fallen down to around 88K/hr. Sure hope they get Georgem back running before the weekend. Would be nice to hear the technical reason why Georgem is disabled also. We mushrooms crave information. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Speedy Send message Joined: 26 Jun 04 Posts: 1643 Credit: 12,921,799 RAC: 89 |
Scroll to the bottom task in the list on the download page in the Manager. Select it with the mouse and click the Retry button. I can usually get a dozen or so tasks that way cleared from the list before the download server begins to ignore me and give me a increased backoff. Then I move to another host and try there until it too craps out. Then move to another host etc. If I try that will I get CUDA 9 on my machine? Tom I suggest having a look at my host before you answer |
Pierre A Renaud Send message Joined: 3 Apr 99 Posts: 998 Credit: 9,101,544 RAC: 65 |
Is it worth using the following IP for the email system (in cases of DNS service failure) ? Haven't needed to use it in ages but have kept it as a (now possibly obsolete or erroneous) reference... 208.68.240.110 setiathome.berkeley.edu # IP address for the messages/email system(s) If you want to modify your hosts list, then these are the current IP addresses.Well, they were current at the dates stated. My local reference set has the dates updated to August 2017, the last time we had to dust them off. Apr 3, 1999 - May 3, 2020 |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
If I try that will I get CUDA 9 on my machine? Tom I suggest having a look at my host before you answer Uh, no. You need to run Linux and the special app to get CUDA9 tasks. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Is it worth using the following IP for the email system (in cases of DNS service failure) ? Haven't needed to use it in ages but have kept it as a (now possibly obsolete or erroneous) reference... Only if we have DNS issues and the web server is not being resolved. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
Scroll to the bottom task in the list on the download page in the Manager. Select it with the mouse and click the Retry button. I can usually get a dozen or so tasks that way cleared from the list before the download server begins to ignore me and give me a increased backoff. Then I move to another host and try there until it too craps out. Then move to another host etc. . . I can offer some guidance on changing over to Linux to run CUDA90 :) Stephen :) |
B. Ahmet KIRAN Send message Joined: 19 Oct 14 Posts: 77 Credit: 36,140,903 RAC: 140 |
Please, Please, Please, someone stop this torture of failed downloads... I have been trying on all my machines to get a decent download without any success for now around 8 hours... Why doesn't someone close down the downloads until the problem is resolved? It is better to read "project has no tasks available" than "download retry in xx:xx:xx" which never manages to succeed in the retry... |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13841 Credit: 208,696,464 RAC: 304 |
Got home to a relatively cool house, due to 1 system being out of all work & the other out of CPU work. Found lots of downloads, all in excessive backoff mode. Tried "Retry Pending transfers" with no joy. From the looks of this thread, it's nice to know i'm not the only one. Grant Darwin NT |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Downloads have been fubared most of the day ever since the mini maintenance outage this morning. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Unixchick Send message Joined: 5 Mar 12 Posts: 815 Credit: 2,361,516 RAC: 22 |
After multiple (very frustrating) retries... I finally got some WUs to download to the faster machine. I sorted the files by size in the download tab and tried the bottom one as suggested ( and threw a penny in a fountain and made a wish on a star) and finally got things to move. I didn't get all the stuck files, but at least the machine is crunching again. good luck... as more machines go into lengthy time outs maybe the traffic jam won't be so bad and us crazy die-hards can get some WUs here and there. |
Speedy Send message Joined: 26 Jun 04 Posts: 1643 Credit: 12,921,799 RAC: 89 |
Scroll to the bottom task in the list on the download page in the Manager. Select it with the mouse and click the Retry button. I can usually get a dozen or so tasks that way cleared from the list before the download server begins to ignore me and give me a increased backoff. Then I move to another host and try there until it too craps out. Then move to another host etc. Thanks for the offer Stephen I will stick with Windows |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
After multiple (very frustrating) retries... I finally got some WUs to download to the faster machine. If you can get at least one stuck download to clear, then you can start clearing one at a time. The download server seems to only respond to a single request to the database at a time for the stuck downloads. It may take dozens of tries to get the first one to start, but once it does, don't stop until you clear all the stuck ones, one at a time. Then you will finally be able to get more work and start the whole stuck download process over again. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Unixchick Send message Joined: 5 Mar 12 Posts: 815 Credit: 2,361,516 RAC: 22 |
They threw another Aricebo file on the splitter. AP files are being handed out to those lucky enough to get through. queries/second has spiked into the 5k range, so I don't think I'll get anymore WUs tonight. I got lucky and got a few when the queries were less than 2k. I hope tomorrow brings the Seti team some better luck and that they can solve this issue so we can all have a good weekend. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13841 Credit: 208,696,464 RAC: 304 |
If you can get at least one stuck download to clear, then you can start clearing one at a time. Not here. Regardless of what I try, it's 1 WU every 30-50 retries. I think i'll wait for the servers to get sorted out. Grant Darwin NT |
Wiggo Send message Joined: 24 Jan 00 Posts: 36584 Credit: 261,360,520 RAC: 489 |
Everything is ok here still and the AP's for today is into 3 figures. :-D Cheers. |
kittyman Send message Joined: 9 Jul 00 Posts: 51477 Credit: 1,018,363,574 RAC: 1,004 |
I found a little trick that seems to be working once you can get downloads started. In cc_config, set max files transfer to 1. <max_file_xfers>1</max_file_xfers> Have Boinc read the config file. Once it gets going, it will only download one task at a time. Seems to be working at the moment. Meow! "Time is simply the mechanism that keeps everything from happening all at once." |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.