Message boards :
Number crunching :
Panic Mode On (26) Server problems
Message board moderation
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 13 · Next
Author | Message |
---|---|
![]() ![]() Send message Joined: 23 May 99 Posts: 4292 Credit: 72,971,319 RAC: 0 ![]() |
I noticed something strange about this problem on my machines. All my machines with Crunch3rs 6.1.0 fixed themselves and are fine. Each of my machines with 6.10.18 needed to be restarted to finish their downloads a couple a few restarts. So the problem seems to be in the newer builds on my machines. Official Abuser of Boinc Buttons... And no good credit hound! ![]() |
![]() ![]() Send message Joined: 16 May 99 Posts: 10436 Credit: 110,373,059 RAC: 54 ![]() ![]() |
I had turned my i7 off for 3 hours when i had problems with the downloads. wathced a movie with the grand kids. low and behold when i turned it on ,I had work units running. maybe patience is what was called for. I only powerd up to read the forums, wasnt worried if i had anything running. ![]() Old James |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 ![]() |
The exception is the Maximum TTL settings in the Windows Registry. That is to correct a Windows BUG. It's all in how you conjugate the verb. If it's in your code, it's a bug. If it's in my code, it's a feature. :-) |
![]() ![]() Send message Joined: 14 Jul 04 Posts: 887 Credit: 176,479,616 RAC: 0 ![]() |
Well, my i7 running Windows 7 64 bit ultimately downloaded 1400 WU's. I say ultimately, since it took nearly 4 hours and more /flushdns's than I can count. What I saw was that after a /flush, Boinc would d/l WU's for five minutes (I put a stopwatch on it to be certain; it was always 5 minutes, give or take 10 seconds), then roll right back into the error state. I'd have to halt d/l's, stop Boinc, do another flush, and start over again. But, it DID work.............;) |
![]() ![]() Send message Joined: 23 May 99 Posts: 4292 Credit: 72,971,319 RAC: 0 ![]() |
I have Windows 7 64 bit ultimate also. Had to do a bunch of restarts. Official Abuser of Boinc Buttons... And no good credit hound! ![]() |
![]() ![]() Send message Joined: 9 Feb 04 Posts: 1175 Credit: 4,754,897 RAC: 0 ![]() |
|
FiveHamlet ![]() Send message Joined: 5 Oct 99 Posts: 783 Credit: 32,638,578 RAC: 0 ![]() |
I found that using the ReSchedule tool for changing GPU>CPU worked for my Vista rigs.Having to do it a few times though. Don't know why it works. Dave ![]() |
Richard Haselgrove ![]() Send message Joined: 4 Jul 99 Posts: 14690 Credit: 200,643,578 RAC: 874 ![]() ![]() |
@ Ned, I've been watching this thread, still without access to my main machines, and you've almost convinced me that it's all Windoze' fault - but not quite. This story sounds like a very good description of the effects that one would expect to see with round-robin DNS and a TTL of 300 seconds: What I saw was that after a /flush, Boinc would d/l WU's for five minutes (I put a stopwatch on it to be certain; it was always 5 minutes, give or take 10 seconds), then roll right back into the error state. I'd have to halt d/l's, stop Boinc, do another flush, and start over again. But if that was the end of the story, wouldn't we expect to see that five minutes later, the downloads restart again of their own accord? (maybe only 50% of the time - sometimes it might be 10 minutes, sometimes 15, and so on - but it should have a good chance of restarting in due course). I don't think anybody has posted in this thread to report a spontaneous resumption of downloads after hitting the problem: if anyone has experienced one, please tell us about it. It sounds the same as the discussion in Panic Mode On (18): there I reported a similar run of downloads followed by failure, but with no resumption after more than an hour. The odds against that are more than 4,000 to 1 if there's a true random re-resolve from DNS every 5 minutes. That's why I think there's a difference between BOINC's initial attempt to download a file (using DNS), and subsequent retries, using ????? |
![]() Send message Joined: 9 Jun 99 Posts: 15184 Credit: 4,362,181 RAC: 3 ![]() |
Tried ipconfig /flushdns got cannot flush the resolver cache, my Isp will not allow it. This has nothing to do with your ISP. You probably have the DNS client service disabled, or stopped. Without a DNS client, you're not caching any DNS entries. (You'll need administrator functions for this) Go to Start->Run Type "services.msc" without the quotes, press Enter Scroll to DNS Client. Double click on DNS Client. Under the Startup type pulldown, put it on "Automatic". Then click the Start button. Click Apply. Click OK. You will now be able to flush your DNS using the ipconfig /flushdns command. |
Fred W Send message Joined: 13 Jun 99 Posts: 2524 Credit: 11,954,210 RAC: 0 ![]() |
That's why I think there's a difference between BOINC's initial attempt to download a file (using DNS), and subsequent retries, using ????? I was trying to compose exactly the same comment but Richard got the words so much better. I have amended the registry entries to force the 5 minute switch and can watch it happening in a cmd box by pinging. But I have one download that has been stuck for over an hour ATM and, no matter how many retries I force (or wait for) it remains stuck. It would certainly appear that the IP address (rather than the url) is being cached elsewhere within Boinc. F. ![]() |
![]() ![]() Send message Joined: 9 Feb 04 Posts: 1175 Credit: 4,754,897 RAC: 0 ![]() |
|
![]() ![]() Send message Joined: 23 May 99 Posts: 4292 Credit: 72,971,319 RAC: 0 ![]() |
I am curious why this only seems to effect my machines with 6.10.18 but none using 6.1.0? Official Abuser of Boinc Buttons... And no good credit hound! ![]() |
![]() ![]() Send message Joined: 28 Jan 06 Posts: 1410 Credit: 934,158 RAC: 0 ![]() |
Haven't been able to download all day. FLUSHDNS doesn't work. Reboot doesn't work. Upload works fine. Report works fine. Dust storm has made my nose run a continuous 4 minute mile all day. Damaged newly-painted wall trying to hang a picture. Motor mower on the fritz. I'm telling you, it's a conspiracy! ![]() |
![]() ![]() Send message Joined: 28 Jan 06 Posts: 1410 Credit: 934,158 RAC: 0 ![]() |
Now report doesn't work. And I appear to have been bitten by a spider. Definitely a conspiracy [panic on] ![]() |
![]() ![]() Send message Joined: 28 Jan 06 Posts: 1410 Credit: 934,158 RAC: 0 ![]() |
Tried ipconfig /flushdns got cannot flush the resolver cache, my Isp will not allow it. While you are correct in what you say, this is not curing the issue, which is a problem at S@H end. We shall have to do the usual, and wait for the crew to hit the appropriate pipe with a hammer. ![]() |
![]() ![]() Send message Joined: 18 Sep 03 Posts: 834 Credit: 1,807,369 RAC: 0 ![]() |
OK... if that goes on like that, I soon can open my own "Technical News" thread. LOL On my dedicated cruncher the host file entries stopped to help. So I deleted them and although this machine still believed to find anything that ends with "berkeley.edu" is on that spanish book shop IP, the download was OK. Now half an hour later I finally get the right IPs for berkeley. And I deleted the host file entries also on my laptop. So now everything works here, hopefully it will last for a while. ![]() |
kittyman ![]() ![]() ![]() ![]() Send message Joined: 9 Jul 00 Posts: 51540 Credit: 1,018,363,574 RAC: 1,004 ![]() ![]() |
I found that using the ReSchedule tool for changing GPU>CPU worked for my Probably because the rescheduler shuts down Boinc and then restarts it when it has completed the rescheduling. "Time is simply the mechanism that keeps everything from happening all at once." ![]() |
![]() ![]() Send message Joined: 20 Aug 02 Posts: 3377 Credit: 20,676,751 RAC: 0 ![]() |
I had six or seven stuck in download. I just shut down connected client, and closed boinc manager for about ten minutes. when I started it back up they all downloaded with no problem. Maybe I just got lucky but it worked once, maybe it will again. Haven't played with the rescheduler, I seem to have a pretty good mix of CPU/GPU tasks but I think the kittyman is probably right on that one. ![]() PROUD MEMBER OF Team Starfire World BOINC |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 ![]() |
@ Ned, @Richard What I think is happening is that Windows will keep both IP addresses cached and in the same order for the default Maximum TTL, even if the record has a shorter TTL. Since the default for the Maximum TTL is 86400 (1 day) your formula is right, but the it'd take a day to time out and re-randomize the lookup. ... and as you pointed out, there is a 50% chance of getting the same order on a random lookup. I suspect that this kind of issue never lasts for more than a couple of days, so on that time scale it's hard to be sure what happened. -- Ned |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 ![]() |
I am curious why this only seems to effect my machines with 6.10.18 but none using 6.1.0? One of my grumbles over 6.1.0 is that it has a BOINC-style version number, but the modifications are based on a 5.x client. The Lunatics gang carefully maintained the original SETI application versions and added an additional version/build info to make it absolutely clear that it's not from Berkeley. As I remember, 6.1.0 is based on a mid-5.x.x version, probably before they started using libcurl. Why the switch to libcurl? Because it handles a variety of proxies and situations that the earlier code did not do well. Richard may be right in that it's a libcurl bug, but I have <http_debug>1</http_debug> in my cc_config.xml, and it seems to say that it's doing the lookup, and getting the (wrong) answer from the underlying OS. I could be wrong. Edit: 6.1.0 is definitely older than it appears. |
©2025 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.