Message boards :
Number crunching :
Current download problem prohibits also other projects downloads
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
From the front page: Actually, they fixed it about three hours ago (before 21:00 UTC - beating their own estimate!). Might be useful to bookmark this page. Still, we definitely have a long catch-up period this time, and things will be slow for a while yet. |
Nathan Send message Joined: 11 Apr 01 Posts: 10 Credit: 9,996,407 RAC: 0 |
Actually, they fixed it about three hours ago (before 21:00 UTC - beating their own estimate!). Sure doesn't look like it based on the machines here. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
Hmmm, I don't know. The INR-688 interface to Cogent "flatlined" yesterday around 3 PM Berkeley time, so it would seem they knew they were in trouble before they left yesterday (Also an afternoon time frame was mentioned in the news item). Sure, it flatlined, but can you infer that they knew ;-) ? a) Were they there? I don't know if it was a holiday / team meeting / awayday / schmooze the sponsors / fit a new receiver at Arecibo / anything else sort of day. All goood reasons for not fixing it. b) Did they notice? Do they have a flat-line alarm? Not a good reason for keeping us in the dark. Even while all the new development is taking place, they should nominate someone to mind the shop during working hours. |
dragon1 Send message Joined: 17 Sep 05 Posts: 33 Credit: 4,438,013 RAC: 0 |
Likely a big backlog....I have JUST received 7 downloads...AND SETI also now sees my preferences ie 1.5 days...something it hasn't done in about a week. Maybe (likely) our good friends at Berk. have been working on that earlier reported issue too. Horray.... |
John McLeod VII Send message Joined: 15 Jul 99 Posts: 24806 Credit: 790,712 RAC: 0 |
It's my guess that his client has requested X seconds of work from seti and the order was filled, but just hasn't reached him yet, so the puter thinks it has X seconds on hand when infact it doesn't. The scheduler has already taken that into account and isn't requesting other work. Atleast, I think this is correct. Actually, there is a problem with the current code as well. If the only contactable project with a higher LTD than -task switch interval is acknowledging work requests by granting work, but the download server is off line, then the BOINC client believes that it has enough work on the way and should not request any more (when it actually does not have work on the way). I believe I have found a fix that will go into 5.7.4 or so. BOINC WIKI |
Pappa Send message Joined: 9 Jan 00 Posts: 2562 Credit: 12,301,681 RAC: 0 |
If I were to think that traffic In/OUT is no longer happening because the server has issues, then the higher volume of traffic will appear to flatline. So the NFS issue prevents the traffic from happening... the lower volume of traffic is workstations trying to connect... Hmmm, I don't know. The INR-688 interface to Cogent "flatlined" yesterday around 3 PM Berkeley time, so it would seem they knew they were in trouble before they left yesterday (Also an afternoon time frame was mentioned in the news item). Please consider a Donation to the Seti Project. |
Nathan Send message Joined: 11 Apr 01 Posts: 10 Credit: 9,996,407 RAC: 0 |
All the machines I can see say "Activities Suspended" No processing--things are just stopped. Restarted boinc, updated project, etc. Nothing. |
Alinator Send message Joined: 19 Apr 05 Posts: 4178 Credit: 4,647,982 RAC: 0 |
This is a known issue in the current release, where it does CPU scheduling system-wide. The current beta does scheduling on a per-core basis. John, just to make sure I'm reading this right. Let's say say Project A has the highest LTD, Project B the next highest, Project C next, and so forth. Project A is returning a NNW on requests, Project B is sending work but DL's are failing. You're saying that when the work for Project A runs out and BOINC discovers the result for Project B is unrunable it won't fall back and DL a result from Project C? Alinator |
Alinator Send message Joined: 19 Apr 05 Posts: 4178 Credit: 4,647,982 RAC: 0 |
Hmmm, I don't know. The INR-688 interface to Cogent "flatlined" yesterday around 3 PM Berkeley time, so it would seem they knew they were in trouble before they left yesterday (Also an afternoon time frame was mentioned in the news item). Apparently according to the Tech News they were busy enough worrying about other issues they *didn't* realize new work wasn't going out the door. ;-) Oh well, things like that happen from time to time (even if alarms go off, just ask the guys at TMI). :-) Alinator |
zombie67 [MM] Send message Joined: 22 Apr 04 Posts: 758 Credit: 27,771,894 RAC: 0 |
Well, I have been (and still am) away on travel during this, and these are my observations. I usually get 17k per day. about 10k of that are machines that run SETI exclusively. The rest (11 machines) run SETI/Rosetta/WCG at 100/50/50. During the outage, I made 10.8. Now, the SETI-exclusive machines were running on a setting of 1.3 (or greater) days of work. This tells me that one of two things happened: 1) I lost almost all the points on the mixed project machines...they sat idle, when they could have just worked on the other two projects instead. or 2) The "Connect to network about every..." setting does not do what I have been told it does. And no, this is not attributable to the recent bug. These machines have been on the 1.3 days for (forgot to write) over a month now. I'm guessing #1 based on other posts in this thread. So....we have uncovered a real bug, IMO. And a serious one at that. Dublin, California Team: SETI.USA |
John McLeod VII Send message Joined: 15 Jul 99 Posts: 24806 Credit: 790,712 RAC: 0 |
This is a known issue in the current release, where it does CPU scheduling system-wide. The current beta does scheduling on a per-core basis. Depends on the LTD of C and whether the system is otherwise in EDF. If the system is not in EDF and the LTD of C is above the cutoff, then work would be fetched. Otherwise not. This is a bug for which I have submitted a fix. It did not make it into 5.7.4 as that came out immediately after I submitted the fix, but before anyone had a chance to look at it and check it in. BOINC WIKI |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.