Message boards :
Number crunching :
Panic Mode On (24) Server problems
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 11 · Next
Author | Message |
---|---|
Sutaru Tsureku Send message Joined: 6 Apr 07 Posts: 7105 Credit: 147,663,825 RAC: 5 |
More like Catch22 Same here and I guess everywhere.. UL/DL slowly/not possible.. errors.. Also not possible to reach the scheduler.. |
52 Aces Send message Joined: 7 Jan 02 Posts: 497 Credit: 14,261,068 RAC: 67 |
More like Catch22 A Catch two-twenty-two, as all your wingmen are also in the same situation, so even stuff you had pending can't be credited as they can't upload either ;-) I do wish the User total chart inside Boinc graphed based on the timestamp I completed the WU. I consistenly crunch the same small amount of work everyday. But my chart looks like a I go dark for days on end as my silly wingmen seem to use their systems for other stuff, thus I wait and my graph goes flat ;-) |
Samdani Send message Joined: 21 Oct 00 Posts: 85 Credit: 13,480,553 RAC: 0 |
Finally started to receive work. Actually, at the exact moment when last GPU unit was being crunched. Am I lucky or what :) |
Jet Send message Joined: 25 Sep 07 Posts: 12 Credit: 1,586,013 RAC: 0 |
Yes, you are right, exact description of the problem. upload pending :-( |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
As others have noted already, bandwidth is pretty much pegged: http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=/router-interfaces/inr-250/gigabitethernet2_3&ranges=d%3Aw&view=Octets It's a 100 megabit link at 92 megabits. When you start loading above 80% or so, there are lots of packets that get dropped due to congestion. It would be nice if BOINC could "flow control" uploads: slowing down all of the attempts would reduce collisions and increase throughput. In the meantime, hang on and enjoy the ride. It always recovers eventually. |
Vistro Send message Joined: 6 Aug 08 Posts: 233 Credit: 316,549 RAC: 0 |
Now my "kill network for a week then upload everything, rinse lather repeat" plan isn't so stupid, lol. God I wish I knew what it is like to worry about being on your last 1,000 GPU units. 30+ Computers heading our way! Currently at the "Zomg we need to talk to our tech expert at the co-op about this first!!!" stage. 16 Lab machines and 14+ Staff machines each with 2.2Ghz CPUs and 256MB ram. Think they balance? The RAM certainly is bad |
PhonAcq Send message Joined: 14 Apr 01 Posts: 1656 Credit: 30,658,217 RAC: 1 |
Can we learn anything about these outages, where the bandwidth gets pegged and we all "suffer"? Like, what is the maximum time Matt can leave Berkeley before all hell breaks loose? Or, is there something more interesting? |
hiamps Send message Joined: 23 May 99 Posts: 4292 Credit: 72,971,319 RAC: 0 |
Uploads keep hitting 100% then just go back to retry in....At least they are making progress. Official Abuser of Boinc Buttons... And no good credit hound! |
Vistro Send message Joined: 6 Aug 08 Posts: 233 Credit: 316,549 RAC: 0 |
The problem at this moment (in about 3.5 moments there will be a new problem :p lol jk), is that everybody's computers are able to get work now, so they use massive amounts of bandwidth. Because of this, nobody can upload their tasks to get more. It's a vicious cycle that only a new internet connection can fix, but right now SETI just doesn't have the cash for it. 30+ Computers heading our way! Currently at the "Zomg we need to talk to our tech expert at the co-op about this first!!!" stage. 16 Lab machines and 14+ Staff machines each with 2.2Ghz CPUs and 256MB ram. Think they balance? The RAM certainly is bad |
Labbie Send message Joined: 19 Jun 06 Posts: 4083 Credit: 5,930,102 RAC: 0 |
Think it's bad here? Pigeon transfers data faster than South Africa's Telkom Calm Chaos Forum...Join Calm Chaos Now |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
Can we learn anything about these outages, where the bandwidth gets pegged and we all "suffer"? Like, what is the maximum time Matt can leave Berkeley before all hell breaks loose? Or, is there something more interesting? If by "we" you mean forum members, I think there are a couple of interesting lessons. Probably the most interesting is: "when you crush the network, it does eventually recover." If by "we" you mean SETI@Home, I think there are some experiments that could be run, like setting the "back-off" time to more than 11 seconds. If by "we" you mean the BOINC developers, I think we have a testbed for some kind of flow control, but someone has to write the code. |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
Think it's bad here? Which makes me wonder why they haven't implemented IPoAC in South Africa. |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
Think it's bad here? Thinking about it a little more, I wonder if IPoAC would be possible to improve the bandwidth situation on campus.... |
Labbie Send message Joined: 19 Jun 06 Posts: 4083 Credit: 5,930,102 RAC: 0 |
Think it's bad here? LOL - but it would take an entire flock!!!! Calm Chaos Forum...Join Calm Chaos Now |
PhonAcq Send message Joined: 14 Apr 01 Posts: 1656 Credit: 30,658,217 RAC: 1 |
Can we learn anything about these outages, where the bandwidth gets pegged and we all "suffer"? Like, what is the maximum time Matt can leave Berkeley before all hell breaks loose? Or, is there something more interesting? Ok, you have some good thoughts. I would love to know if anyone relevant has read them. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14654 Credit: 200,643,578 RAC: 874 |
Telkom could not immediately be reached for comment. I wonder why not? |
Fred W Send message Joined: 13 Jun 99 Posts: 2524 Credit: 11,954,210 RAC: 0 |
Telkom could not immediately be reached for comment. Lack of bandwidth... F. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14654 Credit: 200,643,578 RAC: 874 |
Telkom could not immediately be reached for comment. Or lack of pigeons... |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
Can we learn anything about these outages, where the bandwidth gets pegged and we all "suffer"? Like, what is the maximum time Matt can leave Berkeley before all hell breaks loose? Or, is there something more interesting? I assume you aren't saying that those of use here are not relevant. :-) Since every time this happens it seems to be a minor tempest in the forums, I would say that some forum members still haven't figured out that it isn't as big a problem as it might. For the folks at SETI@Home, I'm sure they've experienced what happens when you try to speed up the recovery process -- my experience is that it generally makes recovery slower. For the BOINC developers, one of my suggestions is in 6.6.38 and later. |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
Telkom could not immediately be reached for comment. Same thing. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.