School (Feb 22 2011) |
![]() |
| log in |
Message boards : Technical News : School (Feb 22 2011)
Previous · 1 · 2
| Author | Message |
|---|---|
|
Uh Oh.... I think that I started something here ! :> | |
| ID: 1081667 · | |
|
Umm, isn't it time to consider cutting down the mandatory wait time after a scheduler update? (to something less than 5 minutes...) | |
| ID: 1081708 · | |
|
Hmm, someone must be calling into the lab over the weekend and loading new tapes as there are both MB and AP units available, and a few more of each to be split from loaded tapes. May I thank that person for going "above and beyond". | |
| ID: 1081962 · | |
Umm, isn't it time to consider cutting down the mandatory wait time after a scheduler update? (to something less than 5 minutes...) I'm not sure, but I suspect the mandatory wait time is there for limiting bandwidth issues. Regardless, it's something I can live with. (Don't want to hammer the servers too much while things are running as smoothly as they are now, after all.) ____________ | |
| ID: 1081980 · | |
Umm, isn't it time to consider cutting down the mandatory wait time after a scheduler update? (to something less than 5 minutes...) Well you just hit the nail on the head, changing the 11 seconds(?), in 5 minutes,I think, was one reason to decrease the IN and OUTput of the Up- & DownLoad SERVERS! A 100,000 or more hosts hammering every few seconds on these servers, was of the reasons to change this, these short 11 seconds, also changed to report direct in some older BOINC versions, is somekind of DDOS-Attack, IMHO! And with the, still growing CUDA/CAL/OpenCL GPU processing, this certainly will be reviewed, sometime in the (near) future.... I'm not a ICT specialist, but with the ever increasing (Moore's Law), demand for faster and more efficient hard and software, it's matter of time when the new SERVERS can't keep up, with the demand for new work, anymore and another expansion, is needed! ____________ Knight Who Says Ni N!, OUT numbered................. | |
| ID: 1082332 · | |
|
I was not suggesting a return to the 11 second delay - just somethin' shorter than 5 minutes! (like 2-3 minutes...) | |
| ID: 1082387 · | |
|
The procedure to increase the delay was kind of successful in that it decreased the entropy on the machinery as well as allowed for bandwidth to be opened up. Newer machines have been added to the mix in the lab, but the bandwidth is, if I'm not mistaken, the same. The planned three day outages may be a thing of the past, but occasional outages will still occur for various reasons. Don't forget that not all the servers were changed out, and plenty of stuff is happening in the background. It's not a good idea to fix something that isn't broken anyway. I consider the delay a real non-issue. If you really want to connect sooner you can always force the issue manually with the update button. | |
| ID: 1082412 · | |
|
The 5-minute delay was to reduce the load on jocelyn when we were waiting for the new servers to be spec'ed, ordered, and installed. | |
| ID: 1082427 · | |
The 5-minute delay was to reduce the load on jocelyn when we were waiting for the new servers to be spec'ed, ordered, and installed. But given the load we're putting in the system at the moment - with the cricket graph maxxed out for seven of the last eight hours - I'd suggest that it would be wise to keep things tamped down for the time being. Remember that the problem was scheduler request files fighting their way through the upload and download traffic, and the result - at that time - was ghost WUs, which certainly caused more problems than they were worth. Now, with the more powerful servers proving themselves capable of handling the limited number of 'resend lost results', things are running a lot smoother - but I see no benefit in increasing the number of lost results needing to be resent. And I haven't seen any sign myself, or heard any complaints from the boards, suggesting that a five-minute delay is too long. Most regular posters will be running caches measured in days, not even hours - having to re-request two or three times isn't going to make any noticable difference to them. | |
| ID: 1082458 · | |
|
No complaints from me about a lousy 5mins. | |
| ID: 1082460 · | |
|
I say leave be on the 5 Min delay for another 3-4 weeks to get a better feel on how things work when we have no problems. We have no need to rush into things. I say we take it slow before making changes. | |
| ID: 1082474 · | |
The 5-minute delay was to reduce the load on jocelyn when we were waiting for the new servers to be spec'ed, ordered, and installed. This is my feeling as well. The 5 minute delays certainly haven't hurt me. My 5 machines have a combined RAC of over 80,000, putting me at over 1000 WU's a day on average. I'm having no problem at all keeping their caches full. Even Todd with his 600,000+ RAC (Easily over 7500 WU's a day) never has complained about keeping the cache topped off, even with the 5 min delays. The only thing limited by that 5 minutes is requesting new work. ____________ | |
| ID: 1082494 · | |
I say leave be on the 5 Min delay for another 3-4 weeks to get a better feel on how things work when we have no problems. We have no need to rush into things. I say we take it slow before making changes. I agree. I don't have any problems, however, I'm only running one little computer. I don't know about the people that are running several machines. I get enough work to keep me going for about 5 days, and so far, it's running nice and smooth. | |
| ID: 1082495 · | |
|
Another here running only a little computer and things couldn't be better. Before boinc would try to connect to the server what seemed like once a minute for HOURS at a time with no success, now if it can't connect to the server it will try again five minutes later and I get another task. Woo and yay. | |
| ID: 1082525 · | |
|
Kibble (KB7TIB) wrote: ... Yes, but a request for work forced that way will get a "Not sending work - last request too recent: xxx sec" message if there is work available. A few of the top computers are able to do several tasks in 5 minutes, but not more than they are likely to get from successful requests at that interval. As hardware improves there will indeed come a time when top computers won't be able to be fully productive without a reduction of that setting. But IMO the time to consider reducing it will be after the available download bandwidth is increased, even with the 5 minute interval the 100 Mbps download link we have now is often saturated. Joe | |
| ID: 1082550 · | |
My 5 machines have a combined RAC of over 80,000, putting me at over 1000 WU's a day on average. I'm having no problem at all keeping their caches full. Even Todd with his 600,000+ RAC (Easily over 7500 WU's a day) never has complained about keeping the cache topped off, even with the 5 min delays. The only thing limited by that 5 minutes is requesting new work. I notice that you are running a number of GTX 460s. Are you happy with them, generally? Looks like you are clocking-in at about 20k RAC, which is higher than I thought that card's production would be. ____________ | |
| ID: 1082904 · | |
|
I'm very happy with the 470 & 480 FERMIs, you can run more then 1 WU at a time, | |
| ID: 1082914 · | |
|
Here's my situation, and why I'd like that 5 min delay reduced to 2 (or less ;-) ) | |
| ID: 1082987 · | |
My 5 machines have a combined RAC of over 80,000, putting me at over 1000 WU's a day on average. I'm having no problem at all keeping their caches full. Even Todd with his 600,000+ RAC (Easily over 7500 WU's a day) never has complained about keeping the cache topped off, even with the 5 min delays. The only thing limited by that 5 minutes is requesting new work. For the money, I don't think they can be beat. Maybe once we see FERMI specific opt apps come out it'll will spread the field, but for now I'm very please. The cards seem to be good for about 16-20k each. My triple machine was up over 50k and climbing before the outage a few weeks ago, still trying to get back to that point. My single machine (Q8300) has been stable around 22k for awhile, I believe about 4k of that coming from the CPU. I'd expect my triple machine to peak around the upper 50k's. ____________ | |
| ID: 1083027 · | |
Here's my situation, and why I'd like that 5 min delay reduced to 2 (or less ;-) ) Surely you're not still on dial-up, are you??? Do you have some reason, other than habit, for not allowing your computers to connect to SETI whenever they feel the need? (I know some people don't like to leave their internet connection on when they're not actively using it. My modem and router are in the basement, stuck up in the floor joists, because it's the most convenient place for connections to power, the outside feed, and internal network cables, but it means they're on 24/7... which is fine for the machine running BOINC and my Radio Reference feed.) My only suggestion, and I'm sure I don't need to make it to you, would be to increase your cache size so that if you miss a day, you'll still have enough work to carry you through for another day. David ____________ David Sitting on my butt while others boldly go, Waiting for a message from a small furry creature from Alpha Centauri. | |
| ID: 1083216 · | |
Message boards : Technical News : School (Feb 22 2011)
| Copyright © 2013 University of California |