Message boards :
Number crunching :
Panic Mode On (20) Server problems
Message board moderation
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · 11 · 12 . . . 15 · Next
| Author | Message |
|---|---|
arkayn Send message Joined: 14 May 99 Posts: 4438 Credit: 55,006,323 RAC: 0
|
|
Bill Walker Send message Joined: 4 Sep 99 Posts: 3868 Credit: 2,697,267 RAC: 0
|
I wish I could turn off networking, but I have another project that needs networking active. I'll second that. I didn't want to interupt my other projects, especially when it looked like I might run out of SETI. When I had the time over the last few days, I would turn off network activity until one of my non-SETI jobs had some uploading or downloading to do, and then turn it back on long enough for the other project to get new work. It would be nicer if BOINC could handle this without my attention. Note to those who run BOINC - this may look like a SETI specific request today, but the other projects could get into the same situation as SETI someday, if they are lucky ;).
|
|
BarryAZ Send message Joined: 1 Apr 01 Posts: 2580 Credit: 16,982,517 RAC: 0
|
Agreed -- SETI isn't the only project that could use a project specific network off switch. My own read with the upload server (opinion only) is that with the large backlog of upload requests from the past 48 to 72 hours, even if it is running well at the server level, it is simply getting swamped big time. Sort of like the typical Tuesday outage congestion (with a typical server outage of 4 to 6 hours, I expect a congestion problem of 8 to 12 hours), only since the upload outage was so long, recovery will be proportional to that. Perhaps it will clear by the weekend, perhaps by Monday in time for the next outage. I wish I could turn off networking, but I have another project that needs networking active.
|
rebest ![]() Send message Joined: 16 Apr 00 Posts: 1296 Credit: 45,357,093 RAC: 0
|
Now that's the smartest idea I've seen all day! :) Join the PACK! |
|
BarryAZ Send message Joined: 1 Apr 01 Posts: 2580 Credit: 16,982,517 RAC: 0
|
Actually, other projects have run into problems for which a project specific network activity off switch would have reduced connect error messages at the user level and reduced network traffic to the specific project. I've seen outages over the years at Climate, Einstein, Rosetta, Spinhenge and others. These are outages where completed work (or in Climate's case trickles), is stuck in transfer mode and suspending the project won't stop the client from trying to connect. Further there are cases where you want to continue processing for a project (you still have a queue) but don't want to generate traffic to a specific project. Note to those who run BOINC - this may look like a SETI specific request today, but the other projects could get into the same situation as SETI someday, if they are lucky ;).
|
|
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0
|
Note to those who run BOINC - this may look like a SETI specific request today, but the other projects could get into the same situation as SETI someday, if they are lucky ;). One of the incredibly cool things about BOINC is that it costs almost nothing to do distributed computing. So it is entirely possible for an individual to run a fair sized project, funded out of their own pocket, on a home internet connection. If you do that, on small servers with a cheap DSL line, it is very possible to run into exactly the same corner, just on a dramatically smaller scale. |
[AF>Libristes] erik Send message Joined: 30 Jul 07 Posts: 19 Credit: 4,016,114 RAC: 0
|
I think I understood SETI uploads problems. We must let time for time. Basically how can I interrupt SETI network communications without interrupt all BOINC network communications. I make that manually but now I want to go to bed. just a poet |
Richard Haselgrove ![]() Send message Joined: 4 Jul 99 Posts: 14690 Credit: 200,643,578 RAC: 874
|
Actually, other projects have run into problems for which a project specific network activity off switch would have reduced connect error messages at the user level and reduced network traffic to the specific project. I've seen outages over the years at Climate, Einstein, Rosetta, Spinhenge and others. These are outages where completed work (or in Climate's case trickles), is stuck in transfer mode and suspending the project won't stop the client from trying to connect. Further there are cases where you want to continue processing for a project (you still have a queue) but don't want to generate traffic to a specific project. Hic! Being a little dis-inhibited, I'm going to reveal that when I turned off networking before going to the pub, I turned off networking for this project only: the code exists (no, I didn't write it) and I'm testing it. I found another (small, cosmetic-only) bug this week, which I haven't reported to the author yet, but apart from that I believe it's nearly ready to submit to BOINC as a ready-made patch. When that time comes, I hope you'll all lobby for trac [trac]#139[/trac] to be actioned. It really helps. |
Bill Walker Send message Joined: 4 Sep 99 Posts: 3868 Credit: 2,697,267 RAC: 0
|
I would like to hereby officially lobby for Trac 139 to be actioned.
|
|
Josef W. Segur Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0
|
Samdani 15 Jul 2009 16:52:34 UTC It certainly makes sense but how about relaxing the linking rule under special circumstances like this. Would that be a feasible option? Samdani 15 Jul 2009 17:14:34 UTC Ok. I give up. There seems no room for any creative ideas :) Don't give up so quickly or easily. The responses you got all assumed by "relaxing" you meant "rescinding". There are certainly reasons to think about that 2X CPUs uploading threshold which was hard coded into BOINC 5.8.6 and hasn't changed since. Now BOINC supports CUDA and multithreaded processing, 5.8.6 didn't. Even without that, 2X is a compromise measure which probably doesn't fit any project particularly well. For S@H, upload bandwidth needed is about 5% of download bandwidth (but work requests also use the pipe into SSL). Since I've never seen more than 30 MBits/sec used going to SSL, any problems on that side have to be ascribed to transaction rates rather than bandwidth. Making the 2X adjustable per project might be useful, though it could also allow a user willing to edit client_state.xml a way around any restriction. Using a more complex formula which counts GPUs as well as CPUs, perhaps even checks the result sizes against the WU sizes in the project folder, might be another way to improve. Any way it is modified it will still impact hosts which normally are most productive before the older, slower hosts. Joe |
Pappa Send message Joined: 9 Jan 00 Posts: 2562 Credit: 12,301,681 RAC: 0
|
erik Just let it run (if you normally do overnight). It will recover with time. I think I understood SETI uploads problems. We must let time for time. Regards Please consider a Donation to the Seti Project. |
|
# Bob Ahlers # Send message Joined: 30 Mar 01 Posts: 18 Credit: 10,209,954 RAC: 0
|
I would suggest that Seti contacts Pacific Internet Exchange: Pacific Internet Exchange 200 Paul Ave. Ste. M-200 San Francisco, CA 94124 It's a 35 mins drive from Seti. Arrange hosting and data at that location (could even be free) and put their name on the Seti site. Setup an VPN or VLAN between Seti and PIE and host the big data eating servers their. This way allot of the probs disappear. They could even drive over to that data enter and bring project on a external HD for 150$ or so. Off course i don't know the details about the seti system but this can not continue as it is now, for Seti and the Crunchers. Just an idea. |
Blurf Send message Joined: 2 Sep 06 Posts: 8964 Credit: 12,678,685 RAC: 0
|
I would suggest that Seti contacts Pacific Internet Exchange: Thank you for the idea. I've brought this to the attention of Matt and asked him to comment directly when he's able. |
Westsail and *Pyxey* Send message Joined: 26 Jul 99 Posts: 338 Credit: 20,544,999 RAC: 0
|
Did my part to ease upload traffic late last night... ;) NOTE: Hopefully someone can learn from my mistake.. I've heard it said a smart man can learn from his mistakes while a genius learns from others.. Anyways, so was trying to get some video stuff working right on my Tesla workstation for someone else; was late and in a hurry.. I shutdown the boinc core client to install different driver version. Nvidia tells me drivers are older than currently installed do I want to reboot..yes yes whatever, hurry up..etc etc So machine reboots, (I can hear folks groaning knowing whats coming) drivers install all is fine. About this time I load up BM only to all at once realize I needed to disable the service from restarting. When the puter rebooted with no vid drivers I lost ~600 completed WU'S instantly. This machine shares a single ethernet cable with another machine that is always plugged in, for now. So I have to manually plug it in and up/report/download 1-2 times a day. With recent difficulties It had been accumulating WU's for a number of days. Was my whole cache that were pending upload with network suspended. Currently the CPU is still working on it's ~100 MB's. lol Opps!?!....Kida forgot as I wasn't really thinking about boinc, I was trying to make a rendering program work correctly. Oh well, operator error, we live to crunch another day. Speedy and pleasant recovery to everyone. Keep crunching! edit to add: Just catching up on the thread...Here Here Richard good show; best idea I have heard yet! Chalk us up! "The most exciting phrase to hear in science, the one that heralds new discoveries, is not Eureka! (I found it!) but rather, 'hmm... that's funny...'" -- Isaac Asimov
|
|
B-Man Send message Joined: 11 Feb 01 Posts: 253 Credit: 147,366 RAC: 0
|
Actually, other projects have run into problems for which a project specific network activity off switch would have reduced connect error messages at the user level and reduced network traffic to the specific project. I've seen outages over the years at Climate, Einstein, Rosetta, Spinhenge and others. These are outages where completed work (or in Climate's case trickles), is stuck in transfer mode and suspending the project won't stop the client from trying to connect. Further there are cases where you want to continue processing for a project (you still have a queue) but don't want to generate traffic to a specific project. I agree that is a great thing. 2nd topic. Just found out that I will be going away for the weekend tomorrow. I had not planed to be away and now I will have stacked up downloads from this week that I can only hope to get in before next tuesdays outage. I have some shorties that time out on the 21st. oh well. I will be shutting down at 7AM EDT I have about 12h to upload them. |
perryjay Send message Joined: 20 Aug 02 Posts: 3377 Credit: 20,676,751 RAC: 0
|
Hey, has anyone else noticed uploads are really slow to go through? :) Only joking although I did just notice it myself. Haven't had BM open in awhile. All my MBs are done and I am holding on by crunching an AP I managed to get a couple of days ago. Looks like two of my uploads are now ready to report so maybe more of them will get through too. PROUD MEMBER OF Team Starfire World BOINC |
[AF>Libristes] erik Send message Joined: 30 Jul 07 Posts: 19 Credit: 4,016,114 RAC: 0
|
Ok I will let my computer to manage itself. But isn't it a problem if it contact server all the time for upload? Not for me, it's just generate lots of error message pages. but for SETI server? Regards. just a poet |
Pappa Send message Joined: 9 Jan 00 Posts: 2562 Credit: 12,301,681 RAC: 0
|
Everyones else machines are trying to contact the Servers and that is what Boinc was designed to do. If you let it do it in its own way things should be fine. Ok Please consider a Donation to the Seti Project. |
|
Dave Stegner Send message Joined: 20 Oct 04 Posts: 540 Credit: 65,583,328 RAC: 27
|
1050 wu to upload for my 21 machines. A number of them are due tomorrow and many more due the next day. Does anyone know if Admins can suspend due dates or will the work go overdue and effort be lost?? Dave |
|
Fred W Send message Joined: 13 Jun 99 Posts: 2524 Credit: 11,954,210 RAC: 0
|
1050 wu to upload for my 21 machines. A number of them are due tomorrow and many more due the next day. Effort will be lost only if someone can return a resend before your original gets there. Fat chance at the moment. F.
|
©2026 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.