Message boards :
Technical News :
Into the Long Weekend (Aug 31 2007)
Message board moderation
Previous · 1 · 2 · 3 · 4 · Next
Author | Message |
---|---|
n7rfa Send message Joined: 13 Apr 04 Posts: 370 Credit: 9,058,599 RAC: 0 |
Your not the only one, I have 2 system on 24/7 just to crunch but over the last week they have not been able to get work ether! So elect, hard drives and everything that wears out when you leave it on is just setting there doing nothing... I wrote about this problem the other day, it seems like any time theirs a long week end we get no work from Site to crunch.. When they should have everything running at peak and giving everyone something to do we get ZIP, notha, nutin.. At first I thought it was me or my system so I formated the drives and reinstalled everything!! 3 days later and still no work from site!! Or it could be that he has his Cahce is set too low. In all of the cases, it's not the server side that has the "problem". |
Pierre Grogan Send message Joined: 2 May 00 Posts: 19 Credit: 166,455 RAC: 0 |
I refer back to my original message 631012 1 Sep to Matt where for sometime now I have problems downloading WUs; the problem remains, today I note that it took all day to try and download one WU having been allocated its number but to no avail. It was only this evening after manually instructing it to retry communications some 4 or 5 times (I don't believe that I 'hammered') did it download that work unit and is now happily crunching away. Since I am not an expert in this and am not really overworked about it, I wouldn't mind an explanation in layman's language without too much emotion. Whilst it may be long again here is todays message string just in case you can see a common thread if you will pardon the pun. Hopefully I am still in the right message area if not please tell me which one to use. Thankyou Pierre 03/09/2007 08:15:00||Starting BOINC client version 5.10.13 for windows_intelx86 03/09/2007 08:15:00||log flags: task, file_xfer, sched_ops 03/09/2007 08:15:00||Libraries: libcurl/7.16.1 OpenSSL/0.9.8e zlib/1.2.3 03/09/2007 08:15:00||Data directory: C:\\Program Files\\BOINC 03/09/2007 08:15:02||Processor: 1 AuthenticAMD AMD Athlon(tm) XP 3000+ [x86 Family 6 Model 10 Stepping 0] 03/09/2007 08:15:02||Processor features: fpu tsc sse 3dnow mmx 03/09/2007 08:15:02||Memory: 1023.48 MB physical, 1.88 GB virtual 03/09/2007 08:15:02||Disk: 39.06 GB total, 15.54 GB free 03/09/2007 08:15:02|SETI@home|URL: http://setiathome.berkeley.edu/; Computer ID: 2968625; location: home; project prefs: default 03/09/2007 08:15:04||General prefs: from SETI@home (last modified 2007-06-21 10:51:29) 03/09/2007 08:15:04||Host location: home 03/09/2007 08:15:04||General prefs: no separate prefs for home; using your defaults 03/09/2007 08:15:04||Reading preferences override file 03/09/2007 08:15:04||Preferences limit memory usage when active to 511.74MB 03/09/2007 08:15:04||Preferences limit memory usage when idle to 921.14MB 03/09/2007 08:15:04||Preferences limit disk usage to 15.44GB 03/09/2007 08:15:06|SETI@home|[file_xfer] Started download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 08:15:28|SETI@home|[file_xfer] Temporarily failed download of 11mr07ae.20928.195622.5.5.235: system connect 03/09/2007 08:15:28|SETI@home|Backing off 1 hr 11 min 47 sec on download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 09:27:16|SETI@home|[file_xfer] Started download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 09:27:38||Project communication failed: attempting access to reference site 03/09/2007 09:27:38|SETI@home|[file_xfer] Temporarily failed download of 11mr07ae.20928.195622.5.5.235: system connect 03/09/2007 09:27:38|SETI@home|Backing off 24 min 22 sec on download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 09:27:40||Access to reference site succeeded - project servers may be temporarily down. 03/09/2007 09:52:01|SETI@home|[file_xfer] Started download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 09:52:23||Project communication failed: attempting access to reference site 03/09/2007 09:52:23|SETI@home|[file_xfer] Temporarily failed download of 11mr07ae.20928.195622.5.5.235: system connect 03/09/2007 09:52:23|SETI@home|Backing off 1 hr 20 min 35 sec on download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 09:52:24||Access to reference site succeeded - project servers may be temporarily down. 03/09/2007 11:12:59|SETI@home|[file_xfer] Started download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 11:13:21||Project communication failed: attempting access to reference site 03/09/2007 11:13:21|SETI@home|[file_xfer] Temporarily failed download of 11mr07ae.20928.195622.5.5.235: system connect 03/09/2007 11:13:21|SETI@home|Backing off 40 min 34 sec on download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 11:13:23||Access to reference site succeeded - project servers may be temporarily down. 03/09/2007 11:53:56|SETI@home|[file_xfer] Started download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 11:54:18||Project communication failed: attempting access to reference site 03/09/2007 11:54:18|SETI@home|[file_xfer] Temporarily failed download of 11mr07ae.20928.195622.5.5.235: system connect 03/09/2007 11:54:18|SETI@home|Backing off 2 hr 57 min 19 sec on download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 11:54:20||Access to reference site succeeded - project servers may be temporarily down. 03/09/2007 14:51:38|SETI@home|[file_xfer] Started download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 14:52:00||Project communication failed: attempting access to reference site 03/09/2007 14:52:00|SETI@home|[file_xfer] Temporarily failed download of 11mr07ae.20928.195622.5.5.235: system connect 03/09/2007 14:52:00|SETI@home|Backing off 2 hr 20 min 1 sec on download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 14:52:02||Access to reference site succeeded - project servers may be temporarily down. 03/09/2007 17:12:02|SETI@home|[file_xfer] Started download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 17:12:25||Project communication failed: attempting access to reference site 03/09/2007 17:12:25|SETI@home|[file_xfer] Temporarily failed download of 11mr07ae.20928.195622.5.5.235: system connect 03/09/2007 17:12:25|SETI@home|Backing off 2 hr 58 min 16 sec on download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 17:12:26||Access to reference site succeeded - project servers may be temporarily down. 03/09/2007 18:20:21|SETI@home|[file_xfer] Started download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 18:20:43||Project communication failed: attempting access to reference site 03/09/2007 18:20:43|SETI@home|[file_xfer] Temporarily failed download of 11mr07ae.20928.195622.5.5.235: system connect 03/09/2007 18:20:43|SETI@home|Backing off 1 hr 22 min 56 sec on download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 18:20:45||Access to reference site succeeded - project servers may be temporarily down. 03/09/2007 18:20:45|SETI@home|[file_xfer] Started download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 18:22:08|SETI@home|[file_xfer] Finished download of file 11mr07ae.20928.195622.5.5.235 03/09/2007 18:22:08|SETI@home|[file_xfer] Throughput 4591 bytes/sec 03/09/2007 18:22:09|SETI@home|Starting 11mr07ae.20928.195622.5.5.235_0 03/09/2007 18:22:10|SETI@home|Starting task 11mr07ae.20928.195622.5.5.235_0 using setiathome_enhanced version 527 |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
I refer back to my original message 631012 1 Sep to Matt where for sometime now I have problems downloading WUs; the problem remains, today I note that it took all day to try and download one WU having been allocated its number but to no avail. It was only this evening after manually instructing it to retry communications some 4 or 5 times (I don't believe that I 'hammered') did it download that work unit and is now happily crunching away. Since I am not an expert in this and am not really overworked about it, I wouldn't mind an explanation in layman's language without too much emotion. Whilst it may be long again here is todays message string just in case you can see a common thread if you will pardon the pun. Hopefully I am still in the right message area if not please tell me which one to use. Layman's explanation: Picture a road that is 100 lanes wide. There is an intersection where fifty roads come together, each road is ten lanes wide. If the fifty roads aren't very congested, everyone can merge into the big road easily. When the fifty roads are bumper-to-bumper, then it takes a long time for everyone to get on the big road. The fifty roads are the internet, cars on those fifty roads are the SETI users, and the 100 lane road is the SETI server farm. If you just wait patiently in the traffic jam, you'll eventually get on the 100 lane road. |
KWSN THE Holy Hand Grenade! Send message Joined: 20 Dec 05 Posts: 3187 Credit: 57,163,290 RAC: 0 |
Not sure this is the right forum, but the "client connection statistics" script is broken, AGAIN... . Hello, from Albany, CA!... |
Contact Send message Joined: 16 Jan 00 Posts: 198 Credit: 2,249,004 RAC: 0 |
|
Kenn Benoît-Hutchins Send message Joined: 24 Aug 99 Posts: 46 Credit: 18,091,320 RAC: 31 |
Both computers are using the lastest apps. The ever increasing pending credit is on the newest computer - Core 2 Duo. The G4 is happily munching away, albeit at a slower rate. KENN Has anyone else had there pending credit increase dramatically. Kenn What is left unsaid is neither heard, nor heeded. Ce qui est laissé inexprimé ni n'est entendu, ni est observé. |
anders n Send message Joined: 26 Jul 99 Posts: 69 Credit: 916,751 RAC: 0 |
Both computers are using the lastest apps. The ever increasing pending credit is on the Should be better now. Wu-s validate as they should when looking at your latest results. Happy cruching Anders n |
Pierre Grogan Send message Joined: 2 May 00 Posts: 19 Credit: 166,455 RAC: 0 |
[/quote] Layman's explanation: Picture a road that is 100 lanes wide. There is an intersection where fifty roads come together, each road is ten lanes wide. If the fifty roads aren't very congested, everyone can merge into the big road easily. When the fifty roads are bumper-to-bumper, then it takes a long time for everyone to get on the big road. The fifty roads are the internet, cars on those fifty roads are the SETI users, and the 100 lane road is the SETI server farm. If you just wait patiently in the traffic jam, you'll eventually get on the 100 lane road.[/quote] Thank you Ned Ludd, were the symptoms similar, I might accept your explanation but it does not explain why I have been travelling down this 100 lane road quite successfully since 1999 and have, apart from the various notified outages, quite happily downloaded workunits immediately at the point that my preferences schedule a download and those downloads have not stopped partway. However, since the 23 Aug this year each time a scheduled download should have happened, the computer, using BOINC, goes to get a workunit, gets its title and starts the download then fails and has to wait to reconnect. This is not a symptom of overcrowding by the merging of many roads because I have already got to my destination this is a symptom of a failure at the destination whereby an operation is started then terminated; consequently, I have to go back down the road again. The net effect is that seti@home is losing much of my computing power that I am happily donating which strikes me as being a waste for them. I don't have any worries about that I am quite happily building a swimming pool in the interim. When I said, I am not an expert in this, I mean't that I am not an expert in BOINC, SETI and its network connections etc. However, I did start my now retired life as a systems analyst designing company intranets in 1976 when telephone lines carried 2400bps. But I do appreciate your explanation thank you. Pierre |
Juha Send message Joined: 7 Mar 04 Posts: 388 Credit: 1,857,738 RAC: 0 |
However, since the 23 Aug this year each time a scheduled download should have happened, the computer, using BOINC, goes to get a workunit, gets its title and starts the download then fails and has to wait to reconnect. This is not a symptom of overcrowding by the merging of many roads because I have already got to my destination this is a symptom of a failure at the destination whereby an operation is started then terminated; consequently, I have to go back down the road again. Ned's explanation was pretty good, but, in order to keep it simple I guess, he left a couple of important details out. Every time BOINC needs to do anything with the servers it has to do the trip from the beginning. BOINC can combine multiple scheduler operations, eg. asking for more work and reporting finished results, into one trip. But this does not apply to downloads and uploads. That is, it takes one trip to have the server assign work to your computer and then it takes second trip to retrieve that work to your computer. Why is this way? The staff has decided to divide tasks to multiple servers in order to cut down the load each server has to bear. There is a server for scheduler, a server for downloads, a server for uploads. (Well, actually, for some tasks they have multiple servers.) Why it is easier to have work assigned to computer than it is to download it? Most likely the server in which the scheduler is located is a bit less busy. Since we're talking about roads you could think that to carry what ever scheduler says to you you need only a small car. Workunits on the other hand are so much larger that in order to carry those you'll need a truck. Trucks take much more space in roads than small passanger cars. Adding more details, each of those trips can carry only one workunit and to one direction only. This also applies to the science app and any files it needs. (Since there might be less computer savvy people reading this I kept the road trip metaphor.) (PS. Your avatar is quote cool.) -Juha |
Rev. Tim Olivera Send message Joined: 15 Jan 06 Posts: 20 Credit: 1,717,714 RAC: 0 |
Thu no work, Fri no work, Sat no work, Sun no work, Mon at 0900 it download some work, When we went home the computer was crunching! When we came in Tus morning no work. So we uninstalled BOINC and removed all trace from the reg, we downloaded the newest ver and installed on a 250Gig drive. one houre later this is what we are getting. 9/4/2007 9:06:58 AM||Benchmark results: 9/4/2007 9:06:58 AM|| Number of CPUs: 2 9/4/2007 9:06:58 AM|| 1853 floating point MIPS (Whetstone) per CPU 9/4/2007 9:06:58 AM|| 2807 integer MIPS (Dhrystone) per CPU 9/4/2007 9:10:40 AM||Resuming computation 9/4/2007 9:10:40 AM|SETI@home|Sending scheduler request: To fetch work 9/4/2007 9:10:40 AM|SETI@home|Requesting 44928 seconds of new work 9/4/2007 9:10:45 AM|SETI@home|Scheduler RPC succeeded [server version 511] 9/4/2007 9:10:45 AM|SETI@home|Deferring communication for 1 min 0 sec 9/4/2007 9:10:45 AM|SETI@home|Reason: no work from project 9/4/2007 9:11:45 AM|SETI@home|Sending scheduler request: To fetch work 9/4/2007 9:11:45 AM|SETI@home|Requesting 44928 seconds of new work 9/4/2007 9:11:50 AM|SETI@home|Scheduler RPC succeeded [server version 511] 9/4/2007 9:11:50 AM|SETI@home|Deferring communication for 1 min 0 sec 9/4/2007 9:11:50 AM|SETI@home|Reason: no work from project 9/4/2007 9:12:50 AM|SETI@home|Sending scheduler request: To fetch work 9/4/2007 9:12:50 AM|SETI@home|Requesting 44928 seconds of new work 9/4/2007 9:12:55 AM|SETI@home|Scheduler RPC succeeded [server version 511] 9/4/2007 9:12:55 AM|SETI@home|Deferring communication for 1 min 0 sec 9/4/2007 9:12:55 AM|SETI@home|Reason: no work from project So in the last 6 days we got maybe 10hr worth of work.. And if it sounds like i'm bitching I AM.. When folks volunteer you shouldn't have them standing around with there thumbs up there butt. An yes after 24 years in the US Army I get just a little pissed off when I see 12hr work down in 6 days... So I will continue to bitch and complain as long as a dual 3.6GIG machine is doing nothing but sucking up power. Tim Olivera http://mysite.verizon.net/moment_1/ |
KWSN THE Holy Hand Grenade! Send message Joined: 20 Dec 05 Posts: 3187 Credit: 57,163,290 RAC: 0 |
Thu no work, Fri no work, Sat no work, Sun no work, Mon at 0900 it downloaded some work, When we went home the computer was crunching! When we came in Tus morning no work. So we uninstalled BOINC and removed all trace from the reg, we downloaded the newest ver(sion) and installed on a 250 Gb drive. One hour later this is what we are getting. 1) I'm not sure why you didn't get work from S@H on Saturday or Sunday... work was there to be gotten... check your BOINC setup. 2) Keep your log extracts to the minimum necessary to illustrate your point...space on S@H's hard drives is not unlimited! 3) When you see the line (in your log extract, in bold, "no work from project", check the "server status" page of S@H's web site - if "Results ready to send" is zero; then there is no work... (this regularly happens on the Tuesday of a three day weekend [USA holiday on a Monday...] due to the splitters running out of work!) 4) As others may point out, there is no guaruntee that there will be work from any BOINC project - if ya wanna keep your "dual 3.6GIG machine" warm, join more than one project... which spreads the load. Also up your cache to two days minimum - that should keep ya crunchin' through minor periods of "no work from project"... . Hello, from Albany, CA!... |
Dr. C.E.T.I. Send message Joined: 29 Feb 00 Posts: 16019 Credit: 794,685 RAC: 0 |
|
John McLeod VII Send message Joined: 15 Jul 99 Posts: 24806 Credit: 790,712 RAC: 0 |
Requesting work and reporting work require database access. Each DB access is expensive in some server resources which is why they are batched together. File upload and downloads do not need database access and there is no benefit to batching them together. Also, reporting work and requesting more is one file sent up to the server. The assignment of new work is one file sent back to the client. BOINC WIKI |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
At the same time, under the same conditions, I've not run short of work. Why? You do the math. |
n7rfa Send message Joined: 13 Apr 04 Posts: 370 Credit: 9,058,599 RAC: 0 |
So in the last 6 days we got maybe 10hr worth of work.. And if it sounds like i'm bitching I AM.. When folks volunteer you shouldn't have them standing around with there thumbs up there butt. An yes after 24 years in the US Army I get just a little pissed off when I see 12hr work down in 6 days... So I will continue to bitch and complain as long as a dual 3.6GIG machine is doing nothing but sucking up power. That's why we have alternate projects to keep us busy. When I start to run out of work with no prospect for more work in the near future, I activate other projects. Look into it. |
Kenn Benoît-Hutchins Send message Joined: 24 Aug 99 Posts: 46 Credit: 18,091,320 RAC: 31 |
Ned: Nor did I run out of work either over the past long weekend. My pending credit also shrank today by a considerable margin. Kenn
Kenn What is left unsaid is neither heard, nor heeded. Ce qui est laissé inexprimé ni n'est entendu, ni est observé. |
Orlando Send message Joined: 24 Aug 07 Posts: 1 Credit: 1,948,882 RAC: 0 |
Ned: Quite True Tim. I have been waiting for something to come through but .... yep, that's right, nothing. I have been wondering why there are so may problems with SETI. I mean, I work for a large organization that processes millions of transaction everyday. And I mean millions literally and in 17 years I have never seen a single day when the system has been down other than for upgrades and the normal cleaning. And it only last a couple of hours every now and then. I joined SETI some 10 years ago and left it because of all the problems it had. Now I am back thinking that all should be OK but I find that nothing has really changed. The same problems of a decade ago are still there. Isn't it time they got a real computer guy? so that tese problems are fixed?. GUYS GUYS OF TEAM SETI, do something about all this crap. I dedicate 2 machines to this project and it upsets me to have them doing abosluty nothing with their time. Orlando |
Henk Haneveld Send message Joined: 16 May 99 Posts: 154 Credit: 1,577,293 RAC: 1 |
I have been waiting for something to come through but .... yep, that's right, nothing. I have been wondering why there are so may problems with SETI. I mean, I work for a large organization that processes millions of transaction everyday. And I mean millions literally and in 17 years I have never seen a single day when the system has been down other than for upgrades and the normal cleaning. And it only last a couple of hours every now and then. I joined SETI some 10 years ago and left it because of all the problems it had. Now I am back thinking that all should be OK but I find that nothing has really changed. The same problems of a decade ago are still there. Isn't it time they got a real computer guy? so that tese problems are fixed?. GUYS GUYS OF TEAM SETI, do something about all this crap. I dedicate 2 machines to this project and it upsets me to have them doing abosluty nothing with their time. Large organization have large budgets. Seti lives on donations. How much have you donated to keep it going? |
Keith T. Send message Joined: 23 Aug 99 Posts: 962 Credit: 537,293 RAC: 9 |
... SETI@home is a Science project not a business. The servers are donated by various companies and individuals. The bandwidth is paid for by donations. There are plenty of other BOINC projects to run when SETI is unavailable. [edit]SETI@home has not been open to the public for 10 years yet. Registration started on March 1 1999. http://seticlassic.ssl.berkeley.edu/classpages/[/edit] Sir Arthur C Clarke 1917-2008 |
KWSN - Sir William The Flagrantly Verbose Send message Joined: 28 Apr 00 Posts: 829 Credit: 11,757,678 RAC: 0 |
Ned: As others have stated, SETI@Home is a scientific endeavor run by an education institution. UC Berkeley may be more well endowed than some Universities, but even that is far short of the capital budgets available to corporate IT departments whose executive Managers DEMAND 99.999% uptime and reliability. With all of that in mind, this is a real labor of love for the SETI@Home staff and volunteers, and it offends me to hear someone denigrating them as not "real computer guys". As a "real computer guy" I know what it's like to work with legacy and hand-me-down hardware in a production environment, and that's without having to support an evolving scientific research environment where your next project might literally never have been done before! Stop griping and whip out the checkbook if you are seriously committed to helping this project, or take your CPU cycles and find another playground! Bill Finch, MCSE, MCDBA, MCT, CCNA Senior Database Administrator LCG Technologies, Inc. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.