Message boards :
Number crunching :
Why classic SETI@home is closing down and other facts of life.
Message board moderation
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 15 · Next
Author | Message |
---|---|
Lost_But_Maken_Headway Send message Joined: 27 Nov 05 Posts: 7 Credit: 9,945 RAC: 0 ![]() |
.........Computer is fine Andy, Software or install of that package, as this appeared during off hours, idle. Notebook P25-509 running virtually 24/7 for two years last month. No known BIOS updates no problems on other software in use. Appreciate the response Andy: Video-NVIDIA, GeForce FX Go5200 has no known issues to me but will do some checking. Hardware runs AutoCad, Development software and anything I have thrown at it to date. XP Home SP2 + latest updates and is current. Anti-Virus; Normans, CA Pest Patrol and Microsoft AntiSpyware Version: 1.0.701. Only today did this error appear. Will check updates. |
![]() ![]() Send message Joined: 9 Nov 04 Posts: 44 Credit: 503,405 RAC: 0 ![]() |
...If there is one thing in a message remotely critical of BOINC, some butt heads over here give the message a negative rating. Really nice and friendly over here. Yup, real tolerant, too. I've noticed that the people that scream the loudest for tolerance throw it out the window a whole lot faster when someone might even appear to even think something counter to their view. So, go ahead, give me a minus, hell, give 20 or 30. The forum I hang out on doesn't have stupid little plus or minus thingies, if someone there disagrees with you they'll just come right out and say it and get it over with so both can just go on with life. If that sounds like a place you'd rather be, here's the link: http://p101.ezboard.com/bsetiathomeclassic BTW, we don't only talk about Classic, that's just a name. After all, "Whats in a name? A rose by any other name still smells as sweet". ![]() |
![]() ![]() Send message Joined: 1 Mar 99 Posts: 1444 Credit: 957,058 RAC: 0 ![]() |
Some random clarifications: I. The faulty exponential backoff is a well known problem and is being worked on, especially in light of last week's issues. II. Clarification about funding: There is some money for SETI@home hardware/data recording, mostly because this technology can be used (and is used) for various non-SETI projects. But there is no money for SETI@home Classic data analysis or day-to-day operations. III. Regarding our "poor planning," some days around here are like working on the bridge of the Enterprise during the last ten minutes of any given episode where the ship is under attack or about to self destruct and there's the stock tension music and everything. It's not exactly fun. There is, on average, about 1.5 people in the lab at any given time. If something breaks, you don't think, "Hmm. Let me load up Xfig and draw a diagram then call for a 2 o'clock meeting where we can discuss this matter." It's more like: "Hmm. Every second we're pissing off 10 people. What's the quickest way I can fix this damn thing so I can get back to writing my grant proposal." IV. We wrote about the outages less and less in the old SETI@home Classic tech news as time wore on. Same thing every week. Online cience database crashed. Server down for 12-24 hours. Everything's back up now. Should have wrote a script that put such a message up on the front page every five days or so. I don't know if anybody's noticed but the Classic servers have already crashed three times already this month. Had to kick 'em back to life every time. In fact, I think they're dropping connections right now... - Matt -- BOINC/SETI@home network/web/science/development person -- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude |
![]() Send message Joined: 3 Sep 04 Posts: 248 Credit: 183,721 RAC: 0 ![]() |
In fact, I think they're dropping connections right now... Just to reassure you that I never saw S@H Classic down! I think that downtime is all part of the global IT conundrum. Live with it! There's not much else you can do! ![]() |
![]() ![]() Send message Joined: 28 Apr 00 Posts: 579 Credit: 130,733 RAC: 0 ![]() |
If the fault of write address is correct at "0x00000010" that is interrupt table of the X86 chip design. Program should never access this area by Intel original design spec. from memory of design in the mid-70's. Actually on NT Based platforms, the first 64k(?) of a processes address space is nulled out and both read and write access is denied. Which OS are you using? ----- Rom BOINC Development Team, U.C. Berkeley My Blog |
DomainMonkee.com Send message Joined: 10 Feb 02 Posts: 1 Credit: 5,364,842 RAC: 25 ![]() ![]() |
Props to Matt. I don't know why people think technology (especially computers)is perfect right out of the box. Guess they never had to return anything. I'm just glad that some of the Seti tech payatention to these boards so the "Dee dee dee" can get some explination to help them untangle their panties!! |
rsisto Send message Joined: 30 Jul 03 Posts: 135 Credit: 729,936 RAC: 0 ![]() |
Man, oh man. Am I ever tired of reading about this bitching. It started as an interesting topic then digressed. Let me re-group for myself: Here you can donate ![]() |
![]() ![]() Send message Joined: 3 Apr 99 Posts: 9659 Credit: 251,998 RAC: 0 |
Matt, I took myself the liberty to spawn your thread in the Cafe, so there's some comments to you also there. And a comment to the assertion made earlier in this thread, that classic stats can't be shown anywhere, it's not true! Sigs showing Classic stats can be made here among other places! Else, thanks to you all, some of us actually never doubted you all, that you again would make this ship sail! .oO(Maybe time to make a new money donation with the expressed command that you use the money for beers and a night out to celebrate your good work! :-D Just as a pat on your shoulders!!!) EDIT: And, Matt, I hope you had a nice weekend in Los Angeles, and that you, Rom, had a nice trip to Australia! :-) Good to see you both back though! ;-) "I'm trying to maintain a shred of dignity in this world." - Me ![]() |
![]() Send message Joined: 11 Oct 00 Posts: 21 Credit: 10,388,338 RAC: 8 ![]() |
mmciastro & ML1, Thanks for your kind welcome. The reason I never went to the seti home page over the last two years was because I expected anything I needed to know about (ET found, new client, alteration in DC functioning, etc.) would be emailed to me as I had allowed contacting me via email. With SetiDriver, I chose to 100 WUs in stock in case of outage. SetiSpy was added to my computer's startup menu and ran 100% of the time. Because of these two programs, any question as to my team progress or WUs crunched was immediately accessable. As such I never felt the need to check the home page and knew the project was functioning. That's prologue now & BOINC does seem to be doing its job nicely and the temporary hiccup seti@home was having is over. I will... look at the home page... more frequently from now on. :) I do have to admit that I liked the simplicity of the old system but the new one is obviously working and offers many nice options. Kudos to Matt et al, Gary |
Terry Send message Joined: 17 Sep 00 Posts: 153 Credit: 1,805,202 RAC: 0 ![]() |
mmciastro & ML1, welcome to a fellow ham operator. Terry k4vh |
![]() ![]() Send message Joined: 14 Jul 03 Posts: 3224 Credit: 4,603,826 RAC: 0 ![]() |
mmciastro & ML1, There are a few of us out there. 73's N9ICD ![]() My movie https://vimeo.com/manage/videos/502242 |
KB7RZF ![]() Send message Joined: 15 Aug 99 Posts: 9549 Credit: 3,308,926 RAC: 2 ![]() |
mmciastro & ML1, Yes there is. :-) Jeremy KB7RZF ![]() |
![]() ![]() Send message Joined: 31 May 02 Posts: 6 Credit: 652,414 RAC: 1 ![]() |
I personally have over 100,000 hours in computer time being donated under the seti classic project and after reading this set of “Here are some reasons off the top of my head that SETI@home Classic is shutting down†I feel a response is necessary to some of your points of perpetual misinformation, this all too typical of the computing environment. Go suspend Windows (I did so). I am happy crunching with MacOSX now with REAL 64bit power in hardware AND OS ! ![]() |
Lost_But_Maken_Headway Send message Joined: 27 Nov 05 Posts: 7 Credit: 9,945 RAC: 0 ![]() |
If the fault of write address is correct at "0x00000010" that is interrupt table of the X86 chip design. Program should never access this area by Intel original design spec. from memory of design in the mid-70's. OS Name Microsoft Windows XP Home Edition Version 5.1.2600 Service Pack 2 Build 2600 OS Manufacturer Microsoft Corporation System Name PORTABLE_2 System Manufacturer TOSHIBA System Model Satellite P25 System Type X86-based PC Processor x86 Family 15 Model 2 Stepping 9 GenuineIntel ~2793 Mhz Processor x86 Family 15 Model 2 Stepping 9 GenuineIntel ~2793 Mhz BIOS Version/Date TOSHIBA V1.80, 3/9/2004 SMBIOS Version 2.31 Windows Directory C:\\WINDOWS System Directory C:\\WINDOWS\\system32 Boot Device \\Device\\HarddiskVolume1 Locale United States Hardware Abstraction Layer Version = "5.1.2600.2180 (xpsp_sp2_rtm.040803-2158)" User Name PORTABLE_2\\xxxxxxx Time Zone Eastern Standard Time Total Physical Memory 512.00 MB Available Physical Memory 105.03 MB Total Virtual Memory 2.00 GB Available Virtual Memory 1.96 GB Page File Space 1.22 GB Page File C:\\pagefile.sys |
Jack Gulley Send message Joined: 4 Mar 03 Posts: 423 Credit: 526,566 RAC: 0 ![]() |
Actually, Boinc is a part of the problem! The communication methodology that Boinc uses is definitely contributing to the bandwidth issues. I looked into the libcurl that BOINC uses to manage its Internet communications between the client and servers. And a lot of Ethereal traces. It is quite efficient and there is not a bandwidth problem with the physical communications link. The "bandwidth" problem is inside the servers (as we now know from the solution found to this recent outage.) I found the Ethernet side of BOINC's communications methodology to be quite suitable for large scale projects like this. Having a huge number of clients pounding on it should not really be a problem at all, as it can and does make an efficient connections with each of them, and just lets most of them wait and timeout (instead of coming right back at it with another request). Dropping excess connections is one of the subtle features of the communications methodology used by libcurl and BOINC. It allows them to be dropped and after timing out or backed off, retried later. This seems to have been working very well during the outage, except for some minor back-off issues. The fact that next to nothing was getting transfered had nothing to do with the communications methodology used by the BOINC system and the BOINC Manager client software. When the bottleneck was "fixed" in the server, uploads proceeded at a rate and efficiency never before seen in the BOINC/Seti project, in spite of the very large number of systems trying to upload. Sure, there was a big bump in network traffic, but the data rate no where near stressed the 100Mbps Ethernet link into the lab. (15Mbps in, 40Mbps out and that includes the background Seti Classic traffic that was going on {2M/12M}) So, "The communications methodology that BOINC uses is definitely" NOT "contributing to the bandwidth issues." Can't say the same for for the Seti server(s) and their programming. Let's slow down a minute. There are some of us out here that are somewhat crusty systems people. There is obviously an architecture problem that does not seem to be addressed. If anything, the combination of Seti and Boinc is clearly showing some of Boinc's limitations. It's a volume thing that the other projects have yet to experience. No. It does not show any limitations of BOINC. And yes, It's a volume thing that the other projects have yet to experience. It does however, show up very clearly the problems with some of the architecture implementation decisions made by Seti in how their servers are programmed, and that not only have they not been addressed, but prior suggestions on how to fix them have been rejected. Trying to run a server with extensive CGI scripts is fine for a lightly access web server, even one running as a file download/upload server. CGI script overhead and performance is not suitable for trying to manage processes in a high transaction rate data base server. Note that when the upload CGI script was converted to FastCGI, the upload server went from being a 35 per second upload transaction processor that failed under the stress of a heavy load, to one that adeptly handled 90 or more transactions per second. BOINC in this process did its part well. With some additional performance tweaking of the code and CGI scripts converted to FastCGI, there is no reason that the server could not do 100 transactions per second or more until it becomes limited by the file system it is accessing. The fact that the server failed so badly under load does suggest that there is still some sort of logic problem with its handling of transactions and connections, but as this mornings live test worked so well, it may never show up again. At least during uploads. No bets on what happens when everything else is running on kryten. The only real performance issue here is, will Seti address their architecture implementation problems? Maybe with Computer Engineers as programmers who understand the limitations of hardware, instead of Computer Science majors. At this point one on the outside could start asking questions like: Are slow CGI scripts being used to access the data base? Are slow CGI scripts being used to handle communications between processes? Are slow CGI scripts being used to handle communications between servers? Has a performance analysis been done of the parts of the existing software? Do you know where each of the bottle necks are and a what load they can handle? Do you know what happens when each of the bottle necks are stressed to limits? While using things like FastCGI require extra work for any simple maintenance, it sure does work a lot faster and better in high transaction rate servers. CGI scripts are fine for slow background processes, but not for any of the front end handling of uploads, downloads, data base access and communications with clients. |
![]() Send message Joined: 7 Aug 99 Posts: 13 Credit: 734 RAC: 0 ![]() |
I keep seeing in these forum threads various valid complaints. Here are some reasons off the top of my head that SETI@home Classic is shutting down, and other facts of life that may very well address your particular concern: |
![]() Send message Joined: 7 Aug 99 Posts: 13 Credit: 734 RAC: 0 ![]() |
To simply freeze everyone's credit to the classic SAH project is completely unfair to those who have been diligently processing WU's. Why would it not be possible to take the average credit received after processing say 100 or 1000 BOINC SAH WU's and simply multiply that by the classic number of WU's completed. I started out like everyone else in Classic crunching a single work unit. Over 6 years, I have added computers to my home network solely for the purpose of crunching numbers for SAH and to try to get as high in the ratings as possible. Now, all that work goes by the wayside and I have to restart from zero? If that's going to be the case, I'll be shutting down all computers as of 12/15. If that's all you care about the people who have made this project what it is, I'm finished with any distributed processing project ever offered by Berkeley. Goodbye SAH Classic, and goodbye to BOINC! |
W-K 666 ![]() Send message Joined: 18 May 99 Posts: 19593 Credit: 40,757,560 RAC: 67 ![]() ![]() |
To simply freeze everyone's credit to the classic SAH project is completely unfair to those who have been diligently processing WU's. Why would it not be possible to take the average credit received after processing say 100 or 1000 BOINC SAH WU's and simply multiply that by the classic number of WU's completed. I started out like everyone else in Classic crunching a single work unit. Over 6 years, I have added computers to my home network solely for the purpose of crunching numbers for SAH and to try to get as high in the ratings as possible. Now, all that work goes by the wayside and I have to restart from zero? If that's going to be the case, I'll be shutting down all computers as of 12/15. If that's all you care about the people who have made this project what it is, I'm finished with any distributed processing project ever offered by Berkeley. Anybody who wishes to know what contribution you made to Classic can be easily read by clicking on your name, alongside the post, which also shows your start date. This was way before BOINC started so everybody knows your a long time cruncher. I read; SETI@home member since 7 Aug 1999 Country United States Total credit 709.09 Recent average credit 7.39 SETI@home classic workunits as of 15 March 2005 56,047 SETI@home classic CPU time as of 15 March 2005 278,277 hours Team May the Schwartz be with you The total credits will be updated shortly after classic closes. |
Astro ![]() Send message Joined: 16 Apr 02 Posts: 8026 Credit: 600,015 RAC: 0 |
To simply freeze everyone's credit to the classic SAH project is completely unfair to those who have been diligently processing WU's. Why would it not be possible to take the average credit received after processing say 100 or 1000 BOINC SAH WU's and simply multiply that by the classic number of WU's completed. I started out like everyone else in Classic crunching a single work unit. Over 6 years, I have added computers to my home network solely for the purpose of crunching numbers for SAH and to try to get as high in the ratings as possible. Now, all that work goes by the wayside and I have to restart from zero? If that's going to be the case, I'll be shutting down all computers as of 12/15. If that's all you care about the people who have made this project what it is, I'm finished with any distributed processing project ever offered by Berkeley. There is NO fair way to convert them. I crunched since Jul 6 1999, and felt similar a year and a half ago when I came over here. There are many reasons why it can't be converted. The biggest for me is that Seti under boinc was engineered to help prevent the cheating that occured in classic. I know I wouldn't want a cheater to get the conversion. Also, earlier classic work was completed quicker than the 3.08 app, and how would you adjust for that? Then there's the ever increasing speed of processors. Seti under boinc makes calculations based upon benchmarks and actual CPU time used. THen there is the WUs that contained too much noise, should you get full credit for those? In Boinc the application aborts the noisey wus so you don't waste time crunching them to completion and you get partial credit for them. In Boinc you get credit ONLY for valid work, junk files just don't cut it here. If you must leave then, thanks for your participation to this point. tony ![]() |
Astro ![]() Send message Joined: 16 Apr 02 Posts: 8026 Credit: 600,015 RAC: 0 |
Oh yeah, Look at the bottom line of my Signature for my up to date Classic stats. You're credits aren't lost |
©2025 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.