Message boards :
News :
Continued server problems.
Message board moderation
Author | Message |
---|---|
Eric Korpela Send message Joined: 3 Apr 99 Posts: 1382 Credit: 54,506,847 RAC: 60 |
We're continuing to have issues due to a database problem early last week and a botched attempt to fix it. The problem is that the result and host tables in the database have grown large enough, and hosts have gotten fast enough that the lookup of result in process for a host and the enumeration of new results to send don't finish before the web connection times out either on the server or the client side. This resulted in hosts being assigned large number or results to compute without the transaction that tells them about these results being completed. The host. think it received no results would then contact the server for more results, which it would again not receive. This isn't a hardware problem. The database currently fits in memory and the processors are fast. We've just crossed a threshold where each host computes fast enough that host queues and the result table have become large enough to cause this problem. To solve it, we've put per host limits on results in process back in place. But hosts that are having this problem will probably continue to have it until the average number of results per host has fallen to a workable level. That could take weeks. For a more permanent fix, we plan do more work in each result by quadrupling the size of the workunits. But that fix will probably take months to implement and test. @SETIEric@qoto.org (Mastodon) |
W-K 666 Send message Joined: 18 May 99 Posts: 19401 Credit: 40,757,560 RAC: 67 |
Thanks for the news, we will struggle through. AS this problem only seems to occur when AP tasks are available, might it be possible to balance the rates of splitting. At the moment the AP tasks are being split much faster than the normal MB tasks. |
Dirk Sadowski Send message Joined: 6 Apr 07 Posts: 7105 Credit: 147,663,825 RAC: 5 |
Eric, thanks for the news! By the way .. In the *panic thread* in the NC subforum, some members report that they have *better* server contact via PROXY usage. Maybe a S@h router need a reboot? * Best regards! :-) * Sutaru Tsureku, team seti.international founder. * Optimize your PC for higher RAC. * SETI@home needs your help. * |
tbret Send message Joined: 28 May 99 Posts: 3380 Credit: 296,162,071 RAC: 40 |
Thanks Eric. Knowing that you are looking into it, on a Sunday night no less, is gratifying. |
KWSN THE Holy Hand Grenade! Send message Joined: 20 Dec 05 Posts: 3187 Credit: 57,163,290 RAC: 0 |
Does that explain why the Schedulers are currently (2115, 9/18/12 Berkeley time) down? . Hello, from Albany, CA!... |
Astro-AL Send message Joined: 31 Mar 00 Posts: 18 Credit: 95,868,034 RAC: 80 |
I have 3 machines with more than 600 completed WU's that I can't report. Does this latest problem mean that these finished WU's will be considered as expired since they will not accept them at the server by expected due date? Should I just turn off my machines that have completed WU and wait for news as to when they will be accepted? I am wasting a lot of CPU power and electric trying to get more work and report finished WU's. I have been solely interested in ET research since 1999 and do not wish to do other Boinc projects. I wish there was more explanations, like this latest post. If there were, it would take a lot of the frustrations out, and crunchers would stop B*tch**g so much. I rarely have anything to say but I read the blogs. I would like to have a more firm date as to when the project is expected to run as efficiently as it did before Boinc software. |
Vicki Send message Joined: 30 Nov 01 Posts: 65 Credit: 1,640,576 RAC: 46 |
Ah! so that's whats going on. Yesterday after reinstalling bonic, it took me 3 hrs to get any work units to download, the last of which completed downloading almost 24 hrs later... At the moment my laptop has 5 results ready to report & my desktop has a few waiting as well. Looking forward to testing chromes remote desktop while on holiday next week to check on my desktop's Bonic progress. A city destroyed by an earthquake is an opportunity to Rebuild, redeign & make it a better place to be. Better, stronger, faster like the 6 Million Dollar Man |
Sp@ceNv@der Send message Joined: 10 Jul 05 Posts: 41 Credit: 117,366,167 RAC: 152 |
THX Eric, for your time to post & your time that will bring a fix eventually. Using proxies doesn't change a lot over here either, the two main crunchers have run dry, one will save on electricity, the best one has been unleashed upon WCG, a very fine project also. I'll leave S@H on autopilot for now, checking the homepage of the project and the messageboards once a day will do for now. Crunching is a passtime, not a basic necessity. Kind Belgian Regards ;) To boldly crunch ... |
Thomas Send message Joined: 9 Dec 11 Posts: 1499 Credit: 1,345,576 RAC: 0 |
Thanks Eric for this news Good luck to all the technical staff of Berkeley from the french team "BRIGADE DU COSMOS" Hopefully there will not be too disgruntled We must never forget that the SETI@home is a scientific non-profit project So be indulgent with this kind of technical risks Hope everything returns to normal fairly quickly |
Draconian Send message Joined: 16 Mar 03 Posts: 21 Credit: 1,809,058 RAC: 0 |
Confirms a theory I had as to why proxy servers work - they send the data stream slower as they are sending to multiple other systems as well. Should be able to configure my router QOS to send, say, 30K / sec max for seti program. For now at least though, finally found a good proxy. Good luck with the fix - you are burdened by your success! |
Bernie Vine Send message Joined: 26 May 99 Posts: 9958 Credit: 103,452,613 RAC: 328 |
Thank you Eric. Can we all now just wait and not start filling these boards with "When is this going to be fixed threads" If it takes months it takes months I for one will be happy to wait. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
We're continuing to have issues due to a database problem early last week and a botched attempt to fix it. My concern is that the Scheduler timeouts started 4 weeks ago, and became a major issue about 3 weeks ago. And that if you can find a good proxy, you don't get Scheduler timeouts. Grant Darwin NT |
.clair. Send message Joined: 4 Nov 04 Posts: 1300 Credit: 55,390,408 RAC: 69 |
Can we have a new switch in `SETI@home preferences` for - VLAR on GPU. In the `Run only the selected applications` section, that would overide the default seting and so reduce the calls to the servers, for those like me that can crunch them on ATI (or any other GPU that can handle them) |
Michael W.F. Miles Send message Joined: 24 Mar 07 Posts: 268 Credit: 34,410,870 RAC: 0 |
So we are going to be having this same problem for months eh!!!. Wow, at least we have an explanation but I am not sure I like the answers. Couple months of this an I will be ready for the hospital... LOONEY BIN Michael Miles |
Tcarey Send message Joined: 20 Aug 99 Posts: 30 Credit: 70,655,757 RAC: 24 |
Eric, Thanks for the update and explanation of the problem. Understanding the issues reduces my frustration considerably. I do hope that it won't be months before my fast machine gets any more work units. |
dancer42 Send message Joined: 2 Jun 02 Posts: 455 Credit: 2,422,890 RAC: 1 |
So we are going to be having this same problem for months eh!!!. The bottom line is while we in our tens of thousands have built a new system gotten a new video card or just found time to tweak our systems,seti can not afford to hire a full time programmer to fix things and for the most part are running the same equipment they had last year. The new equipment is not nearly enough and no amount of tweaking is going to fix it for long. With green bank coming on line soon it will become thousands of times more likely for seti to find a signal. Yet the funding through donations and endowments wouldn't pay the salary's for a good sized McDonald's. For toughs complaining donate a dollar a month cheap to support any hobby,the minimum donation at the seti sight is $10 save up if you have to, it is going to stay broke until seti can catch up with new equipment. |
Ronald R CODNEY Send message Joined: 19 Nov 11 Posts: 87 Credit: 420,920 RAC: 0 |
I agree with Dancer. If you havent put up, then hush up and wait. Eric and the rest of the frustrated Berkeley Vols: Thanks for the info and praying your expertise wins out. |
S@NL Etienne Dokkum Send message Joined: 11 Jun 99 Posts: 212 Credit: 43,822,095 RAC: 0 |
So we are going to be having this same problem for months eh!!!. If you want to help Seti along for the time being why don't you run Beta ? You'll still be crunching signals and it's for the good of the future of this system... I do the same, it's no use complaining about things out of our (and the boys at the lab) control. On topic : thanks Eric for the info, we'll wait it out and see what happens. Set my rigs to NNT and if it reports and I see the servers up I will surely be back ! |
Razorface Send message Joined: 6 Aug 01 Posts: 16 Credit: 217,293,419 RAC: 0 |
[/quote]If you want to help Seti along for the time being why don't you run Beta ? You'll still be crunching signals and it's for the good of the future of this system... Where does one find this Beta? |
John Neale Send message Joined: 16 Mar 00 Posts: 634 Credit: 7,246,513 RAC: 9 |
Where does one find this Beta? Here: SETI@home/AstroPulse Beta You'll have to add this project using BOINC Manager. <Tools> <Add project or account manager...> |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.