Message boards :
Technical News :
Get Out of My House (Jan 18 2011)
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
morpheus Send message Joined: 5 Jun 99 Posts: 71 Credit: 52,480,762 RAC: 33 |
Thanks for the update, Matt. And good luck with Bruno. .:morpheus:. |
DJStarfox Send message Joined: 23 May 01 Posts: 1066 Credit: 1,226,053 RAC: 2 |
...we'll still be here looking forward to business as normal. :-) This is business as usual/normal. LOL |
Westsail and *Pyxey* Send message Joined: 26 Jul 99 Posts: 338 Credit: 20,544,999 RAC: 0 |
Good luck!!! Whether with Synergy or another machine, hope things go relative smoothly. I have a question...Is it possible to separate uploads/downloads? If the uploads could also be routed through the new lab Gbit line. This could leave the Hurricane Elec. line purely for downloads... ?? "The most exciting phrase to hear in science, the one that heralds new discoveries, is not Eureka! (I found it!) but rather, 'hmm... that's funny...'" -- Isaac Asimov |
Donald L. Johnson Send message Joined: 5 Aug 02 Posts: 8240 Credit: 14,654,533 RAC: 20 |
Sounds like Synergy got there not a moment too soon!) Or bruno, knowing synergy was on the way, held out as long as he could before crashing. Either way, bruno has been a good soldier, hope you can get him back in business again. Donald Infernal Optimist / Submariner, retired |
Saaby900T Send message Joined: 24 Dec 10 Posts: 76 Credit: 4,971,171 RAC: 0 |
When is looking like this is going to get back online? |
Geoff Gong Send message Joined: 11 Dec 99 Posts: 53 Credit: 1,543,379 RAC: 0 |
Hi Server status shows ONLY AP Splitters Lando and Vader not running ,both are doing other tasks Is the Server Status page affected ? |
edwartr Send message Joined: 2 May 00 Posts: 31 Credit: 79,402,615 RAC: 14 |
Make sure and check the date/time on the Server Status page: [As of 18 Jan 2011 17:10:05 UTC] It is 20 Jan 2011 06:11 UTC as I am posting this. I gotta fever and the only prescription is more cowbell. |
Adam Weichel Send message Joined: 30 Jul 02 Posts: 22 Credit: 25,877,509 RAC: 46 |
It's good to hear that everything's correctable, Matt. Is there an updated hardware requirement list that's available? Looking to donate some more parts in the spring. :) Computer nut, Distributed Computing freak, Jeeper and Dodge Ram driver. Life is worth living... and worth discovering. I run VMWare ESXi Free - why don't you? |
Jaye Ellen Send message Joined: 29 Nov 08 Posts: 26 Credit: 20,945,032 RAC: 45 |
Keep up the excellent work, Matt and let's try to revive Bruno before his untimely demise ??? All in fun though, I was just wondering why my uploads were just sitting here and now I know, and can stop worrying ... Jaye Ellen |
Todd Hebert Send message Joined: 16 Jun 00 Posts: 648 Credit: 228,292,957 RAC: 0 |
At least when Synergy (New Bruno) comes up it will be a true test to see if it can handle the load of the project with everyone uploading their completed WU's. I have over 4k to report alone. Todd |
ralphw Send message Joined: 7 May 99 Posts: 78 Credit: 18,032,718 RAC: 38 |
Bruno hardware failure - this suggests bruno (the upload server) is a single point of failure, is it feasible to have two systems performing the same function here? Perhaps bruno should have a companion, borat (I'm probably thinking of the wrong bruno here.) |
Gary Charpentier Send message Joined: 25 Dec 00 Posts: 30980 Credit: 53,134,872 RAC: 32 |
Bruno hardware failure - this suggests bruno (the upload server) is a single point of failure, is it feasible to have two systems performing the same function here? Everything on the BOINC side of the house is a single point of failure. The only system that has a hot backup is the science database. |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
Bruno hardware failure - this suggests bruno (the upload server) is a single point of failure, is it feasible to have two systems performing the same function here? BOINC is supposed to make it possible to do big science on a vanishingly small budget. That means redundant servers are often out of the question. So instead of redundant servers so that something can always take work, we have a client that handles outages gracefully. It has to be that way because the standard solution (throw money at the problem) is not available to BOINC projects. |
Todd Hebert Send message Joined: 16 Jun 00 Posts: 648 Credit: 228,292,957 RAC: 0 |
I would disagree with the statement that we can make do without redundant servers for this project. Considering the number of hosts are now over 2 million for just this project that is an incredible amount of computing power and really was the spirit of the architectural design. The work still needs to go somewhere. I don't think some people here realize the scope of the data returned and the impact of the loss of storage devices. To serve the project as has been demanded by the users, some are very vocal, there does need to be many factors considered. It isn't about throwing money at a problem in hopes of a resolution. Things break and need to be replaced over time - requiring money or donations to achieve the goal. Not much different that expecting to drive your new car with the same tires for 150k miles or not getting an oil change. When your dataset increases, your load and time between failures also increases. When trying to make do with piecemeal equipment it can be very challenging to make a go of it. Todd Bruno hardware failure - this suggests bruno (the upload server) is a single point of failure, is it feasible to have two systems performing the same function here? |
kittyman Send message Joined: 9 Jul 00 Posts: 51477 Credit: 1,018,363,574 RAC: 1,004 |
Aye, Capn' Todd.... I face challenges just keeping 8 crunching rigs online some days. Power supplies age, motherboard components age, things change. Adjustments to settings are required. Of course, the kitties push the rigs pretty hard, so any change in tolerances can sometimes throw things outta whack. I suspect that Eric, Matt, and crew sleep much better recently with the new servers at work..... I know that even with the latest short outage, we are enjoying the longest streak of uptime in Seti history for many moons. You and all other contributors have done well. "Time is simply the mechanism that keeps everything from happening all at once." |
John McLeod VII Send message Joined: 15 Jul 99 Posts: 24806 Credit: 790,712 RAC: 0 |
I would disagree with the statement that we can make do without redundant servers for this project. Considering the number of hosts are now over 2 million for just this project that is an incredible amount of computing power and really was the spirit of the architectural design. The work still needs to go somewhere. They do have some redundancy. The DB is mirrored in real time. They have raid for their drive arrays. But having completely redundant servers is too expensive. BOINC WIKI |
KWSN THE Holy Hand Grenade! Send message Joined: 20 Dec 05 Posts: 3187 Credit: 57,163,290 RAC: 0 |
[snip] It's been my experience that when the dataset increases, MTBF (Mean Time Between Failures) decreases... and your load increases by a factor of two or more (for a dataset double, your load quadruples... not saying that the increase is always exponential, though...) . Hello, from Albany, CA!... |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.