Preventive maintenance - how about that? |
![]() |
| log in |
Message boards : Number crunching : Preventive maintenance - how about that?
1 · 2 · 3 · 4 . . . 8 · Next
| Author | Message |
|---|---|
|
What I've seen during the last couple of years with S@H was always the same pattern: | |
| ID: 1076101 · | |
|
you do understand that the Tuesday outages are for that so called routine maintenance. | |
| ID: 1076113 · | |
you do understand that the Tuesday outages are for that so called routine maintenance. Did you actually read my posting? I am talking about being one step ahead - not behind all the time. According to Wikipedia Preventive maintenance (PM) has the following meanings: 1) The care and servicing by personnel for the purpose of maintaining equipment and facilities in satisfactory operating condition by providing for systematic inspection, detection, and correction of incipient failures either before they occur or before they develop into major defects. 2) Maintenance, including tests, measurements, adjustments, and parts replacement, performed specifically to prevent faults from occurring. ____________ Petition against 1366x768 glare displays: http://www.facebook.com/home.php?sk=group_153240404724993 | |
| ID: 1076154 · | |
|
So what you're saying is that you have a crystal ball that will tell everyone what server will crash before it happens and how to fix it before it breaks? | |
| ID: 1076238 · | |
|
redundancy is the only option..... | |
| ID: 1076248 · | |
|
Thumper has received a great deal of PM. This was over due, but it required benching it for an extended period, and time to work on it. Maybe I am a bit old fashioned, but most system admins I know like to be paid for their work. | |
| ID: 1076249 · | |
So what you're saying is that you have a crystal ball that will tell everyone what server will crash before it happens and how to fix it before it breaks? The usual way to look at it is by the olderest machine/component. Older ones are usually more likely to fail than newer ones. Mechanical devices are always going to wear out, so hard disks are a likely case where the oldest are the ones you'd start with. They may have some of a particular brand that are more prone to failure than the others. You could do a similar thing by looking at the oldest server in the closet and start with it. Ideally they should all get replaced over some time-frame (5 years, 10 years or whatever). ____________ BOINC blog | |
| ID: 1076251 · | |
you do understand that the Tuesday outages are for that so called routine maintenance. Yep I read your post. I assume you have a crystal ball you can look at to determine what parts are going bad on a system. I'd like you to look at my systems and tell me which parts I should buy to prevent them from dying. Maintenance is just that. Its looking at your stuff keeping it clean etc. you nor anyone else can prevent a mainboard from dying or having the OS crash unexpectedly. The admins with the exception of the 2 very new servers, work wilth aging and sometimes obsolete equipment. Be grateful more things havent gone down ____________ Proud member of TSWB. End terrorism by building a school | |
| ID: 1076253 · | |
you do understand that the Tuesday outages are for that so called routine maintenance. Yep, Dust and Cat(Dog?) hair does build up in PCs, When It does I do as much as I can to clean It out, Heck I even deploy filters on all My fan intakes that I exchange and then wash whenever they get dirty. ____________ BSG Anthem My Facebook page | |
| ID: 1076257 · | |
|
Googles's Failure Trends in a Large Disk Drive Population pdf is well worth rereading when thinking about drive replacements as PM. They do indicate that a couple of the SMART measurements may actually be useful, for instance. Joe | |
| ID: 1076291 · | |
redundancy is the only option..... Yeah I agree, they should have 2 of everything. When one server fails they take it offline and put in the backup server. That way we are always online. | |
| ID: 1076342 · | |
redundancy is the only option..... Well I'm sure that no one will complain if you supply those extra servers. ;) Cheers. ____________ | |
| ID: 1076347 · | |
|
it seems to me that the crew at SETI do the best they can with what resources they have. Much of the work they are doing now seems tedious, and I am sure they wish for fewer problems like the rest of us do. throughout the boards it seems many people are offering to help with funds for new equipment, but no one seems to have a good grasp of what is needed, or the costs associated with that. it would be great if someone at SETI could spend a little time ( once they are able to solve the latest problems of course), and put together their needs, or wishes for new equipment, and best guess at the costs for each system. It is a lot easier to work towered a goal if we know what that goal might be. Just before I joined the GPU Users group they were able to put together the funds for a new unit in a short period of time, and from a relatively small group of people. This is something I would be willing to put more funds into, knowing the goal, than a general donation to SETI. I have read a few threads where others have said similar to this same idea. I a.m newer to the boards, and am not sure the best way to reach the staff at SETI, but I think there is plenty of intrest here to help them with this, and ourselves in turn. | |
| ID: 1076356 · | |
|
check KIJIJI.COM maybe for servers.......i found 2 there, that i have purchased....brand new is not always needed, but appreciated.....im sure, that this is a university, they can find the space, and the resourses to help this project | |
| ID: 1076364 · | |
|
http://toronto.kijiji.ca/c-buy-and-sell-computers-Twin-dual-core-Opteron-server-computer-6GB-RAM-1TB-HDD-W0QQAdIdZ259683269 | |
| ID: 1076368 · | |
|
There is lots of different methods to the madness when it comes to IT. In general extra power supplies, extra fans, extra hard drives, and a few extra video cards are always laying around because in general those are what go bad, sometime without a hint. That google pdf is one studying 100,000 hard disks of various sizes, manufactures, and models. While they do say that certain SMART data does point at a possible failing drive it also goes on to say SMART diagnosis models are not accurate, or should be used to assert certain reliabilities on a drive. They also comment how drives died without any SMART errors. | |
| ID: 1076400 · | |
|
Any talk of redundancy and backups in IT always comes down to $$$. | |
| ID: 1076414 · | |
|
If I knew whats part were needed I would help. But Like others I would like some clarity as to what they need??? Yes extra servers would be nice but not practical. How about finding New CPU's for the servers they already have? Form looking at the server stats most of them look to be single or dual core servers. I mean you can get Quads for 300 US dollars now days. | |
| ID: 1076432 · | |
|
| |
| ID: 1076437 · | |
|
I agree with Frizz. | |
| ID: 1076450 · | |
Message boards : Number crunching : Preventive maintenance - how about that?
| Copyright © 2013 University of California |