Message boards :
Number crunching :
Maintence Day
Message board moderation
Author | Message |
---|---|
bill Send message Joined: 16 Jun 99 Posts: 861 Credit: 29,352,955 RAC: 0 |
What would be the downside to moving the maintenance day from Tuesday to Monday? It seems that since the system (more often than not) develops problems over the weekend, why not move up the maintenance and possibly decrease the total amount of down time for the week. It might give one more day for any problems that pop up after the system is brought back online to appear while everybody is still in the office, thereby getting fixed during normal work hours (hopefully). |
Slavac Send message Joined: 27 Apr 11 Posts: 1932 Credit: 17,952,639 RAC: 0 |
|
.clair. Send message Joined: 4 Nov 04 Posts: 1300 Credit: 55,390,408 RAC: 69 |
Some time ago i read that doing it on a tuesday means it avoids public holidays when they happen and keeps the automated bits of the job nice and regular. |
soft^spirit Send message Joined: 18 May 99 Posts: 6497 Credit: 34,134,168 RAC: 0 |
My preference is.. any time they need to, for as long as they need to, just let us know before hand if possible. Janice |
Tom95134 Send message Joined: 27 Nov 01 Posts: 216 Credit: 3,790,200 RAC: 0 |
Starting the Maintenance Cycle on Tuesday also gives the guys a chance to try and figure out what little "gifts" we left in the system over the weekend and work out a sensible plan to try to sort them out during the maintenance cycle. Even if there are no surprises from the weekend it does give the guys a chance to evaluate what went right over the weekend and to do a reasonable plan for the Maintenance Cycle and then getting things running again and the pipeline filled. |
EdwardPF Send message Joined: 26 Jul 99 Posts: 389 Credit: 236,772,605 RAC: 374 |
My 2cents say ... it's not my project it belongs to the boys out west ... I'm just glad they let me play in the sand box!! Ed F |
bill Send message Joined: 16 Jun 99 Posts: 861 Credit: 29,352,955 RAC: 0 |
Ahh, the answer appears to be political. It would interfere with government mandated holidays. Personally, I think it would decrease downtime overall, but, we'll never know. |
rob smith Send message Joined: 7 Mar 03 Posts: 22286 Credit: 416,307,556 RAC: 380 |
Having once worked in server management I'd say Monday is the day that most problems are uncovered, so Tuesday is the day to start fixing them. If hardware is needed it gives you a fighting chance to get it ordered and delivered before you start the sweaty jobs in the server room. And as Slavac says who wants to go into the server room after a nice weekend in the sun? The posted three day maintenance "window of opportunity" is a very good idea, first day do the routine, the rest you either use, or don't use, but your user base is aware there may be other periods of non-availability. (On an 8/24-5/7 operation like S@H, with a complex, and ageing, server collection there is a well known fact of life - Servers will misbehave worst just everyone has had the "third glass of red", so can't drive back to work...) Bob Smith Member of Seti PIPPS (Pluto is a Planet Protest Society) Somewhere in the (un)known Universe? |
Bill Walker Send message Joined: 4 Sep 99 Posts: 3868 Credit: 2,697,267 RAC: 0 |
Ahh, the answer appears to be political. It would I think that is a bit of oversimplification. People expect holidays, mandated or not. You can ask them to give them up, but you will pay a long term price in staff performance, and eventually staff migration. Been there, both as a hard-assed manager and as a fed up employee. And what will we do when there is a Monday holiday? Skip a week's worth of maintenance? Sometimes have maintenance on Tuesdays? Prices to pay for both of those options as well. As often happens in life, you set out looking for a good solution, but may have to settle for the "least bad" solution. |
HAL9000 Send message Joined: 11 Sep 99 Posts: 6534 Credit: 196,805,888 RAC: 57 |
Monday is also often a "recovery day" for any problems that might have occurred over the weekend. I normally end up doing maintenance on our stuff at work on Wednesday or Thursday. SETI@home classic workunits: 93,865 CPU time: 863,447 hours Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[ |
kittyman Send message Joined: 9 Jul 00 Posts: 51469 Credit: 1,018,363,574 RAC: 1,004 |
Monday is also often a "recovery day" for any problems that might have occurred over the weekend. I normally end up doing maintenance on our stuff at work on Wednesday or Thursday. I have often seen a 'quick fix' put into place on Monday to get things running until the full sort on Tuesday. "Freedom is just Chaos, with better lighting." Alan Dean Foster |
PhonAcq Send message Joined: 14 Apr 01 Posts: 1656 Credit: 30,658,217 RAC: 1 |
How about segmenting the system architecture better so that different parts can be "maintained" on different days, deferring complete shutdowns to a more infrequent interval? |
HAL9000 Send message Joined: 11 Sep 99 Posts: 6534 Credit: 196,805,888 RAC: 57 |
How about segmenting the system architecture better so that different parts can be "maintained" on different days, deferring complete shutdowns to a more infrequent interval? I thought the primary reason for the weekly maintenance is to make a backup of the science databases. Which effectively takes everything down. SETI@home classic workunits: 93,865 CPU time: 863,447 hours Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[ |
musicplayer Send message Joined: 17 May 10 Posts: 2430 Credit: 926,046 RAC: 0 |
If I push the Update-button on PrimeGrid, I will be passing 8 million in credit there today. Even so, I was able to upload my results here a little earlier on. It shows that there are people who still believe in this project. Also I received a CUDA-task a little earlier on from someone who must have been a very early morning bird. Not bad! Isn't it really so that when downloads do not work, you need to have a cache? So if uploads break down on either a Saturday or Sunday, you soon run dry. If one or so day is "fix it" day and it works the rest of the week, Tuesdays seem to be the best day for backup of data to me. It really could soon be worse. Uploads are separate from downloads. Downloads are raw-data which are unprocessed and comes from Arecibo. Uploads are the finished tasks which are meant to be stored in the Master Database or Science Database as well. |
kittyman Send message Joined: 9 Jul 00 Posts: 51469 Credit: 1,018,363,574 RAC: 1,004 |
How about segmenting the system architecture better so that different parts can be "maintained" on different days, deferring complete shutdowns to a more infrequent interval? Posted about 4 days ago by Matt in Tech News........ "The most important thing that happens during the outage is that we compress the mysql databases. Since we are inserting/deleting millions of rows per day (all results and workunits) the database pages get ridiculously fragmented really fast, and after about a week can no longer fit in memory. The compression part is what takes about 2-3 hours, and you can't really do much with the database while that happens, which is why we stop the projects. The actual backup part takes about 45 minutes, and could actually happen live if we did it on the replica, but just to be safe we back up the master. We also take care of other odds and ends, like rotating the backend server logs, and replacing broken drives, etc. while everything is quiet." "Freedom is just Chaos, with better lighting." Alan Dean Foster |
HAL9000 Send message Joined: 11 Sep 99 Posts: 6534 Credit: 196,805,888 RAC: 57 |
How about segmenting the system architecture better so that different parts can be "maintained" on different days, deferring complete shutdowns to a more infrequent interval? Right! That's it. Thanks. Like I can remember 4 days ago... :) SETI@home classic workunits: 93,865 CPU time: 863,447 hours Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[ |
justsomeguy Send message Joined: 27 May 99 Posts: 84 Credit: 6,084,595 RAC: 11 |
Hear, hear! Heck all I remember from yesterday is blah blah blah your never listen or something like that anyway... :) "Two things are infinite: The universe and human stupidity; and I'm not sure about the universe." - Albert Einstein |
OzzFan Send message Joined: 9 Apr 02 Posts: 15691 Credit: 84,761,841 RAC: 28 |
If you want to know the findings of any result (which is far off topic for this thread), the Near Time Persistency Checker (NTPCKR, or "nit picker") is supposed to do just that. The problem is that due to SETI's over-worked and over-stressed server infrastructure, the NTPCKR functionality has been temporarily disabled. In the mean time, we're just collecting the results to be later analyzed. |
musicplayer Send message Joined: 17 May 10 Posts: 2430 Credit: 926,046 RAC: 0 |
Is there a difference between some blahh blahh blahh and the results that are being received by you from the users of Seti@home? |
OzzFan Send message Joined: 9 Apr 02 Posts: 15691 Credit: 84,761,841 RAC: 28 |
Is there a difference between some blahh blahh blahh and the results that are being received by you from the users of Seti@home? Being received by me? I am not receiving anything. I'm nowhere near Berkeley, California. I work in IT as a Server Administrator. If you want to talk numbers, create a new thread here in Number Crunching and I'm sure someone who understands the signal analysis would love to speak up and help you. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.