Message boards :
Technical News :
Out of the Frying Pan (Feb 17 2010)
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 . . . 6 · Next
Author | Message |
---|---|
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
Nice to hear everything is almost back to normal. Unfortunate that alot of work units were aborted while trying to upload them as their deadline had passed during the downtime. A have a feeling more will be aborted as they are still unable to be uploaded.. You should always let those ride -- you likely would still get credit. |
Jord Send message Joined: 9 Jun 99 Posts: 15184 Credit: 4,362,181 RAC: 3 |
The A/C died and it's too hot? It's winter, it's 25 degrees and snowing...open the windows. That'll cool you off. And assuming you want moist sea-air to enter your server room, wreaking havoc with all the electrics in there. ;-) |
Dena Wiltsie Send message Joined: 19 Apr 01 Posts: 1628 Credit: 24,230,968 RAC: 26 |
For a while my job depended on a system cooled by an air conditioner that I could not depend on. My solution was to get one of these and wire it into an extension cord so I could connect all the non-replaceable equipment to it. I then set it to about 80 F and had no worries about failed hardware. The catch is you must make sure your backups are up to date as the power down will be very hard and in my case the raid lost a drive often when it was powered down (very old drives). The P390 came with a latching power switch. The software was unable to cut the power and the only way the power could be turned off was to push the button or pull the power cord. I don't think Warp has power support in it and even if it did, VM/ESA didn't have that type of support in a P390. Running on real hardware VM/ESA might but the P390 was a strange animal for IBM. My job wasn't to spend a few month getting what you suggest to work, I needed something quick and dirty to protect the hardware because we couldn't afford to replace it. I have what you suggest all set up and functional on my MAC but the P390 is about 15 year old hardware pressed into service long after IBM considered it obsolete. |
Nate Itkin Send message Joined: 29 Jun 99 Posts: 4 Credit: 1,804,607 RAC: 3 |
I concur with Mr. Haselgrove. Something was wrong with the scheduler before the Tuesday shutdown. My crunchers (located in Texas, California, and Hawaii) all had entries like this in their logs: 15-Feb-2010 22:07:14 [SETI@home] Scheduler request failed: Timeout was reached This particular entry was GMT -10. |
KWSN THE Holy Hand Grenade! Send message Joined: 20 Dec 05 Posts: 3187 Credit: 57,163,290 RAC: 0 |
The smart-ass in me made me write this..... ...You forget that the project is in Berkeley... where (during the day at this time of year...) it is about 60-65ºF and only goes down to 50-55ºF at night... NTM that yesterday morning, and this morning, there was a heavy fog (at least in my location, 2½ miles away...) Besides, Matt always refers to it as the "server closet", which implies that it doesn't have a window... (I think I've read that it is a re-purposed janitor closet...) . Hello, from Albany, CA!... |
Gary Charpentier Send message Joined: 25 Dec 00 Posts: 30608 Credit: 53,134,872 RAC: 32 |
So how much is a second A/C unit installed? Perhaps time to add up the thermal load and retire some hot equipment for some cooler equipment. Yes, you need to get thermal cut out switches. As you have UPC's, that makes it much easier for a controlled shutdown. Now if you could automate the door opening and a couple of big fans coming on ... |
Cosmic_Ocean Send message Joined: 23 Dec 00 Posts: 3027 Credit: 13,516,867 RAC: 13 |
So how much is a second A/C unit installed? I don't think a second A/C system would be ideal. From what I remember hearing, power distribution/availability is already pretty much at maximum capacity as it is. Every time a new server is installed in the closet, it means one or two old ones being re-purposed elsewhere. Last I knew, there was still plenty of rack space, but it's a problem of power availability. Linux laptop: record uptime: 1511d 20h 19m (ended due to the power brick giving-up) |
kittyman Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004 |
So how much is a second A/C unit installed? If I remember correctly, the AC and electricity is part of what Berkeley supplies out of the 'cut' they take from donations...... So I don't think this cuts into the puny Seti budget. I don't think power availability ever came into the equation. "Freedom is just Chaos, with better lighting." Alan Dean Foster |
Ronal E. Zepeda Trujillo Send message Joined: 14 Jul 05 Posts: 9 Credit: 3,167,018 RAC: 0 |
At least, was a little disaster... could it be worst... Only a boy with responsabilities of an old man |
Bounce Send message Joined: 3 Apr 99 Posts: 66 Credit: 5,604,569 RAC: 0 |
>I started a chkdsk over 10 hours ago and it's less than halfway through! Try SpinRite (http://www.grc.com - just a satisfied customer). Much better at recovering data and grooming a hdd than what M$ includes for free. >So how much is a second A/C unit installed? At my last agency, a sub-unit (which was required due to how the building did its HVAC) was $10,000.00. These folks are begging for second hand servers to do their projects. I suspect that a real budget item like that is considered a little spendy. Even if UCB is taking a cut for basic facility management, extras like this are often done on the customer's dime. |
RottenMutt Send message Joined: 15 Mar 01 Posts: 1011 Credit: 230,314,058 RAC: 0 |
SETI STAF, WE HAVE BEEN DOWN SINCE SUNDAY! there has been no acknowledgment in the postings, other then BBQ servers. please fix the problem thank you |
FrostKing9 Send message Joined: 20 Oct 01 Posts: 39 Credit: 23,815,960 RAC: 0 |
Yep... the upload and report process is still malfunctioning. Can barely upload completed WU's... only by repeatedly hitting the RETRY NOW on the TRANSFERS window. Then it only UPLOADS from 1 to 3 WU's at a time. And reporting all of those WU's isn't working at all. Not even after over 100-clicks over 8-hours on the UPDATE button on the PROJECTS window. <sigh> I DONATE money to SETI@home.... DO YOU? I'm just slowly BOINC'ing along. Hey... ET... you have a sister who likes earthlings? |
Dave Send message Joined: 29 Mar 02 Posts: 778 Credit: 25,001,396 RAC: 0 |
Patience people... |
DJStarfox Send message Joined: 23 May 01 Posts: 1066 Credit: 1,226,053 RAC: 2 |
Matt, That is insane. You urgently need some kind of automated thermal shutdown or emergency ventilation for that closet. The Linux kernel will shutdown the system when the CPU overheats but not hard drives or other components. If there were to be some kind of fire or failure of most drives, the next failure could mean the end of SETI@Home. My brother configured a monitoring program called Nagios to sense his data center's temperature and email his cell phone above a certain temp. If you're interested, I could get more implementation details. |
Marc F. Send message Joined: 7 Apr 05 Posts: 4 Credit: 3,613,183 RAC: 0 |
Patience people... I agree -- when looking back at Matt's original update ("Off the Beach") after returning from vacation, I was reminded that he does acknowledge that there were some problems even before the A/C failure (e.g. the uploading issues we've all been facing). So there's no need to get riled up about that right now. The way I see it, I'm going to give SETI@home a full week to get back to normal before any of us is really justified in panicking. Actually, come to think of it, we might all do well to heed the wisdom of a certain "Guide" that proclaims in large, friendly letters: DON'T PANIC! By the way, SETI@home staff: I really like the plan to have SETI@home and Astropulse on separate servers. "That's no moon. It's a space station." -Obi-Wan Kenobi ...If there's a Galactic Empire out there with a Death Star that's about to destroy us all, SETI will find it. |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
So how much is a second A/C unit installed? .... and there is the issue of where do you dump the heat? A/C doesn't make cold, it absorbs heat on the cold side and dumps it into a heatsink someplace else. The easiest type of installation would be a "ductless split" but you still have to route some refrigerant tubing between the two units, and there is a distance limit. Campus provides the A/C, so they probably either take what Campus provides, or pay for the installation, and like the gigibit fiber up the hill, SETI@Home is perenially short on cash. Load shedding (automatically powering down the servers) based on temperature is probably more practical. |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
I don't think power availability ever came into the equation. Matt has said that there is a finite amount of power delivered to the closet. I don't know if the issue is the cost of a new branch circuit, or if there is some rule saying these closets come with a certain sized branch circuit..... ... but obviously, if they could pump more energy into the closet, at some point it'd be a fire hazard. |
1mp0£173 Send message Joined: 3 Apr 99 Posts: 8423 Credit: 356,897 RAC: 0 |
The A/C died and it's too hot? It's winter, it's 25 degrees and snowing...open the windows. That'll cool you off. Campus is much farther from the ocean than my server room, which is kept cool by keeping the windows open. This is much less expensive (and much greener) than A/C. |
Peter Moss Send message Joined: 15 Nov 99 Posts: 14 Credit: 3,434,017 RAC: 12 |
I have almost 50 stuck items with - Upload Pending. 18/02/2010 18:19:20 SETI@home Reporting 1 completed tasks, not requesting new tasks 18/02/2010 18:19:42 Project communication failed: attempting access to reference site 18/02/2010 18:19:43 Internet access OK - project servers may be temporarily down. These are UK times... Will they clear soon?? |
Rick Send message Joined: 3 Dec 99 Posts: 79 Credit: 11,486,227 RAC: 0 |
I have almost 50 stuck items with - Upload Pending. Hard to say. It could be soon or it could be a day or so. One of my systems got lucky about 30 minutes ago and was able to download a few tasks. My other system is still waiting for tasks. Best thing to do is just leave it alone and eventually things will get back to normal. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.