Message boards :
Technical News :
Weekend Update (Mar 20 2015)
Message board moderation
Author | Message |
---|---|
Matt Lebofsky Send message Joined: 1 Mar 99 Posts: 1444 Credit: 957,058 RAC: 0 |
Another weekend approaches. Perfect time for an update. Master science database (informix/paddym): Due to transient disk issues last weekend the database crashed, but we quickly recovered. However this caused a big assimilator queue backlog, which we are just now about to clear out. I'll wait till Monday before I start up the result table merges (we hit extent limits on the old table, so I'm shoveling all those into a new larger table with more extents). Random crashes: yesterday one of our file servers choked up for no obvious reason (it's a lustre system, of which I'm not an expert, so nothing that happens on that is obvious to me). I don't think this had any public effects, but was hanging a bunch of our servers up a bit. I actually spent all morning getting that in working order again. This morning synergy (the scheduling server) had some automounter freakout so I just reboot it to clear some pipes. All is well now. Oh yeah the results-to-send queue got kinda low as part of that synergy freakout, but also due to hitting a bunch of tapes with data the splitter is deeming unworthy of workunit creation. So some CPU and I/O time is being wasted as it goes through those. It's best to let this just push through on its on. I did just add a bunch more files to the blanking/splitter queue for processing over the weekend (they'll show up within in the next 24 hours or so). Unless there's weirdness before the end of the day I won't do anything crazy that'll mess things up for the weekend. I've been otherwise working on some GBT (Green Bank Telescope) code in advance of getting SERENDIP VI hardware working there. - Matt -- BOINC/SETI@home network/web/science/development person -- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude |
Gary Charpentier Send message Joined: 25 Dec 00 Posts: 31080 Credit: 53,134,872 RAC: 32 |
Updates are always appreciated. Thanks |
Mike Send message Joined: 17 Feb 01 Posts: 34414 Credit: 79,922,639 RAC: 80 |
Thanks Matt for the update. With each crime and every kindness we birth our future. |
kittyman Send message Joined: 9 Jul 00 Posts: 51495 Credit: 1,018,363,574 RAC: 1,004 |
Thank you for the much appreciated updates, Matt. It means a lot to so many to get your input from the lab. It really does. Meow! "Time is simply the mechanism that keeps everything from happening all at once." |
Cheopis Send message Joined: 17 Sep 00 Posts: 156 Credit: 18,451,329 RAC: 0 |
Matt, You mentioned the other day that one of the power sources in the racking that the servers was located in was the actual cause of what you thought were power supply failures on a couple machines. Where there is one power system failure, there are frequently others. Have you had the rest of your power sources (not power supplies, sources) tested out? Power fluctuations could be causing all sorts of weird issues. I don't think it would be too much to ask that the datacenter check the others, since there was already one problem found. Is there anything connected to power which then connects to the servers, but has no power-monitoring capacity? Perhaps an old router or switch or even a monitor? The fact that you had at least one bad power source makes me a bit suspicious of all the weird issues recently maybe having something to do with power. |
Uli Send message Joined: 6 Feb 00 Posts: 10923 Credit: 5,996,015 RAC: 1 |
Thank you for the update Matt. Pluto will always be a planet to me. Seti Ambassador Not to late to order an Anni Shirt |
Mithotar Send message Joined: 11 Apr 01 Posts: 88 Credit: 66,037,385 RAC: 50 |
I'll add my thanks to this chain........it doesn't take alot - even a once a week note like this would do wonders to keep the natives less restless. |
©2025 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.