Message boards :
Technical News :
Magic Dragon Theatre (Dec 21 2007)
Message board moderation
Author | Message |
---|---|
Matt Lebofsky Send message Joined: 1 Mar 99 Posts: 1444 Credit: 957,058 RAC: 0 |
Happy Holidays! As a present thumper (our main science database) crashed for no reason this morning. Not even the service processor was responding. I wasn't planning on coming to the lab today but here I am. Long story short, Jeff/Bob/I have no idea why it crashed - I found it powered down (but with standby power on). I powered it up no problem. Some drives are resyncing, but there's no sign that any drives died. In fact, every service on it is coming up just fine, including informix. Also no signs of high temperatures, or other hardware failures. Well, jeez. While the main disks are syncing up I'll leave the assimilators/splitters off. We may run out of work, but hopefully not for too long. - Matt -- BOINC/SETI@home network/web/science/development person -- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude |
DJStarfox Send message Joined: 23 May 01 Posts: 1066 Credit: 1,226,053 RAC: 2 |
Hmmm...that's worth checking the logs on that server. Something must have shut it down....UPS monitoring, someone hitting the power button, root login just before shutdown. Was it a clean shutdown? Or did the power fail and cause disk checks upon restart? I don't like it when my servers shutdown without my approval. |
kittyman Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004 |
Hmmm...that's worth checking the logs on that server. Something must have shut it down....UPS monitoring, someone hitting the power button, root login just before shutdown. Was it a clean shutdown? Or did the power fail and cause disk checks upon restart? I don't like it when my servers shutdown without my approval. The janitor hit the power button with the butt of his broom....... "Freedom is just Chaos, with better lighting." Alan Dean Foster |
Scarecrow Send message Joined: 15 Jul 00 Posts: 4520 Credit: 486,601 RAC: 0 |
The janitor hit the power button with the butt of his broom....... Or in the case of our shop, you could have left off... of his broom... (We don't call Don 'old wide load' for nothing ya know) And this time of year, mysterious superfluous server reboots are usually caused by hoards of untamed underpants gnomes. |
Dr. C.E.T.I. Send message Joined: 29 Feb 00 Posts: 16019 Credit: 794,685 RAC: 0 |
Thanks for the Postin' Matt - hope ya find the culprit ;) hmmm . . . Todd Rundgren eh - quite the 'eclectic' collector there Matt Utopia: "Magic Dragon Theatre" - Album: Ra 1975
BOINC Wiki . . . Science Status Page . . . |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14649 Credit: 200,643,578 RAC: 874 |
We may run out of work, but hopefully not for too long. Pity the next tape in the queue came from an Arecibo ALFALFA observing day. |
Josef W. Segur Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0 |
We may run out of work, but hopefully not for too long. ALFALFA is drift scan, the 100p2 scan done on Dec. 1 2006 was at 24.333366 degrees declination so should be around AR 0.40x. The pity is they only had a 3 hour observation period that day and so far it appears the other observations were at high slew rates. Joe |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14649 Credit: 200,643,578 RAC: 874 |
We may run out of work, but hopefully not for too long. OK, if it wasn't ALFALFA, which of the other projects in the schedule for 01 Dec 06 would involve basketweave recording? I've looked at A1852 To continue the monitoring of OH masers in high galactic latitude OH/IR stars (L-wide) A2010 ALFALFA: The Arecibo Legacy Fast ALFA Survey (ALFA) A2060 A GALFA Study of the Disk-Halo Interface (ALFA) A2172 MAPPING HI IN A SPECTACULAR SHELL (ALFA) A2200 HI Content of Distant Galaxy Clusters (ALFA) A2220 Interfaces in the ISM: Mapping a Cold Cloud Boundary With GALFA (ALFA) and ALFALFA still seems the best candidate. Maybe I'll have to correlate future outbreaks of VHAR WUs and see if I can identify the culprit...... (Edit - if anyone else wants to try and puzzle it out, you can reach that page, and others like it, from the 'Old Schedules' link here.) |
Josef W. Segur Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0 |
We may run out of work, but hopefully not for too long. Let's see, the workunit header for 01dc06ag.18398.55642.16.6.147 says it was recorded Sat Dec 2 05:47:42 2006 and has a 1.3876 AR. The Arecibo schedules say AST on the left, and Atlantic Standard Time differs by 4 hours from UTC, so we need to look at the schedule for Dec. 2 at 01:47. That makes it A2060, I think. WU 01dc06ag.16935.4162.10.6.171 was recorded Sat Dec 2 02:47:46 2006, which would convert to Dec. 1 at 22:47 AST, that's A2010 (ALFALFA) and the WU has an AR of 0.408 as expected. I also got some 01dc06af work from the ALFALFA observations with 0.408 AR. I didn't get any from the middle of the 01dc06ag data, but considering how long the slewstorm has lasted I think the A1852 observations were probably high slew also. My guess for the reason there's Dec. 2 data in a chunk labelled 01dc06 is that the hard disk recording was started on the first. Joe |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14649 Credit: 200,643,578 RAC: 874 |
I have to say I'm getting increasingly worried by this ongoing spate of apparent 'high slew' work. Here's a new version of an old graph: (direct link) The 'historical' data I recorded from the start of MB up to the end of October 2007: it's been posted before, but I've rescaled the graph to show percentages rather than absolute numbers. The 'current' data comes from six of the machines I'm monitoring for Joe's deadline re-estimation. I've only shown data from machines which have a 12-hour turnaround or less at these ARs: the plot is again a percentage, but the number of data points is comparable (3,110 historical: 2,126 current). The significant new feature is the spike at AR=2.47, and a smaller spike between AR=7.5 and AR=8.5 Almost all of the work issued by SETI since about midday 22 December UTC has been at these extraordinarily high slew rates. Joe and I have both looked at the recording schedules for Arecibo for 01Dec06 and 02Dec06, and I've also looked at 01/02Jan07. Neither of us has seen anything unusual in the pattern of observations which would account for these ARs. Which leads me to wonder whether, just possibly, the ARs being inserted into the WU headers by the splitters could be corrupt? |
Brian Silvers Send message Joined: 11 Jun 99 Posts: 1681 Credit: 492,052 RAC: 0 |
I have to say I'm getting increasingly worried by this ongoing spate of apparent 'high slew' work. It does have a negative impact on the web server apparent performance... |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.