| Author |
Message |
Matt LebofskyVolunteer moderator Project administrator Project developer Project scientist
 Send message
Joined: 1 Mar 99 Posts: 1375 Credit: 74,079 RAC: 0

|
|
Mmm-kay. So where are we at with the science database...? The morning today was much like yesterday: me, Eric, and Jeff shouting over the deafening noise of the server closet, taking turns hunched over a monitor attached directly to thumper (the kvm monitor was having separate issues). Lots of reboots and unexpected (and unpleasant) results. Lots of thinking we found the problem only to reboot and (five minutes later) finding we were wrong, then having to reboot again off of DVD (taking another five minutes).
Basically our discussions were along the lines of: Why does the boot metadevice disappear when booting off of DVD? And why does the root metadevice disappear when coming up via grub? Didn't we resync these two drives yesterday? Oh look - the grub device map is referring to /dev/sdm, which was how the root drive was ennumerated when there were only 24 drives in the system - it should be referring to /dev/sdy now that we have 48 - so this must be at least one of our problems! Nope. Changing that did nothing. Etc. etc. etc. etc.
Well, whatever. It's been a two-day-long game like a demented version Towers-of-Hanoi - swapping drives, installing/reinstalling grub, resyncing devices, reconfiguring mdadm, then going back to step one and trying a different permutation. On hindsight it probably would have been easier to just install a new OS from scratch (though we would have had to recreate a web of informix configuration which also exists on the root drives). Right now the system is actually up (finally) and resyncing one mirror (again) and will have to sync another once that's finished. So we're offline for another day, and we haven't even gotten to the pulse table problems yet. I will stil try to get Astropulse running in some form later on today/tonight.
Funny thing: Oliver and Bernd of Einstein@home have been visiting from Germany, collaborating with Dave on some general BOINC stuff. They left just a couple hours ago, but we did discuss how when SETI@home is having issues such as this, Einstein@home certainly gets a huge "bump" from the suddenly influx of free CPU time. We joked how the these thumper issues strangely coincided with their arrival last week.
Meanwhile, I'm back on radar blanking detail. We're now trying cross-correlations to match radar patterns using fftw.
- Matt
____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude |
|
|
TOMVolunteer tester
 Send message
Joined: 5 Apr 01 Posts: 58 Credit: 51,437,507 RAC: 92,701

|
|
Is there a difference on how the drives are enumerated when you boot from grub when you had 24 drives or 48 and between 24 and 48 when you boot from DVD?
- Tom
____________
|
|
|
|
|
Mmm-kay. So where are we at with the science database...? The morning today was much like yesterday: me, Eric, and Jeff shouting over the deafening noise of the server closet, taking turns hunched over a monitor attached directly to thumper (the kvm monitor was having separate issues). Lots of reboots and unexpected (and unpleasant) results. Lots of thinking we found the problem only to reboot and (five minutes later) finding we were wrong, then having to reboot again off of DVD (taking another five minutes).
Basically our discussions were along the lines of: Why does the boot metadevice disappear when booting off of DVD? And why does the root metadevice disappear when coming up via grub? Didn't we resync these two drives yesterday? Oh look - the grub device map is referring to /dev/sdm, which was how the root drive was enumerated when there were only 24 drives in the system - it should be referring to /dev/sdy now that we have 48 - so this must be at least one of our problems! Nope. Changing that did nothing. Etc. etc. etc. etc.
Well, whatever. It's been a two-day-long game like a demented version Towers-of-Hanoi - swapping drives, installing/reinstalling grub, resyncing devices, reconfiguring mdadm, then going back to step one and trying a different permutation. On hindsight it probably would have been easier to just install a new OS from scratch (though we would have had to recreate a web of informix configuration which also exists on the root drives). Right now the system is actually up (finally) and resyncing one mirror (again) and will have to sync another once that's finished. So we're offline for another day, and we haven't even gotten to the pulse table problems yet. I will still try to get Astropulse running in some form later on today/tonight.
Funny thing: Oliver and Bernd of Einstein@home have been visiting from Germany, collaborating with Dave on some general BOINC stuff. They left just a couple hours ago, but we did discuss how when SETI@home is having issues such as this, Einstein@home certainly gets a huge "bump" from the suddenly influx of free CPU time. We joked how the these thumper issues strangely coincided with their arrival last week.
Meanwhile, I'm back on radar blanking detail. We're now trying cross-correlations to match radar patterns using fftw.
- Matt
Are You sure they weren't Hunting for Hossenfeffer?
____________
BSG Anthem
My Facebook page
|
|
|
|
|
Are You sure they weren't Hunting for Hossenfeffer?
Who or what is Hossenfeffer?
Gruß,
Gundolf
[edit] Hasenpfeffer? [/edit] |
|
|
|
|
Are You sure they weren't Hunting for Hossenfeffer?
Who or what is Hossenfeffer?
Gruß,
Gundolf
[edit] Hasenpfeffer? [/edit]
I suspect a reference to an old episode of Bugs Bunny
"Bring me my Hossenfeffer!" says the King. Bugs is brought in on a lordly dish with a silver cover, "Mmm, what's up Doc?" etc. etc.
____________
|
|
|
|
|
Are You sure they weren't Hunting for Hossenfeffer?
Who or what is Hossenfeffer?
Gruß,
Gundolf
[edit] Hasenpfeffer? [/edit]
I suspect a reference to an old episode of Bugs Bunny
"Bring me my Hossenfeffer!" says the King. Bugs is brought in on a lordly dish with a silver cover, "Mmm, what's up Doc?" etc. etc.
Bingo as My Uncle would say, You win a cookie. Yeah, Rabbit Stew is what Google would find(the recipe, among others due to a misspelling).
____________
BSG Anthem
My Facebook page
|
|
|
|
|
Are You sure they weren't Hunting for Hossenfeffer?
Who or what is Hossenfeffer?
Gruß,
Gundolf
[edit] Hasenpfeffer? [/edit]
I suspect a reference to an old episode of Bugs Bunny
"Bring me my Hossenfeffer!" says the King. Bugs is brought in on a lordly dish with a silver cover, "Mmm, what's up Doc?" etc. etc.
Bingo as My Uncle would say, You win a cookie. Yeah, Rabbit Stew is what Google would find(the recipe, among others due to a misspelling).
Yup, one of my all time Bugs faves except for maybe "Hocus Pocus / Newport News"
|
|
|
|
|
Are You sure they weren't Hunting for Hossenfeffer?
Who or what is Hossenfeffer?
Gruß,
Gundolf
[edit] Hasenpfeffer? [/edit]
I suspect a reference to an old episode of Bugs Bunny
"Bring me my Hossenfeffer!" says the King. Bugs is brought in on a lordly dish with a silver cover, "Mmm, what's up Doc?" etc. etc.
Bingo as My Uncle would say, You win a cookie. Yeah, Rabbit Stew is what Google would find(the recipe, among others due to a misspelling).
Yup, one of my all time Bugs faves except for maybe "Hocus Pocus / Newport News"
Of course this is the One I was thinking of: The Big Snooze which until recently I didn't realize is older than Me.
____________
BSG Anthem
My Facebook page
|
|
|
|
|
Funny thing: Oliver and Bernd of Einstein@home have been visiting from Germany, collaborating with Dave on some general BOINC stuff. They left just a couple hours ago, but we did discuss how when SETI@home is having issues such as this, Einstein@home certainly gets a huge "bump" from the suddenly influx of free CPU time. We joked how the these thumper issues strangely coincided with their arrival last week.
Meanwhile, I'm back on radar blanking detail. We're now trying cross-correlations to match radar patterns using fftw.
- Matt
Best of luck getting it all sorted Matt.....I know from my own little crunching farm experience how frustrating it can be when things just don't work for unexplainable reasons....and the complexity of your systems there are multitudes greater.
Next time, keep those Einstein boyz outta the server closet, eh? I think they threw ya a mickey....LOL.
And nice to see you have a little time left over for some science again.
____________
******
"Ask not, what your kitty can do for you. Ask what you can do for your kitty."
As it is kitten, so shall it be done.
|
|
|
|
|
Funny thing: Oliver and Bernd of Einstein@home have been visiting from Germany, collaborating with Dave on some general BOINC stuff. They left just a couple hours ago, but we did discuss how when SETI@home is having issues such as this, Einstein@home certainly gets a huge "bump" from the suddenly influx of free CPU time. We joked how the these thumper issues strangely coincided with their arrival last week.
Meanwhile, I'm back on radar blanking detail. We're now trying cross-correlations to match radar patterns using fftw.
- Matt
Best of luck getting it all sorted Matt.....I know from my own little crunching farm experience how frustrating it can be when things just don't work for unexplainable reasons....and the complexity of your systems there are multitudes greater.
Next time, keep those Einstein boyz outta the server closet, eh? I think they threw ya a mickey....LOL.
And nice to see you have a little time left over for some science again.
Yeah I agree those Einstein guys like black holes too much, So keep'em away from the servers.
____________
BSG Anthem
My Facebook page
|
|
|
|
|
Funny thing: Oliver and Bernd of Einstein@home have been visiting from Germany, collaborating with Dave on some general BOINC stuff. They left just a couple hours ago, but we did discuss how when SETI@home is having issues such as this, Einstein@home certainly gets a huge "bump" from the suddenly influx of free CPU time. We joked how the these thumper issues strangely coincided with their arrival last week.
It looks like they were jealous.
All I am getting for Einstein is: "The server at einstein.phys.uwm.edu is taking too long to respond." and 27-Mar-09 19:17:47 Einstein@Home Scheduler request failed: HTTP file not found
____________
Jord
- BOINC FAQ Service
- BOINC User Wiki
Real is just a matter of perception. |
|
|
|
|
|
Yeah, but the SETI Server status page too hangs since 13:20:10 UTC ;-)
Gruß,
Gundolf
____________
Computer sind nicht alles im Leben. (Kleiner Scherz)
SETI@home classic workunits 3,758
SETI@home classic CPU time 66,520 hours |
|
|
|
|
|
Einstein is having its own problems, all right:
David Hammer wrote: We are having some filesystem problems.
I might have it running again in a few hours if all goes well but I don't know if a simple fsck is going to fix the problem yet.
No I was not jealous. ;)
David
____________
Jord
- BOINC FAQ Service
- BOINC User Wiki
Real is just a matter of perception. |
|
|
|
|
|
Must be a space-science-epidemic; Cosmo, Milkyway, Einstein and Seti all beeing down or at least having serious problems...
____________
mic.
|
|
|
|
|
Must be a space-science-epidemic; Cosmo, Milkyway, Einstein and Seti all being down or at least having serious problems...
What Folding doesn't have a problem?
____________
BSG Anthem
My Facebook page
|
|
|
|
|
Must be a space-science-epidemic; Cosmo, Milkyway, Einstein and Seti all beeing down or at least having serious problems...
The little green men have counted and we have too many humans in orbit right now. As soon as the shuttle lands everything will go back to normal. ;)
____________
|
|
|
|
|
|
News from Milkyway@Home:
Travis Desell wrote: We're trying to get the 2nd project set up for the GPUs, and it looks like there was some misunderstanding or miscommunication with labstaff in doing it. Hopefully things should be back up shortly.
____________
Jord
- BOINC FAQ Service
- BOINC User Wiki
Real is just a matter of perception. |
|
|
MarkJ Volunteer tester
 Send message
Joined: 17 Feb 08 Posts: 912 Credit: 14,583,555 RAC: 9,785

|
Funny thing: Oliver and Bernd of Einstein@home have been visiting from Germany, collaborating with Dave on some general BOINC stuff. They left just a couple hours ago, but we did discuss how when SETI@home is having issues such as this, Einstein@home certainly gets a huge "bump" from the suddenly influx of free CPU time. We joked how the these thumper issues strangely coincided with their arrival last week.
It must be sympathy pains, apparently their file server has crashed too and is being rebuilt. Sounds a lot like Thumper. When you try and logon to their website it comes up with this message (after about 3 mins):
Einstein@Home is down due to a fileserver crash. We are working to bring the project back online ASAP. UPDATE: Fri Mar 27 22:39:13 UTC 2009 The filesystem needs to be repaired so the project will likely be down for at least 12 more hours. Thank you for your patience.
____________
BOINC blog |
|
|
|
|
|
Ageless, thanks for the update from Travis.
____________
|
|
|
|
|
|
Since I seem to have all the news on the space themed projects, here's another one. ;-)
Cosmology@Home news update.
Scott Kruger wrote: Jord-
It's been incredibly frustrating. We've gotten a bunch of the hardware replaced, but we're still having a bunch of IO errors. I suspect that there are some bad sectors on one of the RAID drives, so I'm going to have to back up the entire array and reformat it.
If all goes well, it may be up tonight (at least the website).
-Scott
____________
Jord
- BOINC FAQ Service
- BOINC User Wiki
Real is just a matter of perception. |
|
|