Message boards :
News :
Marvin crashed
Message board moderation
Author | Message |
---|---|
Eric Korpela Send message Joined: 3 Apr 99 Posts: 1382 Credit: 54,506,847 RAC: 60 |
It appears that the root partition filled on marvin (rapidly) while I was AFK, for no reason that I am aware of, which caused it to crash. Nobody is at the colocation facility right now, so the astropulse DB is down. I'll try to get remote accesses for a reboot, but chances are that marvin is down until monday morning. @SETIEric@qoto.org (Mastodon) |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242 |
Sorry to hear that. Thanks for the update |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
Many AP channels being split are also erroring out now. EDIT- to add to that, MB splitter output has almost halved, so the ready-to-send buffer is falling & we should be out of work in the next 18-24 hours. Grant Darwin NT |
Dimly Lit Lightbulb 😀 Send message Joined: 30 Aug 08 Posts: 15399 Credit: 7,423,413 RAC: 1 |
Blimey you folks are certainly have a run of bad luck :( Thanks for update Eric, fingers crossed it's an easy thing to figure out and solve. Member of the People Encouraging Niceness In Society club. |
Eric Korpela Send message Joined: 3 Apr 99 Posts: 1382 Credit: 54,506,847 RAC: 60 |
It's probably easy to fix. The likely problem is getting support from the colocation facility on the weekend outside of working hours. We don't pay the additional fees for 24/7 support (primarily because they aren't small). @SETIEric@qoto.org (Mastodon) |
Dimly Lit Lightbulb 😀 Send message Joined: 30 Aug 08 Posts: 15399 Credit: 7,423,413 RAC: 1 |
I take it it's not OK to leave it as it is until Monday in it's crashed state then? If non of the remote stuff works? Member of the People Encouraging Niceness In Society club. |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
Many AP channels being split are also erroring out now. That should be OK, most peple have a 7 day buffer. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
That should be OK, most peple have a 7 day buffer. Depends on what you mean by most people. Mine will last 8-12 hours, except for my slower machine which will have about 4 days work. Others will run out in a couple of hours, or less. Grant Darwin NT |
Gary Charpentier Send message Joined: 25 Dec 00 Posts: 31008 Credit: 53,134,872 RAC: 32 |
It appears that the root partition filled on marvin (rapidly) while I was AFK, for no reason that I am aware of, which caused it to crash. Nobody is at the colocation facility right now, so the astropulse DB is down. I'll try to get remote accesses for a reboot, but chances are that marvin is down until monday morning. Just be sure it didn't fill because it was filling log files with error messages to overflowing. |
Jord Send message Joined: 9 Jun 99 Posts: 15184 Credit: 4,362,181 RAC: 3 |
while I was AFK Wait, why would you even be AK (at keyboard) on a Sunday? |
tullio Send message Joined: 9 Apr 04 Posts: 8797 Credit: 2,930,782 RAC: 1 |
I am running 7 BOINC projects, plus one not BOINC (CernVM_WebAPI). Most of them go on vacation for Xmas because developers and admins enjoy their vacations, but some of them (like Einstein@home) still give me work. So Merry Christmas to everybody. Tullio |
bluestar Send message Joined: 5 Sep 12 Posts: 7264 Credit: 2,084,789 RAC: 3 |
It is nice to see that someone is bothering about this project at all. Supposedly I too often end up here having other thoughts on my mind. I paid a visit to Lunatics yesterday evening. Only had a quick look at their page. Really I am under the impression that application development is a continuous process which never seems to end. One may perhaps be asking whether or not such applications (including the special or proprietary ones), are able to detect a signal if it should be present. For now we only are able to make assumptions on whether or not a signal ever was there by means of looking at the four result categories, as well as possible processing times of a given task, as well as autocorrelation, of course. Back in 1977, Jerry R. Ehman probably was able to detect the Wow signal because the area was already known to be rich in radio sources. Whether or not the source behind this signal was stationary or not probably never will be known for sure, since it only was detected in one of two horns being used for this purpose at that time, namely the radio telescope belonging to the Ohio State University. Sadly this facility is no more. The whole thing was eventually torn down and was replaced by other facilities instead. One part of history gone. Instead being replaced by something else. |
Bernie Vine Send message Joined: 26 May 99 Posts: 9958 Credit: 103,452,613 RAC: 328 |
It is nice to see that someone is bothering about this project at all. Over the last few weeks we have been kept up to date with all the problems. We also know that Matt and Jeff have been involved. So please be a little more respectful. |
Eric Korpela Send message Joined: 3 Apr 99 Posts: 1382 Credit: 54,506,847 RAC: 60 |
I take it it's not OK to leave it as it is until Monday in it's crashed state then? If non of the remote stuff works? It's mainly generation of astropulse work that will suffer. No permanent damage should result. But I'd like to have everything in working order before folks do leave for the holidays. @SETIEric@qoto.org (Mastodon) |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14679 Credit: 200,643,578 RAC: 874 |
I take it it's not OK to leave it as it is until Monday in it's crashed state then? If non of the remote stuff works? Could you ask them to take a look at Lando as well tomorrow, please? Lando's four MB splitters don't seem to have been pulling their weight since Marvin went down. |
bluestar Send message Joined: 5 Sep 12 Posts: 7264 Credit: 2,084,789 RAC: 3 |
You are of course correct when saying so. I should not be using such a language here. Rather I should also say "that" instead of "which" - again my poor language skills. I could also mention that I again may have made a new discovery regarding prime numbers, or at least factors. Eventually quite a number of these factors, 100 - 1000 digit ones should become available in the near future as I get this new collection put together and later uploaded at the proper place (The Factor Database). Also there really is a marked contrast between those .vlar's and the CUDA-based Seti@home tasks when it comes to processing times. I do like those tasks that are carrying out the gaussian search better than the .vlar tasks, but apparently there may be a new batch of tasks later for the CPU which may be doing exactly that, which should not make it necessary for me to go back to changing the preferences back to CUDA-tasks as well as an option. The Genefer tasks are also a fascinating subject, but running these tasks by means of CUDA is demanding and is straining both input and output as well as visible graphics on the screen. Definitely there are both advantages and disadvantages in doing all of this. Sitting at home like you may also be doing, I probably forget it is a Sunday today. And in fact Christmas is coming up as well only 3 or 4 days from now. I wish you good luck in fixing Marvin, Eric! |
Eric Korpela Send message Joined: 3 Apr 99 Posts: 1382 Credit: 54,506,847 RAC: 60 |
Marvin isn't finding a boot device on reboot. I'm not even sure it's seeing the RAID card at all (these remote interfaces to the boot screen aren't good at capturing things that happen quickly). Matt and I are meeting at the co-lo first thing tomorrow morning. I'm bringing a boot CD and what I think is a matching RAID card. @SETIEric@qoto.org (Mastodon) |
Dad Send message Joined: 21 May 99 Posts: 44 Credit: 35,266,844 RAC: 10 |
Thank you again for all the extra effort you make for us. |
Gary Charpentier Send message Joined: 25 Dec 00 Posts: 31008 Credit: 53,134,872 RAC: 32 |
Marvin isn't finding a boot device on reboot. I'm not even sure it's seeing the RAID card at all (these remote interfaces to the boot screen aren't good at capturing things that happen quickly). Matt and I are meeting at the co-lo first thing tomorrow morning. I'm bringing a boot CD and what I think is a matching RAID card. Ouch. Good Luck. Thanks. |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
Marvin isn't finding a boot device on reboot. I'm not even sure it's seeing the RAID card at all (these remote interfaces to the boot screen aren't good at capturing things that happen quickly). Matt and I are meeting at the co-lo first thing tomorrow morning. I'm bringing a boot CD and what I think is a matching RAID card. You need a remote controled "AstroBot" with a wireless cam on it so you can run around and look at things when the doors are locked :D |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.