Moribund Monday (Apr 14 2008) |
![]() |
| log in |
Message boards : Technical News : Moribund Monday (Apr 14 2008)
1 · 2 · Next
| Author | Message |
|---|---|
|
Continuing problems with the workunit storage server... There were more resets over the weekend, ultimately resulting in one that caused the server to think enough drives have failed to call the entire RAID dead. We are confident we can trick the server into thinking otherwise - we actually have some helpful techs logged in doing that as I type. We still want to replace the whole box, which we'll hopefully do today, and then the drives will have to resync again. Chances are we'll be down until tomorrow (Tuesday). | |
| ID: 739049 · | |
|
Thanks for the update Matt, we know you do as much as you can. | |
| ID: 739055 · | |
|
hi | |
| ID: 739068 · | |
|
Hi Matt, just wonderin'. If you get it up and running by tomorrow AM, any chance of foregoing or delaying the dreaded maintenance day until Wednesday so we can all load up on WU's? At least we'd all be working and not sitting idle another whole day ;) | |
| ID: 739091 · | |
Hi Matt, just wonderin'. If you get it up and running by tomorrow AM, any chance of foregoing or delaying the dreaded maintenance day until Wednesday so we can all load up on WU's? At least we'd all be working and not sitting idle another whole day ;) Maybe a good time to check your host's as well, defragmenting disk's, cleaning the registry, removing never used programs, virus/spyware-scan, getting e-mail, etc. etc. Vacuum cleaning your fans & coolers ;) Mylady says, get rid off the cables ?@#$% ____________ Knight Who Says Ni N!, OUT numbered................. | |
| ID: 739106 · | |
|
It's a constant battle isn't it?! | |
| ID: 739124 · | |
|
The Adaptec guys just left - the switchover to the new server looks like a complete success. Plus they coughed up an extra 2GB RAM for the new server while they were here - though that won't show up as a performance boost until the next rev of the OS. | |
| ID: 739127 · | |
The Adaptec guys just left - the switchover to the new server looks like a complete success. Plus they coughed up an extra 2GB RAM for the new server while they were here - though that won't show up as a performance boost until the next rev of the OS. Good news there. Thank you for the extra effort! | |
| ID: 739154 · | |
|
| |
| ID: 739170 · | |
Continuing problems with the workunit storage server... There were more resets over the weekend, ultimately resulting in one that caused the server to think enough drives have failed to call the entire RAID dead. We are confident we can trick the server into thinking otherwise - we actually have some helpful techs logged in doing that as I type. We still want to replace the whole box, which we'll hopefully do today, and then the drives will have to resync again. Chances are we'll be down until tomorrow (Tuesday). Just out of curiosity, it is wise to let clients get more work but without downloading the data files? What happens when the download server comes online and everybody tries to download the missing files (hours or days later)? Would it be better for the scheduler to respond "no work from project" until the download servers are back up? If not, when why not? | |
| ID: 739229 · | |
Continuing problems with the workunit storage server... There were more resets over the weekend, ultimately resulting in one that caused the server to think enough drives have failed to call the entire RAID dead. We are confident we can trick the server into thinking otherwise - we actually have some helpful techs logged in doing that as I type. We still want to replace the whole box, which we'll hopefully do today, and then the drives will have to resync again. Chances are we'll be down until tomorrow (Tuesday). While the database cleanup and backup is going on, the download and upload server is normally still running. This allows the clients to download and upload files as needed, but does not allow the uploaded results to be reported until the cleanup and backup completes. Therefore, if we have clients getting assigned work units today, they can be ready to be downloaded tommorrow while the database is down. The administrators once shut down the upload/download server during database cleanups and backups, hoping that the absence of upload/download activity would speed up the downtime. However, the post-downtime crunch was awful. When they left the upload/download server active during the downtime, this only caused a slight slowdown but allowed the post-downtime crunch to finish up much quicker, because more packets going through the router during the post-downtime crunches were scheduler requests, their responses, and downloads instead of uploads, therefore removing a sizable load off of the then-overloaded router during crunchtime. | |
| ID: 739240 · | |
|
Thanx again for the continued updates Matt. Sorry that you have had so many triala as of late.....hope the replacement download server solves that issue at least..... | |
| ID: 739259 · | |
|
It seems to have been a while since the last post, and I'm still having difficulty getting WUs. I was hoping that someone could post regarding their own situation, or on the success/failure/delay of the necessary upgrades/repairs. I just want to see if others are having any success, or if it's still a problem on my end. I know you guys are working hard so thank you all for allowing us to participate in SETI. | |
| ID: 739366 · | |
It seems to have been a while since the last post, and I'm still having difficulty getting WUs. I was hoping that someone could post regarding their own situation, or on the success/failure/delay of the necessary upgrades/repairs. I just want to see if others are having any success, or if it's still a problem on my end. I know you guys are working hard so thank you all for allowing us to participate in SETI. This page will tell you when the WU's start flowing again. As you can see from the graph there have been no WU's out for more than 24 hours. When the servers come back online, I expect there will be very heavy traffic for several hours, so if you run out of SETI work you may need a backup project at a small resource share. I ran out of SETI work last night (have 2 WU's stuck downloading) but my main PC still has work for 6 other projects. [edit]Other BOINC projects[/edit] ____________ Sir Arthur C Clarke 1917-2008 | |
| ID: 739385 · | |
It seems to have been a while since the last post, and I'm still having difficulty getting WUs. I was hoping that someone could post regarding their own situation, or on the success/failure/delay of the necessary upgrades/repairs. I just want to see if others are having any success, or if it's still a problem on my end. I know you guys are working hard so thank you all for allowing us to participate in SETI. Your post was just after 6:00am in Berkeley. Since Matt said the server was fixed, but would need time to sync., I wouldn't expect it to be up until after they get in this morning and have a chance to check everything out.... ____________ | |
| ID: 739420 · | |
|
I finally got around to checking the computer. I just wanted to say that you guys are doing a fantastic job in spite of enormous difficulties. | |
| ID: 739421 · | |
|
I hope the DB backup goes well today...And that means No Nasty Surprises for you guys...Good luck! | |
| ID: 739446 · | |
|
I'm surprised to see that our friend, the "Reverend" hasn't been around to complain about this recent outage. He always insisted that it was his job to let the SETI team know when they aren't doing theirs. | |
| ID: 739504 · | |
|
I'm happy to say six work units have downloaded within | |
| ID: 739543 · | |
I'm happy to say six work units have downloaded within You can always increase your cache via your preferences in your account. ____________ | |
| ID: 739551 · | |
Message boards : Technical News : Moribund Monday (Apr 14 2008)
| Copyright © 2013 University of California |