Message boards :
Number crunching :
Panic Mode On (34) Server Problems
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 . . . 9 · Next
Author | Message |
---|---|
Josef W. Segur Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0 ![]() |
Total cash donations since 1 June 2008 have been $ 217770.11 so about half that annually. It amounts to about 1/5 of a reasonable budget, so perhaps we should consider anything more than 20% uptime a testament to the skills and dedication of project staff. Joe |
![]() ![]() Send message Joined: 18 Sep 03 Posts: 834 Credit: 1,807,369 RAC: 0 ![]() |
Upload server RAID failed. So much for RAID meaning redundant. That does not necessarily mean, that data was lost. That can be some other hardware than hard drives that failed or it can be even a software problem (IIRC they are using software RAID). ![]() |
Robert Ribbeck ![]() Send message Joined: 7 Jun 02 Posts: 644 Credit: 5,283,174 RAC: 0 ![]() |
Total cash donations since 1 June 2008 have been $ 217770.11 so about half that annually. It amounts to about 1/5 of a reasonable budget, so perhaps we should consider anything more than 20% uptime a testament to the skills and dedication of project staff.Joe Guess you can't count the continuing grant money they receive or hardware donations to the contributions ![]() ![]() |
![]() ![]() Send message Joined: 31 Jul 01 Posts: 2467 Credit: 86,146,931 RAC: 0 ![]() |
Robert................ In case you don't know this. Josef W. Segur has been with SETI from it's inception. He has made or has been responsible for countless bug squashes and enhancements to Boinc and the science apps. He has given countless hours to the project and also from his check book. He should by all rights have the "project scientist" title but he preferes to remain on the sidelines. Now if the project is not up to your standards I invite you to make a financial donation to the project so that perhaps the project can more closely identify with your standards. Boinc....Boinc....Boinc....Boinc.... |
Berserker Send message Joined: 2 Jun 99 Posts: 105 Credit: 5,440,087 RAC: 0 ![]() |
I hope everyone read the news on the home page today. Upload server RAID failed. So much for RAID meaning redundant. Only the disks are redundant, so there's no redundancy in the controller (hardware or software). Additionally, most common RAID levels only handle a single disk failure at a time. I've seen three disks fail near simultaneously (out of a four disk array). That's always fatal. Also since recovery from a failure is an inherently intensive task, it will sometimes trigger a secondary disk failure. Hopefully, what the SETI staff are dealing with just requires a resync. It is often possible to do a resync with the server online, but it causes a significant performance loss - which is bad news if the servers are as heavily loaded as the SETI servers are. Better to do the recovery offline then. Stats site - http://www.teamocuk.co.uk - still alive and (just about) kicking. |
Robert Ribbeck ![]() Send message Joined: 7 Jun 02 Posts: 644 Credit: 5,283,174 RAC: 0 ![]() |
Robert................ ya ya blah blah blah Good for him That does not make him ALWAYS correct In this case he made an incomplete analysis And you took it on your self to SLAM me and infer that I HAVE some problem for correcting him SHAME ON YOU ![]() ![]() |
Wandering Willie Send message Joined: 19 Aug 99 Posts: 136 Credit: 2,127,073 RAC: 0 ![]() |
Away guys you have already had one thread locked today dont continue on this one. |
Fayvitt ![]() Send message Joined: 29 Nov 09 Posts: 217 Credit: 1,190,636 RAC: 0 ![]() |
Is there any chance we could get our resident "forum experts" into Berkeley to donate their time and expertise for the Seti@Home cause? There seems to be a wealth of talent producing itself. Wondering if there's someway we can tap into it, and harness this plethora of knowledge. ![]() ![]() |
Fayvitt ![]() Send message Joined: 29 Nov 09 Posts: 217 Credit: 1,190,636 RAC: 0 ![]() |
Sorry WW, saw your post after i posted the previous. Everyone have a pleasant evening! ![]() |
![]() ![]() Send message Joined: 16 Mar 07 Posts: 3949 Credit: 1,604,668 RAC: 0 ![]() |
Away guys you have already had one thread locked today dont continue on this one. ![]() lol Pure mathematics is, in its way, the poetry of logical ideas. Albert Einstein |
![]() ![]() Send message Joined: 18 May 99 Posts: 6497 Credit: 34,134,168 RAC: 0 ![]() |
Had about 2 pages of units "to do" on one machine. I was going to suspend additional, I got a window complaining about my login was unknown on the host(?) .. I had not clicked the button by the way. It was a persistent error, so I rebooted. It came back and the SETI units had vanished. Buffer has been set at 4 days, now running on 1 machine, approx 36 hours of work left. Janice |
Josef W. Segur Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0 ![]() |
Total cash donations since 1 June 2008 have been $ 217770.11 so about half that annually. It amounts to about 1/5 of a reasonable budget, so perhaps we should consider anything more than 20% uptime a testament to the skills and dedication of project staff.Joe That's exactly why I said "cash donations". Certainly the grant Eric managed to get after that budget changes the picture somewhat, though I don't know how much. I doubt there's any significant amount from the old Astropulse grant. The hardware donations by Overland Storage and Intel, and continuing support from Sun are certainly important too. Will 1/10 of the 1 GBit line being run to SSL cover S@H needs? Perhaps so, the ALFA receiver sytem at Arecibo is being used less than they had hoped. But if enough had been donated to fund a S@H specific 1 GBit line being run up the hill as well as the 1GBit for the rest of the Space Sciences Lab, the capacity would be there if they can get enough funding to record more than 1/100 of the ALFA bandwidth. If they had been able to get another staff member, would that have improved the uptime? Maybe, and maybe it would have meant more posts initiated by staff members. Joe |
Berserker Send message Joined: 2 Jun 99 Posts: 105 Credit: 5,440,087 RAC: 0 ![]() |
Careful! Some here might enjoy that sort of thing! :D Had about 2 pages of units "to do" on one machine. I was going to suspend Open BOINC Manager and check it says 'Connected to localhost' bottom right of the window. If it doesn't, the problem is probably local. If it does, something got hosed - possibly client_state.xml. Stats site - http://www.teamocuk.co.uk - still alive and (just about) kicking. |
Scarecrow Send message Joined: 15 Jul 00 Posts: 4520 Credit: 486,601 RAC: 0 ![]() |
Me! Me! Me! Me! ![]() |
Dave Send message Joined: 29 Mar 02 Posts: 778 Credit: 25,001,396 RAC: 0 ![]() |
Forums are always down @ weekly outage. + think of all the server CPU-time wasted dealing with all this horrible posts... |
![]() ![]() Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 ![]() ![]() |
IMO it's much more important than any bandwidth or any other hardware upgrades. It's first priority indeed... |
![]() ![]() Send message Joined: 18 May 99 Posts: 6497 Credit: 34,134,168 RAC: 0 ![]() |
it does since reboot. And nothing is going to stop the cheap shots. It will not stop me from sharing what could be useful information. Maybe not. The timing was suspicious. But that is a stretch. Depending on where the units landed, might be no cause for alarm. If others are having vanishing units.. Could be cause for concern. Janice |
Robert Ribbeck ![]() Send message Joined: 7 Jun 02 Posts: 644 Credit: 5,283,174 RAC: 0 ![]() |
Total cash donations since 1 June 2008 have been $ 217770.11 so about half that annually. It amounts to about 1/5 of a reasonable budget, so perhaps we should consider anything more than 20% uptime a testament to the skills and dedication of project staff.Joe I guess I missed the "cash" part I meant no slight was just pointing out It's not just the cash donations that keep this going I've been here since the classic days and nothing surprises me or troubles me Sure things could be better .. where couldn't they be It's just a shame we can't all respect each other's opinions and not resort to slamming people we disagree with On another note something I've thought about for a while This is all about distributed computing Could not something be done to distribute the load of handling the "seti@home" project via distribution instead of all being done in a server room at berkley splitting verifications & creating the master data base over several volunteers computers on the internet ![]() ![]() |
Berserker Send message Joined: 2 Jun 99 Posts: 105 Credit: 5,440,087 RAC: 0 ![]() |
it does since reboot. Same. While I do get frustrated with people making demands and claiming rights they don't have, I don't pretend for one moment that there aren't issues to be resolved - and bugs too. Given that the scheduling server has been off for most of the day, it's fairly unlikely to be anything server-side. Unless of course it happened in the brief window when the project was up and you just didn't notice at the time. That's plausible (I've had crunchers sat idle for a week because I didn't notice stuff). Maybe the logs (see BOINC data directory) might reveal something useful. Stats site - http://www.teamocuk.co.uk - still alive and (just about) kicking. |
kittyman ![]() ![]() ![]() ![]() Send message Joined: 9 Jul 00 Posts: 51519 Credit: 1,018,363,574 RAC: 1,004 ![]() ![]() |
It seems to me that this has been proposed before, but it was determined at the time that any benefit in doing so would be lost in the complexities of transferring data back and forth between a master database and external hosts, verifying integrity of data, maintaining scientific integrity of same, etc., etc., etc.. "Time is simply the mechanism that keeps everything from happening all at once." ![]() |
©2025 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.