Out of the fire and into the pit of sulfuric acid. (Feb 19, 2010) |
![]() |
| log in |
Message boards : Technical News : Out of the fire and into the pit of sulfuric acid. (Feb 19, 2010)
1 · 2 · 3 · 4 . . . 15 · Next
| Author | Message |
|---|---|
|
Gargh! The science database on thumper went down at 2am due to a filled root partition. One of the raid arrays on thumper lost a drive at about the same time, and uploads are still too slow. | |
| ID: 971816 · | |
|
Thanks Eric, good to know. Hope everything turns out ok. | |
| ID: 971821 · | |
|
Thanks for the update. | |
| ID: 971828 · | |
|
Beta is still down. | |
| ID: 971829 · | |
|
thx Eric for your undying effort. sigh | |
| ID: 971836 · | |
|
Wehay I have work!!!!!!!!!!!!!!!! HAHA :)! | |
| ID: 971843 · | |
|
Still unable to connect to server :( | |
| ID: 971854 · | |
|
Thanks for the update :) | |
| ID: 971855 · | |
|
Unfortunately Bob and Jeff brought the splitters and assimilators down to allow the RAID array to rebuild at full speed. That'll delay more work by a couple hours. Hopefully uploads are back to full speed by then. | |
| ID: 971856 · | |
|
Mine came on for a short while, one machine got some more tasks at 19:13gmt but the main machine didnt even trying at the same time - hope the repairs are going well! | |
| ID: 971859 · | |
|
Thanks for the update and many thanks for all the efforts of the SETI@Home staff. | |
| ID: 971866 · | |
Unfortunately Bob and Jeff brought the splitters and assimilators down to allow the RAID array to rebuild at full speed. That'll delay more work by a couple hours. Hopefully uploads are back to full speed by then. Nice to hear, But I and others have been trying to upload for the last 7 days and all We get is project backoff, Sure Yesterday I was able to upload 2 Wu's and only 2 as those were ever sent acks, Somewhere in the next 15.5 hours I'll run out of work and the PC here will still be trying to upload, but can't thanks to the project backoff and people are not happy about the backoff at all as there are threads about It, Yet people seem to think Oh Yer only unable to upload cause of the outage, Richard Hasslegrove and I agree, This was happening before that and is not outage related as He says the traffic is just not reaching Seti, It's like We're getting boucebacks saying sorry said server doesn't exist there, go away. ____________ BSG Anthem My Facebook page | |
| ID: 971867 · | |
|
Superjoker: as explained elsewhere the backoffs are our friend as without them the servers would be flooded with requests & no-one would get anywhere. The longer the backoffs the better as it spreads the load more. | |
| ID: 971873 · | |
Superjoker: as explained elsewhere the backoffs are our friend as without them the servers would be flooded with requests & no-one would get anywhere. The longer the backoffs the better as it spreads the load more. Backoffs are a perfect technique for spreading the load when when the complete system is capable of handling (with an adequate satety margin) the aggregate anticipated demand averaged over an extended period of time. That's how SETI normally runs, and a few backoffs to shave the peaks and fill the troughs are exactly what's needed. Backoffs do not help if the aggregate load exceeds - over an extended period - the system's capacity to absorb work. Then you have to take more drastic action, to reduce demand or increase supply. For the last 4.5 days (only), SETI's capacity to absorb work has been below demand. I see no sign that demand has increased: instead, it seems to me that capacity has decreased (hopefully, temporarily). No amount of smoothing (backoffs) will solve this. What is needed is to restore the status quo ante on the capacity side. | |
| ID: 971881 · | |
Gargh! The science database on thumper went down at 2am due to a filled root partition. I know this pain very well. Our main source code server (at my employer) died for exactly the same reason. Not quite so many users depending on it, but still painful - especially as I was sat not very far away from it but had no access to the server room (at the time). ____________ Stats site - http://www.teamocuk.co.uk - still alive and (just about) kicking. | |
| ID: 971883 · | |
|
This will take some time to recover from.. I have dozens ans dozens up WUs to upload and im a small time farmer. Think of the ones that have 4 295s on an I7 and not just one Rig/Machine/System/MoneyPit/ElecticEater/ObjectOfEffection. | |
| ID: 971892 · | |
Superjoker: as explained elsewhere the backoffs are our friend as without them the servers would be flooded with requests & no-one would get anywhere. The longer the backoffs the better as it spreads the load more. That's all fine and good but some of us have been having upload/report problems for the last week. Scarecrow graphs show that returned units are still half of what they are normally. I'm still getting "project servers may be down" when it tries to connect. Done units keep failing to upload. The point is this isn't normal behavior and last time it was a faulty switch. It's aggravating that some people simply aren't seeing this problem while others have been seeing this even before the AC crash and the first group is saying "all is well". ____________ "Life is just nature's way of keeping meat fresh." - The Doctor | |
| ID: 971899 · | |
It's aggravating .... Well, it was aggravating while the problem was unacknowledged. Now that Eric is onsite, and has uttered the magic words "uploads are still too slow", my aggravation levels have dropped considerably. Hi Eric! Sorry you've copped for a miserable Friday, but thanks for the post. If there's anything useful we can do by way of remote logging/diagnostics/testing, please ask. And please note Keith's point that scheduler updates are slow-to-nonexistent as well. | |
| ID: 971900 · | |
It's aggravating that some people simply aren't seeing this problem while others have been seeing this even before the AC crash and the first group is saying "all is well". There is a big difference between saying "all is well" and saying "I think it's fixed, let's see if it keeps getting better." I've been tracking a memory leak in one of my projects (not related to SETI or BOINC). It took about 20 minutes to fix the leak, but it took nearly a day for the results to show. [edit]My biggest worry for the SETI gang is that something stressed during the overheat hasn't decided to fail enough to show. It may be running flawlessly at least until the last staff member to leave is 10 minutes down the road. Knock on wood. ____________ | |
| ID: 971904 · | |
|
There is always "MONDAY" for somebody. Thanks for your efforts and dedication of the staff. | |
| ID: 971905 · | |
Message boards : Technical News : Out of the fire and into the pit of sulfuric acid. (Feb 19, 2010)
| Copyright © 2013 University of California |