Extended outage Jul 20 2010 - problems |
![]() |
| log in |
Message boards : Number crunching : Extended outage Jul 20 2010 - problems
| Author | Message |
|---|---|
|
Hi | |
| ID: 1018032 · | |
|
My Observation: It's seti's system--they can start up/shut down whenever they choose--just appreciate the updates. | |
| ID: 1018042 · | |
|
We have a political forum but not a Sports forum?? | |
| ID: 1018048 · | |
|
Good thought Geek@Play | |
| ID: 1018051 · | |
|
I don't mind the early start to the outage but I am a little worried about the problems that popped up just before they shut down. By that I mean this message... 7/20/2010 10:50:01 AM SETI@home [error] Error reported by file upload server: can't open file /home/boincadm/projects/sah/upload/3e5/06jn10aa.23528.19699.12.10.7_1_0: Read-only file systemand the Validate errors that started to appear. I know they have a script they can run for the validate errors but will we have to list them all or will they be able to do them themselves? Another thing that would be nice if they could find some way to handle all the ghosts we are getting. It is a pain to have to run down our caches and detach to clear them but if we leave them they interfere with our getting new tasks when we come back from the outage. Also, do time outs count as errors for the daily quota? Ok, that's all I can think of to bi....uhh complain about. ____________ PROUD MEMBER OF Team Starfire World BOINC | |
| ID: 1018056 · | |
|
As I did get time to read a lot of the various threads. It would appear that a portion of the cause of all of the odd things that people saw including the early shutdown was the result of the Boinc Database crashing. | |
| ID: 1018073 · | |
As I did get time to read a lot of the various threads. It would appear that a portion of the cause of all of the odd things that people saw including the early shutdown was the result of the Boinc Database crashing. Pappa, Whatever it was, it didn't feel like a database crash from here. First there was a network outage, cutting off uploads and web server access ('page not found'), and from the Cricket graph downloads too, though I can't confirm that from personal observation. None of those has anything to do with BOINC, and a database crash causes different symptoms. Then there were file storage problems - the upload area reporting itself to be read-only,, and the validator failing to find previously-uploaded and previously-accessible result files (as Joe has pointed out). Again, nothing to do with database access there. The speculation I put in my PM to Jeff - and it is only speculation (no reply as yet), from 6,000 miles away, and to be taken with a pinch of salt - was a power surge or brownout which triggered some, but not all, of the mess of interconnected devices in the server closet to reboot themselves. So some machines carried on as normal - web server, upload server: they're on UPS, I think - but other devices, such as the big network storage unit, weren't ready for a while. That's what happens after a brownout on my little network at home: different devices have different susceptibilities. | |
| ID: 1018091 · | |
|
And I posted already in another thread, 40 MB tasks which have a Detached Label, they where UPloaded 13 july. (Previous outage) | |
| ID: 1018102 · | |
|
I set my account to download 10 days data to download and process and only 1 of my systems is rcving all the work units. Have not rcvd new WU's in several days. What is up? | |
| ID: 1018560 · | |
I set my account to download 10 days data to download and process and only 1 of my systems is rcving all the work units. Have not rcvd new WU's in several days. What is up? Every Week, we have extended outages. Tuesday to Friday. This is to work on server issues that can not be completed in about 7 hours and start doing some Science that has been put off for ages. Friday, tomorrow afternoon... Everyone should be at the point where things start to flow again. If I seem a bit "loopy," I have did about 700 miles driving in the last three days. None of it was "pretty." Don't ask... Regards ____________ Please consider a Donation to the Seti Project. | |
| ID: 1018591 · | |
|
Okay.. I suppose the outtage will start to come up soon, I have coffee brewing, and the mad rush to upload on the horizon... | |
| ID: 1018653 · | |
Okay.. I suppose the outtage will start to come up soon, I have coffee brewing, and the mad rush to upload on the horizon... None that I know of. Just watch out for new posts by Jeff Cobb or the team. Let's hope he's remembered to have a look at the upload server filesystem, and reset the validators, before going live. | |
| ID: 1018656 · | |
|
| |
| ID: 1018657 · | |
|
Place your bets. | |
| ID: 1018660 · | |
|
yeah, having 10 days' worth of data is highly recommended, especially for high-output systems. nothing is more frustrating than watching your i7 rig with a gtx 260 sitting idle for 3 days... | |
| ID: 1018669 · | |
So Jeff says. | |
| ID: 1018737 · | |
|
I have to say.. this has been one of the smoothest outtages I have seen over all. Pending numbers seem to be going way up, I am not sure why, But other than the crash going into the outtage, all seems to have brought few surprises. | |
| ID: 1018769 · | |
|
Well, the MDB Queries are relative high - near 1.200. And the Upload Rate is over 45 Mbits/sec - i haven't seen such a high value... | |
| ID: 1018821 · | |
Message boards : Number crunching : Extended outage Jul 20 2010 - problems
| Copyright © 2013 University of California |