Message boards :
Number crunching :
Odd things after the big outage
Message board moderation
Author | Message |
---|---|
dcappello Send message Joined: 3 Apr 99 Posts: 261 Credit: 170,969,320 RAC: 0 |
Just looking over things after the long outage and I notice a few things - some of my rigs seem to be pulling a lot of compute errors on CUDA WUs - others seeing the same? Also I just noticed that looking over the stats I can't even find myself in the list - my RAC should be around 29K right now. |
Geek@Play Send message Joined: 31 Jul 01 Posts: 2467 Credit: 86,146,931 RAC: 0 |
I don't know about your computer but I have been trying to locate myself on the statistics page also and cannot. Should be around RAC 22,500. Boinc....Boinc....Boinc....Boinc.... |
dcappello Send message Joined: 3 Apr 99 Posts: 261 Credit: 170,969,320 RAC: 0 |
Geek - I found you at 30 just now.... perhaps some moths still left in the databases.... |
Terror Australis Send message Joined: 14 Feb 04 Posts: 1817 Credit: 262,693,308 RAC: 44 |
Just looking over things after the long outage and I notice No problems here, Fred Efmer has a thread about -177 errors due to "The Auto DCF" problem. Is this what you're getting ? Re the stats problem, I was experiencing the same problem before the outage, must be some sort of bug in the server software. It usually corrects itself on the next update. T.A. |
dcappello Send message Joined: 3 Apr 99 Posts: 261 Credit: 170,969,320 RAC: 0 |
Humm not seeing those kinds of -177 errors... stuff like: Reason: Illegal Instruction (0xc000001d) at address 0x0000000140003488 Details at: http://setiathome.berkeley.edu/result.php?resultid=1738186251 |
Geek@Play Send message Joined: 31 Jul 01 Posts: 2467 Credit: 86,146,931 RAC: 0 |
Geek - I found you at 30 just now.... perhaps some moths still left in the databases.... Good grief.....I knew I should look before posting. I looked several times yesterday and was missing. Boinc....Boinc....Boinc....Boinc.... |
John McLeod VII Send message Joined: 15 Jul 99 Posts: 24806 Credit: 790,712 RAC: 0 |
Geek - I found you at 30 just now.... perhaps some moths still left in the databases.... It can happen. The leaders stats pages each update on a different schedule (minimum time since the last update, but only if someone looks at the page). That means that if you are on page 2 when page 1 updates, but on page 1 when page 2 updates, you disappear. BOINC WIKI |
soft^spirit Send message Joined: 18 May 99 Posts: 6497 Credit: 34,134,168 RAC: 0 |
That.. is a heck of an error. Not being an expert at Hex addresses but I am guessing a memory error? Hopefully someone more knowledgeable on that can chime in. Janice |
Bill Walker Send message Joined: 4 Sep 99 Posts: 3868 Credit: 2,697,267 RAC: 0 |
I'm getting a few -9 errors on my new GPU machine, about 1 in 10. Being new to GPUs, is this typical? I'm just curious, I don't see this as a huge problem unless there is something I can correct. The WUs error out in a few minutes, I get a few credits, and a new WU right away. |
SciManStev Send message Joined: 20 Jun 99 Posts: 6653 Credit: 121,090,076 RAC: 0 |
I'm getting a few -9 errors on my new GPU machine, about 1 in 10. Being new to GPUs, is this typical? What I saw was a bunch of -185 errors, all coming from your CPU. I don't have time to check further at the moment as I'm at work. Steve Warning, addicted to SETI crunching! Crunching as a member of GPU Users Group. GPUUG Website |
Bill Walker Send message Joined: 4 Sep 99 Posts: 3868 Credit: 2,697,267 RAC: 0 |
What I saw was a bunch of -185 errors, all coming from your CPU. I don't have time to check further at the moment as I'm at work. The CPU errors were all due to me installing the wrong optimized ap on my newish machine. That appears to have been fixed. Looking closer I see the very short GPU tasks were not -9 error messages, but -9 "informational messages", like this task. Never noticed these before, and I was wondering how common they are. |
arkayn Send message Joined: 14 May 99 Posts: 4438 Credit: 55,006,323 RAC: 0 |
|
perryjay Send message Joined: 20 Aug 02 Posts: 3377 Credit: 20,676,751 RAC: 0 |
Arkayn is right as usual but I might add that if you are getting a lot of them while your wingman finishes clean it might be a problem with your setup. Usually, heat or the wrong opt package will cause a machine to generate -9s. PROUD MEMBER OF Team Starfire World BOINC |
Bill Walker Send message Joined: 4 Sep 99 Posts: 3868 Credit: 2,697,267 RAC: 0 |
Thanks guys. A quick check shows wingmen having the same results in several cases, so I guess it is just luck of the draw. |
Josef W. Segur Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0 |
Thanks guys. A quick check shows wingmen having the same results in several cases, so I guess it is just luck of the draw. Yes, when you get credit there's generally nothing to worry about. The Science Status page shows how many overflows were seen in the last 10 minutes, that usually indicates between 5 and 10 percent. Joe |
Allie in Vancouver Send message Joined: 16 Mar 07 Posts: 3949 Credit: 1,604,668 RAC: 0 |
You're mo longer MIA. In 14th spot, atm. :) Pure mathematics is, in its way, the poetry of logical ideas. Albert Einstein |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.