Odd things after the big outage

Message boards : Number crunching : Odd things after the big outage
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile dcappello
Avatar

Send message
Joined: 3 Apr 99
Posts: 261
Credit: 170,969,320
RAC: 0
United States
Message 1054462 - Posted: 10 Dec 2010, 2:05:18 UTC

Just looking over things after the long outage and I notice
a few things - some of my rigs seem to be pulling a lot
of compute errors on CUDA WUs - others seeing the same?
Also I just noticed that looking over the stats I can't even find myself in the list - my RAC should be around 29K right now.

ID: 1054462 · Report as offensive
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 1054465 - Posted: 10 Dec 2010, 2:10:37 UTC

I don't know about your computer but I have been trying to locate myself on the statistics page also and cannot. Should be around RAC 22,500.
Boinc....Boinc....Boinc....Boinc....
ID: 1054465 · Report as offensive
Profile dcappello
Avatar

Send message
Joined: 3 Apr 99
Posts: 261
Credit: 170,969,320
RAC: 0
United States
Message 1054469 - Posted: 10 Dec 2010, 2:19:15 UTC - in response to Message 1054465.  

Geek - I found you at 30 just now.... perhaps some moths still left in the databases....
ID: 1054469 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1054470 - Posted: 10 Dec 2010, 2:19:30 UTC - in response to Message 1054462.  

Just looking over things after the long outage and I notice
a few things - some of my rigs seem to be pulling a lot
of compute errors on CUDA WUs - others seeing the same?
Also I just noticed that looking over the stats I can't even find myself in the list - my RAC should be around 29K right now.

No problems here, Fred Efmer has a thread about -177 errors due to "The Auto DCF" problem. Is this what you're getting ?

Re the stats problem, I was experiencing the same problem before the outage, must be some sort of bug in the server software. It usually corrects itself on the next update.

T.A.
ID: 1054470 · Report as offensive
Profile dcappello
Avatar

Send message
Joined: 3 Apr 99
Posts: 261
Credit: 170,969,320
RAC: 0
United States
Message 1054472 - Posted: 10 Dec 2010, 2:26:02 UTC - in response to Message 1054470.  

Humm not seeing those kinds of -177 errors... stuff like:

Reason: Illegal Instruction (0xc000001d) at address 0x0000000140003488

Details at:

http://setiathome.berkeley.edu/result.php?resultid=1738186251


ID: 1054472 · Report as offensive
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 1054489 - Posted: 10 Dec 2010, 3:21:22 UTC - in response to Message 1054469.  

Geek - I found you at 30 just now.... perhaps some moths still left in the databases....


Good grief.....I knew I should look before posting. I looked several times yesterday and was missing.

Boinc....Boinc....Boinc....Boinc....
ID: 1054489 · Report as offensive
John McLeod VII
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jul 99
Posts: 24806
Credit: 790,712
RAC: 0
United States
Message 1054493 - Posted: 10 Dec 2010, 3:27:34 UTC - in response to Message 1054489.  

Geek - I found you at 30 just now.... perhaps some moths still left in the databases....


Good grief.....I knew I should look before posting. I looked several times yesterday and was missing.

It can happen. The leaders stats pages each update on a different schedule (minimum time since the last update, but only if someone looks at the page). That means that if you are on page 2 when page 1 updates, but on page 1 when page 2 updates, you disappear.


BOINC WIKI
ID: 1054493 · Report as offensive
Profile soft^spirit
Avatar

Send message
Joined: 18 May 99
Posts: 6497
Credit: 34,134,168
RAC: 0
United States
Message 1054495 - Posted: 10 Dec 2010, 3:28:09 UTC - in response to Message 1054472.  

That.. is a heck of an error. Not being an expert at Hex addresses but I am guessing a memory error? Hopefully someone more knowledgeable on that can chime in.
Janice
ID: 1054495 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 1054649 - Posted: 10 Dec 2010, 14:05:44 UTC

I'm getting a few -9 errors on my new GPU machine, about 1 in 10. Being new to GPUs, is this typical?

I'm just curious, I don't see this as a huge problem unless there is something I can correct. The WUs error out in a few minutes, I get a few credits, and a new WU right away.

ID: 1054649 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6653
Credit: 121,090,076
RAC: 0
United States
Message 1054666 - Posted: 10 Dec 2010, 14:42:25 UTC - in response to Message 1054649.  

I'm getting a few -9 errors on my new GPU machine, about 1 in 10. Being new to GPUs, is this typical?

I'm just curious, I don't see this as a huge problem unless there is something I can correct. The WUs error out in a few minutes, I get a few credits, and a new WU right away.


What I saw was a bunch of -185 errors, all coming from your CPU. I don't have time to check further at the moment as I'm at work.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1054666 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 1054678 - Posted: 10 Dec 2010, 15:30:02 UTC - in response to Message 1054666.  
Last modified: 10 Dec 2010, 15:34:23 UTC

What I saw was a bunch of -185 errors, all coming from your CPU. I don't have time to check further at the moment as I'm at work.

Steve


The CPU errors were all due to me installing the wrong optimized ap on my newish machine. That appears to have been fixed.

Looking closer I see the very short GPU tasks were not -9 error messages, but -9 "informational messages", like this task. Never noticed these before, and I was wondering how common they are.

ID: 1054678 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1054690 - Posted: 10 Dec 2010, 15:53:19 UTC - in response to Message 1054678.  

Fairly common actually, it usually means that they captured a terrestrial signal in that sweep and it overwhelms anything else that might be in that area.

ID: 1054690 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1054692 - Posted: 10 Dec 2010, 15:57:36 UTC - in response to Message 1054678.  

Arkayn is right as usual but I might add that if you are getting a lot of them while your wingman finishes clean it might be a problem with your setup. Usually, heat or the wrong opt package will cause a machine to generate -9s.


PROUD MEMBER OF Team Starfire World BOINC
ID: 1054692 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 1054695 - Posted: 10 Dec 2010, 16:04:20 UTC - in response to Message 1054692.  

Thanks guys. A quick check shows wingmen having the same results in several cases, so I guess it is just luck of the draw.

ID: 1054695 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1054735 - Posted: 10 Dec 2010, 18:42:47 UTC - in response to Message 1054695.  

Thanks guys. A quick check shows wingmen having the same results in several cases, so I guess it is just luck of the draw.

Yes, when you get credit there's generally nothing to worry about. The Science Status page shows how many overflows were seen in the last 10 minutes, that usually indicates between 5 and 10 percent.
                                                                Joe
ID: 1054735 · Report as offensive
Profile Allie in Vancouver
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 3949
Credit: 1,604,668
RAC: 0
Canada
Message 1054736 - Posted: 10 Dec 2010, 18:44:07 UTC - in response to Message 1054462.  


Also I just noticed that looking over the stats I can't even find myself in the list - my RAC should be around 29K right now.


You're mo longer MIA. In 14th spot, atm. :)
Pure mathematics is, in its way, the poetry of logical ideas.

Albert Einstein
ID: 1054736 · Report as offensive

Message boards : Number crunching : Odd things after the big outage


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.