WTH is this?

Message boards : Number crunching : WTH is this?
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1732665 - Posted: 7 Oct 2015, 22:27:15 UTC

http://setiathome.berkeley.edu/workunit.php?wuid=1923735966
I came down with a bad case of i don't give a crap
ID: 1732665 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1732673 - Posted: 7 Oct 2015, 22:40:24 UTC - in response to Message 1732665.  

http://setiathome.berkeley.edu/workunit.php?wuid=1923735966

It's an invalid result. Much like the other 146 on your host.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1732673 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1732683 - Posted: 7 Oct 2015, 23:14:40 UTC
Last modified: 7 Oct 2015, 23:15:37 UTC

yes but does it not take 3 to find consensus

looking at that wu we both have the same outcome

3 spikes

the other invalids are the same odd thing
I came down with a bad case of i don't give a crap
ID: 1732683 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1732695 - Posted: 8 Oct 2015, 0:41:22 UTC

Checking the first page of invalids, show that they all came from Device 1.

ID: 1732695 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1732698 - Posted: 8 Oct 2015, 0:49:54 UTC
Last modified: 8 Oct 2015, 0:52:24 UTC

look at the results what i'm saying is exept from teh ones that ended in error because the card crashed there is a bunch of em that did finish correctly with same result as other machine and 3rd party not even started to crunch it


how can it be invalid with 2 computers having the same result

and how can it be marked invalid without a 3rd machine
I came down with a bad case of i don't give a crap
ID: 1732698 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1732699 - Posted: 8 Oct 2015, 0:55:03 UTC

same here same result as other machine yet i'm invalid

http://setiathome.berkeley.edu/workunit.php?wuid=1923735966
I came down with a bad case of i don't give a crap
ID: 1732699 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1732701 - Posted: 8 Oct 2015, 0:59:11 UTC

Have you restarted the machine since this started?

ID: 1732701 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1732702 - Posted: 8 Oct 2015, 1:01:40 UTC

yes couple hours ago when i noticed it
I came down with a bad case of i don't give a crap
ID: 1732702 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1732703 - Posted: 8 Oct 2015, 1:05:05 UTC
Last modified: 8 Oct 2015, 1:06:38 UTC

funny thing is it says device 1 which is the secondary card which i swapped as soon as i saw the errors now the new card is doing the same aperantly

which is pretty much impossible

EDIT

I stick teh card in question in a spare machine i got and see if it does the same crap there but i doubt 2 cards are bad
I came down with a bad case of i don't give a crap
ID: 1732703 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1732708 - Posted: 8 Oct 2015, 1:30:33 UTC
Last modified: 8 Oct 2015, 1:32:32 UTC

hmm it seems to have stopped magically the invalid count is going down

this is just too weird

EDIT

Maybe i spoke too soon
I came down with a bad case of i don't give a crap
ID: 1732708 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22160
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1732737 - Posted: 8 Oct 2015, 4:57:25 UTC
Last modified: 8 Oct 2015, 5:00:04 UTC

You are still getting invalids off that PC, this is classic symptoms of a very heavily overclocked card that has been taken beyond its limits. The latest report shows it to be clocked at 1500MHz against the (Nvidia) stock of 1000/1075MHz.

Even with water cooling that card is going to be right at the limit for calculation - it may be "perfectly fine" for doing graphics where the odd wrong bit is undetectable, but calculations will pick that bit up and the result will be "wrong".
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1732737 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1732750 - Posted: 8 Oct 2015, 5:38:59 UTC - in response to Message 1732737.  
Last modified: 8 Oct 2015, 5:40:28 UTC

You are still getting invalids off that PC, this is classic symptoms of a very heavily overclocked card that has been taken beyond its limits. The latest report shows it to be clocked at 1500MHz against the (Nvidia) stock of 1000/1075MHz.

Even with water cooling that card is going to be right at the limit for calculation - it may be "perfectly fine" for doing graphics where the odd wrong bit is undetectable, but calculations will pick that bit up and the result will be "wrong".


actually these are ti hybrids and stock clocked at almost 1300

EDIT
the stock clock is actually 1379 i just looked
I came down with a bad case of i don't give a crap
ID: 1732750 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1732752 - Posted: 8 Oct 2015, 5:48:53 UTC - in response to Message 1732698.  

look at the results what i'm saying is exept from teh ones that ended in error because the card crashed there is a bunch of em that did finish correctly with same result as other machine and 3rd party not even started to crunch it

how can it be invalid with 2 computers having the same result

and how can it be marked invalid without a 3rd machine

The std.err.txt file is only a summary, it is not the actual result file. Having the same number of each type of signal as your wingmate does NOT mean the results are the same, especially when comparing a stock app with an Anonymous Platform app, or a CPU vs a GPU. Differences in the individual apps, such as signal thresholds, can cause differences even if the number of signals found is the same. You have to look at the actual result file.
Donald
Infernal Optimist / Submariner, retired
ID: 1732752 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1732758 - Posted: 8 Oct 2015, 6:07:13 UTC - in response to Message 1732750.  
Last modified: 8 Oct 2015, 6:07:44 UTC

Geesh, just back off your clocking to 1000.

Overclocking works for video (where a few artifacts don't matter) but for science it doesn't work.

Just try it for 2 days.
ID: 1732758 · Report as offensive
woohoo
Volunteer tester

Send message
Joined: 30 Oct 13
Posts: 972
Credit: 165,671,404
RAC: 5
United States
Message 1732759 - Posted: 8 Oct 2015, 6:09:04 UTC

doesn't evga's webpage say 1140 base clock and 1228 boost clock?
ID: 1732759 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1732766 - Posted: 8 Oct 2015, 6:32:12 UTC

the card is set to defaults it is what it is it was fine for days until earlyer today

only thing that changed is i installed latest boinc yesterday morning
I came down with a bad case of i don't give a crap
ID: 1732766 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22160
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1732787 - Posted: 8 Oct 2015, 8:53:03 UTC

It actually started to through invalids at 5:06 on the 5th of October.
The trouble is hardware faults do occur, and they do occur more frequently at higher clock speeds and temperatures - and you are running that card way beyond the chip designer's specification for that chip set.
EVGA have decided that a batch of chip sets are OK for these increased clock speeds (and thus thermal stresses) that they see during their screening process, which is probably representative of gaming, but not 24/7 computing. Expose these chip sets to 24/7, full speed & power computing then they will degrade much faster than they would at their designed power & speed.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1732787 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1732788 - Posted: 8 Oct 2015, 8:58:18 UTC - in response to Message 1732766.  

the card is set to defaults it is what it is it was fine for days until earlyer today

only thing that changed is i installed latest boinc yesterday morning

At some point either yesterday or today, you (or Microsoft) installed driver: 358.50 - that was only released yesterday, so that's a change, too. Was it before or after the errors started?
ID: 1732788 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1732790 - Posted: 8 Oct 2015, 9:02:15 UTC - in response to Message 1732787.  

Expose these chip sets to 24/7, full speed & power computing then they will degrade much faster than they would at their designed power & speed.

With two cards in the same case, one usually runs hotter than the other, unless the card cooling system (especially fan exhaust) is very carefully designed. If the card suffering most errors is the hotter one, you've found another warning sign.
ID: 1732790 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1732821 - Posted: 8 Oct 2015, 11:52:17 UTC - in response to Message 1732750.  
Last modified: 8 Oct 2015, 11:53:57 UTC

the stock clock is actually 1379 i just looked

Where did you "just looked"?

When others say to you "stock clock", defaults, "by spec" - they mean NVIDIA

When you say the same - you mean "by EVGA"
And EVGA (as most card manufacturers) do "factory Overclock" (to impress gamers, benchmarks and sell more)
 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1732821 · Report as offensive
1 · 2 · Next

Message boards : Number crunching : WTH is this?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.