High Percentage of Invalid Results

Message boards : Number crunching : High Percentage of Invalid Results
Message board moderation

To post messages, you must log in.

AuthorMessage
Goodeid

Send message
Joined: 22 Aug 99
Posts: 7
Credit: 2,416,564
RAC: 0
Canada
Message 1752483 - Posted: 30 Dec 2015, 16:58:24 UTC

I have recently returned to Seti@home with a new machine ( 7862674 ). I read a number of messages about the work unit problems in early November, but understand that these should have mostly dissipated by now. However I am getting a very high percentage of invalid results.

Is there a possible problem with my machine/software/configuration that is causing this problem. If so what do I need to do to correct this situation?

Or will it eventually work itself out?
ID: 1752483 · Report as offensive
Goodeid

Send message
Joined: 22 Aug 99
Posts: 7
Credit: 2,416,564
RAC: 0
Canada
Message 1752491 - Posted: 30 Dec 2015, 17:26:29 UTC - in response to Message 1752483.  

I have recently returned to Seti@home with a new machine ( 7862674 ). I read a number of messages about the work unit problems in early November, but understand that these should have mostly dissipated by now. However I am getting a very high percentage of invalid results.

Is there a possible problem with my machine/software/configuration that is causing this problem. If so what do I need to do to correct this situation?

Or will it eventually work itself out?


This should make it easier to find the machine.
https://setiathome.berkeley.edu/show_host_detail.php?hostid=7862674
ID: 1752491 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22202
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1752554 - Posted: 30 Dec 2015, 21:23:17 UTC

Grabing the stderr for one of thes invalids before it vanishes.


Task 4637062679


Name 20fe11ab.6260.16435.10.19.232_0
Workunit 2020397652
Created 30 Dec 2015, 0:05:08 UTC
Sent 30 Dec 2015, 4:49:41 UTC
Report deadline 29 Feb 2016, 5:10:46 UTC
Received 30 Dec 2015, 11:35:32 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 7862674
Run time 4 min 19 sec
CPU time 4 min 6 sec
Validate state Invalid
Credit 0.00
Device peak FLOPS 3.24 GFLOPS
Application version SETI@home v7 v7.01
i686-pc-linux-gnu
Stderr output

<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
setiathome_v7 7.00 Revision: 1782 g++ (GCC) 4.4.1 20090725 (Red Hat 4.4.1-2)
libboinc: BOINC 7.1.0

Work Unit Info:
...............
WU true angle range is : 0.307951
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_BaseLineSmooth (no other)
v_avxGetPowerSpectrum 0.000044 0.00000
avx_ChirpData_d 0.001967 0.00000
v_avxTranspose4x16ntw 0.000694 0.00000
JS AVX_a folding 0.000325 0.00000
SETI@Home Informational message -9 result_overflow
NOTE: The number of results detected equals the storage space allocated.

Flopcounter: 1479915313166.489746

Spike count: 0
Autocorr count: 30
Pulse count: 0
Triplet count: 0
Gaussian count: 0
06:33:46 (17945): called boinc_finish

</stderr_txt>
]]>



If this had been a task run on a GPU my first though would have been a thermal or overclocking issue, but this is a CPU task so less likely, but worth checking.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1752554 · Report as offensive
Goodeid

Send message
Joined: 22 Aug 99
Posts: 7
Credit: 2,416,564
RAC: 0
Canada
Message 1752575 - Posted: 30 Dec 2015, 23:23:13 UTC - in response to Message 1752554.  


If this had been a task run on a GPU my first though would have been a thermal or overclocking issue, but this is a CPU task so less likely, but worth checking.


Checking both the Asus bios and lm_sensors indicate nothing even close to hot and there have been no problems with any other applications that might indicate heat related problems.
ID: 1752575 · Report as offensive
Profile petri33
Volunteer tester

Send message
Joined: 6 Jun 02
Posts: 1668
Credit: 623,086,772
RAC: 156
Finland
Message 1752580 - Posted: 31 Dec 2015, 0:40:12 UTC - in response to Message 1752575.  


If this had been a task run on a GPU my first though would have been a thermal or overclocking issue, but this is a CPU task so less likely, but worth checking.


Checking both the Asus bios and lm_sensors indicate nothing even close to hot and there have been no problems with any other applications that might indicate heat related problems.


I looked at the results and each and every one randomly chosen one had an issue with autocorrelations. It is a memory intensive phase in the calculations. It uses a lot of cpu too.

If the temps are ok I'd remove and reinstall the memory sticks.

If that does not help I'd check you have not set command rate T1 instead of T2 in bios memory settings by accident.

If that is not the case then I'd check the other values in bios. I have tried to overclock my CPU by notching up a bus related value (can not remember its name) from 100 to 103 or 125. It did not work. Changing multiplier worked though. I have an Asus mobo (Rampage IV black) and a push button to overclock. It does not work reliably with seti. I have to overclock it manually.

Make sure you have set the VDROOP compensation settings to mid or high. "Current capability" and "voltage compensation" or símilar: 120 - 140% both.
To overcome Heisenbergs:
"You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones
ID: 1752580 · Report as offensive
Goodeid

Send message
Joined: 22 Aug 99
Posts: 7
Credit: 2,416,564
RAC: 0
Canada
Message 1752611 - Posted: 31 Dec 2015, 5:40:16 UTC - in response to Message 1752580.  

I have re-seated the memory and checked all of the bios settings. There is no overclocking and the "Current capability" and "voltage compensation" are both set to 140 now. I also changed the fan control from standard to turbo.

The increased fan speed lowered temperatures a little and for now I have limited seti to 2 cores instead of 4 which has also dropped the cpu temperature.

The 2 active cores are running around 55C and the overall cpu is at 45C. I will continue running this way to see if the invalid results situation improves.

Time will tell ...
ID: 1752611 · Report as offensive
Goodeid

Send message
Joined: 22 Aug 99
Posts: 7
Credit: 2,416,564
RAC: 0
Canada
Message 1752688 - Posted: 31 Dec 2015, 16:06:54 UTC - in response to Message 1752611.  

After running overnight at much lower temperatures the results being generated still contain a very high percentage of failures. For now it is a waste of effort to run on this machine so it will be shutdown as soon as the remaining work units are completed.

I don't know what else to try ...
ID: 1752688 · Report as offensive
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1752700 - Posted: 31 Dec 2015, 17:04:39 UTC - in response to Message 1752688.  

Just a thought, but do you have a spare PS handy? I have found this to be a problem in the past, and a new one has cleared things up.

SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1752700 · Report as offensive
Goodeid

Send message
Joined: 22 Aug 99
Posts: 7
Credit: 2,416,564
RAC: 0
Canada
Message 1752710 - Posted: 31 Dec 2015, 18:13:41 UTC - in response to Message 1752700.  

Just a thought, but do you have a spare PS handy? I have found this to be a problem in the past, and a new one has cleared things up.


That was the first thing I tried. No l such luck.
ID: 1752710 · Report as offensive
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1752715 - Posted: 31 Dec 2015, 18:30:47 UTC - in response to Message 1752710.  

Just a thought, but do you have a spare PS handy? I have found this to be a problem in the past, and a new one has cleared things up.


That was the first thing I tried. No l such luck.


Well is was worth a guess. I think something is causing noise in the computer and giving you the results you are getting. Something going bad.

SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1752715 · Report as offensive

Message boards : Number crunching : High Percentage of Invalid Results


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.