Inconclusive Work Units Running AP Ver 6

Author	Message
Josef W. Segur Volunteer developer Volunteer tester Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0	Message 1235464 - Posted: 23 May 2012, 15:41:49 UTC - in response to Message 1235457. Just got one, my computer reported minutes ago. http://setiathome.berkeley.edu/workunit.php?wuid=975800759 So yours is the one with Jason's r557, yes against stock? Need to keep an eye on the final outcome of that one. Will do. Don't think I am going anywhere in the next two weeks, after that not sure yet, waiting for confirmation of scheduled meeting down south, but don't think other party is ready yet. That task must have validated, as it no longer on the task list. It was there just before maintenance, but not there now. But I have zero invalid or errors. I did download and run an offline check on that, just one of the 30 repetitive pulses was questionable, so a weakly similar credit grant was certain. The Lunatics FFA code is vectorized while stock isn't, so there will always be some very small differences leading to occasional disagreement. Joe ID: 1235464 ·

JohnDK Volunteer tester Send message Joined: 28 May 00 Posts: 1222 Credit: 451,243,443 RAC: 1,127	Message 1235476 - Posted: 23 May 2012, 16:05:43 UTC I have one, Linux stock vs r1305 http://setiathome.berkeley.edu/workunit.php?wuid=986876189 ID: 1235476 ·

Urs Echternacht Volunteer tester Send message Joined: 15 May 99 Posts: 692 Credit: 135,197,781 RAC: 211	Message 1235757 - Posted: 24 May 2012, 9:22:02 UTC - in response to Message 1235476. I have one, Linux stock vs r1305 http://setiathome.berkeley.edu/workunit.php?wuid=986876189 Your link points at a wu done by two stock OSX and one NV opencl wingcrew. Is the link correct ? _\\|/_ U r s ID: 1235757 ·

JohnDK Volunteer tester Send message Joined: 28 May 00 Posts: 1222 Credit: 451,243,443 RAC: 1,127	Message 1235772 - Posted: 24 May 2012, 11:22:15 UTC - in response to Message 1235757. I have one, Linux stock vs r1305 http://setiathome.berkeley.edu/workunit.php?wuid=986876189 Your link points at a wu done by two stock OSX and one NV opencl wingcrew. Is the link correct ? Yes, thought it was Linux but it may be OSX. ID: 1235772 ·

Stick Volunteer tester Send message Joined: 26 Feb 00 Posts: 100 Credit: 5,283,449 RAC: 5	Message 1242123 - Posted: 6 Jun 2012, 11:26:53 UTC Another one: wuid=999173464 ID: 1242123 ·

Josef W. Segur Volunteer developer Volunteer tester Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0	Message 1242234 - Posted: 6 Jun 2012, 15:02:30 UTC - in response to Message 1242123. Another one: wuid=999173464 That's fairly unusual: single pulses: 30 repetitive pulses: 1 I'll run an offline test on it, most likely there really are many single pulses but even CPU processing can sometimes glitch and manufacture false pulses. Joe ID: 1242234 ·

Stick Volunteer tester Send message Joined: 26 Feb 00 Posts: 100 Credit: 5,283,449 RAC: 5	Message 1243116 - Posted: 7 Jun 2012, 23:40:08 UTC - in response to Message 1242234. Another one: wuid=999173464 That's fairly unusual: single pulses: 30 repetitive pulses: 1 I'll run an offline test on it, most likely there really are many single pulses but even CPU processing can sometimes glitch and manufacture false pulses. Joe The 3rd task has reported and, surprisingly, all three validated/received credit. ID: 1243116 ·

Josef W. Segur Volunteer developer Volunteer tester Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0	Message 1243131 - Posted: 8 Jun 2012, 0:49:13 UTC - in response to Message 1243116. Another one: wuid=999173464 That's fairly unusual: single pulses: 30 repetitive pulses: 1 I'll run an offline test on it, most likely there really are many single pulses but even CPU processing can sometimes glitch and manufacture false pulses. Joe The 3rd task has reported and, surprisingly, all three validated/received credit. Not really surprising, the more signals there are the more likely for a very small difference in calculations to make a difference. My offline run with ap_6.01r557_SSE2_331_AVX.exe finished and found the 30 single and 1 repetitive pulses, the run with stock 6.01 is at about 57% progress and has found 14 single and 1 repetitive all of which match very closely. It should finish about when I get up tomorrow, then I'll check in detail. Joe ID: 1243131 ·

Sirius B Volunteer tester Send message Joined: 26 Dec 00 Posts: 24879 Credit: 3,081,182 RAC: 7	Message 1243331 - Posted: 8 Jun 2012, 15:41:44 UTC Last modified: 8 Jun 2012, 15:42:28 UTC scrub this..looked at wrong list.. apologies. ID: 1243331 ·

Josef W. Segur Volunteer developer Volunteer tester Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0	Message 1243384 - Posted: 8 Jun 2012, 17:26:55 UTC - in response to Message 1243131. Another one: wuid=999173464 That's fairly unusual: single pulses: 30 repetitive pulses: 1 I'll run an offline test on it, most likely there really are many single pulses but even CPU processing can sometimes glitch and manufacture false pulses. Joe The 3rd task has reported and, surprisingly, all three validated/received credit. Not really surprising, the more signals there are the more likely for a very small difference in calculations to make a difference. My offline run with ap_6.01r557_SSE2_331_AVX.exe finished and found the 30 single and 1 repetitive pulses, the run with stock 6.01 is at about 57% progress and has found 14 single and 1 repetitive all of which match very closely. It should finish about when I get up tomorrow, then I'll check in detail. Joe As it turned out, the stock app's 26th single pulse was only 0.0036% above threshold, and r557 must have calculated it as just below threshold so didn't report it. That wouldn't directly affect validation because only single pulses which are at least 1% above threshold are considered, but indirectly it meant that the 30th single pulse from stock was only the 29th for r557. So r557 actually analyzed that pulse better, stock quit when it first got above threshold by 0.27% but r557 found it was stronger at a higher dispersion and reported it as about 3% above threshold. The Validator calls that a mismatch since it only considers the r557 version. There was another mismatch because r557's 30th single pulse was 3.25% above threshold so also included in the Validator's checking, so two of the pulses checked from the r557 result weren't found in the stock result. Still, in terms of how the Validator works, 27 of 29 checked signals did match so the r557 result was granted credit as being "weakly similar". Joe ID: 1243384 ·

Stick Volunteer tester Send message Joined: 26 Feb 00 Posts: 100 Credit: 5,283,449 RAC: 5	Message 1243460 - Posted: 8 Jun 2012, 19:22:01 UTC - in response to Message 1243384. Joe, Thank you for the explanation! Stick As it turned out, the stock app's 26th single pulse was only 0.0036% above threshold, and r557 must have calculated it as just below threshold so didn't report it. That wouldn't directly affect validation because only single pulses which are at least 1% above threshold are considered, but indirectly it meant that the 30th single pulse from stock was only the 29th for r557. So r557 actually analyzed that pulse better, stock quit when it first got above threshold by 0.27% but r557 found it was stronger at a higher dispersion and reported it as about 3% above threshold. The Validator calls that a mismatch since it only considers the r557 version. There was another mismatch because r557's 30th single pulse was 3.25% above threshold so also included in the Validator's checking, so two of the pulses checked from the r557 result weren't found in the stock result. Still, in terms of how the Validator works, 27 of 29 checked signals did match so the r557 result was granted credit as being "weakly similar". Joe ID: 1243460 ·

Fred E. Volunteer tester Send message Joined: 22 Jul 99 Posts: 768 Credit: 24,140,697 RAC: 0	Message 1243463 - Posted: 8 Jun 2012, 19:35:10 UTC Not sure if you're still looking for these, but I have one: workunit 986977542 ID: 1243463 ·

Josef W. Segur Volunteer developer Volunteer tester Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0	Message 1243622 - Posted: 9 Jun 2012, 2:40:06 UTC - in response to Message 1243463. Not sure if you're still looking for these, but I have one: workunit 986977542 If it were practical, we'd analyze all cases of inconclusive results from Lunatics apps. There's too little time and/or equipment for that, but I'll check this one too on general principles. Should be done sometime Sunday. The WU may well be resoved before then and I expect as usual all completed tasks will get credit, but there's always the possibility analysis will point to some lurking bug. Joe ID: 1243622 ·

perryjay Volunteer tester Send message Joined: 20 Aug 02 Posts: 3377 Credit: 20,676,751 RAC: 0	Message 1244135 - Posted: 9 Jun 2012, 22:11:12 UTC Here's one from me too... http://setiathome.berkeley.edu/workunit.php?wuid=997551999 PROUD MEMBER OF Team Starfire World BOINC ID: 1244135 ·

Josef W. Segur Volunteer developer Volunteer tester Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0	Message 1244861 - Posted: 12 Jun 2012, 2:56:37 UTC - in response to Message 1243622. Not sure if you're still looking for these, but I have one: workunit 986977542 If it were practical, we'd analyze all cases of inconclusive results from Lunatics apps. There's too little time and/or equipment for that, but I'll check this one too on general principles. Should be done sometime Sunday. The WU may well be resoved before then and I expect as usual all completed tasks will get credit, but there's always the possibility analysis will point to some lurking bug. Joe That one turned out to be simple to analyze, the stock app found one more repetitive pulse than optimized. The peak power was 5170.0068359375 and the threshold 5170 (FFA thresholds are truncated to integer values). So it's just one of the inevitable cases where a tiny difference in calculation is enough to cause an inconclusive validation. Joe ID: 1244861 ·

fataldog187 Send message Joined: 4 Nov 02 Posts: 42 Credit: 1,271,261 RAC: 0	Message 1244993 - Posted: 12 Jun 2012, 10:16:17 UTC I couldn't read the whole thread but... I have 3 APv6 with 30 repetitive pulses that are inconclusive http://setiathome.berkeley.edu/workunit.php?wuid=997100145 http://setiathome.berkeley.edu/workunit.php?wuid=1008642492 http://setiathome.berkeley.edu/workunit.php?wuid=1007059107 And one was marked invalid after being sent to a 3rd wingman http://setiathome.berkeley.edu/workunit.php?wuid=1005838187 There are also two others with 30 repetitive pulses that are awaiting validation, and these are all out of only 8 that are listed with AstroPulse v6 Anonymous platform, something seems a little glitchy ID: 1244993 ·

Josef W. Segur Volunteer developer Volunteer tester Send message Joined: 30 Oct 99 Posts: 4504 Credit: 1,414,761 RAC: 0	Message 1245088 - Posted: 12 Jun 2012, 15:39:24 UTC - in response to Message 1244993. I couldn't read the whole thread but... I have 3 APv6 with 30 repetitive pulses that are inconclusive http://setiathome.berkeley.edu/workunit.php?wuid=997100145 http://setiathome.berkeley.edu/workunit.php?wuid=1008642492 http://setiathome.berkeley.edu/workunit.php?wuid=1007059107 And one was marked invalid after being sent to a 3rd wingman http://setiathome.berkeley.edu/workunit.php?wuid=1005838187 There are also two others with 30 repetitive pulses that are awaiting validation, and these are all out of only 8 that are listed with AstroPulse v6 Anonymous platform, something seems a little glitchy Although the r555 ATi OpenCL application has been reliable on most systems, there are exceptions. You might be able to stabilize processing by tuning voltages, clock rates, or command line parameters. Joe ID: 1245088 ·

Fred E. Volunteer tester Send message Joined: 22 Jul 99 Posts: 768 Credit: 24,140,697 RAC: 0	Message 1245128 - Posted: 12 Jun 2012, 22:24:18 UTC That one turned out to be simple to analyze, the stock app found one more repetitive pulse than optimized. The peak power was 5170.0068359375 and the threshold 5170 (FFA thresholds are truncated to integer values). So it's just one of the inevitable cases where a tiny difference in calculation is enough to cause an inconclusive validation. Joe Thanks for the analysis - thought it might be something like that since I ususally don't lose when results go inconclusive. I probably won't have any more for a while. I've stopped fetching AP6 until I can figure out what's causing errors with zero runtime - have hit 4 in the past few days and I'm in troubleshoot mode. May open a thread if I don't find a problem Another Fred Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop. ID: 1245128 ·

Wedge009 Volunteer tester Send message Joined: 3 Apr 99 Posts: 451 Credit: 431,396,357 RAC: 553	Message 1245159 - Posted: 12 Jun 2012, 22:53:57 UTC Over the past couple of months, I've also been trying to make sense out of the increasing occurrences of invalid results due to returning 30 repetitive pulses. In fact, aside from my main host (running AP/OpenCL on a HD 6950), I think all of my GPU-related AP work-units are returning 30 repetitive pulse results, which I find quite worrying. Especially since I generally don't over-clock and am pretty sure the hosts are otherwise stable. It seems very unlikely to me that so many hosts would start failing simultaneously. I tried doing a full shut-down, rest, and cold boot on some of the affected hosts, as suggested elsewhere for flushing the GPU RAM - there doesn't seem to be new AP work being generated at the moment, so will have to wait to see if that makes any difference. Host 1504137 Main machine, currently only host consistently returning valid AP results. Radeon HD 6950 running OpenCL r555, ironically the only AP-related GPU being over-clocked (albeit very modestly). Host 6077487 A machine I made for my brother, was previously running Brook r555 on a Radeon HD 4850 at factory clock rates. Seemed stable in APv5 era, but have now switched to AP SSE2 r557. Has been returning valid AP work-units since. Host 4462068 A 2008 laptop, was previously running Brook r555 on a Mobility Radeon HD 2600. Again, was fine in APv5 era, but even if it hadn't started generating lots of 30 repetitive pulse results, I would have switched to AP SSE2 r557 anyway because the heat pipe cooling the GPU no longer seems effective (CPU is cooled, but GPU overheats with anything more strenuous than processing a video and shuts everything down). Host 6649080 A fairly new host, albeit older hardware, running OpenCL r555 on HD 4670. Was returning good results initially, but now I notice it is returning mostly 30 repetitive pulses, which mostly seem to be marked as invalid. Host 6692553 Another new host running OpenCL r555 on a Fusion C-50. Before anyone questions whether it's worth the bother on such a low-power machine, I think 20 hours is a big improvement on 100+ hours on an Atom. Anyway, only the first AP work-unit was valid, subsequent results seem to exhibit 30 repetitive pulses. Has also been unlucky with regards to CPU time - a lot of high blanking percentage work-units there. So, basically, I'm not sure what's going on, so was trying to search the forums for any clue on why I'm getting so many 30 repetitive pulse results. Soli Deo Gloria ID: 1245159 ·

Raistmer Volunteer developer Volunteer tester Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121	Message 1246384 - Posted: 15 Jun 2012, 11:48:10 UTC There was bugfix in r1305 that potentially can improve app precision. All who participate please upgrade to "stock" 6.03 binary for AP (it's rev1305 binary). All who dont' run beta - await for binary update... ID: 1246384 ·

©2024 University of California

SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.