Posts by Karsten Vinding

1) Message boards : Number crunching : Vega Frontier Edition - MB Options Tuning (Message 1895841)
Posted 6 days ago by Profile Karsten Vinding
Post:
I don't have any solid measurements, but the Wu's do seem to be crunching faster on my RX480, with these settings, even though its a less powerfull processor with fever CU's.

I'll have to find a way to measure the differences :)
2) Message boards : Number crunching : Vega Frontier Edition - MB Options Tuning (Message 1895712)
Posted 7 days ago by Profile Karsten Vinding
Post:
I really appreciate you taking the time to run all of these tests.

I don't know if your results are the absolute optimum settings, but I'll try running them on my RX480.

There are so many different options that its hard to find heads or tails in optimizing things.
3) Message boards : Number crunching : User achievements thread......... (Message 1886593)
Posted 28 Aug 2017 by Profile Karsten Vinding
Post:
Even though I have been with the project almost since the very beginning, I have only made a little less than 19 million credits.

For some years I had all my computers crunching 24/7. I dont do that anymore.
There are many reasons.

Most of my computers are not that powerfull, and don't have a useable GPU, so it wouldnt be worth the effort.

Electricity is relatively expensive where I live, and my electric bills were simply to big.

And I got tired of having many computers standing around making noise and heat (especially during the summer time).

So I'm down to crunching on one, not particularly powerfull machine, and its normally only on, when I actually use it.

I'm hoping to get to 20 million credits, and perhaps even 50 million, someday, but its not something I will put a lot of effort into.

On the other hand I have done, and still do some testing for the developers. Not much any more, but I am ready to help if they need my input.
This, I hope, helps the project with having the big crunchers run more efficiently / making less errors.
So in that way I hope to contribute in my own way. That way, perhaps indirectly, I'm helping it produce some extra science, for the energy spent.
4) Message boards : Number crunching : To checkpoint or not -- the wear and tear of SSD drives (Message 1877947)
Posted 12 Jul 2017 by Profile Karsten Vinding
Post:
I have been running SSD's for some (many) years now.

My very first was a 60Gb OCZ Agility, bought at the same time when Intel 64/128/256Gb where the best performing drives. I remember being stunned by the speedup it gave my system.
Nowadays i feel stunned when I work at a normal HDD drive computer, but not in a good way...

Since then I upgraded to a Samsung 840EVO, and all my other PC's are running with various SSD's.

At first when I only had the 64Gb Agility, I was very cautious about not writing to it. It more or less contained only the OS. Swap drive was moved to a HDD.
Later I moved the swap disk to it. It didn't show any problems.

Later on it became a secondary drive as the EVO moved in. Now it maintained the BOINC drive and some of the Steam games. Wear did go up, but not at an alarming level.

It lived this way until about a year ago, when remaining lifetime showed 15%. It also seemed that it had started using its spare blocks, as the reallocation count was going up, so it was getting worn out.

I decided it should live its last write cycles in my PS3. This killed it in a matter of 8 months, with mainly using the PS3 for streaming movies via Netflix / Plex.
As the PS3 does not have much mem it caches streams to disk, so it probably saw a lot of writes during this last time. One day, without warning the PS3 wouldnt boot, the Agility was dead.

All in all the Agility lasted me more than 7 (close to 8) years, I have had many hard drives that didn't last as long. And this is for a small 60Gb drive, with less capacity to do its wear leveling. A 120 og 240Gb drive would have lasted much longer in the same conditions.

I for one is not worried about wear. None of my current drives are below 95% wear leveling, despite being used without any special settings, besides the ones Windows / Centos/ Ubuntu sets themselves.

SSD's can fail prematurely, as can any piece of electronic, but I consider the technology mature and reliable. We soon pass the 10 year mark for normal consumer availability of these drives.
My latest hardware failure was a good old fashioned HDD, only 4 months old.
5) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1876932)
Posted 5 Jul 2017 by Profile Karsten Vinding
Post:
Luckily I havent had all that many AP WU's, so I havent produced all that many invalids.

I'm also looking forward to what Raistmer finds out. And I'm off course willing to do additional tests.

But I suspect this is hard/difficult work, so we will have to give him time to find out exactly what is wrong.

AMD is about to come out with Vega, and it would be bad (and sad) if these also produce invalids.
6) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1875259)
Posted 26 Jun 2017 by Profile Karsten Vinding
Post:
@Raistmer.

I will shortly send you a message with the links to the results files I have made.

Please tell me if you have any problems retreiving them.
7) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1875251)
Posted 26 Jun 2017 by Profile Karsten Vinding
Post:
Tried with both :)

Neither did work.

But I have it working now, thanks for your input :)
8) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1875245)
Posted 26 Jun 2017 by Profile Karsten Vinding
Post:
Sadly I don't seem to be able to get the "--device 1" command line switch to work. The work is still being done on device 0.

Edit: Figured it out, set the command line in the AP211.cmd file instead, and now it works. I have saved the orignal .cmd file for later.
9) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1875237)
Posted 26 Jun 2017 by Profile Karsten Vinding
Post:
I'm allready running the full test on my HD7770, after running it on the RX480 (more WU's than the first test).

The resultfiles will be put onto dropbox, and I will provide a link.

Sadly, since I started the test on the HD7770, the content of the testdata dir is gone. I'll have to run it again on the RX480.

When this is done the results from testdata for both HD7770 and RX480 will also be put onto dropbox.

I'll PM you the links.

When this is done, we will do single_pulses.wu

Will get back later when data is collected.
10) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1875149)
Posted 26 Jun 2017 by Profile Karsten Vinding
Post:
Continued:

Rep. pulse: num_std_devs=3.504 peak_power=589500.9 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=7.148409
Rep. pulse: num_std_devs=3.98 peak_power=594133.1 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.555221
Rep. pulse: num_std_devs=4.824 peak_power=298068.3 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=3.531917
Rep. pulse: num_std_devs=4.619 peak_power=297934.8 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=7.108598
Rep. pulse: num_std_devs=4.103 peak_power=594266.8 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.063834
Rep. pulse: num_std_devs=3.782 peak_power=606274.5 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.47992
Rep. pulse: num_std_devs=4.685 peak_power=300039.9 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=14.05692
Rep. pulse: num_std_devs=4.807 peak_power=306307.9 dm=-896 peak_bin=512 scale=4 ffa_scale=4 period=55.31376
Rep. pulse: num_std_devs=3.716 peak_power=614438.6 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.422208
Rep. pulse: num_std_devs=4.486 peak_power=310217 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=13.61239
Rep. pulse: num_std_devs=3.6 peak_power=622543.9 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.387451
Rep. pulse: num_std_devs=3.988 peak_power=631220 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.329341
Rep. pulse: num_std_devs=4.621 peak_power=314432.1 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=6.703948
Rep. pulse: num_std_devs=4.662 peak_power=316521.8 dm=-896 peak_bin=128 scale=4 ffa_scale=2 period=13.31818
Rep. pulse: num_std_devs=4.379 peak_power=318390.4 dm=-896 peak_bin=0 scale=4 ffa_scale=3 period=26.50823
Rep. pulse: num_std_devs=3.582 peak_power=647227.9 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.261013
Rep. pulse: num_std_devs=4.468 peak_power=322573.9 dm=-896 peak_bin=64 scale=4 ffa_scale=1 period=6.534977
Rep. pulse: num_std_devs=4.474 peak_power=320516.8 dm=-896 peak_bin=32 scale=4 ffa_scale=0 period=3.292261
Rep. pulse: num_std_devs=4.404 peak_power=322529.8 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=13.1167
Rep. pulse: num_std_devs=3.706 peak_power=655608.9 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.205953
Rep. pulse: num_std_devs=4.429 peak_power=328731.6 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=3.205219
Rep. pulse: num_std_devs=5.492 peak_power=166316 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=6.37182

class T_remove_radar: total=1.88e+009, N=1, <>=1.88e+009, min=1.88e+009, max=1.88e+009
class T_main_loop_L1: total=7.96e+010, N=2, <>=3.98e+010, min=3.54e+010, max=4.42e+010
class T_FFT_forward: total=7.86e+007, N=1824, <>=4.31e+004, min=3.57e+004, max=8.48e+005
class T_remove_radar_randomize: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_build_chirp_table: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_dechirp: total=9.79e+007, N=1824, <>=5.37e+004, min=4.74e+004, max=3.74e+005
class Dechirp_ns: total=0, N=0, <>=0, min=0 max=0
class Half_ns: total=0, N=0, <>=0, min=0 max=0
class T_PC_single_pulse_kernel_FFA_update: total=7.06e+010, N=1824, <>=3.87e+007, min=3.73e+007, max=1.80e+008
class PC_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_oclWriteBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_inverse: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ffa: total=8.75e+009, N=1, <>=8.75e+009, min=8.75e+009, max=8.75e+009

FFA blocks counters:
class T_FFA_fetch: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=3.75e+008, N=337, <>=1.11e+006, min=7.96e+005, max=1.48e+007
class T_FFA_coadd: total=5.02e+006, N=681, <>=7.37e+003, min=4.79e+003, max=4.58e+005
class T_FFA_stride_add: total=5.74e+006, N=294, <>=1.95e+004, min=1.70e+004, max=1.31e+005
class T_GPU_buffer_read_backs: total=0, N=0, <>=0, min=0 max=0
TWIN_FFA OCL_ZERO_COPY USE_OPENCL OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2742
GPU device sync requested... ...GPU device synched
08:41:24 (10720): called boinc_finish(0)
[ /stderr ]

------------
AP7_win_x64_AVX_CPU_r2692.exe -verbose / Raistmer_tinyrr.wu :
AppName: AP7_win_x64_AVX_CPU_r2692.exe
AppArgs: -verbose
TaskName: Raistmer_tinyrr.wu
Started at : 08:41:29.615
Ended at : 08:42:58.710
Result : stored as ref for validations.
89.044 secs Elapsed
87.000 secs CPU time

[ stderr ]
08:41:29 (6964): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:41:29 (6964): Can't open init data file - running in standalone mode

Build features: Non-graphics BLANKIT TWINDECHIRP USE_LRINT FFTW USE_INCREASED_PRECISION USE_AVX x64
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x64 rev 2692, V7 match, by Raistmer with support of Lunatics.kwsn.net team.
by Lunatics team. Built with uncommitted modifications
state.fold_buf_size_short=65536; state.fold_buf_size_long=524288

single pulses: 0
repetitive pulses: 30
percent blanked: 0.00
Rep. pulse: num_std_devs=3.572 peak_power=5.649e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.718569
Rep. pulse: num_std_devs=3.715 peak_power=5.65e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.463398
Rep. pulse: num_std_devs=4.008 peak_power=5.736e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.33895
Rep. pulse: num_std_devs=3.949 peak_power=5.776e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.645032
Rep. pulse: num_std_devs=4.252 peak_power=5.821e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.258542
Rep. pulse: num_std_devs=4.927 peak_power=2.919e+005 dm=-896 peak_bin=16 scale=4 ffa_scale=0 period=3.631154
Rep. pulse: num_std_devs=4.7 peak_power=2.939e+005 dm=-896 peak_bin=128 scale=4 ffa_scale=2 period=14.40675
Rep. pulse: num_std_devs=3.828 peak_power=5.857e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.59285
Rep. pulse: num_std_devs=3.504 peak_power=5.895e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=7.148409
Rep. pulse: num_std_devs=3.98 peak_power=5.941e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.555221
Rep. pulse: num_std_devs=4.824 peak_power=2.981e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=3.531917
Rep. pulse: num_std_devs=4.62 peak_power=2.979e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=7.108598
Rep. pulse: num_std_devs=4.103 peak_power=5.943e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.063834
Rep. pulse: num_std_devs=3.782 peak_power=6.063e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.47992
Rep. pulse: num_std_devs=4.685 peak_power=3e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=14.05692
Rep. pulse: num_std_devs=4.807 peak_power=3.063e+005 dm=-896 peak_bin=512 scale=4 ffa_scale=4 period=55.31376
Rep. pulse: num_std_devs=3.717 peak_power=6.144e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.422208
Rep. pulse: num_std_devs=4.486 peak_power=3.102e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=13.61239
Rep. pulse: num_std_devs=3.6 peak_power=6.225e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.387451
Rep. pulse: num_std_devs=3.988 peak_power=6.312e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.329341
Rep. pulse: num_std_devs=4.621 peak_power=3.144e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=6.703948
Rep. pulse: num_std_devs=4.662 peak_power=3.165e+005 dm=-896 peak_bin=128 scale=4 ffa_scale=2 period=13.31818
Rep. pulse: num_std_devs=4.379 peak_power=3.184e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=3 period=26.50823
Rep. pulse: num_std_devs=3.582 peak_power=6.472e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.261013
Rep. pulse: num_std_devs=4.468 peak_power=3.226e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=1 period=6.534977
Rep. pulse: num_std_devs=4.474 peak_power=3.205e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=0 period=3.292261
Rep. pulse: num_std_devs=4.404 peak_power=3.225e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=13.1167
Rep. pulse: num_std_devs=3.707 peak_power=6.556e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.205953
Rep. pulse: num_std_devs=4.43 peak_power=3.287e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=3.205219
Rep. pulse: num_std_devs=5.492 peak_power=1.663e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=6.37182

class T_remove_radar: total=1.42e+009, N=1, <>=1.42e+009, min=1.42e+009, max=1.42e+009
class T_main_loop_L1: total=3.97e+011, N=1, <>=3.97e+011, min=3.97e+011, max=3.97e+011
class T_FFT_forward: total=9.49e+009, N=8188, <>=1.16e+006, min=1.10e+006, max=2.52e+006
class T_remove_radar_randomize: total=1.17e+007, N=8188, <>=1.43e+003, min=8.49e+002, max=6.16e+004
class T_build_chirp_table: total=4.84e+008, N=4, <>=1.21e+008, min=1.19e+008, max=1.25e+008
class T_dechirp: total=4.40e+010, N=262015, <>=1.68e+005, min=3.70e+001, max=2.09e+006
class T_FFT_inverse: total=2.87e+011, N=262015, <>=1.09e+006, min=1.06e+006, max=4.03e+006
class T_ffa: total=1.12e+009, N=1, <>=1.12e+009, min=1.12e+009, max=1.12e+009

FFA blocks counters:
class T_FFA_fetch: total=7.31e+008, N=10461, <>=6.99e+004, min=5.59e+004, max=4.38e+005
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=2.66e+008, N=391352, <>=6.80e+002, min=4.20e+001, max=4.39e+005
class T_FFA_coadd: total=5.61e+007, N=391351, <>=1.43e+002, min=4.10e+001, max=5.17e+004
class T_FFA_stride_add: total=7.48e+006, N=83686, <>=8.90e+001, min=3.70e+001, max=2.97e+004
USE_INCREASED_PRECISION SMALL_CHIRP_TABLE BLANKIT TWINDECHIRP USE_LRINT
rev 2692
08:42:56 (6964): called boinc_finish(0)
[ /stderr ]
------------
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose / Raistmer_tinyrr.wu :
AppName: AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe
AppArgs: -verbose
TaskName: Raistmer_tinyrr.wu
Started at : 08:43:01.848
Ended at : 08:43:10.849
8.837 secs Elapsed
2.781 secs CPU time
Speedup : 96.80%
Ratio : 31.28x

ref-AP7_win_x64_AVX_CPU_r2692.exe-Raistmer_tinyrr.wu.res: <ap_signal>40,<pulses>30,<best_pulses>10
result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-Raistmer_tinyrr.wu.res: <ap_signal>40,<pulses>30,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: Checked 30, 30 , Strongly Similar
Best Pulses: Weakly similar or Different.

-(.\testDatas\ref\ref-AP7_win_x64_AVX_CPU_r2692.exe-Raistmer_tinyrr.wu.res)-
Reportable Single Pulses: 0 [OK], 0 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [OK]
Single Pulses (Best): 10 [Weak], 0 above threshold*THRESHOLD_FUDGE

-(.\testDatas\result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-Raistmer_tinyrr.wu.res)-
Reportable Single Pulses: 0 [OK], 0 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [OK]
Single Pulses (Best): 0 [Weak], 0 above threshold*THRESHOLD_FUDGE


[ stderr ]
08:43:01 (4704): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:43:01 (4704): Can't open init data file - running in standalone mode
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
08:43:01 (4704): Can't open init data file - running in standalone mode
WARNING: init_data.xml missing
OpenCL platform detected: Advanced Micro Devices, Inc.
WARNING: BOINC supplied wrong platform!
BOINC assigns device 0
WARNING: BOINC failed to provide OpenCL device, using own enumeration abilities
Used GPU device parameters are:
Number of compute units: 36
Single buffer allocation size: 256MB
Total device global memory: 3072MB
max WG size: 256
local mem type: Real
-unroll default value used: 18
-ffa_block default value used: 9216
-ffa_block_fetch default value used: 4608

Build features: Non-graphics BLANKIT OpenCL TWIN_FFA OCL_ZERO_COPY COMBINED_DECHIRP_KERNEL FFTW USE_INCREASED_PRECISION USE_SSE2 x86
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x86 rev 2742, V7 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2

OpenCL version by Raistmer

oclFFT fix for ATI GPUs by Urs Echternacht
ffa threshold mods by Joe Segur
SSE3 dechirping by JDWhale
Combined dechirp kernel by Frizz
Number of OpenCL platforms: 1


OpenCL Platform Name: AMD Accelerated Parallel Processing
Number of devices: 1
Max compute units: 36
Max work group size: 256
Max clock frequency: 1288Mhz
Max memory allocation: 3221225472
Cache type: Read/Write
Cache line size: 64
Cache size: 16384
Global memory size: 3221225472
Constant buffer size: 3221225472
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Queue properties:
Out-of-Order: No
Name: Ellesmere
Vendor: Advanced Micro Devices, Inc.
Driver version: 2348.4
Version: OpenCL 1.2 AMD-APP (2348.4)
Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash


state.fold_buf_size_short=65536; state.fold_buf_size_long=524288
INFO: excessive number of rep. pulses detected. To save system memory FFA will be redone with decreased FFA block value 256.

single pulses: 0
repetitive pulses: 30
percent blanked: 0.00
Rep. pulse: num_std_devs=3.572 peak_power=564871.4 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.718569
Rep. pulse: num_std_devs=3.714 peak_power=565019.6 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.463398
Rep. pulse: num_std_devs=4.008 peak_power=573564.9 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.33895
Rep. pulse: num_std_devs=3.949 peak_power=577621.9 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.645032
Rep. pulse: num_std_devs=4.252 peak_power=582063.8 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.258542
Rep. pulse: num_std_devs=4.927 peak_power=291947 dm=-896 peak_bin=16 scale=4 ffa_scale=0 period=3.631154
Rep. pulse: num_std_devs=4.7 peak_power=293862.5 dm=-896 peak_bin=128 scale=4 ffa_scale=2 period=14.40675
Rep. pulse: num_std_devs=3.827 peak_power=585730.9 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.59285
Rep. pulse: num_std_devs=3.504 peak_power=589500.9 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=7.148409
Rep. pulse: num_std_devs=3.98 peak_power=594133.1 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.555221
Rep. pulse: num_std_devs=4.824 peak_power=298068.3 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=3.531917
Rep. pulse: num_std_devs=4.619 peak_power=297934.8 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=7.108598
Rep. pulse: num_std_devs=4.103 peak_power=594266.8 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.063834
Rep. pulse: num_std_devs=3.782 peak_power=606274.5 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.47992
Rep. pulse: num_std_devs=4.685 peak_power=300039.9 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=14.05692
Rep. pulse: num_std_devs=4.807 peak_power=306307.9 dm=-896 peak_bin=512 scale=4 ffa_scale=4 period=55.31376
Rep. pulse: num_std_devs=3.716 peak_power=614438.6 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.422208
Rep. pulse: num_std_devs=4.486 peak_power=310217 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=13.61239
Rep. pulse: num_std_devs=3.6 peak_power=622543.9 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.387451
Rep. pulse: num_std_devs=3.988 peak_power=631220 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.329341
Rep. pulse: num_std_devs=4.621 peak_power=314432.1 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=6.703948
Rep. pulse: num_std_devs=4.662 peak_power=316521.8 dm=-896 peak_bin=128 scale=4 ffa_scale=2 period=13.31818
Rep. pulse: num_std_devs=4.379 peak_power=318390.4 dm=-896 peak_bin=0 scale=4 ffa_scale=3 period=26.50823
Rep. pulse: num_std_devs=3.582 peak_power=647227.9 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.261013
Rep. pulse: num_std_devs=4.468 peak_power=322573.9 dm=-896 peak_bin=64 scale=4 ffa_scale=1 period=6.534977
Rep. pulse: num_std_devs=4.474 peak_power=320516.8 dm=-896 peak_bin=32 scale=4 ffa_scale=0 period=3.292261
Rep. pulse: num_std_devs=4.404 peak_power=322529.8 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=13.1167
Rep. pulse: num_std_devs=3.706 peak_power=655608.9 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.205953
Rep. pulse: num_std_devs=4.429 peak_power=328731.6 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=3.205219
Rep. pulse: num_std_devs=5.492 peak_power=166316 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=6.37182

class T_remove_radar: total=1.89e+009, N=1, <>=1.89e+009, min=1.89e+009, max=1.89e+009
class T_main_loop_L1: total=2.66e+010, N=1, <>=2.66e+010, min=2.66e+010, max=2.66e+010
class T_FFT_forward: total=2.02e+007, N=456, <>=4.43e+004, min=3.59e+004, max=7.17e+005
class T_remove_radar_randomize: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_build_chirp_table: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_dechirp: total=2.49e+007, N=456, <>=5.46e+004, min=4.80e+004, max=1.95e+005
class Dechirp_ns: total=0, N=0, <>=0, min=0 max=0
class Half_ns: total=0, N=0, <>=0, min=0 max=0
class T_PC_single_pulse_kernel_FFA_update: total=1.77e+010, N=456, <>=3.89e+007, min=3.76e+007, max=1.60e+008
class PC_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_oclWriteBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_inverse: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ffa: total=8.77e+009, N=1, <>=8.77e+009, min=8.77e+009, max=8.77e+009

FFA blocks counters:
class T_FFA_fetch: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=3.76e+008, N=337, <>=1.12e+006, min=8.00e+005, max=1.48e+007
class T_FFA_coadd: total=4.82e+006, N=681, <>=7.08e+003, min=4.59e+003, max=6.07e+005
class T_FFA_stride_add: total=5.58e+006, N=294, <>=1.90e+004, min=1.67e+004, max=1.43e+005
class T_GPU_buffer_read_backs: total=0, N=0, <>=0, min=0 max=0
TWIN_FFA OCL_ZERO_COPY USE_OPENCL OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2742
GPU device sync requested... ...GPU device synched
08:43:08 (4704): called boinc_finish(0)
[ /stderr ]

------------
AP7_win_x64_AVX_CPU_r2692.exe -verbose / sigind_v5.wu :
AppName: AP7_win_x64_AVX_CPU_r2692.exe
AppArgs: -verbose
TaskName: sigind_v5.wu
Started at : 08:43:14.221
Ended at : 08:46:54.900
Result : stored as ref for validations.
220.626 secs Elapsed
218.375 secs CPU time

[ stderr ]
08:43:14 (4596): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:43:14 (4596): Can't open init data file - running in standalone mode

Build features: Non-graphics BLANKIT TWINDECHIRP USE_LRINT FFTW USE_INCREASED_PRECISION USE_AVX x64
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x64 rev 2692, V7 match, by Raistmer with support of Lunatics.kwsn.net team.
by Lunatics team. Built with uncommitted modifications
state.fold_buf_size_short=65536; state.fold_buf_size_long=262144

single pulses: 0
repetitive pulses: 9
percent blanked: 61.21
Rep. pulse: num_std_devs=7.085 peak_power=2.436e+006 dm=-3072 peak_bin=512 scale=7 ffa_scale=2 period=55.0985
Rep. pulse: num_std_devs=9.825 peak_power=9.805e+005 dm=2944 peak_bin=11264 scale=7 ffa_scale=2 period=137.7462
Rep. pulse: num_std_devs=8.737 peak_power=4.915e+005 dm=-2944 peak_bin=14848 scale=7 ffa_scale=2 period=275.4913
Rep. pulse: num_std_devs=8.603 peak_power=9.793e+005 dm=3072 peak_bin=28672 scale=7 ffa_scale=3 period=275.4903
Rep. pulse: num_std_devs=7.544 peak_power=4.907e+005 dm=-2944 peak_bin=0 scale=7 ffa_scale=1 period=137.7467
Rep. pulse: num_std_devs=9.826 peak_power=1.955e+006 dm=-2944 peak_bin=0 scale=7 ffa_scale=3 period=137.7451
Rep. pulse: num_std_devs=7.253 peak_power=2.463e+005 dm=3072 peak_bin=8192 scale=7 ffa_scale=1 period=275.4924
Rep. pulse: num_std_devs=6.666 peak_power=2.435e+006 dm=-3072 peak_bin=512 scale=7 ffa_scale=1 period=27.54915
Rep. pulse: num_std_devs=7.222 peak_power=4.866e+006 dm=3072 peak_bin=512 scale=7 ffa_scale=2 period=27.54894

class T_remove_radar: total=1.46e+009, N=1, <>=1.46e+009, min=1.46e+009, max=1.46e+009
class T_main_loop_L1: total=1.00e+012, N=2, <>=5.00e+011, min=5.00e+011, max=5.00e+011
class T_FFT_forward: total=1.49e+010, N=12704, <>=1.17e+006, min=1.11e+006, max=2.99e+006
class T_remove_radar_randomize: total=1.80e+007, N=12704, <>=1.42e+003, min=8.69e+002, max=2.65e+004
class T_build_chirp_table: total=1.98e+009, N=16, <>=1.23e+008, min=1.22e+008, max=1.31e+008
class T_dechirp: total=6.94e+010, N=406528, <>=1.71e+005, min=3.70e+001, max=6.91e+006
class T_FFT_inverse: total=4.47e+011, N=406528, <>=1.10e+006, min=1.06e+006, max=1.51e+007
class T_ffa: total=3.81e+011, N=36, <>=1.06e+010, min=4.08e+009, max=6.35e+010

FFA blocks counters:
class T_FFA_fetch: total=3.15e+011, N=2180484, <>=1.44e+005, min=5.59e+004, max=6.65e+006
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=3.09e+010, N=92171996, <>=3.35e+002, min=3.70e+001, max=6.42e+006
class T_FFA_coadd: total=1.73e+010, N=92171996, <>=1.87e+002, min=3.70e+001, max=1.18e+006
class T_FFA_stride_add: total=2.20e+009, N=18683764, <>=1.17e+002, min=3.70e+001, max=2.29e+005
USE_INCREASED_PRECISION SMALL_CHIRP_TABLE BLANKIT TWINDECHIRP USE_LRINT
rev 2692
08:46:52 (4596): called boinc_finish(0)
[ /stderr ]
------------
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose / sigind_v5.wu :
AppName: AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe
AppArgs: -verbose
TaskName: sigind_v5.wu
Started at : 08:46:58.035
Ended at : 08:47:21.644
23.440 secs Elapsed
5.422 secs CPU time
Speedup : 97.52%
Ratio : 40.28x

ref-AP7_win_x64_AVX_CPU_r2692.exe-sigind_v5.wu.res: <ap_signal>19,<pulses>9,<best_pulses>10
result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-sigind_v5.wu.res: <ap_signal>40,<pulses>30,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: pulse at signal 0 has no match (direction -->)
Weakly similar or Different.
Best Pulses: Weakly similar or Different.

-(.\testDatas\ref\ref-AP7_win_x64_AVX_CPU_r2692.exe-sigind_v5.wu.res)-
Reportable Single Pulses: 0 [OK], 0 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 9 [Weak]
Single Pulses (Best): 10 [Weak], 0 above threshold*THRESHOLD_FUDGE

-(.\testDatas\result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-sigind_v5.wu.res)-
Reportable Single Pulses: 0 [OK], 0 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [Weak]
Single Pulses (Best): 0 [Weak], 0 above threshold*THRESHOLD_FUDGE


[ stderr ]
08:46:58 (7504): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:46:58 (7504): Can't open init data file - running in standalone mode
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
08:46:58 (7504): Can't open init data file - running in standalone mode
WARNING: init_data.xml missing
OpenCL platform detected: Advanced Micro Devices, Inc.
WARNING: BOINC supplied wrong platform!
BOINC assigns device 0
WARNING: BOINC failed to provide OpenCL device, using own enumeration abilities
Used GPU device parameters are:
Number of compute units: 36
Single buffer allocation size: 256MB
Total device global memory: 3072MB
max WG size: 256
local mem type: Real
-unroll default value used: 18
-ffa_block default value used: 9216
-ffa_block_fetch default value used: 4608

Build features: Non-graphics BLANKIT OpenCL TWIN_FFA OCL_ZERO_COPY COMBINED_DECHIRP_KERNEL FFTW USE_INCREASED_PRECISION USE_SSE2 x86
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x86 rev 2742, V7 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2

OpenCL version by Raistmer

oclFFT fix for ATI GPUs by Urs Echternacht
ffa threshold mods by Joe Segur
SSE3 dechirping by JDWhale
Combined dechirp kernel by Frizz
Number of OpenCL platforms: 1


OpenCL Platform Name: AMD Accelerated Parallel Processing
Number of devices: 1
Max compute units: 36
Max work group size: 256
Max clock frequency: 1288Mhz
Max memory allocation: 3221225472
Cache type: Read/Write
Cache line size: 64
Cache size: 16384
Global memory size: 3221225472
Constant buffer size: 3221225472
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Queue properties:
Out-of-Order: No
Name: Ellesmere
Vendor: Advanced Micro Devices, Inc.
Driver version: 2348.4
Version: OpenCL 1.2 AMD-APP (2348.4)
Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash


state.fold_buf_size_short=65536; state.fold_buf_size_long=262144

single pulses: 0
repetitive pulses: 30
percent blanked: 63.67
Rep. pulse: num_std_devs=8.188 peak_power=78148.52 dm=-3056 peak_bin=2560 scale=4 ffa_scale=5 period=440.5428
Rep. pulse: num_std_devs=9.264 peak_power=155538.4 dm=-3056 peak_bin=2048 scale=4 ffa_scale=6 period=440.7849
Rep. pulse: num_std_devs=6.943 peak_power=39275.07 dm=-3040 peak_bin=2816 scale=4 ffa_scale=4 period=440.442
Rep. pulse: num_std_devs=6.713 peak_power=19880.19 dm=-3040 peak_bin=2176 scale=4 ffa_scale=3 period=440.7916
Rep. pulse: num_std_devs=12 peak_power=1225779 dm=2944 peak_bin=2560 scale=7 ffa_scale=1 period=55.09829
Rep. pulse: num_std_devs=12.89 peak_power=2445335 dm=2944 peak_bin=2560 scale=7 ffa_scale=2 period=55.09787
Rep. pulse: num_std_devs=9.659 peak_power=1223167 dm=2944 peak_bin=9728 scale=7 ffa_scale=2 period=110.1953
Rep. pulse: num_std_devs=14.12 peak_power=4882111 dm=-2944 peak_bin=2048 scale=7 ffa_scale=3 period=55.09808
Rep. pulse: num_std_devs=9.657 peak_power=613767.6 dm=2944 peak_bin=9728 scale=7 ffa_scale=1 period=110.1957
Rep. pulse: num_std_devs=10.33 peak_power=2441262 dm=-2944 peak_bin=2048 scale=7 ffa_scale=3 period=110.1962
Rep. pulse: num_std_devs=10.03 peak_power=614061.3 dm=2944 peak_bin=2688 scale=7 ffa_scale=0 period=55.09808
Rep. pulse: num_std_devs=7.117 peak_power=1220338 dm=-2944 peak_bin=2048 scale=7 ffa_scale=3 period=220.3923
Rep. pulse: num_std_devs=7.854 peak_power=164955.6 dm=-2944 peak_bin=1536 scale=7 ffa_scale=2 period=826.4694
Rep. pulse: num_std_devs=8.689 peak_power=328541.1 dm=-2944 peak_bin=58368 scale=7 ffa_scale=3 period=826.4757
Rep. pulse: num_std_devs=7.201 peak_power=164692.7 dm=-2944 peak_bin=58368 scale=7 ffa_scale=3 period=1652.951
Rep. pulse: num_std_devs=7.01 peak_power=197657.5 dm=-2944 peak_bin=121856 scale=7 ffa_scale=3 period=1377.457
Rep. pulse: num_std_devs=8.036 peak_power=1632401 dm=-2944 peak_bin=2048 scale=7 ffa_scale=3 period=165.2938
Rep. pulse: num_std_devs=12.55 peak_power=494157.2 dm=-2944 peak_bin=30208 scale=7 ffa_scale=2 period=275.4934
Rep. pulse: num_std_devs=10.42 peak_power=247834.9 dm=-2944 peak_bin=65536 scale=7 ffa_scale=2 period=550.9848
Rep. pulse: num_std_devs=10.53 peak_power=247885.2 dm=-2944 peak_bin=23296 scale=7 ffa_scale=1 period=275.4924
Rep. pulse: num_std_devs=13.08 peak_power=983755.1 dm=-2944 peak_bin=16384 scale=7 ffa_scale=3 period=275.4882
Rep. pulse: num_std_devs=10.98 peak_power=493060.2 dm=-2944 peak_bin=58368 scale=7 ffa_scale=3 period=550.9848
Rep. pulse: num_std_devs=8.24 peak_power=124217.7 dm=-2944 peak_bin=58624 scale=7 ffa_scale=1 period=550.9785
Rep. pulse: num_std_devs=8.291 peak_power=124235.5 dm=-2944 peak_bin=23424 scale=7 ffa_scale=0 period=275.4903
Rep. pulse: num_std_devs=7.908 peak_power=246592.3 dm=-2944 peak_bin=71680 scale=7 ffa_scale=3 period=1101.97
Rep. pulse: num_std_devs=8.262 peak_power=124225.3 dm=-2944 peak_bin=136192 scale=7 ffa_scale=2 period=1101.961
Rep. pulse: num_std_devs=7.111 peak_power=977819.5 dm=-2944 peak_bin=5632 scale=7 ffa_scale=2 period=137.7457
Rep. pulse: num_std_devs=7.557 peak_power=612124.1 dm=2944 peak_bin=23552 scale=7 ffa_scale=2 period=220.3932
Rep. pulse: num_std_devs=8.046 peak_power=307545.6 dm=2944 peak_bin=9728 scale=7 ffa_scale=0 period=110.1962
Rep. pulse: num_std_devs=7.081 peak_power=307013.2 dm=2944 peak_bin=23808 scale=7 ffa_scale=1 period=220.3915

class T_remove_radar: total=1.91e+009, N=1, <>=1.91e+009, min=1.91e+009, max=1.91e+009
class T_main_loop_L1: total=9.36e+010, N=2, <>=4.68e+010, min=3.97e+010, max=5.39e+010
class T_FFT_forward: total=1.32e+008, N=1824, <>=7.25e+004, min=3.56e+004, max=9.31e+005
class T_remove_radar_randomize: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_build_chirp_table: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_dechirp: total=1.03e+008, N=1824, <>=5.62e+004, min=4.74e+004, max=3.42e+005
class Dechirp_ns: total=0, N=0, <>=0, min=0 max=0
class Half_ns: total=0, N=0, <>=0, min=0 max=0
class T_PC_single_pulse_kernel_FFA_update: total=7.31e+010, N=1824, <>=4.01e+007, min=3.77e+007, max=1.61e+008
class PC_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_oclWriteBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_inverse: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ffa: total=1.30e+010, N=9, <>=1.45e+009, min=7.05e+008, max=7.00e+009

FFA blocks counters:
class T_FFA_fetch: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=8.45e+008, N=114, <>=7.41e+006, min=1.13e+006, max=5.86e+007
class T_FFA_coadd: total=4.89e+006, N=594, <>=8.23e+003, min=4.94e+003, max=5.57e+005
class T_FFA_stride_add: total=2.17e+006, N=101, <>=2.15e+004, min=1.71e+004, max=1.07e+005
class T_GPU_buffer_read_backs: total=0, N=0, <>=0, min=0 max=0
TWIN_FFA OCL_ZERO_COPY USE_OPENCL OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2742
GPU device sync requested... ...GPU device synched
08:47:19 (7504): called boinc_finish(0)
[ /stderr ]

------------
AP7_win_x64_AVX_CPU_r2692.exe -verbose / single_pulses.wu :
AppName: AP7_win_x64_AVX_CPU_r2692.exe
AppArgs: -verbose
TaskName: single_pulses.wu
Started at : 08:47:25.050
Ended at : 08:50:21.305
Result : stored as ref for validations.
176.203 secs Elapsed
173.859 secs CPU time

[ stderr ]
08:47:25 (7312): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:47:25 (7312): Can't open init data file - running in standalone mode

Build features: Non-graphics BLANKIT TWINDECHIRP USE_LRINT FFTW USE_INCREASED_PRECISION USE_AVX x64
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x64 rev 2692, V7 match, by Raistmer with support of Lunatics.kwsn.net team.
by Lunatics team. Built with uncommitted modifications
state.fold_buf_size_short=65536; state.fold_buf_size_long=262144

single pulses: 4
repetitive pulses: 30
percent blanked: 0.00
Single pulse: peak_power=38.55 dm=-5720 fft_num=13991936 peak_bin=13999216 scale=2
Single pulse: peak_power=653.6 dm=-5749 fft_num=25804800 peak_bin=25807872 scale=9
Rep. pulse: num_std_devs=5.508 peak_power=2.841e+005 dm=-5719 peak_bin=1024 scale=4 ffa_scale=6 period=238.2936
Rep. pulse: num_std_devs=6.368 peak_power=2.846e+005 dm=-5719 peak_bin=256 scale=4 ffa_scale=3 period=29.81171
Rep. pulse: num_std_devs=6.823 peak_power=2.911e+005 dm=-5719 peak_bin=128 scale=4 ffa_scale=2 period=14.61354
Rep. pulse: num_std_devs=6.403 peak_power=5.678e+005 dm=-5719 peak_bin=0 scale=4 ffa_scale=2 period=7.461121
Rep. pulse: num_std_devs=6.805 peak_power=5.724e+005 dm=-5719 peak_bin=0 scale=4 ffa_scale=1 period=3.692785
Rep. pulse: num_std_devs=6.553 peak_power=2.847e+005 dm=-5719 peak_bin=32 scale=4 ffa_scale=1 period=7.437932
Rep. pulse: num_std_devs=6.55 peak_power=2.847e+005 dm=-5719 peak_bin=0 scale=4 ffa_scale=0 period=3.736143
Rep. pulse: num_std_devs=6.094 peak_power=2.844e+005 dm=-5719 peak_bin=0 scale=4 ffa_scale=7 period=476.8273
Rep. pulse: num_std_devs=6.363 peak_power=2.846e+005 dm=-5719 peak_bin=256 scale=4 ffa_scale=4 period=59.42633
Rep. pulse: num_std_devs=6.413 peak_power=2.867e+005 dm=-5719 peak_bin=512 scale=4 ffa_scale=5 period=117.9295
Rep. pulse: num_std_devs=6.632 peak_power=1.461e+005 dm=-5719 peak_bin=48 scale=4 ffa_scale=0 period=7.310673
Rep. pulse: num_std_devs=5.875 peak_power=1.427e+005 dm=-5719 peak_bin=384 scale=4 ffa_scale=3 period=59.5725
Rep. pulse: num_std_devs=7.214 peak_power=1.453e+005 dm=-5719 peak_bin=128 scale=4 ffa_scale=2 period=29.32267
Rep. pulse: num_std_devs=6.515 peak_power=1.44e+005 dm=-5719 peak_bin=64 scale=4 ffa_scale=1 period=14.78286
Rep. pulse: num_std_devs=5.856 peak_power=1.437e+005 dm=-5719 peak_bin=512 scale=4 ffa_scale=4 period=117.9655
Rep. pulse: num_std_devs=6.542 peak_power=2.868e+005 dm=-5719 peak_bin=0 scale=4 ffa_scale=0 period=3.692785
Rep. pulse: num_std_devs=6.341 peak_power=5.76e+005 dm=-5719 peak_bin=64 scale=4 ffa_scale=2 period=7.357562
Rep. pulse: num_std_devs=6.41 peak_power=2.888e+005 dm=-5719 peak_bin=128 scale=4 ffa_scale=3 period=29.32222
Rep. pulse: num_std_devs=5.853 peak_power=2.884e+005 dm=-5719 peak_bin=0 scale=4 ffa_scale=6 period=234.1665
Rep. pulse: num_std_devs=5.953 peak_power=1.438e+005 dm=-5719 peak_bin=6144 scale=4 ffa_scale=6 period=471.675
Rep. pulse: num_std_devs=6.41 peak_power=2.908e+005 dm=-5719 peak_bin=32 scale=4 ffa_scale=1 period=7.310784
Rep. pulse: num_std_devs=5.819 peak_power=2.884e+005 dm=-5719 peak_bin=0 scale=4 ffa_scale=4 period=58.78869
Rep. pulse: num_std_devs=5.844 peak_power=2.884e+005 dm=-5719 peak_bin=0 scale=4 ffa_scale=7 period=468.6118
Rep. pulse: num_std_devs=6.432 peak_power=1.45e+005 dm=-5719 peak_bin=512 scale=4 ffa_scale=3 period=58.64892
Rep. pulse: num_std_devs=6.584 peak_power=7.301e+004 dm=-5719 peak_bin=576 scale=4 ffa_scale=2 period=58.64624
Rep. pulse: num_std_devs=6.769 peak_power=2.911e+005 dm=-5719 peak_bin=48 scale=4 ffa_scale=0 period=3.655392
Rep. pulse: num_std_devs=6.995 peak_power=5.808e+005 dm=-5719 peak_bin=32 scale=4 ffa_scale=1 period=3.655392
Rep. pulse: num_std_devs=6.128 peak_power=1.459e+005 dm=-5719 peak_bin=128 scale=4 ffa_scale=1 period=14.61354
Rep. pulse: num_std_devs=5.609 peak_power=1.457e+005 dm=-5719 peak_bin=1536 scale=4 ffa_scale=5 period=233.7132
Rep. pulse: num_std_devs=5.673 peak_power=1.457e+005 dm=-5719 peak_bin=1280 scale=4 ffa_scale=4 period=116.7621
Single pulse: peak_power=91.79 dm=-5800 fft_num=3063808 peak_bin=3071328 scale=5
Single pulse: peak_power=63.35 dm=-5803 fft_num=3063808 peak_bin=3071344 scale=4

class T_remove_radar: total=1.47e+009, N=1, <>=1.47e+009, min=1.47e+009, max=1.47e+009
class T_main_loop_L1: total=7.96e+011, N=1, <>=7.96e+011, min=7.96e+011, max=7.96e+011
class T_FFT_forward: total=1.87e+010, N=16376, <>=1.14e+006, min=1.08e+006, max=1.13e+007
class T_remove_radar_randomize: total=2.34e+007, N=16376, <>=1.43e+003, min=8.73e+002, max=5.67e+004
class T_build_chirp_table: total=1.04e+009, N=8, <>=1.31e+008, min=1.28e+008, max=1.37e+008
class T_dechirp: total=8.89e+010, N=524031, <>=1.70e+005, min=3.70e+001, max=6.76e+006
class T_FFT_inverse: total=5.76e+011, N=524031, <>=1.10e+006, min=1.06e+006, max=1.44e+007
class T_ffa: total=1.05e+009, N=1, <>=1.05e+009, min=1.05e+009, max=1.05e+009

FFA blocks counters:
class T_FFA_fetch: total=1.13e+008, N=1570, <>=7.19e+004, min=5.89e+004, max=3.39e+005
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=9.13e+008, N=59642, <>=1.53e+004, min=5.70e+001, max=1.97e+006
class T_FFA_coadd: total=9.13e+006, N=59641, <>=1.53e+002, min=4.10e+001, max=3.06e+004
class T_FFA_stride_add: total=1.46e+006, N=12554, <>=1.16e+002, min=3.70e+001, max=2.90e+003
USE_INCREASED_PRECISION SMALL_CHIRP_TABLE BLANKIT TWINDECHIRP USE_LRINT
rev 2692
08:50:19 (7312): called boinc_finish(0)
[ /stderr ]
------------
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose / single_pulses.wu :
AppName: AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe
AppArgs: -verbose
TaskName: single_pulses.wu
Started at : 08:50:24.431
Ended at : 08:50:40.513
15.914 secs Elapsed
5.469 secs CPU time
Speedup : 96.85%
Ratio : 31.79x

ref-AP7_win_x64_AVX_CPU_r2692.exe-single_pulses.wu.res: <ap_signal>44,<pulses>34,<best_pulses>10
result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-single_pulses.wu.res: <ap_signal>40,<pulses>30,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: pulse at signal 0 has no match (direction -->)
Weakly similar or Different.
Best Pulses: Weakly similar or Different.

-(.\testDatas\ref\ref-AP7_win_x64_AVX_CPU_r2692.exe-single_pulses.wu.res)-
Reportable Single Pulses: 4 [Weak], 3 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [Weak]
Single Pulses (Best): 10 [Weak], 3 above threshold*THRESHOLD_FUDGE

-(.\testDatas\result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-single_pulses.wu.res)-
Reportable Single Pulses: 0 [Weak], 0 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [Weak]
Single Pulses (Best): 10 [Weak], 0 above threshold*THRESHOLD_FUDGE


[ stderr ]
08:50:24 (6312): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:50:24 (6312): Can't open init data file - running in standalone mode
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
08:50:24 (6312): Can't open init data file - running in standalone mode
WARNING: init_data.xml missing
OpenCL platform detected: Advanced Micro Devices, Inc.
WARNING: BOINC supplied wrong platform!
BOINC assigns device 0
WARNING: BOINC failed to provide OpenCL device, using own enumeration abilities
Used GPU device parameters are:
Number of compute units: 36
Single buffer allocation size: 256MB
Total device global memory: 3072MB
max WG size: 256
local mem type: Real
-unroll default value used: 18
-ffa_block default value used: 9216
-ffa_block_fetch default value used: 4608

Build features: Non-graphics BLANKIT OpenCL TWIN_FFA OCL_ZERO_COPY COMBINED_DECHIRP_KERNEL FFTW USE_INCREASED_PRECISION USE_SSE2 x86
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x86 rev 2742, V7 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2

OpenCL version by Raistmer

oclFFT fix for ATI GPUs by Urs Echternacht
ffa threshold mods by Joe Segur
SSE3 dechirping by JDWhale
Combined dechirp kernel by Frizz
Number of OpenCL platforms: 1


OpenCL Platform Name: AMD Accelerated Parallel Processing
Number of devices: 1
Max compute units: 36
Max work group size: 256
Max clock frequency: 1288Mhz
Max memory allocation: 3221225472
Cache type: Read/Write
Cache line size: 64
Cache size: 16384
Global memory size: 3221225472
Constant buffer size: 3221225472
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Queue properties:
Out-of-Order: No
Name: Ellesmere
Vendor: Advanced Micro Devices, Inc.
Driver version: 2348.4
Version: OpenCL 1.2 AMD-APP (2348.4)
Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash


state.fold_buf_size_short=65536; state.fold_buf_size_long=262144
INFO: excessive number of rep. pulses detected. To save system memory FFA will be redone with decreased FFA block value 256.

single pulses: 0
repetitive pulses: 30
percent blanked: 0.00
Rep. pulse: num_std_devs=6.189 peak_power=284493.8 dm=-5719 peak_bin=512 scale=4 ffa_scale=4 period=59.54069
Rep. pulse: num_std_devs=5.965 peak_power=284351.6 dm=-5719 peak_bin=128 scale=4 ffa_scale=3 period=29.77534
Rep. pulse: num_std_devs=6.117 peak_power=284448.2 dm=-5719 peak_bin=0 scale=4 ffa_scale=2 period=14.88585
Rep. pulse: num_std_devs=6.674 peak_power=284800.3 dm=-5719 peak_bin=0 scale=4 ffa_scale=1 period=7.421947
Rep. pulse: num_std_devs=6.862 peak_power=568296.5 dm=-5719 peak_bin=0 scale=4 ffa_scale=2 period=7.421833
Rep. pulse: num_std_devs=6.65 peak_power=284785.6 dm=-5719 peak_bin=16 scale=4 ffa_scale=0 period=3.724247
Rep. pulse: num_std_devs=6.615 peak_power=568040.1 dm=-5719 peak_bin=32 scale=4 ffa_scale=1 period=3.72965
Rep. pulse: num_std_devs=5.463 peak_power=284034.1 dm=-5719 peak_bin=2048 scale=4 ffa_scale=7 period=476.0785
Rep. pulse: num_std_devs=6.235 peak_power=142851 dm=-5719 peak_bin=1536 scale=4 ffa_scale=4 period=119.562
Rep. pulse: num_std_devs=6.802 peak_power=36376.91 dm=-5719 peak_bin=3584 scale=4 ffa_scale=3 period=239.0913
Rep. pulse: num_std_devs=6.622 peak_power=71983.22 dm=-5719 peak_bin=1664 scale=4 ffa_scale=3 period=119.5456
Rep. pulse: num_std_devs=5.921 peak_power=286390 dm=-5719 peak_bin=0 scale=4 ffa_scale=5 period=118.0286
Rep. pulse: num_std_devs=6.234 peak_power=142850.4 dm=-5719 peak_bin=16 scale=4 ffa_scale=0 period=7.448494
Rep. pulse: num_std_devs=5.674 peak_power=286233.2 dm=-5719 peak_bin=0 scale=4 ffa_scale=6 period=235.9383
Rep. pulse: num_std_devs=6.352 peak_power=144969.6 dm=-5719 peak_bin=32 scale=4 ffa_scale=1 period=14.68193
Rep. pulse: num_std_devs=5.726 peak_power=143675.8 dm=-5719 peak_bin=768 scale=4 ffa_scale=3 period=59.24164
Rep. pulse: num_std_devs=6.095 peak_power=142793.5 dm=-5719 peak_bin=320 scale=4 ffa_scale=2 period=29.70637
Rep. pulse: num_std_devs=6.366 peak_power=71912.17 dm=-5719 peak_bin=256 scale=4 ffa_scale=2 period=59.42724
Rep. pulse: num_std_devs=6.328 peak_power=576008.1 dm=-5719 peak_bin=32 scale=4 ffa_scale=1 period=3.666732
Rep. pulse: num_std_devs=6.597 peak_power=288888.2 dm=-5719 peak_bin=16 scale=4 ffa_scale=0 period=3.659131
Rep. pulse: num_std_devs=5.921 peak_power=288456.3 dm=-5719 peak_bin=256 scale=4 ffa_scale=3 period=29.41095
Rep. pulse: num_std_devs=5.942 peak_power=288469.7 dm=-5719 peak_bin=0 scale=4 ffa_scale=2 period=14.70143
Rep. pulse: num_std_devs=6.033 peak_power=288527.8 dm=-5719 peak_bin=0 scale=4 ffa_scale=4 period=58.7869
Rep. pulse: num_std_devs=5.541 peak_power=288213.4 dm=-5719 peak_bin=0 scale=4 ffa_scale=7 period=469.3847
Rep. pulse: num_std_devs=6.455 peak_power=145012.4 dm=-5719 peak_bin=16 scale=4 ffa_scale=0 period=7.360594
Rep. pulse: num_std_devs=5.721 peak_power=144707.9 dm=-5719 peak_bin=1280 scale=4 ffa_scale=4 period=117.5254
Rep. pulse: num_std_devs=5.772 peak_power=144729 dm=-5719 peak_bin=0 scale=4 ffa_scale=2 period=29.40287
Rep. pulse: num_std_devs=6.192 peak_power=288629.2 dm=-5719 peak_bin=96 scale=4 ffa_scale=1 period=7.332681
Rep. pulse: num_std_devs=6.248 peak_power=575923.3 dm=-5719 peak_bin=64 scale=4 ffa_scale=2 period=7.339622
Rep. pulse: num_std_devs=5.947 peak_power=144801.6 dm=-5719 peak_bin=0 scale=4 ffa_scale=3 period=58.54163

class T_remove_radar: total=1.90e+009, N=1, <>=1.90e+009, min=1.90e+009, max=1.90e+009
class T_main_loop_L1: total=5.90e+010, N=1, <>=5.90e+010, min=5.90e+010, max=5.90e+010
class T_FFT_forward: total=4.34e+007, N=912, <>=4.76e+004, min=3.60e+004, max=1.59e+006
class T_remove_radar_randomize: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_build_chirp_table: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_dechirp: total=5.09e+007, N=912, <>=5.58e+004, min=4.77e+004, max=2.97e+005
class Dechirp_ns: total=0, N=0, <>=0, min=0 max=0
class Half_ns: total=0, N=0, <>=0, min=0 max=0
class T_PC_single_pulse_kernel_FFA_update: total=3.54e+010, N=912, <>=3.88e+007, min=3.76e+007, max=1.74e+008
class PC_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_oclWriteBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_inverse: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ffa: total=2.31e+010, N=1, <>=2.31e+010, min=2.31e+010, max=2.31e+010

FFA blocks counters:
class T_FFA_fetch: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=1.52e+009, N=50, <>=3.04e+007, min=1.01e+006, max=2.51e+008
class T_FFA_coadd: total=1.52e+006, N=106, <>=1.43e+004, min=5.05e+003, max=6.01e+005
class T_FFA_stride_add: total=1.65e+006, N=43, <>=3.83e+004, min=2.16e+004, max=1.14e+005
class T_GPU_buffer_read_backs: total=2, N=2, <>=1, min=1 max=1
TWIN_FFA OCL_ZERO_COPY USE_OPENCL OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2742
GPU device sync requested... ...GPU device synched
08:50:38 (6312): called boinc_finish(0)
[ /stderr ]

------------
Quick timetable

WU : #ap_genwis.dat
AP7_win_x64_AVX_CPU_r2692.exe -verbose :
Elapsed 2.629 secs
CPU 0.516 secs
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose :
Elapsed 3.499 secs, speedup: -33.09% ratio: 0.75x
CPU 1.219 secs, speedup: -136.24% ratio: 0.42x

WU : LoThresh_v5.dat
AP7_win_x64_AVX_CPU_r2692.exe -verbose :
Elapsed 153.788 secs
CPU 150.875 secs
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose :
Elapsed 18.503 secs, speedup: 87.97% ratio: 8.31x
CPU 7.219 secs, speedup: 95.22% ratio: 20.90x

WU : short_ap_21oc08ab_B2_P0_00081_20081130_08605.dat
AP7_win_x64_AVX_CPU_r2692.exe -verbose :
Elapsed 193.870 secs
CPU 191.219 secs
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose :
Elapsed 14.065 secs, speedup: 92.75% ratio: 13.78x
CPU 3.422 secs, speedup: 98.21% ratio: 55.88x

WU : ap_Zblank_2LC67.wu
AP7_win_x64_AVX_CPU_r2692.exe -verbose :
Elapsed 433.668 secs
CPU 431.422 secs
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose :
Elapsed 23.568 secs, speedup: 94.57% ratio: 18.40x
CPU 3.313 secs, speedup: 99.23% ratio: 130.22x

WU : ap_Zblank_9LC67.wu
AP7_win_x64_AVX_CPU_r2692.exe -verbose :
Elapsed 1938.928 secs
CPU 1933.547 secs
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose :
Elapsed 94.465 secs, speedup: 95.13% ratio: 20.53x
CPU 11.672 secs, speedup: 99.40% ratio: 165.66x

WU : JasonShort_v5.wu
AP7_win_x64_AVX_CPU_r2692.exe -verbose :
Elapsed 351.883 secs
CPU 349.047 secs
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose :
Elapsed 20.387 secs, speedup: 94.21% ratio: 17.26x
CPU 3.609 secs, speedup: 98.97% ratio: 96.72x

WU : Raistmer_tinyrr.wu
AP7_win_x64_AVX_CPU_r2692.exe -verbose :
Elapsed 89.044 secs
CPU 87.000 secs
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose :
Elapsed 8.837 secs, speedup: 90.08% ratio: 10.08x
CPU 2.781 secs, speedup: 96.80% ratio: 31.28x

WU : sigind_v5.wu
AP7_win_x64_AVX_CPU_r2692.exe -verbose :
Elapsed 220.626 secs
CPU 218.375 secs
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose :
Elapsed 23.440 secs, speedup: 89.38% ratio: 9.41x
CPU 5.422 secs, speedup: 97.52% ratio: 40.28x

WU : single_pulses.wu
AP7_win_x64_AVX_CPU_r2692.exe -verbose :
Elapsed 176.203 secs
CPU 173.859 secs
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose :
Elapsed 15.914 secs, speedup: 90.97% ratio: 11.07x
CPU 5.469 secs, speedup: 96.85% ratio: 31.79x

------------
11) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1875148)
Posted 26 Jun 2017 by Profile Karsten Vinding
Post:
The results of the first run are in.

Not very encouraging....

I ended up running only some of the WU's, didn't run some of the long/slow ones, as results were allready pretty clear:


APbench211.cmd
======================================
9 testWU(s) found
(#ap_genwis.dat)
(LoThresh_v5.dat)
(short_ap_21oc08ab_B2_P0_00081_20081130_08605.dat)
(ap_Zblank_2LC67.wu)
(ap_Zblank_9LC67.wu)
(JasonShort_v5.wu)
(Raistmer_tinyrr.wu)
(sigind_v5.wu)
(single_pulses.wu)
1 reference science app(s) found
(AP7_win_x64_AVX_CPU_r2692.exe -verbose)
1 science app(s) found
(AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose )
======================================
AP7_win_x64_AVX_CPU_r2692.exe -verbose / #ap_genwis.dat :
AppName: AP7_win_x64_AVX_CPU_r2692.exe
AppArgs: -verbose
TaskName: #ap_genwis.dat
Started at : 07:52:33.842
Ended at : 07:52:36.522
2.629 secs Elapsed
0.516 secs CPU time

[ stderr ]
07:52:33 (12048): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
07:52:33 (12048): Can't open init data file - running in standalone mode

Build features: Non-graphics BLANKIT TWINDECHIRP USE_LRINT FFTW USE_INCREASED_PRECISION USE_AVX x64
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x64 rev 2692, V7 match, by Raistmer with support of Lunatics.kwsn.net team.
by Lunatics team. Built with uncommitted modifications
state.fold_buf_size_short=65536; state.fold_buf_size_long=262144
In ap_remove_radar.cpp: get_indices_to_randomize: num_ffts_forecast < 100. Blanking too much RFI?
percent blanked: 100.00

class T_remove_radar: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_main_loop_L1: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_forward: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_remove_radar_randomize: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_build_chirp_table: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_dechirp: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_inverse: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ffa: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000

FFA blocks counters:
class T_FFA_fetch: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_coadd: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_stride_add: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
USE_INCREASED_PRECISION SMALL_CHIRP_TABLE BLANKIT TWINDECHIRP USE_LRINT
rev 2692
07:52:34 (12048): called boinc_finish(0)
[ /stderr ]
------------
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose / #ap_genwis.dat :
AppName: AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe
AppArgs: -verbose
TaskName: #ap_genwis.dat
Started at : 07:52:39.637
Ended at : 07:52:43.283
3.499 secs Elapsed
1.219 secs CPU time
Speedup : -136.24%
Ratio : 0.42x
Skipping validation, genwis run.

[ stderr ]
07:52:39 (4844): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
07:52:39 (4844): Can't open init data file - running in standalone mode
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
07:52:39 (4844): Can't open init data file - running in standalone mode
WARNING: init_data.xml missing
OpenCL platform detected: Advanced Micro Devices, Inc.
WARNING: BOINC supplied wrong platform!
BOINC assigns device 0
WARNING: BOINC failed to provide OpenCL device, using own enumeration abilities
Used GPU device parameters are:
Number of compute units: 36
Single buffer allocation size: 256MB
Total device global memory: 3072MB
max WG size: 256
local mem type: Real
-unroll default value used: 18
-ffa_block default value used: 9216
-ffa_block_fetch default value used: 4608

Build features: Non-graphics BLANKIT OpenCL TWIN_FFA OCL_ZERO_COPY COMBINED_DECHIRP_KERNEL FFTW USE_INCREASED_PRECISION USE_SSE2 x86
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x86 rev 2742, V7 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2

OpenCL version by Raistmer

oclFFT fix for ATI GPUs by Urs Echternacht
ffa threshold mods by Joe Segur
SSE3 dechirping by JDWhale
Combined dechirp kernel by Frizz
Number of OpenCL platforms: 1


OpenCL Platform Name: AMD Accelerated Parallel Processing
Number of devices: 1
Max compute units: 36
Max work group size: 256
Max clock frequency: 1288Mhz
Max memory allocation: 3221225472
Cache type: Read/Write
Cache line size: 64
Cache size: 16384
Global memory size: 3221225472
Constant buffer size: 3221225472
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Queue properties:
Out-of-Order: No
Name: Ellesmere
Vendor: Advanced Micro Devices, Inc.
Driver version: 2348.4
Version: OpenCL 1.2 AMD-APP (2348.4)
Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash


state.fold_buf_size_short=65536; state.fold_buf_size_long=262144
In ap_remove_radar.cpp: get_indices_to_randomize: num_ffts_forecast < 100. Blanking too much RFI?
percent blanked: 100.00

class T_remove_radar: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_main_loop_L1: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_forward: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_remove_radar_randomize: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_build_chirp_table: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_dechirp: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class Dechirp_ns: total=0, N=0, <>=0, min=0 max=0
class Half_ns: total=0, N=0, <>=0, min=0 max=0
class T_PC_single_pulse_kernel_FFA_update: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class PC_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_oclWriteBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_inverse: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ffa: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000

FFA blocks counters:
class T_FFA_fetch: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_coadd: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_stride_add: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_GPU_buffer_read_backs: total=0, N=0, <>=0, min=0 max=0
TWIN_FFA OCL_ZERO_COPY USE_OPENCL OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2742
07:52:41 (4844): called boinc_finish(0)
[ /stderr ]

------------
AP7_win_x64_AVX_CPU_r2692.exe -verbose / LoThresh_v5.dat :
Result cached, skipping execution
153.788 secs Elapsed
150.875 secs CPU time

Stderr.txt : not found
------------
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose / LoThresh_v5.dat :
AppName: AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe
AppArgs: -verbose
TaskName: LoThresh_v5.dat
Started at : 07:52:46.612
Ended at : 07:53:05.280
18.503 secs Elapsed
7.219 secs CPU time
Speedup : 95.22%
Ratio : 20.90x

ref-AP7_win_x64_AVX_CPU_r2692.exe-LoThresh_v5.dat.res: <ap_signal>70,<pulses>60,<best_pulses>10
result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-LoThresh_v5.dat.res: <ap_signal>70,<pulses>60,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: Checked 60, 60 , Strongly Similar
Best Pulses: Weakly similar or Different.

-(.\testDatas\ref\ref-AP7_win_x64_AVX_CPU_r2692.exe-LoThresh_v5.dat.res)-
Reportable Single Pulses: 30 [OK], 16 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [OK]
Single Pulses (Best): 10 [Weak], 9 above threshold*THRESHOLD_FUDGE

-(.\testDatas\result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-LoThresh_v5.dat.res)-
Reportable Single Pulses: 30 [OK], 16 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [OK]
Single Pulses (Best): 10 [Weak], 9 above threshold*THRESHOLD_FUDGE


[ stderr ]
07:52:46 (4564): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
07:52:46 (4564): Can't open init data file - running in standalone mode
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
07:52:46 (4564): Can't open init data file - running in standalone mode
WARNING: init_data.xml missing
OpenCL platform detected: Advanced Micro Devices, Inc.
WARNING: BOINC supplied wrong platform!
BOINC assigns device 0
WARNING: BOINC failed to provide OpenCL device, using own enumeration abilities
Used GPU device parameters are:
Number of compute units: 36
Single buffer allocation size: 256MB
Total device global memory: 3072MB
max WG size: 256
local mem type: Real
-unroll default value used: 18
-ffa_block default value used: 9216
-ffa_block_fetch default value used: 4608

Build features: Non-graphics BLANKIT OpenCL TWIN_FFA OCL_ZERO_COPY COMBINED_DECHIRP_KERNEL FFTW USE_INCREASED_PRECISION USE_SSE2 x86
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x86 rev 2742, V7 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2

OpenCL version by Raistmer

oclFFT fix for ATI GPUs by Urs Echternacht
ffa threshold mods by Joe Segur
SSE3 dechirping by JDWhale
Combined dechirp kernel by Frizz
Number of OpenCL platforms: 1


OpenCL Platform Name: AMD Accelerated Parallel Processing
Number of devices: 1
Max compute units: 36
Max work group size: 256
Max clock frequency: 1288Mhz
Max memory allocation: 3221225472
Cache type: Read/Write
Cache line size: 64
Cache size: 16384
Global memory size: 3221225472
Constant buffer size: 3221225472
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Queue properties:
Out-of-Order: No
Name: Ellesmere
Vendor: Advanced Micro Devices, Inc.
Driver version: 2348.4
Version: OpenCL 1.2 AMD-APP (2348.4)
Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash


state.fold_buf_size_short=65536; state.fold_buf_size_long=262144
INFO: excessive number of rep. pulses detected. To save system memory FFA will be redone with decreased FFA block value 256.
Found 30 single pulses and 30 repeating pulses, exiting.
percent blanked: 23.07
Single pulse: peak_power=196.372 dm=896 fft_num=7749632 peak_bin=7765760 scale=7
Single pulse: peak_power=617.536 dm=1006 fft_num=12599296 peak_bin=12599296 scale=9
Single pulse: peak_power=122.405 dm=907 fft_num=13713408 peak_bin=13718528 scale=6
Single pulse: peak_power=194.256 dm=908 fft_num=19529728 peak_bin=19543680 scale=7
Single pulse: peak_power=606.276 dm=918 fft_num=19939328 peak_bin=19945984 scale=9
Single pulse: peak_power=81.0368 dm=-915 fft_num=20627456 peak_bin=20635328 scale=5
Single pulse: peak_power=54.6496 dm=-911 fft_num=24805376 peak_bin=24816384 scale=4
Single pulse: peak_power=30.5704 dm=-910 fft_num=27918336 peak_bin=27928028 scale=2
Single pulse: peak_power=23.8134 dm=918 fft_num=14974976 peak_bin=14983100 scale=1
Single pulse: peak_power=21.728 dm=919 fft_num=14974976 peak_bin=14983100 scale=0
Single pulse: peak_power=339.871 dm=-920 fft_num=16662528 peak_bin=16665344 scale=8
Single pulse: peak_power=605.255 dm=-920 fft_num=18579456 peak_bin=18579456 scale=9
Single pulse: peak_power=615.265 dm=937 fft_num=27361280 peak_bin=27371520 scale=9
Single pulse: peak_power=21.5215 dm=-934 fft_num=24346624 peak_bin=24356408 scale=0
Single pulse: peak_power=24.7623 dm=-953 fft_num=7585792 peak_bin=7591104 scale=1
Single pulse: peak_power=333.996 dm=956 fft_num=10092544 peak_bin=10107904 scale=8
Single pulse: peak_power=202.038 dm=989 fft_num=10928128 peak_bin=10931200 scale=7
Single pulse: peak_power=196.911 dm=-962 fft_num=15171584 peak_bin=15175168 scale=7
Single pulse: peak_power=610.667 dm=970 fft_num=15450112 peak_bin=15459328 scale=9
Single pulse: peak_power=334.682 dm=-972 fft_num=17776640 peak_bin=17778944 scale=8
Single pulse: peak_power=333.511 dm=950 fft_num=29081600 peak_bin=29092608 scale=8
Single pulse: peak_power=21.1826 dm=-974 fft_num=3899392 peak_bin=3914823 scale=0
Single pulse: peak_power=21.5014 dm=973 fft_num=9945088 peak_bin=9950747 scale=0
Single pulse: peak_power=24.3755 dm=-966 fft_num=14139392 peak_bin=14148486 scale=1
Single pulse: peak_power=334.28 dm=999 fft_num=10928128 peak_bin=10931200 scale=8
Single pulse: peak_power=605.108 dm=988 fft_num=19742720 peak_bin=19755008 scale=9
Single pulse: peak_power=605.26 dm=-1007 fft_num=12042240 peak_bin=12052480 scale=9
Single pulse: peak_power=607.48 dm=1005 fft_num=17580032 peak_bin=17595392 scale=9
Single pulse: peak_power=31.1783 dm=-1001 fft_num=18939904 peak_bin=18951852 scale=2
Single pulse: peak_power=333.91 dm=1023 fft_num=4358144 peak_bin=4374016 scale=8
Rep. pulse: num_std_devs=7.337 peak_power=9002194 dm=-896 peak_bin=256 scale=7 ffa_scale=1 period=7.473569
Rep. pulse: num_std_devs=8.006 peak_power=1.812754e+007 dm=-896 peak_bin=0 scale=7 ffa_scale=2 period=7.393885
Rep. pulse: num_std_devs=8.392 peak_power=1.812944e+007 dm=-896 peak_bin=0 scale=7 ffa_scale=1 period=3.710053
Rep. pulse: num_std_devs=7.141 peak_power=9001560 dm=-896 peak_bin=256 scale=7 ffa_scale=0 period=3.722784
Rep. pulse: num_std_devs=7.419 peak_power=9068096 dm=-896 peak_bin=512 scale=7 ffa_scale=2 period=14.78535
Rep. pulse: num_std_devs=6.609 peak_power=4503785 dm=-896 peak_bin=256 scale=7 ffa_scale=1 period=14.94714
Rep. pulse: num_std_devs=6.372 peak_power=9064692 dm=-896 peak_bin=1024 scale=7 ffa_scale=3 period=29.62647
Rep. pulse: num_std_devs=6.566 peak_power=4503691 dm=-896 peak_bin=256 scale=7 ffa_scale=0 period=7.461974
Rep. pulse: num_std_devs=7.964 peak_power=1130826 dm=-896 peak_bin=6144 scale=7 ffa_scale=3 period=238.7568
Rep. pulse: num_std_devs=7.435 peak_power=566749.4 dm=-896 peak_bin=28160 scale=7 ffa_scale=2 period=238.7522
Rep. pulse: num_std_devs=7.303 peak_power=284453.9 dm=-896 peak_bin=11776 scale=7 ffa_scale=1 period=238.754
Rep. pulse: num_std_devs=7.052 peak_power=142932.7 dm=-896 peak_bin=23552 scale=7 ffa_scale=0 period=238.754
Rep. pulse: num_std_devs=6.13 peak_power=8998286 dm=-896 peak_bin=0 scale=7 ffa_scale=6 period=238.7467
Rep. pulse: num_std_devs=6.072 peak_power=9063717 dm=-896 peak_bin=2048 scale=7 ffa_scale=4 period=59.21723
Rep. pulse: num_std_devs=7.38 peak_power=9067972 dm=-896 peak_bin=768 scale=7 ffa_scale=1 period=7.392616
Rep. pulse: num_std_devs=6.927 peak_power=4537315 dm=-896 peak_bin=768 scale=7 ffa_scale=1 period=14.78523
Rep. pulse: num_std_devs=8.239 peak_power=9070764 dm=-896 peak_bin=4096 scale=7 ffa_scale=5 period=118.0686
Rep. pulse: num_std_devs=13.16 peak_power=1144632 dm=-896 peak_bin=11264 scale=7 ffa_scale=2 period=118.0673
Rep. pulse: num_std_devs=11.11 peak_power=573647.1 dm=-896 peak_bin=11520 scale=7 ffa_scale=1 period=118.0673
Rep. pulse: num_std_devs=11.86 peak_power=2279187 dm=-896 peak_bin=6144 scale=7 ffa_scale=3 period=118.0673
Rep. pulse: num_std_devs=8.141 peak_power=4540001 dm=-896 peak_bin=6144 scale=7 ffa_scale=4 period=118.0677
Rep. pulse: num_std_devs=9.277 peak_power=287568.7 dm=-896 peak_bin=18176 scale=7 ffa_scale=1 period=236.1337
Rep. pulse: num_std_devs=10.1 peak_power=572878 dm=-896 peak_bin=11264 scale=7 ffa_scale=2 period=236.1346
Rep. pulse: num_std_devs=8.537 peak_power=287173.8 dm=-896 peak_bin=11648 scale=7 ffa_scale=0 period=118.0673
Rep. pulse: num_std_devs=8.948 peak_power=1140107 dm=-896 peak_bin=6144 scale=7 ffa_scale=3 period=236.1337
Rep. pulse: num_std_devs=7.979 peak_power=286876.9 dm=-896 peak_bin=11264 scale=7 ffa_scale=2 period=472.2691
Rep. pulse: num_std_devs=6.872 peak_power=4537193 dm=-896 peak_bin=24576 scale=7 ffa_scale=5 period=236.1346
Rep. pulse: num_std_devs=7.491 peak_power=144131.7 dm=-896 peak_bin=21632 scale=7 ffa_scale=0 period=236.1346
Rep. pulse: num_std_devs=6.703 peak_power=4536819 dm=-896 peak_bin=1536 scale=7 ffa_scale=2 period=29.51682
Rep. pulse: num_std_devs=7.02 peak_power=2271764 dm=-896 peak_bin=16384 scale=7 ffa_scale=4 period=236.1337

class T_remove_radar: total=1.94e+009, N=1, <>=1.94e+009, min=1.94e+009, max=1.94e+009
class T_main_loop_L1: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_forward: total=5.54e+007, N=912, <>=6.08e+004, min=3.52e+004, max=1.03e+006
class T_remove_radar_randomize: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_build_chirp_table: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_dechirp: total=5.03e+007, N=912, <>=5.52e+004, min=4.71e+004, max=2.36e+005
class Dechirp_ns: total=0, N=0, <>=0, min=0 max=0
class Half_ns: total=0, N=0, <>=0, min=0 max=0
class T_PC_single_pulse_kernel_FFA_update: total=3.64e+010, N=912, <>=3.99e+007, min=3.76e+007, max=1.79e+008
class PC_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_oclWriteBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_inverse: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ffa: total=4.29e+009, N=8, <>=5.36e+008, min=5.07e+008, max=6.43e+008

FFA blocks counters:
class T_FFA_fetch: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=1.48e+009, N=149, <>=9.94e+006, min=8.02e+005, max=7.96e+007
class T_FFA_coadd: total=4.71e+006, N=595, <>=7.91e+003, min=4.54e+003, max=5.93e+005
class T_FFA_stride_add: total=2.97e+006, N=134, <>=2.22e+004, min=1.68e+004, max=1.15e+005
class T_GPU_buffer_read_backs: total=61, N=61, <>=1, min=1 max=1
TWIN_FFA OCL_ZERO_COPY USE_OPENCL OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2742
GPU device sync requested... ...GPU device synched
07:53:03 (4564): called boinc_finish(0)
[ /stderr ]

------------
AP7_win_x64_AVX_CPU_r2692.exe -verbose / short_ap_21oc08ab_B2_P0_00081_20081130_08605.dat :
Result cached, skipping execution
193.870 secs Elapsed
191.219 secs CPU time

Stderr.txt : not found
------------
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose / short_ap_21oc08ab_B2_P0_00081_20081130_08605.dat :
AppName: AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe
AppArgs: -verbose
TaskName: short_ap_21oc08ab_B2_P0_00081_20081130_08605.dat
Started at : 07:53:08.728
Ended at : 07:53:22.955
14.065 secs Elapsed
3.422 secs CPU time
Speedup : 98.21%
Ratio : 55.88x

ref-AP7_win_x64_AVX_CPU_r2692.exe-short_ap_21oc08ab_B2_P0_00081_20081130_08605.dat.res: <ap_signal>40,<pulses>30,<best_pulses>10
result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-short_ap_21oc08ab_B2_P0_00081_20081130_08605.dat.res: <ap_signal>40,<pulses>30,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: pulse at signal 21 has no match (direction -->)
Weakly similar or Different.
Best Pulses: Weakly similar or Different.

-(.\testDatas\ref\ref-AP7_win_x64_AVX_CPU_r2692.exe-short_ap_21oc08ab_B2_P0_00081_20081130_08605.dat.res)-
Reportable Single Pulses: 0 [OK], 0 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [Weak]
Single Pulses (Best): 10 [Weak], 0 above threshold*THRESHOLD_FUDGE

-(.\testDatas\result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-short_ap_21oc08ab_B2_P0_00081_20081130_08605.dat.res)-
Reportable Single Pulses: 0 [OK], 0 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [Weak]
Single Pulses (Best): 0 [Weak], 0 above threshold*THRESHOLD_FUDGE


[ stderr ]
07:53:08 (10264): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
07:53:08 (10264): Can't open init data file - running in standalone mode
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
07:53:08 (10264): Can't open init data file - running in standalone mode
WARNING: init_data.xml missing
OpenCL platform detected: Advanced Micro Devices, Inc.
WARNING: BOINC supplied wrong platform!
BOINC assigns device 0
WARNING: BOINC failed to provide OpenCL device, using own enumeration abilities
Used GPU device parameters are:
Number of compute units: 36
Single buffer allocation size: 256MB
Total device global memory: 3072MB
max WG size: 256
local mem type: Real
-unroll default value used: 18
-ffa_block default value used: 9216
-ffa_block_fetch default value used: 4608

Build features: Non-graphics BLANKIT OpenCL TWIN_FFA OCL_ZERO_COPY COMBINED_DECHIRP_KERNEL FFTW USE_INCREASED_PRECISION USE_SSE2 x86
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x86 rev 2742, V7 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2

OpenCL version by Raistmer

oclFFT fix for ATI GPUs by Urs Echternacht
ffa threshold mods by Joe Segur
SSE3 dechirping by JDWhale
Combined dechirp kernel by Frizz
Number of OpenCL platforms: 1


OpenCL Platform Name: AMD Accelerated Parallel Processing
Number of devices: 1
Max compute units: 36
Max work group size: 256
Max clock frequency: 1288Mhz
Max memory allocation: 3221225472
Cache type: Read/Write
Cache line size: 64
Cache size: 16384
Global memory size: 3221225472
Constant buffer size: 3221225472
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Queue properties:
Out-of-Order: No
Name: Ellesmere
Vendor: Advanced Micro Devices, Inc.
Driver version: 2348.4
Version: OpenCL 1.2 AMD-APP (2348.4)
Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash


state.fold_buf_size_short=65536; state.fold_buf_size_long=262144
INFO: excessive number of rep. pulses detected. To save system memory FFA will be redone with decreased FFA block value 256.

single pulses: 0
repetitive pulses: 30
Rep. pulse: num_std_devs=8.633 peak_power=39612.73 dm=896 peak_bin=1280 scale=4 ffa_scale=4 period=440.7513
Rep. pulse: num_std_devs=6.557 peak_power=41282.8 dm=-896 peak_bin=1536 scale=4 ffa_scale=2 period=104.9849
Rep. pulse: num_std_devs=6.994 peak_power=81958.25 dm=-1008 peak_bin=1536 scale=4 ffa_scale=3 period=104.9801
Rep. pulse: num_std_devs=6.847 peak_power=162784.2 dm=-1008 peak_bin=0 scale=4 ffa_scale=4 period=104.9304
Rep. pulse: num_std_devs=6.8 peak_power=324184.9 dm=-1008 peak_bin=0 scale=4 ffa_scale=5 period=104.9032
Rep. pulse: num_std_devs=8.178 peak_power=78145.77 dm=1008 peak_bin=1536 scale=4 ffa_scale=5 period=440.563
Rep. pulse: num_std_devs=8.324 peak_power=155133.8 dm=1008 peak_bin=1024 scale=4 ffa_scale=6 period=440.8521
Rep. pulse: num_std_devs=7.769 peak_power=20027.39 dm=896 peak_bin=1408 scale=4 ffa_scale=3 period=440.6907
Rep. pulse: num_std_devs=7.134 peak_power=54955.18 dm=-1008 peak_bin=3328 scale=4 ffa_scale=4 period=314.8522
Rep. pulse: num_std_devs=6.409 peak_power=108742.3 dm=-992 peak_bin=3072 scale=4 ffa_scale=5 period=314.8522
Rep. pulse: num_std_devs=5.499 peak_power=200412.2 dm=976 peak_bin=1024 scale=4 ffa_scale=5 period=169.5545
Rep. pulse: num_std_devs=5.864 peak_power=200596.7 dm=1008 peak_bin=1024 scale=4 ffa_scale=4 period=84.78241
Rep. pulse: num_std_devs=7.11 peak_power=10168.54 dm=-960 peak_bin=0 scale=4 ffa_scale=2 period=440.8454
Rep. pulse: num_std_devs=4.208 peak_power=398636.6 dm=-976 peak_bin=0 scale=4 ffa_scale=5 period=84.75783
Rep. pulse: num_std_devs=3.785 peak_power=488902.3 dm=1008 peak_bin=1024 scale=4 ffa_scale=6 period=137.7608
Rep. pulse: num_std_devs=6.595 peak_power=81839.29 dm=-1008 peak_bin=1536 scale=4 ffa_scale=4 period=209.9441
Rep. pulse: num_std_devs=5.641 peak_power=162249 dm=-1008 peak_bin=1536 scale=4 ffa_scale=5 period=209.8448
Rep. pulse: num_std_devs=5.931 peak_power=123584.2 dm=1008 peak_bin=1024 scale=4 ffa_scale=5 period=275.5427
Rep. pulse: num_std_devs=4.881 peak_power=246529.8 dm=-1008 peak_bin=0 scale=4 ffa_scale=5 period=137.6516
Rep. pulse: num_std_devs=6.88 peak_power=2703.361 dm=1008 peak_bin=1600 scale=4 ffa_scale=0 period=440.6302
Rep. pulse: num_std_devs=4.908 peak_power=245513.6 dm=1008 peak_bin=1024 scale=4 ffa_scale=6 period=275.4165
Rep. pulse: num_std_devs=6.323 peak_power=1.811921e+007 dm=-896 peak_bin=0 scale=7 ffa_scale=1 period=3.710067
Rep. pulse: num_std_devs=6.848 peak_power=1.812181e+007 dm=-896 peak_bin=512 scale=7 ffa_scale=2 period=7.379515
Rep. pulse: num_std_devs=8.4 peak_power=567476 dm=-896 peak_bin=4608 scale=7 ffa_scale=2 period=238.7513
Rep. pulse: num_std_devs=8.528 peak_power=1131430 dm=-896 peak_bin=4096 scale=7 ffa_scale=3 period=238.7549
Rep. pulse: num_std_devs=8.608 peak_power=285147.1 dm=-896 peak_bin=11776 scale=7 ffa_scale=1 period=238.754
Rep. pulse: num_std_devs=7.863 peak_power=143237 dm=-896 peak_bin=23552 scale=7 ffa_scale=0 period=238.754
Rep. pulse: num_std_devs=6.616 peak_power=8999861 dm=-896 peak_bin=256 scale=7 ffa_scale=0 period=3.722784
Rep. pulse: num_std_devs=6.283 peak_power=9064402 dm=-896 peak_bin=256 scale=7 ffa_scale=1 period=7.382499
Rep. pulse: num_std_devs=8.262 peak_power=143386.5 dm=-896 peak_bin=38144 scale=7 ffa_scale=1 period=476.8

class T_remove_radar: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_main_loop_L1: total=5.25e+010, N=1, <>=5.25e+010, min=5.25e+010, max=5.25e+010
class T_FFT_forward: total=4.13e+007, N=912, <>=4.53e+004, min=3.55e+004, max=7.76e+005
class T_remove_radar_randomize: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_build_chirp_table: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_dechirp: total=5.03e+007, N=912, <>=5.52e+004, min=4.71e+004, max=3.19e+005
class Dechirp_ns: total=0, N=0, <>=0, min=0 max=0
class Half_ns: total=0, N=0, <>=0, min=0 max=0
class T_PC_single_pulse_kernel_FFA_update: total=3.60e+010, N=912, <>=3.94e+007, min=3.75e+007, max=1.77e+008
class PC_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_oclWriteBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_inverse: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ffa: total=1.63e+010, N=9, <>=1.82e+009, min=8.91e+008, max=8.18e+009

FFA blocks counters:
class T_FFA_fetch: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=1.49e+009, N=302, <>=4.93e+006, min=8.06e+005, max=1.27e+008
class T_FFA_coadd: total=6.13e+006, N=737, <>=8.32e+003, min=4.69e+003, max=7.17e+005
class T_FFA_stride_add: total=5.45e+006, N=268, <>=2.03e+004, min=1.70e+004, max=1.20e+005
class T_GPU_buffer_read_backs: total=0, N=0, <>=0, min=0 max=0
TWIN_FFA OCL_ZERO_COPY USE_OPENCL OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2742
GPU device sync requested... ...GPU device synched
07:53:20 (10264): called boinc_finish(0)
[ /stderr ]

------------
AP7_win_x64_AVX_CPU_r2692.exe -verbose / ap_Zblank_2LC67.wu :
AppName: AP7_win_x64_AVX_CPU_r2692.exe
AppArgs: -verbose
TaskName: ap_Zblank_2LC67.wu
Started at : 07:53:26.392
Ended at : 08:00:40.121
Result : stored as ref for validations.
433.668 secs Elapsed
431.422 secs CPU time

[ stderr ]
07:53:26 (9372): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
07:53:26 (9372): Can't open init data file - running in standalone mode

Build features: Non-graphics BLANKIT TWINDECHIRP USE_LRINT FFTW USE_INCREASED_PRECISION USE_AVX x64
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x64 rev 2692, V7 match, by Raistmer with support of Lunatics.kwsn.net team.
by Lunatics team. Built with uncommitted modifications
state.fold_buf_size_short=65536; state.fold_buf_size_long=262144

single pulses: 2
repetitive pulses: 3
percent blanked: 0.00
Single pulse: peak_power=87.34 dm=8582 fft_num=27164672 peak_bin=27169792 scale=5
Rep. pulse: num_std_devs=7.225 peak_power=2806 dm=8640 peak_bin=6528 scale=4 ffa_scale=0 period=426.52
Rep. pulse: num_std_devs=7.013 peak_power=2607 dm=-8720 peak_bin=1328 scale=4 ffa_scale=0 period=460.3341
Single pulse: peak_power=129.8 dm=-8823 fft_num=8028160 peak_bin=8029312 scale=6
Rep. pulse: num_std_devs=7.285 peak_power=2.008e+004 dm=8704 peak_bin=82304 scale=7 ffa_scale=0 period=1755.588

class T_remove_radar: total=1.44e+009, N=1, <>=1.44e+009, min=1.44e+009, max=1.44e+009
class T_main_loop_L1: total=1.98e+012, N=2, <>=9.88e+011, min=9.88e+011, max=9.89e+011
class T_FFT_forward: total=3.76e+010, N=32752, <>=1.15e+006, min=1.09e+006, max=4.73e+006
class T_remove_radar_randomize: total=4.55e+007, N=32752, <>=1.39e+003, min=8.55e+002, max=2.72e+004
class T_build_chirp_table: total=2.15e+009, N=16, <>=1.34e+008, min=1.33e+008, max=1.40e+008
class T_dechirp: total=1.75e+011, N=1048064, <>=1.67e+005, min=3.70e+001, max=7.19e+006
class T_FFT_inverse: total=1.16e+012, N=1048064, <>=1.11e+006, min=1.07e+006, max=1.37e+007
class T_ffa: total=3.82e+011, N=36, <>=1.06e+010, min=4.10e+009, max=6.31e+010

FFA blocks counters:
class T_FFA_fetch: total=3.16e+011, N=2180484, <>=1.45e+005, min=5.53e+004, max=4.07e+006
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=3.11e+010, N=92171996, <>=3.37e+002, min=3.70e+001, max=1.89e+006
class T_FFA_coadd: total=1.72e+010, N=92171996, <>=1.87e+002, min=3.70e+001, max=1.88e+006
class T_FFA_stride_add: total=2.19e+009, N=18683764, <>=1.17e+002, min=3.70e+001, max=2.60e+005
USE_INCREASED_PRECISION SMALL_CHIRP_TABLE BLANKIT TWINDECHIRP USE_LRINT
rev 2692
08:00:38 (9372): called boinc_finish(0)
[ /stderr ]
------------
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose / ap_Zblank_2LC67.wu :
AppName: AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe
AppArgs: -verbose
TaskName: ap_Zblank_2LC67.wu
Started at : 08:00:43.259
Ended at : 08:01:06.995
23.568 secs Elapsed
3.313 secs CPU time
Speedup : 99.23%
Ratio : 130.22x

ref-AP7_win_x64_AVX_CPU_r2692.exe-ap_Zblank_2LC67.wu.res: <ap_signal>15,<pulses>5,<best_pulses>10
result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-ap_Zblank_2LC67.wu.res: <ap_signal>14,<pulses>4,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: pulse at signal 0 has no match (direction -->)
Weakly similar or Different.
Best Pulses: Weakly similar or Different.

-(.\testDatas\ref\ref-AP7_win_x64_AVX_CPU_r2692.exe-ap_Zblank_2LC67.wu.res)-
Reportable Single Pulses: 2 [Weak], 0 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 3 [Weak]
Single Pulses (Best): 10 [Weak], 0 above threshold*THRESHOLD_FUDGE

-(.\testDatas\result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-ap_Zblank_2LC67.wu.res)-
Reportable Single Pulses: 4 [Weak], 1 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 0 [Weak]
Single Pulses (Best): 10 [Weak], 1 above threshold*THRESHOLD_FUDGE


[ stderr ]
08:00:43 (11188): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:00:43 (11188): Can't open init data file - running in standalone mode
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
08:00:43 (11188): Can't open init data file - running in standalone mode
WARNING: init_data.xml missing
OpenCL platform detected: Advanced Micro Devices, Inc.
WARNING: BOINC supplied wrong platform!
BOINC assigns device 0
WARNING: BOINC failed to provide OpenCL device, using own enumeration abilities
Used GPU device parameters are:
Number of compute units: 36
Single buffer allocation size: 256MB
Total device global memory: 3072MB
max WG size: 256
local mem type: Real
-unroll default value used: 18
-ffa_block default value used: 9216
-ffa_block_fetch default value used: 4608

Build features: Non-graphics BLANKIT OpenCL TWIN_FFA OCL_ZERO_COPY COMBINED_DECHIRP_KERNEL FFTW USE_INCREASED_PRECISION USE_SSE2 x86
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x86 rev 2742, V7 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2

OpenCL version by Raistmer

oclFFT fix for ATI GPUs by Urs Echternacht
ffa threshold mods by Joe Segur
SSE3 dechirping by JDWhale
Combined dechirp kernel by Frizz
Number of OpenCL platforms: 1


OpenCL Platform Name: AMD Accelerated Parallel Processing
Number of devices: 1
Max compute units: 36
Max work group size: 256
Max clock frequency: 1288Mhz
Max memory allocation: 3221225472
Cache type: Read/Write
Cache line size: 64
Cache size: 16384
Global memory size: 3221225472
Constant buffer size: 3221225472
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Queue properties:
Out-of-Order: No
Name: Ellesmere
Vendor: Advanced Micro Devices, Inc.
Driver version: 2348.4
Version: OpenCL 1.2 AMD-APP (2348.4)
Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash


state.fold_buf_size_short=65536; state.fold_buf_size_long=262144

single pulses: 4
repetitive pulses: 0
percent blanked: 0.00
Single pulse: peak_power=88.3304 dm=8585 fft_num=27164672 peak_bin=27169792 scale=5
Single pulse: peak_power=61.6616 dm=8692 fft_num=27164672 peak_bin=27169808 scale=4
Single pulse: peak_power=129.884 dm=8759 fft_num=27164672 peak_bin=27169792 scale=6
Single pulse: peak_power=87.025 dm=8821 fft_num=27164672 peak_bin=27169824 scale=5

class T_remove_radar: total=1.93e+009, N=1, <>=1.93e+009, min=1.93e+009, max=1.93e+009
class T_main_loop_L1: total=9.30e+010, N=2, <>=4.65e+010, min=4.63e+010, max=4.67e+010
class T_FFT_forward: total=8.41e+007, N=1824, <>=4.61e+004, min=3.56e+004, max=1.00e+006
class T_remove_radar_randomize: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_build_chirp_table: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_dechirp: total=9.96e+007, N=1824, <>=5.46e+004, min=4.74e+004, max=2.68e+005
class Dechirp_ns: total=0, N=0, <>=0, min=0 max=0
class Half_ns: total=0, N=0, <>=0, min=0 max=0
class T_PC_single_pulse_kernel_FFA_update: total=7.21e+010, N=1824, <>=3.95e+007, min=3.76e+007, max=1.83e+008
class PC_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_oclWriteBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_inverse: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ffa: total=1.89e+010, N=18, <>=1.05e+009, min=5.07e+008, max=5.20e+009

FFA blocks counters:
class T_FFA_fetch: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_coadd: total=6.65e+006, N=966, <>=6.89e+003, min=4.75e+003, max=5.67e+005
class T_FFA_stride_add: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_GPU_buffer_read_backs: total=9, N=9, <>=1, min=1 max=1
TWIN_FFA OCL_ZERO_COPY USE_OPENCL OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2742
GPU device sync requested... ...GPU device synched
08:01:04 (11188): called boinc_finish(0)
[ /stderr ]

------------
AP7_win_x64_AVX_CPU_r2692.exe -verbose / ap_Zblank_9LC67.wu :
AppName: AP7_win_x64_AVX_CPU_r2692.exe
AppArgs: -verbose
TaskName: ap_Zblank_9LC67.wu
Started at : 08:01:10.402
Ended at : 08:33:29.397
Result : stored as ref for validations.
1938.928 secs Elapsed
1933.547 secs CPU time

[ stderr ]
08:01:10 (8768): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:01:10 (8768): Can't open init data file - running in standalone mode

Build features: Non-graphics BLANKIT TWINDECHIRP USE_LRINT FFTW USE_INCREASED_PRECISION USE_AVX x64
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x64 rev 2692, V7 match, by Raistmer with support of Lunatics.kwsn.net team.
by Lunatics team. Built with uncommitted modifications
state.fold_buf_size_short=65536; state.fold_buf_size_long=262144

single pulses: 4
repetitive pulses: 4
percent blanked: 0.00
Single pulse: peak_power=87.34 dm=8582 fft_num=27164672 peak_bin=27169792 scale=5
Rep. pulse: num_std_devs=7.225 peak_power=2806 dm=8640 peak_bin=6528 scale=4 ffa_scale=0 period=426.52
Rep. pulse: num_std_devs=7.013 peak_power=2607 dm=-8720 peak_bin=1328 scale=4 ffa_scale=0 period=460.3341
Single pulse: peak_power=129.8 dm=-8823 fft_num=8028160 peak_bin=8029312 scale=6
Rep. pulse: num_std_devs=7.285 peak_power=2.008e+004 dm=8704 peak_bin=82304 scale=7 ffa_scale=0 period=1755.588
Single pulse: peak_power=87.03 dm=-9364 fft_num=3473408 peak_bin=3478272 scale=5
Single pulse: peak_power=62.94 dm=-9636 fft_num=2375680 peak_bin=2388944 scale=4
Rep. pulse: num_std_devs=6.924 peak_power=2.252e+004 dm=-9600 peak_bin=140800 scale=7 ffa_scale=0 period=1556.547

class T_remove_radar: total=1.46e+009, N=1, <>=1.46e+009, min=1.46e+009, max=1.46e+009
class T_main_loop_L1: total=8.87e+012, N=9, <>=9.86e+011, min=9.84e+011, max=9.90e+011
class T_FFT_forward: total=1.71e+011, N=147384, <>=1.16e+006, min=1.09e+006, max=1.57e+007
class T_remove_radar_randomize: total=2.09e+008, N=147384, <>=1.42e+003, min=8.32e+002, max=6.76e+004
class T_build_chirp_table: total=1.01e+010, N=72, <>=1.40e+008, min=1.38e+008, max=1.45e+008
class T_dechirp: total=7.97e+011, N=4716288, <>=1.69e+005, min=3.70e+001, max=1.75e+007
class T_FFT_inverse: total=5.18e+012, N=4716288, <>=1.10e+006, min=1.06e+006, max=2.59e+007
class T_ffa: total=1.72e+012, N=162, <>=1.06e+010, min=4.10e+009, max=6.37e+010

FFA blocks counters:
class T_FFA_fetch: total=1.42e+012, N=9812178, <>=1.45e+005, min=5.52e+004, max=1.34e+007
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=1.40e+011, N=414773982, <>=3.37e+002, min=3.70e+001, max=1.10e+007
class T_FFA_coadd: total=7.82e+010, N=414773982, <>=1.88e+002, min=3.70e+001, max=1.04e+007
class T_FFA_stride_add: total=9.89e+009, N=84076938, <>=1.17e+002, min=3.70e+001, max=1.80e+006
USE_INCREASED_PRECISION SMALL_CHIRP_TABLE BLANKIT TWINDECHIRP USE_LRINT
rev 2692
08:33:27 (8768): called boinc_finish(0)
[ /stderr ]
------------
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose / ap_Zblank_9LC67.wu :
AppName: AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe
AppArgs: -verbose
TaskName: ap_Zblank_9LC67.wu
Started at : 08:33:32.535
Ended at : 08:35:07.161
94.465 secs Elapsed
11.672 secs CPU time
Speedup : 99.40%
Ratio : 165.66x

ref-AP7_win_x64_AVX_CPU_r2692.exe-ap_Zblank_9LC67.wu.res: <ap_signal>18,<pulses>8,<best_pulses>10
result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-ap_Zblank_9LC67.wu.res: <ap_signal>19,<pulses>9,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: pulse at signal 0 has no match (direction -->)
Weakly similar or Different.
Best Pulses: Weakly similar or Different.

-(.\testDatas\ref\ref-AP7_win_x64_AVX_CPU_r2692.exe-ap_Zblank_9LC67.wu.res)-
Reportable Single Pulses: 4 [Weak], 1 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 4 [Weak]
Single Pulses (Best): 10 [Weak], 1 above threshold*THRESHOLD_FUDGE

-(.\testDatas\result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-ap_Zblank_9LC67.wu.res)-
Reportable Single Pulses: 9 [Weak], 5 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 0 [Weak]
Single Pulses (Best): 10 [Weak], 2 above threshold*THRESHOLD_FUDGE


[ stderr ]
08:33:32 (2376): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:33:32 (2376): Can't open init data file - running in standalone mode
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
08:33:32 (2376): Can't open init data file - running in standalone mode
WARNING: init_data.xml missing
OpenCL platform detected: Advanced Micro Devices, Inc.
WARNING: BOINC supplied wrong platform!
BOINC assigns device 0
WARNING: BOINC failed to provide OpenCL device, using own enumeration abilities
Used GPU device parameters are:
Number of compute units: 36
Single buffer allocation size: 256MB
Total device global memory: 3072MB
max WG size: 256
local mem type: Real
-unroll default value used: 18
-ffa_block default value used: 9216
-ffa_block_fetch default value used: 4608

Build features: Non-graphics BLANKIT OpenCL TWIN_FFA OCL_ZERO_COPY COMBINED_DECHIRP_KERNEL FFTW USE_INCREASED_PRECISION USE_SSE2 x86
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x86 rev 2742, V7 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2

OpenCL version by Raistmer

oclFFT fix for ATI GPUs by Urs Echternacht
ffa threshold mods by Joe Segur
SSE3 dechirping by JDWhale
Combined dechirp kernel by Frizz
Number of OpenCL platforms: 1


OpenCL Platform Name: AMD Accelerated Parallel Processing
Number of devices: 1
Max compute units: 36
Max work group size: 256
Max clock frequency: 1288Mhz
Max memory allocation: 3221225472
Cache type: Read/Write
Cache line size: 64
Cache size: 16384
Global memory size: 3221225472
Constant buffer size: 3221225472
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Queue properties:
Out-of-Order: No
Name: Ellesmere
Vendor: Advanced Micro Devices, Inc.
Driver version: 2348.4
Version: OpenCL 1.2 AMD-APP (2348.4)
Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash


state.fold_buf_size_short=65536; state.fold_buf_size_long=262144

single pulses: 9
repetitive pulses: 0
percent blanked: 0.00
Single pulse: peak_power=88.3304 dm=8585 fft_num=27164672 peak_bin=27169792 scale=5
Single pulse: peak_power=61.6616 dm=8692 fft_num=27164672 peak_bin=27169808 scale=4
Single pulse: peak_power=129.884 dm=8759 fft_num=27164672 peak_bin=27169792 scale=6
Single pulse: peak_power=87.025 dm=8821 fft_num=27164672 peak_bin=27169824 scale=5
Single pulse: peak_power=213.249 dm=-8904 fft_num=12173312 peak_bin=12176384 scale=7
Single pulse: peak_power=62.6536 dm=-9473 fft_num=2375680 peak_bin=2388960 scale=4
Single pulse: peak_power=64.0883 dm=-9641 fft_num=2375680 peak_bin=2388944 scale=4
Single pulse: peak_power=88.2957 dm=-9634 fft_num=2375680 peak_bin=2388928 scale=5
Single pulse: peak_power=63.431 dm=-9723 fft_num=6062080 peak_bin=6069664 scale=4

class T_remove_radar: total=1.89e+009, N=1, <>=1.89e+009, min=1.89e+009, max=1.89e+009
class T_main_loop_L1: total=4.19e+011, N=9, <>=4.66e+010, min=4.58e+010, max=5.06e+010
class T_FFT_forward: total=3.73e+008, N=8208, <>=4.55e+004, min=3.54e+004, max=1.31e+006
class T_remove_radar_randomize: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_build_chirp_table: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_DataWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ChirpWrite_ns: total=0, N=0, <>=0, min=0 max=0
class T_dechirp: total=4.47e+008, N=8208, <>=5.44e+004, min=4.69e+004, max=4.60e+005
class Dechirp_ns: total=0, N=0, <>=0, min=0 max=0
class Half_ns: total=0, N=0, <>=0, min=0 max=0
class T_PC_single_pulse_kernel_FFA_update: total=3.28e+011, N=8208, <>=3.99e+007, min=3.73e+007, max=2.56e+009
class PC_ns: total=0, N=0, <>=0, min=0 max=0
class T_oclReadBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_oclWriteBuf: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFT_inverse: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_ffa: total=8.43e+010, N=81, <>=1.04e+009, min=5.02e+008, max=5.24e+009

FFA blocks counters:
class T_FFA_fetch: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_coadd: total=2.95e+007, N=4347, <>=6.78e+003, min=4.64e+003, max=3.43e+006
class T_FFA_stride_add: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_GPU_buffer_read_backs: total=33, N=33, <>=1, min=1 max=1
TWIN_FFA OCL_ZERO_COPY USE_OPENCL OPENCL_WRITE USE_INCREASED_PRECISION SMALL_CHIRP_TABLE COMBINED_DECHIRP_KERNEL BLANKIT
rev 2742
GPU device sync requested... ...GPU device synched
08:35:05 (2376): called boinc_finish(0)
[ /stderr ]

------------
AP7_win_x64_AVX_CPU_r2692.exe -verbose / JasonShort_v5.wu :
AppName: AP7_win_x64_AVX_CPU_r2692.exe
AppArgs: -verbose
TaskName: JasonShort_v5.wu
Started at : 08:35:10.575
Ended at : 08:41:02.518
Result : stored as ref for validations.
351.883 secs Elapsed
349.047 secs CPU time

[ stderr ]
08:35:10 (8288): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:35:10 (8288): Can't open init data file - running in standalone mode

Build features: Non-graphics BLANKIT TWINDECHIRP USE_LRINT FFTW USE_INCREASED_PRECISION USE_AVX x64
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x64 rev 2692, V7 match, by Raistmer with support of Lunatics.kwsn.net team.
by Lunatics team. Built with uncommitted modifications
state.fold_buf_size_short=65536; state.fold_buf_size_long=262144

single pulses: 0
repetitive pulses: 30
percent blanked: 0.00
Rep. pulse: num_std_devs=3.572 peak_power=5.649e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.718569
Rep. pulse: num_std_devs=3.715 peak_power=5.65e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.463398
Rep. pulse: num_std_devs=4.008 peak_power=5.736e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.33895
Rep. pulse: num_std_devs=3.949 peak_power=5.776e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.645032
Rep. pulse: num_std_devs=4.252 peak_power=5.821e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.258542
Rep. pulse: num_std_devs=4.927 peak_power=2.919e+005 dm=-896 peak_bin=16 scale=4 ffa_scale=0 period=3.631154
Rep. pulse: num_std_devs=4.7 peak_power=2.939e+005 dm=-896 peak_bin=128 scale=4 ffa_scale=2 period=14.40675
Rep. pulse: num_std_devs=3.828 peak_power=5.857e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.59285
Rep. pulse: num_std_devs=3.504 peak_power=5.895e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=7.148409
Rep. pulse: num_std_devs=3.98 peak_power=5.941e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.555221
Rep. pulse: num_std_devs=4.824 peak_power=2.981e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=3.531917
Rep. pulse: num_std_devs=4.62 peak_power=2.979e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=7.108598
Rep. pulse: num_std_devs=4.103 peak_power=5.943e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.063834
Rep. pulse: num_std_devs=3.782 peak_power=6.063e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.47992
Rep. pulse: num_std_devs=4.685 peak_power=3e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=14.05692
Rep. pulse: num_std_devs=4.807 peak_power=3.063e+005 dm=-896 peak_bin=512 scale=4 ffa_scale=4 period=55.31376
Rep. pulse: num_std_devs=3.717 peak_power=6.144e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.422208
Rep. pulse: num_std_devs=4.486 peak_power=3.102e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=2 period=13.61239
Rep. pulse: num_std_devs=3.6 peak_power=6.225e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.387451
Rep. pulse: num_std_devs=3.988 peak_power=6.312e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.329341
Rep. pulse: num_std_devs=4.621 peak_power=3.144e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=6.703948
Rep. pulse: num_std_devs=4.662 peak_power=3.165e+005 dm=-896 peak_bin=128 scale=4 ffa_scale=2 period=13.31818
Rep. pulse: num_std_devs=4.379 peak_power=3.184e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=3 period=26.50823
Rep. pulse: num_std_devs=3.582 peak_power=6.472e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.261013
Rep. pulse: num_std_devs=4.468 peak_power=3.226e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=1 period=6.534977
Rep. pulse: num_std_devs=4.474 peak_power=3.205e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=0 period=3.292261
Rep. pulse: num_std_devs=4.404 peak_power=3.225e+005 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=13.1167
Rep. pulse: num_std_devs=3.707 peak_power=6.556e+005 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.205953
Rep. pulse: num_std_devs=4.43 peak_power=3.287e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=3.205219
Rep. pulse: num_std_devs=5.492 peak_power=1.663e+005 dm=-896 peak_bin=0 scale=4 ffa_scale=0 period=6.37182

class T_remove_radar: total=1.44e+009, N=1, <>=1.44e+009, min=1.44e+009, max=1.44e+009
class T_main_loop_L1: total=1.60e+012, N=2, <>=8.01e+011, min=7.97e+011, max=8.05e+011
class T_FFT_forward: total=3.82e+010, N=32752, <>=1.17e+006, min=1.09e+006, max=1.67e+007
class T_remove_radar_randomize: total=4.67e+007, N=32752, <>=1.43e+003, min=8.46e+002, max=2.65e+004
class T_build_chirp_table: total=1.91e+009, N=16, <>=1.19e+008, min=1.18e+008, max=1.24e+008
class T_dechirp: total=1.83e+011, N=1048063, <>=1.74e+005, min=3.70e+001, max=1.56e+007
class T_FFT_inverse: total=1.16e+012, N=1048063, <>=1.10e+006, min=1.07e+006, max=4.24e+007
class T_ffa: total=1.13e+009, N=1, <>=1.13e+009, min=1.13e+009, max=1.13e+009

FFA blocks counters:
class T_FFA_fetch: total=7.47e+008, N=10461, <>=7.14e+004, min=5.67e+004, max=4.08e+005
class T_FFA_tt_build: total=0.00e+000, N=0, <>=0.00e+000, min=1.84e+019, max=0.00e+000
class T_FFA_compare: total=2.65e+008, N=391352, <>=6.76e+002, min=4.00e+001, max=3.91e+005
class T_FFA_coadd: total=5.60e+007, N=391351, <>=1.42e+002, min=4.10e+001, max=5.44e+004
class T_FFA_stride_add: total=7.32e+006, N=83686, <>=8.70e+001, min=3.70e+001, max=2.39e+004
USE_INCREASED_PRECISION SMALL_CHIRP_TABLE BLANKIT TWINDECHIRP USE_LRINT
rev 2692
08:41:00 (8288): called boinc_finish(0)
[ /stderr ]
------------
AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe -verbose / JasonShort_v5.wu :
AppName: AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe
AppArgs: -verbose
TaskName: JasonShort_v5.wu
Started at : 08:41:05.653
Ended at : 08:41:26.201
20.387 secs Elapsed
3.609 secs CPU time
Speedup : 98.97%
Ratio : 96.72x

ref-AP7_win_x64_AVX_CPU_r2692.exe-JasonShort_v5.wu.res: <ap_signal>40,<pulses>30,<best_pulses>10
result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-JasonShort_v5.wu.res: <ap_signal>40,<pulses>30,<best_pulses>10
All Signals: Weakly similar or Different.
Pulses: Checked 30, 30 , Strongly Similar
Best Pulses: Weakly similar or Different.

-(.\testDatas\ref\ref-AP7_win_x64_AVX_CPU_r2692.exe-JasonShort_v5.wu.res)-
Reportable Single Pulses: 0 [OK], 0 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [OK]
Single Pulses (Best): 10 [Weak], 0 above threshold*THRESHOLD_FUDGE

-(.\testDatas\result-AP7_win_x86_SSE2_OpenCL_ATI_r2742.exe-JasonShort_v5.wu.res)-
Reportable Single Pulses: 0 [OK], 0 above threshold*THRESHOLD_FUDGE
Reportable Repeating Pulses: 30 [OK]
Single Pulses (Best): 0 [Weak], 0 above threshold*THRESHOLD_FUDGE


[ stderr ]
08:41:05 (10720): Can't open init data file - running in standalone mode
Not using ap_cmdline.txt-file, using commandline options.
08:41:05 (10720): Can't open init data file - running in standalone mode
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
08:41:05 (10720): Can't open init data file - running in standalone mode
WARNING: init_data.xml missing
OpenCL platform detected: Advanced Micro Devices, Inc.
WARNING: BOINC supplied wrong platform!
BOINC assigns device 0
WARNING: BOINC failed to provide OpenCL device, using own enumeration abilities
Used GPU device parameters are:
Number of compute units: 36
Single buffer allocation size: 256MB
Total device global memory: 3072MB
max WG size: 256
local mem type: Real
-unroll default value used: 18
-ffa_block default value used: 9216
-ffa_block_fetch default value used: 4608

Build features: Non-graphics BLANKIT OpenCL TWIN_FFA OCL_ZERO_COPY COMBINED_DECHIRP_KERNEL FFTW USE_INCREASED_PRECISION USE_SSE2 x86
CPUID: AMD FX(tm)-8150 Eight-Core Processor

Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
AstroPulse v7 Windows x86 rev 2742, V7 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2

OpenCL version by Raistmer

oclFFT fix for ATI GPUs by Urs Echternacht
ffa threshold mods by Joe Segur
SSE3 dechirping by JDWhale
Combined dechirp kernel by Frizz
Number of OpenCL platforms: 1


OpenCL Platform Name: AMD Accelerated Parallel Processing
Number of devices: 1
Max compute units: 36
Max work group size: 256
Max clock frequency: 1288Mhz
Max memory allocation: 3221225472
Cache type: Read/Write
Cache line size: 64
Cache size: 16384
Global memory size: 3221225472
Constant buffer size: 3221225472
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Queue properties:
Out-of-Order: No
Name: Ellesmere
Vendor: Advanced Micro Devices, Inc.
Driver version: 2348.4
Version: OpenCL 1.2 AMD-APP (2348.4)
Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash


state.fold_buf_size_short=65536; state.fold_buf_size_long=262144
INFO: excessive number of rep. pulses detected. To save system memory FFA will be redone with decreased FFA block value 256.

single pulses: 0
repetitive pulses: 30
percent blanked: 0.00
Rep. pulse: num_std_devs=3.572 peak_power=564871.4 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.718569
Rep. pulse: num_std_devs=3.714 peak_power=565019.6 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.463398
Rep. pulse: num_std_devs=4.008 peak_power=573564.9 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.33895
Rep. pulse: num_std_devs=3.949 peak_power=577621.9 dm=-896 peak_bin=32 scale=4 ffa_scale=1 period=3.645032
Rep. pulse: num_std_devs=4.252 peak_power=582063.8 dm=-896 peak_bin=64 scale=4 ffa_scale=2 period=7.258542
Rep. pulse: num_std_devs=4.927 peak_power=291947 dm=-896 peak_bin=16 scale=4 ffa_scale=0 period=3.631154
Rep. pulse: num_std_devs=4.7 peak_power=293862.5 dm=-896 peak_bin=128 scale=4 ffa_scale=2 period=14.40675
Rep. pulse: num_std_devs=3.827 peak_power=585730.9 dm=-896 peak_bin=0 scale=4 ffa_scale=1 period=3.5
12) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1875135)
Posted 26 Jun 2017 by Profile Karsten Vinding
Post:
OK.

The test is running right now, made a mistake in the first run so results were not usable.

Have "reinstalled" (deleted and re-unpacked) APBench.

But will reinstall the HD7770 tonight and try to run on that one too.

Anything special I could do to run on both GPU's in the same run?
13) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1875049)
Posted 25 Jun 2017 by Profile Karsten Vinding
Post:
I will try to run the tests tomorrow when I'm home from work.

Any way to force APBench to use my RX480 (Device 1) and not my HD7770 (Device 0)?

Edit:
Oh well, went ahead and removed the HD7770 from the system.

Have downloaded all the testwu's and started the test.

It'll probably run for many hours, lets see how it goes.
14) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1874997)
Posted 25 Jun 2017 by Profile Karsten Vinding
Post:
I would be willing to do so.

But it has been a long time since I used KNABench or KWSNBench, so I would probably need some guidance.

But I think it would be nice to find out what goes wrong, so that we dont provide invalid data to the project.
15) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1874891)
Posted 24 Jun 2017 by Profile Karsten Vinding
Post:
By the way, I tried to do some optimisations following the tips in the included readmes in the Lunatics installer.

Using these settings:
-unroll 18 -ffa_block 16384 -ffa_block_fetch 8192

it seems that my results now validate. At least I have 4 valid WU's (but also 2 inconclusives) since I put in these settings yesterday. No invalids yet.

That is a much higher succes rate than before, allthough it could be down to chance.

I'll keep an eye on it.
16) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1874890)
Posted 24 Jun 2017 by Profile Karsten Vinding
Post:
I havent had any problems with crashes and that sort with any of the recent drivers on my RX480. Currently running 17.6.2, and its rock solid.

But there are problems with AP crunching, and the oldest driver (16.6.2) which supports the RX480 is never than 15.12, so the situation is difficult.

As things are now, owners off newer hardware are out of luck. Something should be done about this.
17) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1874817)
Posted 24 Jun 2017 by Profile Karsten Vinding
Post:
Do you have any more info on this?

15.12 is 1½ years old drivers.

I would think this would have been fixed by now, either by AMD or Lunatics?
18) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1874766)
Posted 24 Jun 2017 by Profile Karsten Vinding
Post:
Thanks for you answer, Brent.

Sadly this is not the only invalid I have had. In the last week I have crunched about 20 AP WU's with my GPU's, and about 4 out of 5 have been invalids.

Something is wrong, and I'm wondering what it is.
19) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1874764)
Posted 24 Jun 2017 by Profile Karsten Vinding
Post:
Thats OK Wiggo.

Perhaps someone else has an idea.

If I don't find an answer I'll probably disable AP WU's for my GPU's. I don't like producing invalid results.

I was hoping Raistmer would have something to say, he knows the innards of the apps.
20) Message boards : Number crunching : Anything relating to AstroPulse tasks (Message 1874760)
Posted 23 Jun 2017 by Profile Karsten Vinding
Post:
I'm running the Beta 6 lunatics version.

Both my GPU's are running at about 70 degress C, the fan profiles are set to maintain that temperature.

It seems that the WU's crunched on my R7770 (device 0) get validated ( I just got one validated now "https://setiathome.berkeley.edu/workunit.php?wuid=2578079182"), and that the ones crunched on my RX480 (device 1) are invalid.

Both GPU's never produce a single invalid WU running MB WU's. I don't OC, or do anything special.


Next 20


 
©2017 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.