Task 8644559744

Name blc64_2bit_guppi_58838_20583_TIC427352241_0083.23268.818.19.28.154.vlar_0
Workunit 3929970332
Created 14 Mar 2020, 4:26:24 UTC
Sent 14 Mar 2020, 4:37:51 UTC
Report deadline 6 May 2020, 9:37:33 UTC
Received 14 Mar 2020, 16:20:43 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8906239
Run time 18 min 18 sec
CPU time 18 min 4 sec
Validate state Valid
Credit 110.30
Device peak FLOPS 413.22 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 66.59 MB
Peak swap size 91.58 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 1
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 1
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Ryzen 3 1200 Quad-Core Processor            

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.014799  NumCfft=114841  NumGauss=0  NumPulse=45459058560  NumTriplet=58430191776
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 3
  Max compute units:				 8
  Max work group size:				 1024
  Max clock frequency:				 1342Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 393216
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 960
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1124Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 760
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1137Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 760
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.014799
Used GPU device parameters are:
	Number of compute units: 6
	Single buffer allocation size: 128MB
	Total device global memory: 2048MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Autocorr: peak=17.94504, time=51.54, delay=4.842, d_freq=8190239172.77, chirp=4.2711, fft_len=128k
Pulse: peak=0.4717609, time=45.82, period=0.3165, d_freq=8190239647.7, score=1.001, chirp=-8.2465, fft_len=64 
D:	threshold 0.005741141; unscaled peak power: 0.005743129 exceeds threshold for 0.03463%
Pulse: peak=9.521601, time=45.99, period=26.84, d_freq=8190234130.36, score=1.009, chirp=-26.302, fft_len=4k
D:	threshold 2.776424; unscaled peak power: 2.799003 exceeds threshold for 0.8132%
Spike: peak=24.08343, time=40.09, d_freq=8190240069.51, chirp=-28.342, fft_len=128k
Spike: peak=24.04864, time=40.09, d_freq=8190240069.49, chirp=-28.344, fft_len=128k
Pulse: peak=1.636041, time=45.86, period=2.669, d_freq=8190237310.75, score=1.012, chirp=-28.493, fft_len=1024 
D:	threshold 0.1649357; unscaled peak power: 0.1662051 exceeds threshold for 0.7697%
Pulse: peak=1.475636, time=45.84, period=2.072, d_freq=8190239127.98, score=1.04, chirp=33.572, fft_len=512 
D:	threshold 0.07770368; unscaled peak power: 0.0795323 exceeds threshold for 2.353%
Triplet: peak=11.79514, time=35.08, period=7.069, d_freq=8190243585.57, chirp=-35.671, fft_len=2k
Pulse: peak=1.691868, time=45.86, period=2.367, d_freq=8190234632.91, score=1.05, chirp=-88.349, fft_len=1024 
D:	threshold 0.1603014; unscaled peak power: 0.1652953 exceeds threshold for 3.115%
Triplet: peak=10.57912, time=65.16, period=3.143, d_freq=8190235698.31, chirp=-94.533, fft_len=256 

Best spike: peak=24.08343, time=40.09, d_freq=8190240069.51, chirp=-28.342, fft_len=128k
Best autocorr: peak=17.94504, time=51.54, delay=4.842, d_freq=8190239172.77, chirp=4.2711, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=1.691868, time=45.86, period=2.367, d_freq=8190234632.91, score=1.05, chirp=-88.349, fft_len=1024 
Best triplet: peak=11.79514, time=35.08, period=7.069, d_freq=8190243585.57, chirp=-35.671, fft_len=2k
Spike count:    2
Autocorr count: 1
Pulse count:    5
Triplet count:  2
Gaussian count: 0
Wallclock time elapsed since last restart: 1093.7 seconds
Fftlength=32,pass=3:Tune: sum=44248.5(ms); min=4.722(ms); max=66.44(ms); mean=45.24(ms); s_mean=54.22; sleep=45(ms); delta=610; N=978; usual
Fftlength=32,pass=4:Tune: sum=35227.4(ms); min=5.757(ms); max=70.86(ms); mean=40.49(ms); s_mean=55.92; sleep=45(ms); delta=705; N=870; usual
Fftlength=32,pass=5:Tune: sum=28493.3(ms); min=5.462(ms); max=71.09(ms); mean=35.66(ms); s_mean=51.07; sleep=45(ms); delta=788; N=799; usual
Fftlength=64,pass=3:Tune: sum=43754.4(ms); min=3.277(ms); max=73.31(ms); mean=43.8(ms); s_mean=62.62; sleep=60(ms); delta=683; N=999; usual
Fftlength=64,pass=4:Tune: sum=34387.1(ms); min=3.005(ms); max=81.27(ms); mean=36.7(ms); s_mean=45.7; sleep=45(ms); delta=817; N=937; usual
Fftlength=64,pass=5:Tune: sum=26845.9(ms); min=2.797(ms); max=67.99(ms); mean=31.14(ms); s_mean=49.46; sleep=45(ms); delta=867; N=862; usual
Fftlength=128,pass=3:Tune: sum=43662.3(ms); min=1.719(ms); max=69.16(ms); mean=41.74(ms); s_mean=63.4; sleep=60(ms); delta=685; N=1046; usual
Fftlength=128,pass=4:Tune: sum=32708.2(ms); min=1.563(ms); max=83.45(ms); mean=32.35(ms); s_mean=43.27; sleep=45(ms); delta=1035; N=1011; usual
Fftlength=128,pass=5:Tune: sum=26500.8(ms); min=1.47(ms); max=70.74(ms); mean=27.29(ms); s_mean=34.58; sleep=30(ms); delta=994; N=971; usual
Fftlength=256,pass=3:Tune: sum=42445.7(ms); min=0.8402(ms); max=62.19(ms); mean=38.38(ms); s_mean=62.14; sleep=60(ms); delta=688; N=1106; usual
Fftlength=256,pass=4:Tune: sum=32443.8(ms); min=0.8066(ms); max=47.7(ms); mean=30.52(ms); s_mean=47.62; sleep=45(ms); delta=1075; N=1063; usual
Fftlength=256,pass=5:Tune: sum=26214.9(ms); min=0.7471(ms); max=38.47(ms); mean=25.7(ms); s_mean=38.4; sleep=30(ms); delta=1031; N=1020; usual
Fftlength=512,pass=3:Tune: sum=42157.3(ms); min=0.4384(ms); max=31.06(ms); mean=26.82(ms); s_mean=30.98; sleep=30(ms); delta=1578; N=1572; usual
Fftlength=512,pass=4:Tune: sum=33055.1(ms); min=0.4187(ms); max=24.41(ms); mean=21.34(ms); s_mean=24.33; sleep=15(ms); delta=1555; N=1549; usual
Fftlength=512,pass=5:Tune: sum=26609.2(ms); min=0.3949(ms); max=19.63(ms); mean=17.43(ms); s_mean=19.6; sleep=15(ms); delta=1532; N=1527; usual
Fftlength=1024,pass=3:Tune: sum=102992(ms); min=0.2386(ms); max=38.68(ms); mean=36.48(ms); s_mean=38.4; sleep=30(ms); delta=2826; N=2823; high_perf
Fftlength=1024,pass=4:Tune: sum=756.594(ms); min=0.2248(ms); max=12.52(ms); mean=4.85(ms); s_mean=10.83; sleep=0(ms); delta=2816; N=156; usual
Fftlength=1024,pass=5:Tune: sum=627.245(ms); min=0.2062(ms); max=10.33(ms); mean=4.296(ms); s_mean=10.13; sleep=0(ms); delta=2805; N=146; usual
Fftlength=2048,pass=3:Tune: sum=60239.5(ms); min=4.616(ms); max=11.38(ms); mean=11.09(ms); s_mean=11.05; sleep=0(ms); delta=1; N=5433; high_perf
Fftlength=4096,pass=3:Tune: sum=58766.9(ms); min=2.28(ms); max=5.673(ms); mean=5.409(ms); s_mean=5.423; sleep=0(ms); delta=1; N=10865; high_perf
Fftlength=8192,pass=3:Tune: sum=89306.8(ms); min=4.047(ms); max=4.186(ms); mean=4.11(ms); s_mean=4.109; sleep=0(ms); delta=1; N=21731; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=54737,	N=54737,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=1015,	N=1015,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=43367,	N=43367,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=10,	N=10,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=8,	N=8,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=54729,	N=54729,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=1024,	N=1024,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
12:16:53 (7288): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.