Task 8704164380

Name blc15_2bit_guppi_58691_58260_PSR_B0355+54_0022.4840.409.22.45.11.vlar_5
Workunit 3818452655
Created 23 Apr 2020, 7:39:28 UTC
Sent 23 Apr 2020, 7:40:18 UTC
Report deadline 15 Jun 2020, 12:40:00 UTC
Received 23 Apr 2020, 9:42:13 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8906239
Run time 12 min 59 sec
CPU time 12 min 44 sec
Validate state Valid
Credit 95.98
Device peak FLOPS 413.23 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 98.30 MB
Peak swap size 126.38 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Ryzen 3 1200 Quad-Core Processor            

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.008248  NumCfft=96681  NumGauss=0  NumPulse=26454649728  NumTriplet=39400748192
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 3
  Max compute units:				 8
  Max work group size:				 1024
  Max clock frequency:				 1342Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 393216
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 960
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1124Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 760
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1137Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 760
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.008248
Used GPU device parameters are:
	Number of compute units: 8
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Spike: peak=24.15611, time=85.9, d_freq=6266727707.08, chirp=13.286, fft_len=128k
Spike: peak=24.14856, time=85.9, d_freq=6266727707.08, chirp=13.291, fft_len=128k
Triplet: peak=9.943137, time=53.37, period=14.91, d_freq=6266726314.56, chirp=32.389, fft_len=16 
Pulse: peak=5.724741, time=45.84, period=11.78, d_freq=6266721955.39, score=1.007, chirp=57.694, fft_len=512 
D:	threshold 0.2120916; unscaled peak power: 0.2133711 exceeds threshold for 0.6033%
Pulse: peak=3.479953, time=45.86, period=7.181, d_freq=6266730342.21, score=1.067, chirp=-62.375, fft_len=1024 
D:	threshold 0.2823045; unscaled peak power: 0.2968392 exceeds threshold for 5.149%
Pulse: peak=2.636061, time=45.86, period=5.346, d_freq=6266726928.22, score=1.001, chirp=-68.828, fft_len=1024 
D:	threshold 0.223712; unscaled peak power: 0.2238234 exceeds threshold for 0.04979%
Pulse: peak=1.598062, time=45.9, period=2.364, d_freq=6266729417.05, score=1.014, chirp=-94.764, fft_len=2k
D:	threshold 0.3134729; unscaled peak power: 0.3161165 exceeds threshold for 0.8433%
Pulse: peak=2.255449, time=45.86, period=4.144, d_freq=6266732274.12, score=1.006, chirp=95.271, fft_len=1024 
D:	threshold 0.2008121; unscaled peak power: 0.2016991 exceeds threshold for 0.4417%

Best spike: peak=24.15611, time=85.9, d_freq=6266727707.08, chirp=13.286, fft_len=128k
Best autocorr: peak=17.27853, time=40.09, delay=5.3788, d_freq=6266726303.97, chirp=-28.514, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=3.479953, time=45.86, period=7.181, d_freq=6266730342.21, score=1.067, chirp=-62.375, fft_len=1024 
Best triplet: peak=9.943137, time=53.37, period=14.91, d_freq=6266726314.56, chirp=32.389, fft_len=16 
Spike count:    2
Autocorr count: 0
Pulse count:    5
Triplet count:  1
Gaussian count: 0
Wallclock time elapsed since last restart: 773.4 seconds
Fftlength=32,pass=3:Tune: sum=26605.7(ms); min=5.755(ms); max=68.39(ms); mean=40.43(ms); s_mean=57.62; sleep=60(ms); delta=607; N=658; usual
Fftlength=32,pass=4:Tune: sum=15992.7(ms); min=4.085(ms); max=51.4(ms); mean=27.15(ms); s_mean=43.58; sleep=45(ms); delta=716; N=589; usual
Fftlength=32,pass=5:Tune: sum=12881.9(ms); min=4.377(ms); max=46.93(ms); mean=23.42(ms); s_mean=34.39; sleep=30(ms); delta=677; N=550; usual
Fftlength=64,pass=3:Tune: sum=21906.8(ms); min=2.737(ms); max=64.84(ms); mean=31.07(ms); s_mean=53.38; sleep=45(ms); delta=713; N=705; usual
Fftlength=64,pass=4:Tune: sum=15218(ms); min=2.288(ms); max=51.02(ms); mean=22.88(ms); s_mean=35.18; sleep=30(ms); delta=728; N=665; usual
Fftlength=64,pass=5:Tune: sum=12090(ms); min=2.235(ms); max=40.71(ms); mean=19.25(ms); s_mean=26.44; sleep=15(ms); delta=691; N=628; usual
Fftlength=128,pass=3:Tune: sum=21533.5(ms); min=1.463(ms); max=63.56(ms); mean=28.37(ms); s_mean=53.43; sleep=45(ms); delta=730; N=759; usual
Fftlength=128,pass=4:Tune: sum=15167.8(ms); min=1.137(ms); max=51.71(ms); mean=21.07(ms); s_mean=37.52; sleep=30(ms); delta=751; N=720; usual
Fftlength=128,pass=5:Tune: sum=12309.1(ms); min=1.136(ms); max=46.54(ms); mean=18.1(ms); s_mean=30.33; sleep=30(ms); delta=711; N=680; usual
Fftlength=256,pass=3:Tune: sum=21920.1(ms); min=0.6836(ms); max=59.52(ms); mean=26.73(ms); s_mean=55.58; sleep=45(ms); delta=691; N=820; usual
Fftlength=256,pass=4:Tune: sum=15786.6(ms); min=0.5573(ms); max=42.84(ms); mean=20.34(ms); s_mean=40.87; sleep=30(ms); delta=791; N=776; usual
Fftlength=256,pass=5:Tune: sum=12990.9(ms); min=0.5923(ms); max=35.58(ms); mean=17.75(ms); s_mean=33.84; sleep=30(ms); delta=747; N=732; usual
Fftlength=512,pass=3:Tune: sum=22561.1(ms); min=0.3472(ms); max=30.05(ms); mean=22.47(ms); s_mean=28.83; sleep=30(ms); delta=1011; N=1004; usual
Fftlength=512,pass=4:Tune: sum=15949.9(ms); min=0.3076(ms); max=21.32(ms); mean=16.26(ms); s_mean=20.37; sleep=15(ms); delta=988; N=981; usual
Fftlength=512,pass=5:Tune: sum=12822.1(ms); min=0.3175(ms); max=16.99(ms); mean=13.37(ms); s_mean=16.31; sleep=15(ms); delta=966; N=959; usual
Fftlength=1024,pass=3:Tune: sum=49516.1(ms); min=0.2213(ms); max=33.18(ms); mean=29.35(ms); s_mean=31.89; sleep=30(ms); delta=1690; N=1687; high_perf
Fftlength=1024,pass=4:Tune: sum=579.025(ms); min=0.1823(ms); max=10.16(ms); mean=3.736(ms); s_mean=8.56; sleep=0(ms); delta=1679; N=155; usual
Fftlength=1024,pass=5:Tune: sum=473.437(ms); min=0.1881(ms); max=8.07(ms); mean=3.311(ms); s_mean=7.768; sleep=0(ms); delta=1667; N=143; usual
Fftlength=2048,pass=3:Tune: sum=50547.9(ms); min=6.851(ms); max=16.61(ms); mean=15.99(ms); s_mean=16.02; sleep=15(ms); delta=1; N=3161; high_perf
Fftlength=4096,pass=3:Tune: sum=57058.8(ms); min=3.697(ms); max=9.283(ms); mean=9.024(ms); s_mean=9.008; sleep=0(ms); delta=1; N=6323; high_perf
Fftlength=8192,pass=3:Tune: sum=31550.5(ms); min=2.475(ms); max=2.524(ms); mean=2.495(ms); s_mean=2.495; sleep=0(ms); delta=1; N=12647; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=36996,	N=36996,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=596,	N=596,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=25229,	N=25229,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=14,	N=14,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=11,	N=11,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=36985,	N=36985,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=608,	N=608,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
04:54:34 (5776): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.