Task 8704164517

Name blc44_2bit_guppi_58838_01073_TIC434234955_0016.14353.0.19.28.103.vlar_3
Workunit 3915465364
Created 23 Apr 2020, 7:39:28 UTC
Sent 23 Apr 2020, 7:40:18 UTC
Report deadline 27 Jul 2020, 16:12:46 UTC
Received 23 Apr 2020, 12:48:52 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8906239
Run time 21 min 54 sec
CPU time 21 min 32 sec
Validate state Valid
Credit 157.48
Device peak FLOPS 413.23 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 64.75 MB
Peak swap size 91.25 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 2
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 2
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Ryzen 3 1200 Quad-Core Processor            

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.021837  NumCfft=105867  NumGauss=0  NumPulse=78601879150  NumTriplet=84315336437
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 3
  Max compute units:				 8
  Max work group size:				 1024
  Max clock frequency:				 1342Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 393216
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 960
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1124Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 760
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1137Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 760
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.021837
Used GPU device parameters are:
	Number of compute units: 6
	Single buffer allocation size: 128MB
	Total device global memory: 2048MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=0.3791629, time=42.07, period=0.1968, d_freq=10322466801.6, score=1.02, chirp=5.939, fft_len=64 
D:	threshold 0.005357303; unscaled peak power: 0.005386788 exceeds threshold for 0.5504%
Pulse: peak=3.776949, time=42.1, period=7.74, d_freq=10322463007, score=1.013, chirp=6.5888, fft_len=1024 
D:	threshold 0.3084746; unscaled peak power: 0.3115324 exceeds threshold for 0.9913%
Spike: peak=24.09425, time=62.99, d_freq=10322468739.1, chirp=27.805, fft_len=128k
Triplet: peak=11.31038, time=62.39, period=19.28, d_freq=10322468451.1, chirp=-31.179, fft_len=128 
Triplet: peak=11.31888, time=62.39, period=19.28, d_freq=10322468451.1, chirp=-31.179, fft_len=128 
Pulse: peak=2.3633, time=49.62, period=4.021, d_freq=10322466050.9, score=1.056, chirp=34.055, fft_len=1024 
D:	threshold 0.2016707; unscaled peak power: 0.2094737 exceeds threshold for 3.869%
Pulse: peak=1.458828, time=49.55, period=1.97, d_freq=10322463702.3, score=1.03, chirp=34.518, fft_len=512 
D:	threshold 0.07466526; unscaled peak power: 0.07596139 exceeds threshold for 1.736%
Pulse: peak=6.494862, time=42.14, period=16.33, d_freq=10322471886.9, score=1.028, chirp=-35.168, fft_len=2k
D:	threshold 0.907074; unscaled peak power: 0.9293213 exceeds threshold for 2.453%
Pulse: peak=3.725936, time=49.66, period=8.165, d_freq=10322471622.6, score=1.021, chirp=-35.168, fft_len=2k
D:	threshold 0.5862373; unscaled peak power: 0.5957617 exceeds threshold for 1.625%
Triplet: peak=11.7209, time=54.81, period=35.08, d_freq=10322464900, chirp=37.766, fft_len=1024 
Triplet: peak=10.51585, time=38.01, period=37.08, d_freq=10322469750.3, chirp=-39.344, fft_len=256 
Pulse: peak=5.471599, time=42.05, period=13.69, d_freq=10322470703.3, score=1.029, chirp=-70.428, fft_len=4k
D:	threshold 1.705582; unscaled peak power: 1.747583 exceeds threshold for 2.463%
Pulse: peak=5.571821, time=49.57, period=13.69, d_freq=10322470174, score=1.048, chirp=-70.428, fft_len=4k
D:	threshold 1.679572; unscaled peak power: 1.747583 exceeds threshold for 4.049%
Pulse: peak=0.7944852, time=42.1, period=0.8403, d_freq=10322465124, score=1.006, chirp=72.006, fft_len=1024 
D:	threshold 0.1121795; unscaled peak power: 0.1124726 exceeds threshold for 0.2613%

Best spike: peak=24.09425, time=62.99, d_freq=10322468739.1, chirp=27.805, fft_len=128k
Best autocorr: peak=16.22533, time=62.99, delay=4.4842, d_freq=10322466709.9, chirp=-17.36, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=2.3633, time=49.62, period=4.021, d_freq=10322466050.9, score=1.056, chirp=34.055, fft_len=1024 
Best triplet: peak=11.7209, time=54.81, period=35.08, d_freq=10322464900, chirp=37.766, fft_len=1024 
Spike count:    1
Autocorr count: 0
Pulse count:    9
Triplet count:  4
Gaussian count: 0
Wallclock time elapsed since last restart: 1309.2 seconds
Fftlength=32,pass=3:Tune: sum=61352.5(ms); min=6.199(ms); max=72.97(ms); mean=56.65(ms); s_mean=54.52; sleep=45(ms); delta=339; N=1083; usual
Fftlength=32,pass=4:Tune: sum=48230.9(ms); min=5.726(ms); max=73.79(ms); mean=52.6(ms); s_mean=56.12; sleep=45(ms); delta=387; N=917; usual
Fftlength=32,pass=5:Tune: sum=37848.8(ms); min=5.323(ms); max=75.18(ms); mean=48.34(ms); s_mean=60.38; sleep=60(ms); delta=447; N=783; usual
Fftlength=64,pass=3:Tune: sum=58896.7(ms); min=3.199(ms); max=69.72(ms); mean=53.45(ms); s_mean=58.91; sleep=60(ms); delta=360; N=1102; usual
Fftlength=64,pass=4:Tune: sum=44861.2(ms); min=2.889(ms); max=80.4(ms); mean=48.24(ms); s_mean=47.16; sleep=45(ms); delta=448; N=930; usual
Fftlength=64,pass=5:Tune: sum=35573.9(ms); min=2.725(ms); max=78.68(ms); mean=44.52(ms); s_mean=63.5; sleep=60(ms); delta=503; N=799; usual
Fftlength=128,pass=3:Tune: sum=57564.2(ms); min=1.574(ms); max=78.92(ms); mean=50.36(ms); s_mean=46.96; sleep=45(ms); delta=415; N=1143; usual
Fftlength=128,pass=4:Tune: sum=43349.3(ms); min=1.471(ms); max=85.76(ms); mean=44.14(ms); s_mean=45.74; sleep=45(ms); delta=564; N=982; usual
Fftlength=128,pass=5:Tune: sum=35108.1(ms); min=1.397(ms); max=78.82(ms); mean=43.45(ms); s_mean=64.48; sleep=60(ms); delta=502; N=808; usual
Fftlength=256,pass=3:Tune: sum=57112.4(ms); min=0.8079(ms); max=106.8(ms); mean=47.44(ms); s_mean=48.09; sleep=45(ms); delta=624; N=1204; usual
Fftlength=256,pass=4:Tune: sum=44186.9(ms); min=0.7686(ms); max=81.53(ms); mean=43.45(ms); s_mean=64.8; sleep=60(ms); delta=565; N=1017; usual
Fftlength=256,pass=5:Tune: sum=35933.9(ms); min=0.7244(ms); max=66.49(ms); mean=40.47(ms); s_mean=66.15; sleep=60(ms); delta=502; N=888; usual
Fftlength=512,pass=3:Tune: sum=57187.3(ms); min=0.4307(ms); max=53.65(ms); mean=44.92(ms); s_mean=53.08; sleep=45(ms); delta=1279; N=1273; usual
Fftlength=512,pass=4:Tune: sum=45250.1(ms); min=0.4042(ms); max=42.06(ms); mean=36.08(ms); s_mean=41.97; sleep=30(ms); delta=1259; N=1254; usual
Fftlength=512,pass=5:Tune: sum=37669.4(ms); min=0.3759(ms); max=35.04(ms); mean=30.53(ms); s_mean=34.96; sleep=30(ms); delta=1238; N=1234; usual
Fftlength=1024,pass=3:Tune: sum=81507.2(ms); min=15.67(ms); max=38.68(ms); mean=37.82(ms); s_mean=37.94; sleep=30(ms); delta=1; N=2155; high_perf
Fftlength=2048,pass=3:Tune: sum=86634.5(ms); min=8.426(ms); max=20.62(ms); mean=20.1(ms); s_mean=20.03; sleep=15(ms); delta=1; N=4311; high_perf
Fftlength=4096,pass=3:Tune: sum=83309(ms); min=2.031(ms); max= 10(ms); mean=9.663(ms); s_mean=9.643; sleep=0(ms); delta=1; N=8621; high_perf
Fftlength=8192,pass=3:Tune: sum=106028(ms); min=6.13(ms); max=6.718(ms); mean=6.149(ms); s_mean=6.155; sleep=0(ms); delta=1; N=17243; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=36010,	N=36010,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=919,	N=919,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=34404,	N=34404,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=13,	N=13,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=4,	N=4,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=36001,	N=36001,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=929,	N=929,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
07:49:57 (8088): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.