Task 8703851659

Name 16mr20ad.17726.17654.10.37.171_3
Workunit 3937663283
Created 22 Apr 2020, 16:07:50 UTC
Sent 22 Apr 2020, 16:11:19 UTC
Report deadline 14 Jun 2020, 12:24:27 UTC
Received 22 Apr 2020, 17:21:28 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8322522
Run time 6 min 54 sec
CPU time 6 min 50 sec
Validate state Valid
Credit 88.46
Device peak FLOPS 617.99 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 115.77 MB
Peak swap size 155.71 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 1
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 1
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Ryzen 7 1700 Eight-Core Processor           

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.436582  NumCfft=193833  NumGauss=1083282270  NumPulse=226395127658  NumTriplet=452794334736
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 2
  Max compute units:				 9
  Max work group size:				 1024
  Max clock frequency:				 1784Mhz
  Max memory allocation:			 805306368
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 147456
  Global memory size:				 3221225472
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1060 3GB
  Vendor:					 NVIDIA Corporation
  Driver version:				 432.00
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Max compute units:				 9
  Max work group size:				 1024
  Max clock frequency:				 1784Mhz
  Max memory allocation:			 805306368
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 147456
  Global memory size:				 3221225472
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1060 3GB
  Vendor:					 NVIDIA Corporation
  Driver version:				 432.00
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.436582
Used GPU device parameters are:
	Number of compute units: 9
	Single buffer allocation size: 128MB
	Total device global memory: 3072MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Autocorr: peak=17.83408, time=33.55, delay=3.305, d_freq=1419170412.28, chirp=14.615, fft_len=128k
Triplet: peak=13.41553, time=20.74, period=0.6701, d_freq=1419169910.95, chirp=28.907, fft_len=16 
Triplet: peak=13.10048, time=20.74, period=0.6701, d_freq=1419169932.8, chirp=-28.907, fft_len=16 
Triplet: peak=12.86299, time=20.74, period=0.6701, d_freq=1419169932.8, chirp=-28.907, fft_len=16 
Triplet: peak=13.46864, time=20.74, period=0.6701, d_freq=1419169847.16, chirp=-33.037, fft_len=16 
Triplet: peak=13.21555, time=20.74, period=0.6701, d_freq=1419169847.16, chirp=-33.037, fft_len=16 
Triplet: peak=10.62067, time=42.72, period=2.333, d_freq=1419169048.93, chirp=-35.617, fft_len=256 
Triplet: peak=10.7482, time=42.72, period=2.333, d_freq=1419169048.04, chirp=-37.424, fft_len=256 
Triplet: peak=10.228, time=42.21, period=5.348, d_freq=1419173611.52, chirp=46.523, fft_len=1024 
Triplet: peak=10.9004, time=42.21, period=5.348, d_freq=1419173610.14, chirp=46.716, fft_len=1024 
Pulse: peak=6.105624, time=61.84, period=2.084, d_freq=1419171212.4, score=1.022, chirp=-50.071, fft_len=512 
D:	threshold 0.2031329; unscaled peak power: 0.2069159 exceeds threshold for 1.862%
Pulse: peak=6.017261, time=61.84, period=2.084, d_freq=1419171215.47, score=1.007, chirp=-50.33, fft_len=512 
D:	threshold 0.200143; unscaled peak power: 0.2013349 exceeds threshold for 0.5955%
Pulse: peak=9.295311, time=61.84, period=3.985, d_freq=1419171153.76, score=1.01, chirp=56.007, fft_len=512 
D:	threshold 0.3279865; unscaled peak power: 0.3309096 exceeds threshold for 0.8912%
Triplet: peak=12.51097, time=20.74, period=0.6701, d_freq=1419169943.72, chirp=-57.814, fft_len=16 
Triplet: peak=12.28696, time=20.74, period=0.6701, d_freq=1419169943.72, chirp=-57.814, fft_len=16 
Triplet: peak=13.58288, time=20.74, period=0.6701, d_freq=1419169858.08, chirp=-61.944, fft_len=16 
Triplet: peak=13.30728, time=20.74, period=0.6701, d_freq=1419169858.08, chirp=-61.944, fft_len=16 
Triplet: peak=11.60999, time=20.74, period=0.6701, d_freq=1419169954.64, chirp=-86.722, fft_len=16 
Triplet: peak=11.59616, time=20.74, period=0.6701, d_freq=1419169954.64, chirp=-86.722, fft_len=16 
Triplet: peak=13.32206, time=20.74, period=0.6701, d_freq=1419169869, chirp=-90.851, fft_len=16 
Triplet: peak=13.2549, time=20.74, period=0.6701, d_freq=1419169869, chirp=-90.851, fft_len=16 

Best spike: peak=23.61537, time=82.21, d_freq=1419168433.62, chirp=11.239, fft_len=32k
Best autocorr: peak=17.83408, time=33.55, delay=3.305, d_freq=1419170412.28, chirp=14.615, fft_len=128k
Best gaussian: peak=3.941068, mean=0.5669376, ChiSq=1.335687, time=15.94, d_freq=1419174109.3,
	score=-0.997654, null_hyp=2.149742, chirp=4.1259, fft_len=16k
Best pulse: peak=6.105624, time=61.84, period=2.084, d_freq=1419171212.4, score=1.022, chirp=-50.071, fft_len=512 
Best triplet: peak=13.58288, time=20.74, period=0.6701, d_freq=1419169858.08, chirp=-61.944, fft_len=16 
Spike count:    0
Autocorr count: 1
Pulse count:    3
Triplet count:  17
Gaussian count: 0
Wallclock time elapsed since last restart: 408.9 seconds
Fftlength=8,pass=3:Tune: sum=6914.28(ms); min=2.082(ms); max=62.97(ms); mean=33.73(ms); s_mean=42.21; sleep=45(ms); delta=247; N=205; usual
Fftlength=8,pass=4:Tune: sum=4687.6(ms); min=4.772(ms); max=43.91(ms); mean=25.76(ms); s_mean=32.96; sleep=30(ms); delta=245; N=182; usual
Fftlength=8,pass=5:Tune: sum=3428.47(ms); min=2.483(ms); max=33.74(ms); mean=21.84(ms); s_mean=28.34; sleep=30(ms); delta=236; N=157; usual
Fftlength=16,pass=3:Tune: sum=4048.04(ms); min=0.9421(ms); max=37.68(ms); mean=18.74(ms); s_mean=29.7; sleep=30(ms); delta=263; N=216; usual
Fftlength=16,pass=4:Tune: sum=2673.93(ms); min=1.304(ms); max=25.59(ms); mean=14.38(ms); s_mean=20.16; sleep=15(ms); delta=249; N=186; usual
Fftlength=16,pass=5:Tune: sum=1931.07(ms); min=0.8765(ms); max=20.21(ms); mean=11.49(ms); s_mean=14.08; sleep=15(ms); delta=239; N=168; usual
Fftlength=32,pass=3:Tune: sum=2595.31(ms); min=0.6779(ms); max=24.78(ms); mean=11.24(ms); s_mean=16.19; sleep=15(ms); delta=266; N=231; usual
Fftlength=32,pass=4:Tune: sum=1811.62(ms); min=0.5019(ms); max=17.92(ms); mean=8.272(ms); s_mean=11.11; sleep=0(ms); delta=254; N=219; usual
Fftlength=32,pass=5:Tune: sum=1345.29(ms); min=0.3798(ms); max=14.33(ms); mean=6.468(ms); s_mean=8.129; sleep=0(ms); delta=243; N=208; usual
Fftlength=64,pass=3:Tune: sum=2474.41(ms); min=0.2591(ms); max=24.49(ms); mean=9.233(ms); s_mean=11.51; sleep=0(ms); delta=285; N=268; usual
Fftlength=64,pass=4:Tune: sum=1784.9(ms); min=0.2785(ms); max=19.1(ms); mean=6.892(ms); s_mean=8.229; sleep=0(ms); delta=276; N=259; usual
Fftlength=64,pass=5:Tune: sum=1342.16(ms); min=0.2089(ms); max=15.17(ms); mean=5.546(ms); s_mean=14.06; sleep=15(ms); delta=259; N=242; usual
Fftlength=128,pass=3:Tune: sum=6670.54(ms); min=14.55(ms); max=36.62(ms); mean=34.56(ms); s_mean=34.84; sleep=30(ms); delta=1; N=193; high_perf
Fftlength=256,pass=3:Tune: sum=6357.71(ms); min=7.084(ms); max=17.82(ms); mean=16.43(ms); s_mean=16.37; sleep=15(ms); delta=1; N=387; high_perf
Fftlength=512,pass=3:Tune: sum=6203.14(ms); min=2.05(ms); max=9.45(ms); mean=8.004(ms); s_mean=7.998; sleep=0(ms); delta=1; N=775; high_perf
Fftlength=1024,pass=3:Tune: sum=1648.15(ms); min=0.9984(ms); max=1.971(ms); mean=1.064(ms); s_mean=1.032; sleep=0(ms); delta=1; N=1549; usual
Fftlength=2048,pass=3:Tune: sum=1177.9(ms); min=0.3635(ms); max=0.8746(ms); mean=0.3801(ms); s_mean=0.3728; sleep=0(ms); delta=1; N=3099; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=35,	N=35,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24374,	N=24374,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=397,	N=397,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12377,	N=12377,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=8,	N=8,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=1,	N=1,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=24367,	N=24367,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=405,	N=405,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
18:52:41 (6844): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.