Task 8697237623

Name 26mr20ac.10530.15706.5.32.113_1
Workunit 3953683156
Created 31 Mar 2020, 4:31:57 UTC
Sent 31 Mar 2020, 6:18:21 UTC
Report deadline 23 May 2020, 17:39:49 UTC
Received 31 Mar 2020, 12:51:35 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8888201
Run time 7 min 28 sec
CPU time 7 min 26 sec
Validate state Valid
Credit 85.86
Device peak FLOPS 1,230.08 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 112.07 MB
Peak swap size 151.13 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Ryzen Threadripper 3960X 24-Core Processor  

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.422945  NumCfft=196889  NumGauss=1116694318  NumPulse=226355974487  NumTriplet=452749758727
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 2
  Max compute units:				 28
  Max work group size:				 1024
  Max clock frequency:				 1531Mhz
  Max memory allocation:			 3221225472
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 1376256
  Global memory size:				 12884901888
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 TITAN X (Pascal)
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Max compute units:				 19
  Max work group size:				 1024
  Max clock frequency:				 1683Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 933888
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1070 Ti
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.422945
Used GPU device parameters are:
	Number of compute units: 28
	Single buffer allocation size: 128MB
	Total device global memory: 12288MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=3.273021, time=44.41, period=0.8798, d_freq=1421105544.05, score=1.03, chirp=-11.016, fft_len=128 
D:	threshold 0.03119648; unscaled peak power: 0.03191186 exceeds threshold for 2.293%
Spike: peak=24.63088, time=97.31, d_freq=1421103941.32, chirp=-11.202, fft_len=64k
Spike: peak=26.20865, time=97.31, d_freq=1421103941.31, chirp=-11.221, fft_len=64k
Spike: peak=26.13098, time=97.31, d_freq=1421103941.34, chirp=-11.228, fft_len=64k
Spike: peak=24.92116, time=97.31, d_freq=1421103941.3, chirp=-11.239, fft_len=64k
Spike: peak=26.34449, time=97.31, d_freq=1421103941.33, chirp=-11.246, fft_len=64k
Pulse: peak=3.205247, time=44.41, period=0.8798, d_freq=1421105531.35, score=1.009, chirp=-13.02, fft_len=128 
D:	threshold 0.03192289; unscaled peak power: 0.03213699 exceeds threshold for 0.6707%
Pulse: peak=6.532964, time=69.71, period=2.189, d_freq=1421105853.42, score=1.01, chirp=16.024, fft_len=64 
D:	threshold 0.02888806; unscaled peak power: 0.02913337 exceeds threshold for 0.8492%
Triplet: peak=9.963892, time=80.58, period=3.511, d_freq=1421101323.72, chirp=22.033, fft_len=32 
Triplet: peak=10.84987, time=80.58, period=3.511, d_freq=1421101341.31, chirp=26.039, fft_len=32 
Gaussian: peak=3.733679, mean=0.5240476, ChiSq=1.360866, time=37.75, d_freq=1421099777.24,
	score=1.361258, null_hyp=2.294613, chirp=-29.337, fft_len=16k
Triplet: peak=11.05578, time=80.58, period=3.511, d_freq=1421101358.9, chirp=30.045, fft_len=32 
Triplet: peak=11.51844, time=80.58, period=3.511, d_freq=1421101376.57, chirp=34.052, fft_len=32 
Pulse: peak=0.4662547, time=25.37, period=0.04385, d_freq=1421104127.09, score=1.025, chirp=-48.073, fft_len=16 
D:	threshold 0.001425001; unscaled peak power: 0.001436355 exceeds threshold for 0.7968%
Pulse: peak=0.4719063, time=25.37, period=0.04385, d_freq=1421104127.64, score=1.038, chirp=-72.109, fft_len=16 
D:	threshold 0.001431925; unscaled peak power: 0.001448897 exceeds threshold for 1.185%

Best spike: peak=26.34449, time=97.31, d_freq=1421103941.33, chirp=-11.246, fft_len=64k
Best autocorr: peak=17.56552, time=87.24, delay=1.7149, d_freq=1421103464.42, chirp=-0.58691, fft_len=128k
Best gaussian: peak=3.733679, mean=0.5240476, ChiSq=1.360866, time=37.75, d_freq=1421099777.24,
	score=1.361258, null_hyp=2.294613, chirp=-29.337, fft_len=16k
Best pulse: peak=0.4719063, time=25.37, period=0.04385, d_freq=1421104127.64, score=1.038, chirp=-72.109, fft_len=16 
Best triplet: peak=11.51844, time=80.58, period=3.511, d_freq=1421101376.57, chirp=34.052, fft_len=32 
Spike count:    5
Autocorr count: 0
Pulse count:    5
Triplet count:  4
Gaussian count: 1
Wallclock time elapsed since last restart: 443.9 seconds
Fftlength=8,pass=3:Tune: sum=11116.2(ms); min=3.696(ms); max=94.34(ms); mean=48.33(ms); s_mean=53.97; sleep=45(ms); delta=163; N=230; usual
Fftlength=8,pass=4:Tune: sum=5980.16(ms); min=2.899(ms); max=72.55(ms); mean=31.15(ms); s_mean=50.11; sleep=45(ms); delta=242; N=192; usual
Fftlength=8,pass=5:Tune: sum=4367.64(ms); min=2.719(ms); max=51.21(ms); mean=27.64(ms); s_mean=37.98; sleep=30(ms); delta=241; N=158; usual
Fftlength=16,pass=3:Tune: sum=4386.49(ms); min=1.703(ms); max=52.91(ms); mean=19.94(ms); s_mean=34.15; sleep=30(ms); delta=261; N=220; usual
Fftlength=16,pass=4:Tune: sum=2485.86(ms); min=1.854(ms); max=31.32(ms); mean=13.08(ms); s_mean=25.36; sleep=15(ms); delta=245; N=190; usual
Fftlength=16,pass=5:Tune: sum=1651.46(ms); min=1.371(ms); max=28.8(ms); mean=9.546(ms); s_mean= 19; sleep=15(ms); delta=228; N=173; usual
Fftlength=32,pass=3:Tune: sum=2269.96(ms); min=0.8664(ms); max=30.56(ms); mean=9.458(ms); s_mean=17.4; sleep=15(ms); delta=267; N=240; usual
Fftlength=32,pass=4:Tune: sum=1447.09(ms); min=0.8581(ms); max=25.36(ms); mean=6.319(ms); s_mean=10.51; sleep=0(ms); delta=256; N=229; usual
Fftlength=32,pass=5:Tune: sum=1079.25(ms); min=1.014(ms); max=22.87(ms); mean=4.997(ms); s_mean=6.196; sleep=0(ms); delta=243; N=216; usual
Fftlength=64,pass=3:Tune: sum=1923.74(ms); min=0.5069(ms); max=27.86(ms); mean=6.895(ms); s_mean=11.99; sleep=0(ms); delta=292; N=279; usual
Fftlength=64,pass=4:Tune: sum=1241.72(ms); min=0.3492(ms); max=24.5(ms); mean=4.616(ms); s_mean=6.764; sleep=0(ms); delta=282; N=269; usual
Fftlength=64,pass=5:Tune: sum=985.461(ms); min=0.3052(ms); max=25.08(ms); mean=3.911(ms); s_mean=8.971; sleep=0(ms); delta=265; N=252; usual
Fftlength=128,pass=3:Tune: sum=4977.67(ms); min=6.369(ms); max=33.47(ms); mean=25.01(ms); s_mean=25.09; sleep=15(ms); delta=1; N=199; high_perf
Fftlength=256,pass=3:Tune: sum=2450.5(ms); min=2.599(ms); max=8.982(ms); mean=6.142(ms); s_mean=6.651; sleep=0(ms); delta=1; N=399; high_perf
Fftlength=512,pass=3:Tune: sum=2650.09(ms); min=1.532(ms); max=6.058(ms); mean=3.317(ms); s_mean=3.494; sleep=0(ms); delta=1; N=799; high_perf
Fftlength=1024,pass=3:Tune: sum=2526.84(ms); min=1.026(ms); max=4.316(ms); mean=1.582(ms); s_mean=1.388; sleep=0(ms); delta=1; N=1597; usual
Fftlength=2048,pass=3:Tune: sum=2062.36(ms); min=0.34(ms); max=1.075(ms); mean=0.6455(ms); s_mean=0.8078; sleep=0(ms); delta=1; N=3195; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=18,	N=18,	<>=1,	min=1	max=1
class Gaussian_report:		total=1,	N=1,	<>=1,	min=1	max=1
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=25097,	N=25097,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=436,	N=436,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12757,	N=12757,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=8,	N=8,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=6,	N=6,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=25095,	N=25095,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=439,	N=439,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
05:42:10 (5456): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.