Task 8703966766

Name 19mr20ab.20813.1703.14.41.42_3
Workunit 3945431842
Created 22 Apr 2020, 21:37:30 UTC
Sent 22 Apr 2020, 21:39:44 UTC
Report deadline 14 Jun 2020, 20:18:23 UTC
Received 23 Apr 2020, 0:46:45 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8888201
Run time 7 min 9 sec
CPU time 7 min 6 sec
Validate state Valid
Credit 92.76
Device peak FLOPS 1,230.09 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 115.78 MB
Peak swap size 143.52 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Ryzen Threadripper 3960X 24-Core Processor  

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.434338  NumCfft=194261  NumGauss=1087942450  NumPulse=226451329199  NumTriplet=452833597891
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 2
  Max compute units:				 28
  Max work group size:				 1024
  Max clock frequency:				 1531Mhz
  Max memory allocation:			 3221225472
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 1376256
  Global memory size:				 12884901888
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 TITAN X (Pascal)
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Max compute units:				 19
  Max work group size:				 1024
  Max clock frequency:				 1683Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 933888
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1070 Ti
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.434338
Used GPU device parameters are:
	Number of compute units: 28
	Single buffer allocation size: 128MB
	Total device global memory: 12288MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Spike: peak=24.33747, time=87.24, d_freq=1420412429.27, chirp=17.922, fft_len=128k
Pulse: peak=1.624056, time=98.98, period=0.3315, d_freq=1420415011.83, score=1.03, chirp=-78.127, fft_len=128 
D:	threshold 0.0199316; unscaled peak power: 0.02030141 exceeds threshold for 1.855%
Pulse: peak=1.578561, time=98.98, period=0.3315, d_freq=1420414986.4, score=1.001, chirp=-79.155, fft_len=128 
D:	threshold 0.02064284; unscaled peak power: 0.02066131 exceeds threshold for 0.08944%
Pulse: peak=1.668588, time=98.98, period=0.3315, d_freq=1420415011.73, score=1.059, chirp=-81.211, fft_len=128 
D:	threshold 0.02012316; unscaled peak power: 0.02084436 exceeds threshold for 3.584%
Pulse: peak=1.596521, time=98.98, period=0.3315, d_freq=1420415011.63, score=1.013, chirp=-84.295, fft_len=128 
D:	threshold 0.02028218; unscaled peak power: 0.02044172 exceeds threshold for 0.7866%
Pulse: peak=0.9153776, time=43.24, period=0.1446, d_freq=1420412653.41, score=1.012, chirp=-90.463, fft_len=32 
D:	threshold 0.003708588; unscaled peak power: 0.003729595 exceeds threshold for 0.5664%

Best spike: peak=24.33747, time=87.24, d_freq=1420412429.27, chirp=17.922, fft_len=128k
Best autocorr: peak=16.2963, time=100.7, delay=1.9131, d_freq=1420410831.16, chirp=6.7046, fft_len=128k
Best gaussian: peak=3.709804, mean=0.5582039, ChiSq=1.36179, time=94.79, d_freq=1420413330.07,
	score=-1.604256, null_hyp=2.132713, chirp=-90.435, fft_len=16k
Best pulse: peak=1.668588, time=98.98, period=0.3315, d_freq=1420415011.73, score=1.059, chirp=-81.211, fft_len=128 
Best triplet: peak=0, time=-2.125e+011, period=0, d_freq=0, chirp=0, fft_len=0 
Spike count:    1
Autocorr count: 0
Pulse count:    5
Triplet count:  0
Gaussian count: 0
Wallclock time elapsed since last restart: 423.6 seconds
Fftlength=8,pass=3:Tune: sum=6234.95(ms); min=3.423(ms); max=78.49(ms); mean=30.27(ms); s_mean=31.59; sleep=30(ms); delta=252; N=206; usual
Fftlength=8,pass=4:Tune: sum=4006.51(ms); min=1.817(ms); max=50.69(ms); mean=21.89(ms); s_mean=23.26; sleep=15(ms); delta=246; N=183; usual
Fftlength=8,pass=5:Tune: sum=2981.73(ms); min=2.594(ms); max=42.94(ms); mean=18.87(ms); s_mean=20.65; sleep=15(ms); delta=237; N=158; usual
Fftlength=16,pass=3:Tune: sum=3731.07(ms); min=1.46(ms); max=83.17(ms); mean=17.6(ms); s_mean=22.73; sleep=15(ms); delta=259; N=212; usual
Fftlength=16,pass=4:Tune: sum=2195.31(ms); min=1.532(ms); max=29.29(ms); mean= 12(ms); s_mean=14.62; sleep=15(ms); delta=246; N=183; usual
Fftlength=16,pass=5:Tune: sum=1725.63(ms); min=1.512(ms); max=56.68(ms); mean=10.85(ms); s_mean=11.26; sleep=0(ms); delta=238; N=159; usual
Fftlength=32,pass=3:Tune: sum=1981.23(ms); min=0.7875(ms); max=71.7(ms); mean=8.965(ms); s_mean=20.13; sleep=15(ms); delta=268; N=221; usual
Fftlength=32,pass=4:Tune: sum=1188.68(ms); min=0.8366(ms); max=26.36(ms); mean=5.973(ms); s_mean=7.883; sleep=0(ms); delta=262; N=199; usual
Fftlength=32,pass=5:Tune: sum=933.791(ms); min=0.8059(ms); max=33.74(ms); mean=5.336(ms); s_mean=11.6; sleep=0(ms); delta=254; N=175; usual
Fftlength=64,pass=3:Tune: sum=1269.21(ms); min=0.3022(ms); max=57.81(ms); mean=5.18(ms); s_mean=6.289; sleep=0(ms); delta=292; N=245; usual
Fftlength=64,pass=4:Tune: sum=951.143(ms); min=0.5028(ms); max=30.33(ms); mean=4.153(ms); s_mean=11.53; sleep=0(ms); delta=284; N=229; usual
Fftlength=64,pass=5:Tune: sum=809.705(ms); min=0.3164(ms); max=34.14(ms); mean=3.874(ms); s_mean=12.49; sleep=15(ms); delta=264; N=209; usual
Fftlength=128,pass=3:Tune: sum=4174.24(ms); min=5.777(ms); max=36.02(ms); mean=21.41(ms); s_mean=17.52; sleep=15(ms); delta=1; N=195; high_perf
Fftlength=256,pass=3:Tune: sum=2732.38(ms); min=2.671(ms); max=26.57(ms); mean=7.024(ms); s_mean=7.807; sleep=0(ms); delta=1; N=389; high_perf
Fftlength=512,pass=3:Tune: sum=2939.98(ms); min=2.737(ms); max=10.19(ms); mean=3.774(ms); s_mean=4.014; sleep=0(ms); delta=1; N=779; high_perf
Fftlength=1024,pass=3:Tune: sum=2756.06(ms); min=0.9728(ms); max=3.845(ms); mean=1.77(ms); s_mean=2.15; sleep=0(ms); delta=1; N=1557; usual
Fftlength=2048,pass=3:Tune: sum=2421.07(ms); min=0.3604(ms); max=1.62(ms); mean=0.7777(ms); s_mean=0.797; sleep=0(ms); delta=1; N=3113; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=21,	N=21,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24518,	N=24518,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=361,	N=361,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12425,	N=12425,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=16,	N=16,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=3,	N=3,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=24506,	N=24506,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=374,	N=374,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
17:30:02 (22260): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.