Task 8703966763

Name 19mr20ab.6391.7020.15.42.169_3
Workunit 3945430102
Created 22 Apr 2020, 21:37:30 UTC
Sent 22 Apr 2020, 21:39:44 UTC
Report deadline 14 Jun 2020, 20:23:45 UTC
Received 23 Apr 2020, 2:18:03 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8888201
Run time 9 min 51 sec
CPU time 9 min 49 sec
Validate state Valid
Credit 92.69
Device peak FLOPS 1,230.09 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 109.46 MB
Peak swap size 137.56 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 1
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 1
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Ryzen Threadripper 3960X 24-Core Processor  

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.434256  NumCfft=194367  NumGauss=1089086152  NumPulse=226452985397  NumTriplet=452836924041
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 2
  Max compute units:				 28
  Max work group size:				 1024
  Max clock frequency:				 1531Mhz
  Max memory allocation:			 3221225472
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 1376256
  Global memory size:				 12884901888
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 TITAN X (Pascal)
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Max compute units:				 19
  Max work group size:				 1024
  Max clock frequency:				 1683Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 933888
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1070 Ti
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.434256
Used GPU device parameters are:
	Number of compute units: 19
	Single buffer allocation size: 128MB
	Total device global memory: 8192MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Gaussian: peak=3.858734, mean=0.5609397, ChiSq=1.400108, time=69.63, d_freq=1419151322.26,
	score=0.2363674, null_hyp=2.256395, chirp=3.1249, fft_len=16k
Autocorr: peak=18.55302, time=100.7, delay=4.9053, d_freq=1419149789.59, chirp=-5.9707, fft_len=128k
Gaussian: peak=3.656606, mean=0.5328799, ChiSq=1.401284, time=42.78, d_freq=1419151902.71,
	score=1.566338, null_hyp=2.327192, chirp=-27.128, fft_len=16k
Gaussian: peak=4.055913, mean=0.5130715, ChiSq=1.261274, time=44.46, d_freq=1419151857.2,
	score=3.401004, null_hyp=2.344387, chirp=-27.128, fft_len=16k
Gaussian: peak=4.089258, mean=0.5079444, ChiSq=1.30039, time=46.14, d_freq=1419151811.68,
	score=4.067775, null_hyp=2.398979, chirp=-27.128, fft_len=16k
Gaussian: peak=3.73116, mean=0.5428032, ChiSq=1.399839, time=47.82, d_freq=1419151766.17,
	score=0.7470804, null_hyp=2.283309, chirp=-27.128, fft_len=16k
Pulse: peak=1.970101, time=43.31, period=0.4063, d_freq=1419149130.11, score=1.005, chirp=55.452, fft_len=64 
D:	threshold 0.01134446; unscaled peak power: 0.01138423 exceeds threshold for 0.3506%

Best spike: peak=23.87612, time=105.7, d_freq=1419151326.68, chirp=-87.014, fft_len=32k
Best autocorr: peak=18.55302, time=100.7, delay=4.9053, d_freq=1419149789.59, chirp=-5.9707, fft_len=128k
Best gaussian: peak=4.089258, mean=0.5079444, ChiSq=1.30039, time=46.14, d_freq=1419151811.68,
	score=4.067775, null_hyp=2.398979, chirp=-27.128, fft_len=16k
Best pulse: peak=1.970101, time=43.31, period=0.4063, d_freq=1419149130.11, score=1.005, chirp=55.452, fft_len=64 
Best triplet: peak=0, time=-2.125e+011, period=0, d_freq=0, chirp=0, fft_len=0 
Spike count:    0
Autocorr count: 1
Pulse count:    1
Triplet count:  0
Gaussian count: 5
Wallclock time elapsed since last restart: 586.3 seconds
Fftlength=8,pass=3:Tune: sum=6886.24(ms); min=3.743(ms); max=77.61(ms); mean=33.43(ms); s_mean=40.8; sleep=30(ms); delta=253; N=206; usual
Fftlength=8,pass=4:Tune: sum=4440.44(ms); min=3.994(ms); max=75.05(ms); mean=24.26(ms); s_mean=29.64; sleep=30(ms); delta=246; N=183; usual
Fftlength=8,pass=5:Tune: sum=3302.56(ms); min=2.627(ms); max=45.83(ms); mean=20.9(ms); s_mean=23.49; sleep=15(ms); delta=237; N=158; usual
Fftlength=16,pass=3:Tune: sum=3695.3(ms); min=1.101(ms); max=42.02(ms); mean=17.03(ms); s_mean=24.78; sleep=15(ms); delta=264; N=217; usual
Fftlength=16,pass=4:Tune: sum=2407.63(ms); min=1.68(ms); max=36.88(ms); mean=12.61(ms); s_mean=17.17; sleep=15(ms); delta=254; N=191; usual
Fftlength=16,pass=5:Tune: sum=1781.84(ms); min=1.247(ms); max=25.19(ms); mean=10.54(ms); s_mean=12.17; sleep=15(ms); delta=248; N=169; usual
Fftlength=32,pass=3:Tune: sum=2230.39(ms); min=0.982(ms); max=28.85(ms); mean=10.18(ms); s_mean=13.89; sleep=15(ms); delta=266; N=219; usual
Fftlength=32,pass=4:Tune: sum=1500.99(ms); min=0.8049(ms); max=40.22(ms); mean=7.658(ms); s_mean=8.605; sleep=0(ms); delta=259; N=196; usual
Fftlength=32,pass=5:Tune: sum=1119.81(ms); min=0.8622(ms); max=21.5(ms); mean=6.291(ms); s_mean=7.142; sleep=0(ms); delta=253; N=178; usual
Fftlength=64,pass=3:Tune: sum=2317.09(ms); min=0.4578(ms); max=28.34(ms); mean=9.268(ms); s_mean=10.91; sleep=0(ms); delta=287; N=250; usual
Fftlength=64,pass=4:Tune: sum=1534.67(ms); min=0.2273(ms); max=22.25(ms); mean=6.448(ms); s_mean=6.889; sleep=0(ms); delta=275; N=238; usual
Fftlength=64,pass=5:Tune: sum=1141.04(ms); min=0.2316(ms); max=18.06(ms); mean=5.14(ms); s_mean=10.74; sleep=0(ms); delta=259; N=222; usual
Fftlength=128,pass=3:Tune: sum=7776.12(ms); min=10.2(ms); max=220.7(ms); mean=39.88(ms); s_mean=31.68; sleep=30(ms); delta=1; N=195; high_perf
Fftlength=256,pass=3:Tune: sum=7365.03(ms); min=4.392(ms); max=111.4(ms); mean=18.93(ms); s_mean=15.25; sleep=15(ms); delta=1; N=389; high_perf
Fftlength=512,pass=3:Tune: sum=5511.35(ms); min=2.378(ms); max=44.73(ms); mean=7.075(ms); s_mean=6.11; sleep=0(ms); delta=1; N=779; high_perf
Fftlength=1024,pass=3:Tune: sum=2836.78(ms); min=0.9032(ms); max=3.45(ms); mean=1.82(ms); s_mean=2.01; sleep=0(ms); delta=1; N=1559; usual
Fftlength=2048,pass=3:Tune: sum=2708.84(ms); min=0.4014(ms); max=1.482(ms); mean=0.8691(ms); s_mean=0.8804; sleep=0(ms); delta=1; N=3117; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=13,	N=13,	<>=1,	min=1	max=1
class Gaussian_report:		total=5,	N=5,	<>=1,	min=1	max=1
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24453,	N=24453,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=454,	N=454,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12445,	N=12445,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=10,	N=10,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=4,	N=4,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=24449,	N=24449,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=459,	N=459,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
19:04:47 (20052): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.