Task 8703606652

Name 02ap10ab.27756.17270.11.38.227_3
Workunit 3911210862
Created 22 Apr 2020, 5:05:54 UTC
Sent 22 Apr 2020, 5:07:47 UTC
Report deadline 14 Jun 2020, 17:27:36 UTC
Received 22 Apr 2020, 5:38:11 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8865639
Run time 4 min 55 sec
CPU time 4 min 52 sec
Validate state Valid
Credit 98.57
Device peak FLOPS 1,029.69 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 112.75 MB
Peak swap size 148.62 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 1
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 1
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.422099  NumCfft=197199  NumGauss=1120013784  NumPulse=226455438629  NumTriplet=452842202835
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 2
  Max compute units:				 15
  Max work group size:				 1024
  Max clock frequency:				 1784Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 737280
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1070
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.59
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Max compute units:				 15
  Max work group size:				 1024
  Max clock frequency:				 1784Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 737280
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1070
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.59
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.422099
Used GPU device parameters are:
	Number of compute units: 15
	Single buffer allocation size: 128MB
	Total device global memory: 8192MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Autocorr: peak=18.31343, time=87.24, delay=6.5096, d_freq=1419718493.41, chirp=19.446, fft_len=128k
Autocorr: peak=19.02936, time=33.55, delay=3.8419, d_freq=1419715801.11, chirp=-29.676, fft_len=128k
Spike: peak=24.97607, time=57.04, d_freq=1419714205.19, chirp=-28.966, fft_len=64k
Spike: peak=24.32063, time=57.04, d_freq=1419714205.18, chirp=-28.985, fft_len=64k
Triplet: peak=9.963138, time=75.24, period=0.3064, d_freq=1419721332.67, chirp=35.947, fft_len=32 
Triplet: peak=9.779763, time=75.24, period=0.3064, d_freq=1419721327.99, chirp=39.941, fft_len=32 
Triplet: peak=9.891174, time=75.24, period=0.3064, d_freq=1419721327.99, chirp=39.941, fft_len=32 
Triplet: peak=9.867546, time=75.24, period=0.3064, d_freq=1419721323.38, chirp=43.936, fft_len=32 
Pulse: peak=10.42262, time=101, period=3.739, d_freq=1419717635.02, score=1.049, chirp=89.868, fft_len=64 
D:	threshold 0.04233487; unscaled peak power: 0.04420426 exceeds threshold for 4.416%
Pulse: peak=10.56457, time=101, period=3.739, d_freq=1419717629.85, score=1.063, chirp=95.859, fft_len=64 
D:	threshold 0.04196709; unscaled peak power: 0.04436479 exceeds threshold for 5.713%

Best spike: peak=24.97607, time=57.04, d_freq=1419714205.19, chirp=-28.966, fft_len=64k
Best autocorr: peak=19.02936, time=33.55, delay=3.8419, d_freq=1419715801.11, chirp=-29.676, fft_len=128k
Best gaussian: peak=3.63574, mean=0.5469897, ChiSq=1.406134, time=61.24, d_freq=1419713357.55,
	score=-0.05696106, null_hyp=2.246932, chirp=-53.585, fft_len=16k
Best pulse: peak=10.56457, time=101, period=3.739, d_freq=1419717629.85, score=1.063, chirp=95.859, fft_len=64 
Best triplet: peak=9.963138, time=75.24, period=0.3064, d_freq=1419721332.67, chirp=35.947, fft_len=32 
Spike count:    2
Autocorr count: 2
Pulse count:    2
Triplet count:  4
Gaussian count: 0
Wallclock time elapsed since last restart: 291.7 seconds
Fftlength=8,pass=3:Tune: sum=6023.33(ms); min=2.705(ms); max=54.39(ms); mean=29.1(ms); s_mean=36.75; sleep=30(ms); delta=258; N=207; usual
Fftlength=8,pass=4:Tune: sum=4078.83(ms); min=2.224(ms); max=37.76(ms); mean=21.58(ms); s_mean=30.41; sleep=30(ms); delta=248; N=189; usual
Fftlength=8,pass=5:Tune: sum=3084.36(ms); min=2.124(ms); max=29.3(ms); mean=17.52(ms); s_mean=20.3; sleep=15(ms); delta=235; N=176; usual
Fftlength=16,pass=3:Tune: sum=3567.58(ms); min=1.248(ms); max=33.62(ms); mean=15.18(ms); s_mean=25.68; sleep=15(ms); delta=264; N=235; usual
Fftlength=16,pass=4:Tune: sum=2508.39(ms); min=0.9922(ms); max=23.67(ms); mean=11.3(ms); s_mean=19.32; sleep=15(ms); delta=251; N=222; usual
Fftlength=16,pass=5:Tune: sum=1918.54(ms); min=1.005(ms); max=19.25(ms); mean=9.18(ms); s_mean=13.87; sleep=15(ms); delta=238; N=209; usual
Fftlength=32,pass=3:Tune: sum=2571.93(ms); min=0.5335(ms); max=22.92(ms); mean=9.93(ms); s_mean=15.26; sleep=15(ms); delta=273; N=259; usual
Fftlength=32,pass=4:Tune: sum=1806.33(ms); min=0.4415(ms); max=17.08(ms); mean=7.313(ms); s_mean=10.71; sleep=0(ms); delta=261; N=247; usual
Fftlength=32,pass=5:Tune: sum=1399.64(ms); min=0.4891(ms); max=14.17(ms); mean=5.956(ms); s_mean=7.976; sleep=0(ms); delta=249; N=235; usual
Fftlength=64,pass=3:Tune: sum=2008.52(ms); min=0.2693(ms); max=19.24(ms); mean=7.148(ms); s_mean=9.213; sleep=0(ms); delta=295; N=281; usual
Fftlength=64,pass=4:Tune: sum=1389.17(ms); min=0.2543(ms); max=14.43(ms); mean=5.183(ms); s_mean=6.441; sleep=0(ms); delta=282; N=268; usual
Fftlength=64,pass=5:Tune: sum=1033.45(ms); min=0.2327(ms); max= 12(ms); mean=4.167(ms); s_mean=10.2; sleep=0(ms); delta=262; N=248; usual
Fftlength=128,pass=3:Tune: sum=4587.27(ms); min=10.05(ms); max=23.48(ms); mean=22.82(ms); s_mean=22.9; sleep=15(ms); delta=1; N=201; high_perf
Fftlength=256,pass=3:Tune: sum=4114.82(ms); min=4.582(ms); max=10.63(ms); mean=10.26(ms); s_mean=10.35; sleep=0(ms); delta=1; N=401; high_perf
Fftlength=512,pass=3:Tune: sum=4183.82(ms); min=2.335(ms); max=5.365(ms); mean=5.223(ms); s_mean=5.226; sleep=0(ms); delta=1; N=801; high_perf
Fftlength=1024,pass=3:Tune: sum=1571.35(ms); min=0.9421(ms); max=1.01(ms); mean=0.9803(ms); s_mean=0.9856; sleep=0(ms); delta=1; N=1603; usual
Fftlength=2048,pass=3:Tune: sum=1042.84(ms); min=0.3136(ms); max=0.3337(ms); mean=0.3254(ms); s_mean=0.327; sleep=0(ms); delta=1; N=3205; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=19,	N=19,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=25313,	N=25313,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=302,	N=302,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12800,	N=12800,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=9,	N=9,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=5,	N=5,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=25304,	N=25304,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=312,	N=312,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
00:13:04 (8224): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.