Task 8703921457

Name 18mr20aa.18361.4566.5.32.113_3
Workunit 3942430630
Created 22 Apr 2020, 19:37:06 UTC
Sent 22 Apr 2020, 19:38:28 UTC
Report deadline 14 Jun 2020, 17:01:21 UTC
Received 22 Apr 2020, 21:57:00 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8833633
Run time 9 min 46 sec
CPU time 9 min 34 sec
Validate state Valid
Credit 86.81
Device peak FLOPS 321.50 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 144.98 MB
Peak swap size 165.44 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-8700 CPU @ 3.20GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.435503  NumCfft=193945  NumGauss=1084507886  NumPulse=226397667822  NumTriplet=452799347738
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1392Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 294912
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1050 Ti
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.59
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1200Mhz
  Max memory allocation:			 858992640
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717985280
  Constant buffer size:				 858992640
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) UHD Graphics 630
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.7262
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_unified_shared_memory_preview cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_device_side_avc_motion_estimation cl_intel_advanced_motion_estimation cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.435503
Used GPU device parameters are:
	Number of compute units: 6
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Gaussian: peak=4.283696, mean=0.5727571, ChiSq=1.386122, time=86.4, d_freq=1421108208.68,
	score=0.5433699, null_hyp=2.2635, chirp=-10.047, fft_len=16k
Spike: peak=24.13458, time=33.55, d_freq=1421101348.08, chirp=-26.599, fft_len=128k
Spike: peak=25.11204, time=20.13, d_freq=1421104485.6, chirp=-27.834, fft_len=128k
Spike: peak=25.29946, time=20.13, d_freq=1421104485.6, chirp=-27.838, fft_len=128k
Spike: peak=24.49589, time=20.13, d_freq=1421104485.6, chirp=-27.842, fft_len=128k
Triplet: peak=11.54317, time=99.52, period=4.938, d_freq=1421102162.36, chirp=-35.063, fft_len=32 
Triplet: peak=11.40996, time=99.52, period=4.938, d_freq=1421102156.98, chirp=-41.25, fft_len=32 

Best spike: peak=25.29946, time=20.13, d_freq=1421104485.6, chirp=-27.838, fft_len=128k
Best autocorr: peak=17.50769, time=87.24, delay=5.6735, d_freq=1421102495.44, chirp=-11.694, fft_len=128k
Best gaussian: peak=4.283696, mean=0.5727571, ChiSq=1.386122, time=86.4, d_freq=1421108208.68,
	score=0.5433699, null_hyp=2.2635, chirp=-10.047, fft_len=16k
Best pulse: peak=4.082139, time=101.2, period=1.389, d_freq=1421108037.51, score=0.9904, chirp=-58.781, fft_len=512 
Best triplet: peak=11.54317, time=99.52, period=4.938, d_freq=1421102162.36, chirp=-35.063, fft_len=32 
Spike count:    4
Autocorr count: 0
Pulse count:    0
Triplet count:  2
Gaussian count: 1
Wallclock time elapsed since last restart: 582.3 seconds
Fftlength=8,pass=3:Tune: sum=9171.29(ms); min=2.914(ms); max=67.7(ms); mean=43.26(ms); s_mean=52.31; sleep=45(ms); delta=196; N=212; usual
Fftlength=8,pass=4:Tune: sum=6043.28(ms); min=6.529(ms); max=56.62(ms); mean=33.2(ms); s_mean=42.49; sleep=45(ms); delta=245; N=182; usual
Fftlength=8,pass=5:Tune: sum=4410.13(ms); min=3.309(ms); max=44.23(ms); mean=28.09(ms); s_mean=36.44; sleep=30(ms); delta=236; N=157; usual
Fftlength=16,pass=3:Tune: sum=5355.44(ms); min=1.745(ms); max=49.27(ms); mean=24.91(ms); s_mean=40.14; sleep=30(ms); delta=262; N=215; usual
Fftlength=16,pass=4:Tune: sum=3566.01(ms); min=1.036(ms); max=34.86(ms); mean=18.01(ms); s_mean=27.91; sleep=30(ms); delta=245; N=198; usual
Fftlength=16,pass=5:Tune: sum=2502.14(ms); min=1.339(ms); max=25.59(ms); mean=13.53(ms); s_mean=18.37; sleep=15(ms); delta=232; N=185; usual
Fftlength=32,pass=3:Tune: sum=3695.21(ms); min=0.7045(ms); max=34.58(ms); mean=15.33(ms); s_mean=23.04; sleep=15(ms); delta=264; N=241; usual
Fftlength=32,pass=4:Tune: sum=2582.26(ms); min=0.7154(ms); max=25.4(ms); mean=11.28(ms); s_mean=15.95; sleep=15(ms); delta=252; N=229; usual
Fftlength=32,pass=5:Tune: sum=1896.62(ms); min=0.5816(ms); max=20.12(ms); mean=8.74(ms); s_mean=11.37; sleep=0(ms); delta=240; N=217; usual
Fftlength=64,pass=3:Tune: sum=3816.03(ms); min=0.3389(ms); max=37.39(ms); mean=13.93(ms); s_mean=17.62; sleep=15(ms); delta=285; N=274; usual
Fftlength=64,pass=4:Tune: sum=2709.95(ms); min=0.4454(ms); max=28.37(ms); mean=10.42(ms); s_mean=12.75; sleep=15(ms); delta=271; N=260; usual
Fftlength=64,pass=5:Tune: sum=1994.59(ms); min=0.2585(ms); max=23.1(ms); mean=8.141(ms); s_mean=19.65; sleep=15(ms); delta=256; N=245; usual
Fftlength=128,pass=3:Tune: sum=4580.43(ms); min=23.04(ms); max=24.28(ms); mean=23.73(ms); s_mean=23.83; sleep=15(ms); delta=1; N=193; usual
Fftlength=128,pass=4:Tune: sum=3479.05(ms); min=17.12(ms); max=18.5(ms); mean=18.03(ms); s_mean=18.1; sleep=15(ms); delta=1; N=193; usual
Fftlength=128,pass=5:Tune: sum=2638.04(ms); min=12.51(ms); max=14.74(ms); mean=13.67(ms); s_mean=13.67; sleep=15(ms); delta=1; N=193; usual
Fftlength=256,pass=3:Tune: sum=10119.9(ms); min=11.7(ms); max=26.99(ms); mean=26.15(ms); s_mean=26.19; sleep=15(ms); delta=1; N=387; high_perf
Fftlength=512,pass=3:Tune: sum=9997.02(ms); min=5.591(ms); max=13.29(ms); mean=12.9(ms); s_mean=13.01; sleep=15(ms); delta=1; N=775; high_perf
Fftlength=1024,pass=3:Tune: sum=2939.53(ms); min=1.855(ms); max=2.431(ms); mean=1.895(ms); s_mean=1.883; sleep=0(ms); delta=1; N=1551; usual
Fftlength=2048,pass=3:Tune: sum=2065.4(ms); min=0.6554(ms); max=0.8325(ms); mean=0.6656(ms); s_mean=0.6642; sleep=0(ms); delta=1; N=3103; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=11,	N=11,	<>=1,	min=1	max=1
class Gaussian_report:		total=1,	N=1,	<>=1,	min=1	max=1
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24315,	N=24315,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=484,	N=484,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12390,	N=12390,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=9,	N=9,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=5,	N=5,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=24310,	N=24310,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=490,	N=490,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
17:35:39 (8492): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.