Task 8703841578

Name 16mr20aa.10899.25016.11.38.35.vlar_3
Workunit 3936461744
Created 22 Apr 2020, 16:07:43 UTC
Sent 22 Apr 2020, 16:09:24 UTC
Report deadline 14 Jun 2020, 21:09:06 UTC
Received 23 Apr 2020, 3:56:22 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8914093
Run time 20 min 45 sec
CPU time 20 min 9 sec
Validate state Valid
Credit 157.94
Device peak FLOPS 960.21 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 136.10 MB
Peak swap size 153.63 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.018810  NumCfft=145999  NumGauss=0  NumPulse=50175070336  NumTriplet=67968952480
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 16
  Max work group size:				 1024
  Max clock frequency:				 1560Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 524288
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1650
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.59
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1150Mhz
  Max memory allocation:			 858966016
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717932032
  Constant buffer size:				 858966016
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) UHD Graphics 630
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.6999
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_intel_device_side_avc_motion_estimation cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_advanced_motion_estimation cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.018810
Used GPU device parameters are:
	Number of compute units: 16
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=1.555514, time=53.7, period=2.638, d_freq=1420340332.78, score=1.072, chirp=-2.4012, fft_len=256 
D:	threshold 0.03760133; unscaled peak power: 0.03920874 exceeds threshold for 4.275%
Pulse: peak=0.9580578, time=53.7, period=1.125, d_freq=1420341249.43, score=1.041, chirp=10.406, fft_len=256 
D:	threshold 0.03046625; unscaled peak power: 0.03106824 exceeds threshold for 1.976%
Pulse: peak=3.99876, time=53.71, period=10.47, d_freq=1420339963.87, score=1.043, chirp=-19.212, fft_len=512 
D:	threshold 0.147739; unscaled peak power: 0.1528114 exceeds threshold for 3.433%
Triplet: peak=10.18281, time=52.32, period=20.76, d_freq=1420338873.58, chirp=-38.19, fft_len=2k
Pulse: peak=3.481555, time=53.69, period=8.104, d_freq=1420340474.69, score=1.002, chirp=42.158, fft_len=128 
D:	threshold 0.03514956; unscaled peak power: 0.03520159 exceeds threshold for 0.148%
Triplet: peak=12.34, time=51.07, period=43.88, d_freq=1420344402.05, chirp=42.425, fft_len=1024 
Triplet: peak=11.21681, time=17.41, period=8.493, d_freq=1420338829.03, chirp=42.625, fft_len=2k
Triplet: peak=10.16637, time=17.41, period=8.493, d_freq=1420338829.62, chirp=42.659, fft_len=2k
Pulse: peak=0.8525598, time=53.71, period=0.9495, d_freq=1420343889.92, score=1.058, chirp=-73.244, fft_len=512 
D:	threshold 0.05507938; unscaled peak power: 0.05651078 exceeds threshold for 2.599%
Pulse: peak=1.981855, time=53.79, period=4.312, d_freq=1420340024.7, score=1.027, chirp=88.853, fft_len=2k
D:	threshold 0.3744747; unscaled peak power: 0.3810229 exceeds threshold for 1.749%
Spike: peak=24.0223, time=41.94, d_freq=1420343595.23, chirp=94.511, fft_len=32k
Pulse: peak=4.652308, time=53.7, period=11.97, d_freq=1420343085.41, score=1.001, chirp=-96.057, fft_len=256 
D:	threshold 0.08664502; unscaled peak power: 0.08673679 exceeds threshold for 0.1059%
Pulse: peak=1.427361, time=53.71, period=2.261, d_freq=1420344204.45, score=1.008, chirp=-98.992, fft_len=512 
D:	threshold 0.07764658; unscaled peak power: 0.07801428 exceeds threshold for 0.4735%

Best spike: peak=24.0223, time=41.94, d_freq=1420343595.23, chirp=94.511, fft_len=32k
Best autocorr: peak=17.59412, time=33.55, delay=1.3114, d_freq=1420342232.39, chirp=12.979, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.125e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=1.555514, time=53.7, period=2.638, d_freq=1420340332.78, score=1.072, chirp=-2.4012, fft_len=256 
Best triplet: peak=12.34, time=51.07, period=43.88, d_freq=1420344402.05, chirp=42.425, fft_len=1024 
Spike count:    1
Autocorr count: 0
Pulse count:    8
Triplet count:  4
Gaussian count: 0
Wallclock time elapsed since last restart: 1226.8 seconds
Fftlength=32,pass=3:Tune: sum=61097.9(ms); min=5.325(ms); max=68.29(ms); mean=51.26(ms); s_mean=55.23; sleep=45(ms); delta=498; N=1192; usual
Fftlength=32,pass=4:Tune: sum=27035.8(ms); min=3.912(ms); max=61.5(ms); mean=31.22(ms); s_mean=54.42; sleep=45(ms); delta=989; N=866; usual
Fftlength=32,pass=5:Tune: sum=20471.6(ms); min=3.35(ms); max=51.09(ms); mean=25.3(ms); s_mean=41.12; sleep=30(ms); delta=936; N=809; usual
Fftlength=64,pass=3:Tune: sum=30412(ms); min=2.65(ms); max=64.11(ms); mean=31.26(ms); s_mean=51.14; sleep=45(ms); delta=989; N=973; usual
Fftlength=64,pass=4:Tune: sum=19407.3(ms); min=1.817(ms); max=45.98(ms); mean=21.68(ms); s_mean=31.47; sleep=30(ms); delta=1022; N=895; usual
Fftlength=64,pass=5:Tune: sum=14428.5(ms); min=1.708(ms); max=36.87(ms); mean=17.05(ms); s_mean=22.91; sleep=15(ms); delta=973; N=846; usual
Fftlength=128,pass=3:Tune: sum=27846.5(ms); min=1.391(ms); max=65.74(ms); mean=25.69(ms); s_mean=33.61; sleep=30(ms); delta=1147; N=1084; usual
Fftlength=128,pass=4:Tune: sum=20935.3(ms); min=1.084(ms); max=57.61(ms); mean=20.13(ms); s_mean=27.06; sleep=30(ms); delta=1103; N=1040; usual
Fftlength=128,pass=5:Tune: sum=13845(ms); min=0.9156(ms); max=35.96(ms); mean=13.83(ms); s_mean=15.86; sleep=15(ms); delta=1064; N=1001; usual
Fftlength=256,pass=3:Tune: sum=31029(ms); min=0.6069(ms); max=44.01(ms); mean=26.8(ms); s_mean=42.34; sleep=45(ms); delta=1189; N=1158; usual
Fftlength=256,pass=4:Tune: sum=22403.7(ms); min=0.5371(ms); max=32.66(ms); mean=20.11(ms); s_mean=31.17; sleep=30(ms); delta=1145; N=1114; usual
Fftlength=256,pass=5:Tune: sum=17290.7(ms); min=0.4758(ms); max=25.22(ms); mean=16.19(ms); s_mean=23.9; sleep=15(ms); delta=1099; N=1068; usual
Fftlength=512,pass=3:Tune: sum=31338.7(ms); min=0.3133(ms); max=21.92(ms); mean=18.39(ms); s_mean=20.86; sleep=15(ms); delta=1719; N=1704; usual
Fftlength=512,pass=4:Tune: sum=23995.2(ms); min=0.2603(ms); max=16.8(ms); mean=14.27(ms); s_mean=16.29; sleep=15(ms); delta=1697; N=1682; usual
Fftlength=512,pass=5:Tune: sum=18209.2(ms); min=0.2442(ms); max=12.92(ms); mean=10.98(ms); s_mean=12.22; sleep=15(ms); delta=1674; N=1659; usual
Fftlength=1024,pass=3:Tune: sum=70318.1(ms); min=0.1679(ms); max=31.9(ms); mean=22.67(ms); s_mean=23.57; sleep=15(ms); delta=3109; N=3102; high_perf
Fftlength=1024,pass=4:Tune: sum=399.315(ms); min=0.1396(ms); max=7.391(ms); mean=2.644(ms); s_mean=6.291; sleep=0(ms); delta=3098; N=151; usual
Fftlength=1024,pass=5:Tune: sum=332.071(ms); min=0.1388(ms); max=6.296(ms); mean=2.372(ms); s_mean=5.972; sleep=0(ms); delta=3087; N=140; usual
Fftlength=2048,pass=3:Tune: sum=68608.3(ms); min=4.883(ms); max=11.81(ms); mean=11.44(ms); s_mean=11.46; sleep=0(ms); delta=1; N=5997; high_perf
Fftlength=4096,pass=3:Tune: sum=73713.5(ms); min=2.541(ms); max=6.743(ms); mean=6.146(ms); s_mean=6.149; sleep=0(ms); delta=1; N=11993; high_perf
Fftlength=8192,pass=3:Tune: sum=44042.6(ms); min=1.757(ms); max=2.509(ms); mean=1.836(ms); s_mean=1.837; sleep=0(ms); delta=1; N=23985; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=63910,	N=63910,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=942,	N=942,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=47864,	N=47864,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=13,	N=13,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=12,	N=12,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=63900,	N=63900,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=953,	N=953,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
04:56:11 (17896): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.