Task 8704128748

Name 26mr20aa.7638.1299.9.36.141_4
Workunit 3953707604
Created 23 Apr 2020, 5:39:03 UTC
Sent 23 Apr 2020, 5:41:08 UTC
Report deadline 15 Jun 2020, 11:15:54 UTC
Received 23 Apr 2020, 12:01:00 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8740205
Run time 12 min 38 sec
CPU time 12 min 24 sec
Validate state Valid
Credit 81.61
Device peak FLOPS 321.50 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 131.69 MB
Peak swap size 149.21 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-8700 CPU @ 3.20GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.428048  NumCfft=195835  NumGauss=1105129676  NumPulse=226443570134  NumTriplet=452854165130
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1392Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 294912
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1050 Ti
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1200Mhz
  Max memory allocation:			 858966016
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717932032
  Constant buffer size:				 858966016
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) UHD Graphics 630
  Vendor:					 Intel(R) Corporation
  Driver version:				 25.20.100.6471
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_intel_device_side_avc_motion_estimation cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_advanced_motion_estimation cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.428048
Used GPU device parameters are:
	Number of compute units: 6
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=3.314215, time=18.82, period=0.9413, d_freq=1418878784.42, score=1.019, chirp=-8.0956, fft_len=64 
D:	threshold 0.01727924; unscaled peak power: 0.01752913 exceeds threshold for 1.446%
Spike: peak=24.52115, time=100.7, d_freq=1418876564.45, chirp=12.352, fft_len=128k
Spike: peak=24.87666, time=100.7, d_freq=1418876564.45, chirp=12.356, fft_len=128k
Spike: peak=24.54475, time=100.7, d_freq=1418876564.45, chirp=12.359, fft_len=128k

Best spike: peak=24.87666, time=100.7, d_freq=1418876564.45, chirp=12.356, fft_len=128k
Best autocorr: peak=17.02493, time=46.98, delay=0.45619, d_freq=1418876717.8, chirp=-5.0095, fft_len=128k
Best gaussian: peak=3.921632, mean=0.576673, ChiSq=1.406469, time=99.82, d_freq=1418875761.79,
	score=-1.004978, null_hyp=2.193675, chirp=28.949, fft_len=16k
Best pulse: peak=3.314215, time=18.82, period=0.9413, d_freq=1418878784.42, score=1.019, chirp=-8.0956, fft_len=64 
Best triplet: peak=0, time=-2.125e+011, period=0, d_freq=0, chirp=0, fft_len=0 
Spike count:    3
Autocorr count: 0
Pulse count:    1
Triplet count:  0
Gaussian count: 0
Wallclock time elapsed since last restart: 739.7 seconds
Fftlength=8,pass=3:Tune: sum=9713.8(ms); min=7.884(ms); max=69.02(ms); mean=44.56(ms); s_mean=46.09; sleep=45(ms); delta=193; N=218; usual
Fftlength=8,pass=4:Tune: sum=6717.03(ms); min=4.584(ms); max=62.02(ms); mean=36.51(ms); s_mean=50.65; sleep=45(ms); delta=235; N=184; usual
Fftlength=8,pass=5:Tune: sum=4887.33(ms); min=4.092(ms); max=48.55(ms); mean=30.55(ms); s_mean=40.64; sleep=30(ms); delta=239; N=160; usual
Fftlength=16,pass=3:Tune: sum=5627.56(ms); min=1.133(ms); max=51.38(ms); mean=26.05(ms); s_mean=40.45; sleep=30(ms); delta=263; N=216; usual
Fftlength=16,pass=4:Tune: sum=3820.5(ms); min=1.56(ms); max=36.76(ms); mean=19.1(ms); s_mean=29.99; sleep=30(ms); delta=247; N=200; usual
Fftlength=16,pass=5:Tune: sum=2741.12(ms); min=1.094(ms); max=27.11(ms); mean=14.66(ms); s_mean=20.35; sleep=15(ms); delta=234; N=187; usual
Fftlength=32,pass=3:Tune: sum=3747.5(ms); min=0.7547(ms); max=34.62(ms); mean=15.42(ms); s_mean=23.52; sleep=15(ms); delta=266; N=243; usual
Fftlength=32,pass=4:Tune: sum=2663.87(ms); min=0.6748(ms); max=25.83(ms); mean=11.53(ms); s_mean=16.44; sleep=15(ms); delta=254; N=231; usual
Fftlength=32,pass=5:Tune: sum=1946.9(ms); min=0.6154(ms); max= 20(ms); mean=8.89(ms); s_mean=11.89; sleep=0(ms); delta=242; N=219; usual
Fftlength=64,pass=3:Tune: sum=3978.51(ms); min=0.2959(ms); max=38.36(ms); mean=14.36(ms); s_mean=18.21; sleep=15(ms); delta=288; N=277; usual
Fftlength=64,pass=4:Tune: sum=2866.46(ms); min=0.3011(ms); max=30.02(ms); mean=10.74(ms); s_mean=13.29; sleep=15(ms); delta=278; N=267; usual
Fftlength=64,pass=5:Tune: sum=2126.61(ms); min=0.255(ms); max=23.45(ms); mean=8.473(ms); s_mean=21.18; sleep=15(ms); delta=262; N=251; usual
Fftlength=128,pass=3:Tune: sum=4810.66(ms); min=23.65(ms); max=24.87(ms); mean=24.42(ms); s_mean=24.35; sleep=15(ms); delta=1; N=197; usual
Fftlength=128,pass=4:Tune: sum=3679.89(ms); min=17.95(ms); max=19.18(ms); mean=18.68(ms); s_mean=18.65; sleep=15(ms); delta=1; N=197; usual
Fftlength=128,pass=5:Tune: sum=2720.02(ms); min=12.55(ms); max=15.12(ms); mean=13.81(ms); s_mean=13.5; sleep=15(ms); delta=1; N=197; usual
Fftlength=256,pass=3:Tune: sum=10744.6(ms); min=11.74(ms); max=28.09(ms); mean=27.2(ms); s_mean=27.2; sleep=30(ms); delta=1; N=395; high_perf
Fftlength=512,pass=3:Tune: sum=10518.1(ms); min=5.838(ms); max=13.67(ms); mean=13.3(ms); s_mean=13.35; sleep=15(ms); delta=1; N=791; high_perf
Fftlength=1024,pass=3:Tune: sum=3259.97(ms); min=2.012(ms); max=2.41(ms); mean=2.062(ms); s_mean=2.059; sleep=0(ms); delta=1; N=1581; usual
Fftlength=2048,pass=3:Tune: sum=2032.45(ms); min=0.6318(ms); max=0.7965(ms); mean=0.6426(ms); s_mean=0.6431; sleep=0(ms); delta=1; N=3163; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=22,	N=22,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24820,	N=24820,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=453,	N=453,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12632,	N=12632,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=5,	N=5,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=3,	N=3,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=24816,	N=24816,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=458,	N=458,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
14:26:58 (10908): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.