Task 8703933158

Name 18mr20ac.19600.4975.10.37.161_3
Workunit 3943511185
Created 22 Apr 2020, 19:37:11 UTC
Sent 22 Apr 2020, 19:40:54 UTC
Report deadline 14 Jun 2020, 16:48:34 UTC
Received 22 Apr 2020, 22:02:56 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8885744
Run time 3 min 13 sec
CPU time 3 min 10 sec
Validate state Valid
Credit 79.40
Device peak FLOPS 3,348.88 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 145.65 MB
Peak swap size 174.54 MB
Peak disk usage 0.01 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.435738  NumCfft=194035  NumGauss=1085462036  NumPulse=226450004339  NumTriplet=452830915775
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 48
  Max work group size:				 1024
  Max clock frequency:				 1815Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 1572864
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce RTX 2080 SUPER
  Vendor:					 NVIDIA Corporation
  Driver version:				 445.87
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1200Mhz
  Max memory allocation:			 858992640
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717985280
  Constant buffer size:				 858992640
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) UHD Graphics 630
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.7263
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_unified_shared_memory_preview cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_device_side_avc_motion_estimation cl_intel_advanced_motion_estimation cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.435738
Used GPU device parameters are:
	Number of compute units: 48
	Single buffer allocation size: 128MB
	Total device global memory: 8192MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Spike: peak=24.0906, time=90.6, d_freq=1419067394.84, chirp=20.201, fft_len=64k
Spike: peak=24.64486, time=90.6, d_freq=1419067394.84, chirp=20.215, fft_len=64k
Spike: peak=24.10935, time=87.24, d_freq=1419068938.4, chirp=-21.275, fft_len=128k
Spike: peak=24.50458, time=87.24, d_freq=1419068938.4, chirp=-21.276, fft_len=128k
Pulse: peak=0.5998604, time=49.33, period=0.07078, d_freq=1419067687.2, score=1.018, chirp=24.728, fft_len=32 
D:	threshold 0.003040986; unscaled peak power: 0.003061331 exceeds threshold for 0.669%
Pulse: peak=0.596751, time=49.33, period=0.07078, d_freq=1419067686.8, score=1.013, chirp=37.092, fft_len=32 
D:	threshold 0.003053844; unscaled peak power: 0.0030683 exceeds threshold for 0.4734%
Triplet: peak=9.868783, time=8.9, period=2.641, d_freq=1419068232.14, chirp=-41.729, fft_len=128 
Triplet: peak=10.70276, time=99.52, period=3.826, d_freq=1419076899.43, chirp=-57.699, fft_len=32 
Triplet: peak=10.68895, time=99.52, period=3.826, d_freq=1419076899.43, chirp=-57.699, fft_len=32 
Triplet: peak=11.08371, time=99.52, period=3.826, d_freq=1419076894.61, chirp=-63.88, fft_len=32 
Triplet: peak=11.05153, time=99.52, period=3.826, d_freq=1419076894.61, chirp=-63.88, fft_len=32 
Gaussian: peak=4.072314, mean=0.5565221, ChiSq=1.416522, time=83.05, d_freq=1419074903.82,
	score=0.6584652, null_hyp=2.288405, chirp=89.257, fft_len=16k

Best spike: peak=24.64486, time=90.6, d_freq=1419067394.84, chirp=20.215, fft_len=64k
Best autocorr: peak=17.79655, time=73.82, delay=4.9625, d_freq=1419073639.89, chirp=18.616, fft_len=128k
Best gaussian: peak=4.072314, mean=0.5565221, ChiSq=1.416522, time=83.05, d_freq=1419074903.82,
	score=0.6584652, null_hyp=2.288405, chirp=89.257, fft_len=16k
Best pulse: peak=0.5998604, time=49.33, period=0.07078, d_freq=1419067687.2, score=1.018, chirp=24.728, fft_len=32 
Best triplet: peak=11.08371, time=99.52, period=3.826, d_freq=1419076894.61, chirp=-63.88, fft_len=32 
Spike count:    4
Autocorr count: 0
Pulse count:    2
Triplet count:  5
Gaussian count: 1
Wallclock time elapsed since last restart: 187.0 seconds
Fftlength=8,pass=3:Tune: sum=2858.35(ms); min=1.966(ms); max=26.44(ms); mean=13.94(ms); s_mean=17.02; sleep=15(ms); delta=255; N=205; usual
Fftlength=8,pass=4:Tune: sum=1806.35(ms); min=2.478(ms); max=16.64(ms); mean=9.925(ms); s_mean=12.37; sleep=15(ms); delta=245; N=182; usual
Fftlength=8,pass=5:Tune: sum=1252.48(ms); min=1.808(ms); max= 12(ms); mean=7.978(ms); s_mean=9.861; sleep=0(ms); delta=236; N=157; usual
Fftlength=16,pass=3:Tune: sum=2381.39(ms); min=1.11(ms); max=22.27(ms); mean=11.29(ms); s_mean=18.22; sleep=15(ms); delta=258; N=211; usual
Fftlength=16,pass=4:Tune: sum=1501.04(ms); min=1.141(ms); max=14.44(ms); mean=8.07(ms); s_mean=11.26; sleep=0(ms); delta=249; N=186; usual
Fftlength=16,pass=5:Tune: sum=1066.65(ms); min=0.94(ms); max=10.74(ms); mean=6.544(ms); s_mean=7.637; sleep=0(ms); delta=242; N=163; usual
Fftlength=32,pass=3:Tune: sum=1297.55(ms); min=0.6308(ms); max=12.17(ms); mean=5.898(ms); s_mean=7.966; sleep=0(ms); delta=267; N=220; usual
Fftlength=32,pass=4:Tune: sum=862.081(ms); min=0.3746(ms); max=8.651(ms); mean=4.332(ms); s_mean=5.232; sleep=0(ms); delta=262; N=199; usual
Fftlength=32,pass=5:Tune: sum=595.508(ms); min=0.3948(ms); max=7.029(ms); mean=3.462(ms); s_mean=3.912; sleep=0(ms); delta=251; N=172; usual
Fftlength=64,pass=3:Tune: sum=795.439(ms); min=0.2321(ms); max=7.927(ms); mean=3.273(ms); s_mean=3.643; sleep=0(ms); delta=290; N=243; usual
Fftlength=64,pass=4:Tune: sum=536.692(ms); min=0.1961(ms); max=6.119(ms); mean=2.428(ms); s_mean=4.027; sleep=0(ms); delta=284; N=221; usual
Fftlength=64,pass=5:Tune: sum=389.381(ms); min=0.177(ms); max=4.741(ms); mean=2.071(ms); s_mean=4.09; sleep=0(ms); delta=269; N=188; usual
Fftlength=128,pass=3:Tune: sum=1689.94(ms); min=4.053(ms); max=9.427(ms); mean=8.666(ms); s_mean=8.651; sleep=0(ms); delta=1; N=195; high_perf
Fftlength=256,pass=3:Tune: sum=1669.25(ms); min=2.513(ms); max=4.947(ms); mean=4.291(ms); s_mean=4.397; sleep=0(ms); delta=1; N=389; high_perf
Fftlength=512,pass=3:Tune: sum=1649(ms); min=0.9379(ms); max=2.776(ms); mean=2.122(ms); s_mean=2.066; sleep=0(ms); delta=1; N=777; high_perf
Fftlength=1024,pass=3:Tune: sum=410.937(ms); min=0.254(ms); max=0.554(ms); mean=0.2646(ms); s_mean=0.2637; sleep=0(ms); delta=1; N=1553; usual
Fftlength=2048,pass=3:Tune: sum=374.301(ms); min=0.1149(ms); max=0.2074(ms); mean=0.1205(ms); s_mean=0.1196; sleep=0(ms); delta=1; N=3105; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=18,	N=18,	<>=1,	min=1	max=1
class Gaussian_report:		total=1,	N=1,	<>=1,	min=1	max=1
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=20904,	N=20904,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=3919,	N=3919,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12406,	N=12406,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=7,	N=7,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=5,	N=5,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=20902,	N=20902,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=3922,	N=3922,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
17:09:01 (14388): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.