Task 8703762390

Name blc44_2bit_guppi_58838_26948_TIC67772767_0103.21393.409.20.29.94.vlar_3
Workunit 3928655162
Created 22 Apr 2020, 12:07:04 UTC
Sent 22 Apr 2020, 12:09:16 UTC
Report deadline 14 Jun 2020, 17:08:58 UTC
Received 22 Apr 2020, 14:02:35 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8838940
Run time 7 min 4 sec
CPU time 6 min 52 sec
Validate state Valid
Credit 103.48
Device peak FLOPS 1,039.44 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 133.21 MB
Peak swap size 160.84 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-8700K CPU @ 3.70GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.010663  NumCfft=105417  NumGauss=0  NumPulse=35597370496  NumTriplet=48555003296
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 22
  Max work group size:				 1024
  Max clock frequency:				 1228Mhz
  Max memory allocation:			 1610612736
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 1081344
  Global memory size:				 6442450944
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 980 Ti
  Vendor:					 NVIDIA Corporation
  Driver version:				 450.82
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1200Mhz
  Max memory allocation:			 858992640
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717985280
  Constant buffer size:				 858992640
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) UHD Graphics 630
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.7262
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_unified_shared_memory_preview cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_device_side_avc_motion_estimation cl_intel_advanced_motion_estimation cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.010663
Used GPU device parameters are:
	Number of compute units: 22
	Single buffer allocation size: 128MB
	Total device global memory: 6144MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=2.735809, time=45.84, period=5.704, d_freq=10460062305.4, score=1.014, chirp=-10.343, fft_len=512 
D:	threshold 0.1173873; unscaled peak power: 0.1185825 exceeds threshold for 1.018%
Pulse: peak=6.491954, time=45.81, period=14.71, d_freq=10460060149.6, score=1.039, chirp=24.072, fft_len=32 
D:	threshold 0.01400393; unscaled peak power: 0.01447215 exceeds threshold for 3.343%
Pulse: peak=4.828796, time=45.84, period=10.11, d_freq=10460057734.6, score=1.063, chirp=-30.09, fft_len=512 
D:	threshold 0.1710048; unscaled peak power: 0.1798284 exceeds threshold for 5.16%
GPU device sync requested...  ...GPU device synched
Termination request detected or computations are finished. GPU device synched,  exiting...
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-8700K CPU @ 3.70GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.010663  NumCfft=105417  NumGauss=0  NumPulse=35597370496  NumTriplet=48555003296
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 75.44 percent.
Used GPU device parameters are:
	Number of compute units: 22
	Single buffer allocation size: 128MB
	Total device global memory: 6144MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=3.74634, time=45.86, period=8.243, d_freq=10460057664.7, score=1.002, chirp=-51.34, fft_len=1024 
D:	threshold 0.2914927; unscaled peak power: 0.2919848 exceeds threshold for 0.1688%
Pulse: peak=3.741683, time=45.99, period=8.411, d_freq=10460056250.5, score=1.049, chirp=55.9, fft_len=4k
D:	threshold 1.196537; unscaled peak power: 1.24217 exceeds threshold for 3.814%
Pulse: peak=9.554779, time=45.99, period=26.49, d_freq=10460061248, score=1.013, chirp=-77.22, fft_len=4k
D:	threshold 2.632569; unscaled peak power: 2.663565 exceeds threshold for 1.177%

Best spike: peak=23.67376, time=85.9, d_freq=10460061793.9, chirp=-2.4878, fft_len=128k
Best autocorr: peak=16.3267, time=51.54, delay=2.7035, d_freq=10460060696.4, chirp=11.191, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=4.828796, time=45.84, period=10.11, d_freq=10460057734.6, score=1.063, chirp=-30.09, fft_len=512 
Best triplet: peak=0, time=-2.124e+011, period=0, d_freq=0, chirp=0, fft_len=0 
Spike count:    0
Autocorr count: 0
Pulse count:    6
Triplet count:  0
Gaussian count: 0
Wallclock time elapsed since last restart: 173.1 seconds
Fftlength=32,pass=3:Tune: sum=20017.9(ms); min=6.062(ms); max=67.95(ms); mean=35.94(ms); s_mean=53.18; sleep=45(ms); delta=622; N=557; usual
Fftlength=32,pass=4:Tune: sum=13914.5(ms); min=4.198(ms); max=50.94(ms); mean=26.06(ms); s_mean=41.87; sleep=30(ms); delta=632; N=534; usual
Fftlength=32,pass=5:Tune: sum=11946.1(ms); min=4.818(ms); max=43.09(ms); mean=23.47(ms); s_mean=31.91; sleep=30(ms); delta=596; N=509; usual
Fftlength=64,pass=3:Tune: sum=10528.3(ms); min=2.768(ms); max=36.34(ms); mean=18.12(ms); s_mean=22.82; sleep=15(ms); delta=668; N=581; usual
Fftlength=64,pass=4:Tune: sum=7334.45(ms); min=2.081(ms); max=27.15(ms); mean=13.38(ms); s_mean=23.43; sleep=15(ms); delta=635; N=548; usual
Fftlength=64,pass=5:Tune: sum=4552.57(ms); min=2.318(ms); max=16.68(ms); mean=9.564(ms); s_mean=13.24; sleep=15(ms); delta=607; N=476; usual
Fftlength=128,pass=3:Tune: sum=6205.97(ms); min=1.416(ms); max=21.18(ms); mean=10.17(ms); s_mean=19.26; sleep=15(ms); delta=697; N=610; usual
Fftlength=128,pass=4:Tune: sum=4348.38(ms); min=1.175(ms); max=15.81(ms); mean=7.615(ms); s_mean=13.8; sleep=15(ms); delta=658; N=571; usual
Fftlength=128,pass=5:Tune: sum=3517.33(ms); min=1.167(ms); max=13.88(ms); mean=6.612(ms); s_mean=11.11; sleep=0(ms); delta=619; N=532; usual
Fftlength=256,pass=3:Tune: sum=7280.43(ms); min=0.707(ms); max=45.88(ms); mean=10.49(ms); s_mean=44.19; sleep=45(ms); delta=737; N=694; high_perf
Fftlength=256,pass=4:Tune: sum=3369.73(ms); min=0.5768(ms); max=13.46(ms); mean=5.663(ms); s_mean=13.35; sleep=15(ms); delta=694; N=595; usual
Fftlength=256,pass=5:Tune: sum=2743.57(ms); min=0.5919(ms); max=11.72(ms); mean=4.979(ms); s_mean=11.32; sleep=0(ms); delta=650; N=551; usual
Fftlength=512,pass=3:Tune: sum=11668.6(ms); min=0.3668(ms); max=22.93(ms); mean=14.71(ms); s_mean=22.12; sleep=15(ms); delta=814; N=793; high_perf
Fftlength=512,pass=4:Tune: sum=837.466(ms); min=0.292(ms); max=6.772(ms); mean=2.81(ms); s_mean=6.514; sleep=0(ms); delta=793; N=298; usual
Fftlength=512,pass=5:Tune: sum=681.826(ms); min=0.3038(ms); max=5.751(ms); mean=2.47(ms); s_mean=5.522; sleep=0(ms); delta=771; N=276; usual
Fftlength=1024,pass=3:Tune: sum=12574(ms); min=0.1847(ms); max=11.39(ms); mean=9.762(ms); s_mean=10.87; sleep=0(ms); delta=1298; N=1288; high_perf
Fftlength=1024,pass=4:Tune: sum=202.487(ms); min=0.1547(ms); max=3.285(ms); mean=1.368(ms); s_mean=2.841; sleep=0(ms); delta=1287; N=148; usual
Fftlength=1024,pass=5:Tune: sum=167.694(ms); min=0.1601(ms); max=2.775(ms); mean=1.224(ms); s_mean=2.659; sleep=0(ms); delta=1276; N=137; usual
Fftlength=2048,pass=3:Tune: sum=9357.47(ms); min=1.778(ms); max=4.6(ms); mean=3.935(ms); s_mean=3.928; sleep=0(ms); delta=1; N=2378; high_perf
Fftlength=4096,pass=3:Tune: sum=9237.48(ms); min=0.8164(ms); max=2.221(ms); mean=1.942(ms); s_mean=1.931; sleep=0(ms); delta=1; N=4756; high_perf
Fftlength=8192,pass=3:Tune: sum=8601.55(ms); min=0.8908(ms); max=0.9754(ms); mean=0.9045(ms); s_mean=0.9091; sleep=0(ms); delta=1; N=9510; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24538,	N=24538,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=1352,	N=1352,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=18979,	N=18979,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=3,	N=3,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=1,	N=1,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=24535,	N=24535,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=1355,	N=1355,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
10:00:00 (150620): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.