Task 8703579019

Name blc64_2bit_guppi_58838_02594_TIC468880077_0021.16056.0.20.29.1.vlar_3
Workunit 3907936353
Created 21 Apr 2020, 22:52:15 UTC
Sent 21 Apr 2020, 22:52:17 UTC
Report deadline 14 Jun 2020, 3:51:59 UTC
Received 22 Apr 2020, 12:27:53 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8207016
Run time 1 hours 51 min 31 sec
CPU time 28 min 50 sec
Validate state Valid
Credit 114.97
Device peak FLOPS 135.84 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 134.05 MB
Peak swap size 161.41 MB
Peak disk usage 0.05 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6700HQ CPU @ 2.60GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
Low-performance GPU detected, default period_iterations_num set to 500
For low-performance GPU path use_sleep enabled with 1ms per iteration, high prec timer enabled
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.012740  NumCfft=115021  NumGauss=0  NumPulse=45647691648  NumTriplet=58618824864
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 3
  Max work group size:				 1024
  Max clock frequency:				 1176Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 73728
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce 940M
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.23
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1050Mhz
  Max memory allocation:			 858992640
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717985280
  Constant buffer size:				 858992640
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) HD Graphics 530
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.7263
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_unified_shared_memory_preview cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_device_side_avc_motion_estimation cl_intel_advanced_motion_estimation cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.012740
Used GPU device parameters are:
	Number of compute units: 3
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: yes
	HighPerformanceGPU path: no
period_iterations_num=500
Pulse: peak=0.5610434, time=45.82, period=0.3914, d_freq=8156260345.38, score=1.025, chirp=11.144, fft_len=128 
D:	threshold 0.01197542; unscaled peak power: 0.01207975 exceeds threshold for 0.8712%
Spike: peak=24.1209, time=17.18, d_freq=8156255852.48, chirp=-17.158, fft_len=128k
Spike: peak=24.06549, time=17.18, d_freq=8156255852.48, chirp=-17.163, fft_len=128k
Spike: peak=24.99416, time=17.18, d_freq=8156265719.63, chirp=-29.574, fft_len=128k
Spike: peak=25.15653, time=17.18, d_freq=8156265719.63, chirp=-29.579, fft_len=128k
GPU device sync requested...  ...GPU device synched
Termination request detected or computations are finished. GPU device synched,  exiting...
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6700HQ CPU @ 2.60GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
Low-performance GPU detected, default period_iterations_num set to 500
For low-performance GPU path use_sleep enabled with 1ms per iteration, high prec timer enabled
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.012740  NumCfft=115021  NumGauss=0  NumPulse=45647691648  NumTriplet=58618824864
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 72.47 percent.
Used GPU device parameters are:
	Number of compute units: 3
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: yes
	HighPerformanceGPU path: no
period_iterations_num=500
Pulse: peak=4.115025, time=45.86, period=8.254, d_freq=8156261677.39, score=1.101, chirp=-53.89, fft_len=1024 
D:	threshold 0.2968354; unscaled peak power: 0.3204329 exceeds threshold for 7.95%
Pulse: peak=9.473404, time=45.99, period=28.45, d_freq=8156263210.66, score=1.002, chirp=57.06, fft_len=4k
D:	threshold 2.38475; unscaled peak power: 2.388906 exceeds threshold for 0.1743%
Pulse: peak=8.400015, time=45.9, period=22.55, d_freq=8156261776, score=1.102, chirp=57.629, fft_len=2k
D:	threshold 1.103161; unscaled peak power: 1.202678 exceeds threshold for 9.021%
Pulse: peak=10.25388, time=45.9, period=28.28, d_freq=8156256821.4, score=1.057, chirp=-63.091, fft_len=2k
D:	threshold 1.344017; unscaled peak power: 1.413424 exceeds threshold for 5.164%
Pulse: peak=5.661801, time=45.86, period=12.91, d_freq=8156259182.85, score=1.016, chirp=-63.201, fft_len=1024 
D:	threshold 0.3844393; unscaled peak power: 0.3898091 exceeds threshold for 1.397%
GPU device sync requested...  ...GPU device synched
Termination request detected or computations are finished. GPU device synched,  exiting...
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6700HQ CPU @ 2.60GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
Low-performance GPU detected, default period_iterations_num set to 500
For low-performance GPU path use_sleep enabled with 1ms per iteration, high prec timer enabled
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.012740  NumCfft=115021  NumGauss=0  NumPulse=45647691648  NumTriplet=58618824864
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 86.38 percent.
Used GPU device parameters are:
	Number of compute units: 3
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: yes
	HighPerformanceGPU path: no
period_iterations_num=500
GPU device sync requested...  ...GPU device synched
Termination request detected or computations are finished. GPU device synched,  exiting...
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6700HQ CPU @ 2.60GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
Low-performance GPU detected, default period_iterations_num set to 500
For low-performance GPU path use_sleep enabled with 1ms per iteration, high prec timer enabled
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.012740  NumCfft=115021  NumGauss=0  NumPulse=45647691648  NumTriplet=58618824864
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 88.52 percent.
Used GPU device parameters are:
	Number of compute units: 3
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: yes
	HighPerformanceGPU path: no
period_iterations_num=500
Triplet: peak=10.40353, time=26.41, period=25.46, d_freq=8156256775.74, chirp=-78.598, fft_len=128 
Pulse: peak=6.897832, time=45.84, period=17.29, d_freq=8156256836.3, score=1.039, chirp=-93.702, fft_len=512 
D:	threshold 0.2382169; unscaled peak power: 0.2463213 exceeds threshold for 3.402%
Pulse: peak=2.255178, time=45.86, period=4.273, d_freq=8156261152.58, score=1.006, chirp=-93.848, fft_len=1024 
D:	threshold 0.2029162; unscaled peak power: 0.2037007 exceeds threshold for 0.3866%
Pulse: peak=2.007845, time=45.84, period=2.874, d_freq=8156259686.3, score=1.003, chirp=95.754, fft_len=512 
D:	threshold 0.09198654; unscaled peak power: 0.09217962 exceeds threshold for 0.2099%

Best spike: peak=25.15653, time=17.18, d_freq=8156265719.63, chirp=-29.579, fft_len=128k
Best autocorr: peak=17.46458, time=62.99, delay=4.9995, d_freq=8156259965.39, chirp=-23.474, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=8.400015, time=45.9, period=22.55, d_freq=8156261776, score=1.102, chirp=57.629, fft_len=2k
Best triplet: peak=10.40353, time=26.41, period=25.46, d_freq=8156256775.74, chirp=-78.598, fft_len=128 
Spike count:    4
Autocorr count: 0
Pulse count:    9
Triplet count:  1
Gaussian count: 0
Wallclock time elapsed since last restart: 1228.1 seconds
Fftlength=32,pass=3:Tune: sum=46958.5(ms); min=10.45(ms); max=80.44(ms); mean=57.62(ms); s_mean=50.51; sleep=45(ms); delta=144; N=815; usual
Fftlength=32,pass=4:Tune: sum=32033.7(ms); min=6.54(ms); max=74.83(ms); mean=54.57(ms); s_mean=51.11; sleep=45(ms); delta=192; N=587; usual
Fftlength=32,pass=5:Tune: sum=26678.3(ms); min=7.22(ms); max=68.7(ms); mean=53.25(ms); s_mean=55.12; sleep=45(ms); delta=199; N=501; usual
Fftlength=64,pass=3:Tune: sum=46432(ms); min=4.281(ms); max=74.42(ms); mean=56.9(ms); s_mean=53.47; sleep=45(ms); delta=144; N=816; usual
Fftlength=64,pass=4:Tune: sum=31371.1(ms); min=3.608(ms); max=67.99(ms); mean=52.03(ms); s_mean=54.4; sleep=45(ms); delta=192; N=603; usual
Fftlength=64,pass=5:Tune: sum=24767.5(ms); min=3.585(ms); max=68.47(ms); mean=49.53(ms); s_mean=58.67; sleep=60(ms); delta=220; N=500; usual
Fftlength=128,pass=3:Tune: sum=45815.7(ms); min=3.054(ms); max=72.74(ms); mean=57.27(ms); s_mean=59.07; sleep=60(ms); delta=153; N=800; usual
Fftlength=128,pass=4:Tune: sum=32124.3(ms); min=2.524(ms); max=74.4(ms); mean=51.32(ms); s_mean=62.03; sleep=60(ms); delta=205; N=626; usual
Fftlength=128,pass=5:Tune: sum=24408.2(ms); min=1.822(ms); max=69.47(ms); mean=48.14(ms); s_mean=58.78; sleep=60(ms); delta=221; N=507; usual
Fftlength=256,pass=3:Tune: sum=45905.6(ms); min=1.221(ms); max=83.41(ms); mean=54.39(ms); s_mean=47.99; sleep=45(ms); delta=169; N=844; usual
Fftlength=256,pass=4:Tune: sum=32291.7(ms); min=0.9075(ms); max=77.06(ms); mean=50.53(ms); s_mean=64.16; sleep=60(ms); delta=207; N=639; usual
Fftlength=256,pass=5:Tune: sum=25064.8(ms); min=0.9564(ms); max=83.54(ms); mean=46.42(ms); s_mean=67.15; sleep=60(ms); delta=273; N=540; usual
Fftlength=512,pass=3:Tune: sum=46708.5(ms); min=0.6093(ms); max=81.93(ms); mean=53.69(ms); s_mean=42.98; sleep=45(ms); delta=170; N=870; usual
Fftlength=512,pass=4:Tune: sum=34101.3(ms); min=0.5374(ms); max=110.8(ms); mean=49.35(ms); s_mean=48.5; sleep=45(ms); delta=305; N=691; usual
Fftlength=512,pass=5:Tune: sum=26426.2(ms); min=0.5828(ms); max=90.51(ms); mean=45.88(ms); s_mean=56.54; sleep=45(ms); delta=271; N=576; usual
Fftlength=1024,pass=3:Tune: sum=50447.8(ms); min=0.4521(ms); max=87.26(ms); mean=53.21(ms); s_mean=63.02; sleep=60(ms); delta=171; N=948; usual
Fftlength=1024,pass=4:Tune: sum=42774(ms); min=0.4337(ms); max=72.63(ms); mean=52.81(ms); s_mean=65.77; sleep=60(ms); delta=155; N=810; usual
Fftlength=1024,pass=5:Tune: sum=23636.1(ms); min=0.5692(ms); max=43.01(ms); mean=32.42(ms); s_mean=36.45; sleep=30(ms); delta=731; N=729; usual
Fftlength=2048,pass=3:Tune: sum=50449(ms); min=37.56(ms); max=42.3(ms); mean=39.2(ms); s_mean=39.07; sleep=30(ms); delta=1; N=1287; usual
Fftlength=2048,pass=4:Tune: sum=39126.5(ms); min=27.44(ms); max=34.94(ms); mean=30.4(ms); s_mean=31.08; sleep=30(ms); delta=1; N=1287; usual
Fftlength=2048,pass=5:Tune: sum=23601.4(ms); min=17.83(ms); max=22.32(ms); mean=18.34(ms); s_mean=18.12; sleep=15(ms); delta=1; N=1287; usual
Fftlength=4096,pass=3:Tune: sum=52909.3(ms); min=19.72(ms); max=21.16(ms); mean=20.54(ms); s_mean=20.44; sleep=15(ms); delta=1; N=2576; usual
Fftlength=4096,pass=4:Tune: sum=41682.5(ms); min=15.54(ms); max=16.89(ms); mean=16.18(ms); s_mean=16.22; sleep=15(ms); delta=1; N=2576; usual
Fftlength=4096,pass=5:Tune: sum=27236.6(ms); min=10.11(ms); max=11.22(ms); mean=10.57(ms); s_mean=10.56; sleep=0(ms); delta=1; N=2576; usual
Fftlength=8192,pass=3:Tune: sum=23380.7(ms); min=4.147(ms); max=5.389(ms); mean=4.538(ms); s_mean=4.635; sleep=0(ms); delta=1; N=5152; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=12966,	N=12966,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=235,	N=235,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=10276,	N=10276,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=3,	N=3,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=3,	N=3,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=12963,	N=12963,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=238,	N=238,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
08:27:27 (14292): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.