Task 8579488256

Name blc73_2bit_guppi_58693_08905_HIP98801_0143.21225.0.22.45.52.vlar_0
Workunit 3900383146
Created 23 Feb 2020, 16:14:06 UTC
Sent 23 Feb 2020, 16:44:27 UTC
Report deadline 16 Apr 2020, 21:44:09 UTC
Received 24 Feb 2020, 17:50:36 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8826666
Run time 15 min 28 sec
CPU time 15 min 22 sec
Validate state Valid
Credit 123.63
Device peak FLOPS 298.05 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 145.29 MB
Peak swap size 163.14 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: Intel(R) Corporation
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-8750H CPU @ 2.20GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.036004  NumCfft=107135  NumGauss=0  NumPulse=37395782784  NumTriplet=50355381664
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1100Mhz
  Max memory allocation:			 858992640
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717985280
  Constant buffer size:				 858992640
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) UHD Graphics 630
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.7262
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_unified_shared_memory_preview cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_device_side_avc_motion_estimation cl_intel_advanced_motion_estimation cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1290Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1050 Ti with Max-Q Design
  Vendor:					 NVIDIA Corporation
  Driver version:				 431.84
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.036004
Used GPU device parameters are:
	Number of compute units: 6
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Triplet: peak=12.03381, time=53.49, period=18.97, d_freq=4433206558.41, chirp=-30.073, fft_len=128 
Pulse: peak=2.753741, time=45.84, period=4.933, d_freq=4433207423.85, score=1.025, chirp=-36.696, fft_len=512 
D:	threshold 0.1145068; unscaled peak power: 0.1166047 exceeds threshold for 1.832%
Pulse: peak=1.917296, time=45.9, period=2.919, d_freq=4433214420.6, score=1, chirp=40.187, fft_len=2k
D:	threshold 0.3491244; unscaled peak power: 0.3491704 exceeds threshold for 0.01317%
Pulse: peak=2.336929, time=45.84, period=4.452, d_freq=4433206794.74, score=1.018, chirp=59.788, fft_len=512 
D:	threshold 0.09974346; unscaled peak power: 0.1010213 exceeds threshold for 1.281%
Pulse: peak=2.752104, time=45.82, period=5.447, d_freq=4433210987.29, score=1, chirp=-87.714, fft_len=256 
D:	threshold 0.05904617; unscaled peak power: 0.05904836 exceeds threshold for 0.003703%
Pulse: peak=6.88439, time=45.82, period=15.69, d_freq=4433210966.38, score=1.017, chirp=-89.145, fft_len=256 
D:	threshold 0.1211962; unscaled peak power: 0.1229691 exceeds threshold for 1.463%

Best spike: peak=23.94209, time=39.37, d_freq=4433207300.82, chirp=15.841, fft_len=16k
Best autocorr: peak=17.48213, time=5.727, delay=1.0312, d_freq=4433212242.05, chirp=-6.5876, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=2.753741, time=45.84, period=4.933, d_freq=4433207423.85, score=1.025, chirp=-36.696, fft_len=512 
Best triplet: peak=12.03381, time=53.49, period=18.97, d_freq=4433206558.41, chirp=-30.073, fft_len=128 
Spike count:    0
Autocorr count: 0
Pulse count:    5
Triplet count:  1
Gaussian count: 0
Wallclock time elapsed since last restart: 918.1 seconds
Fftlength=32,pass=3:Tune: sum=35115.2(ms); min=3.976(ms); max=66.99(ms); mean=42.26(ms); s_mean=47.27; sleep=45(ms); delta=655; N=831; usual
Fftlength=32,pass=4:Tune: sum=23437.2(ms); min=3.467(ms); max=63.56(ms); mean=31.33(ms); s_mean=55.28; sleep=45(ms); delta=823; N=748; usual
Fftlength=32,pass=5:Tune: sum=18888.7(ms); min=3.755(ms); max=58.07(ms); mean=26.87(ms); s_mean=43.02; sleep=45(ms); delta=798; N=703; usual
Fftlength=64,pass=3:Tune: sum=32814.9(ms); min=2.533(ms); max=67.89(ms); mean=37.59(ms); s_mean=57.09; sleep=60(ms); delta=688; N=873; usual
Fftlength=64,pass=4:Tune: sum=22209(ms); min=2.068(ms); max=60.34(ms); mean=26.99(ms); s_mean=50.98; sleep=45(ms); delta=855; N=823; usual
Fftlength=64,pass=5:Tune: sum=17544(ms); min=1.84(ms); max=51.18(ms); mean=22.64(ms); s_mean=39.95; sleep=30(ms); delta=822; N=775; usual
Fftlength=128,pass=3:Tune: sum=31729.8(ms); min=1.306(ms); max=64.27(ms); mean=34.27(ms); s_mean=56.31; sleep=45(ms); delta=689; N=926; usual
Fftlength=128,pass=4:Tune: sum=21956.9(ms); min=0.9155(ms); max=62.94(ms); mean=24.7(ms); s_mean=36.94; sleep=30(ms); delta=913; N=889; usual
Fftlength=128,pass=5:Tune: sum=17419(ms); min=0.9533(ms); max=54.63(ms); mean=20.54(ms); s_mean=29.24; sleep=30(ms); delta=871; N=848; usual
Fftlength=256,pass=3:Tune: sum=32250.1(ms); min=0.5723(ms); max=66.74(ms); mean=32.71(ms); s_mean=58.03; sleep=60(ms); delta=690; N=986; usual
Fftlength=256,pass=4:Tune: sum=22440.2(ms); min=0.5003(ms); max=47.73(ms); mean=23.87(ms); s_mean=40.8; sleep=30(ms); delta=952; N=940; usual
Fftlength=256,pass=5:Tune: sum=17924.8(ms); min=0.4863(ms); max=36.15(ms); mean=20.01(ms); s_mean=32.81; sleep=30(ms); delta=907; N=896; usual
Fftlength=512,pass=3:Tune: sum=31746.9(ms); min=0.299(ms); max=33.33(ms); mean=23.87(ms); s_mean=28.46; sleep=30(ms); delta=1336; N=1330; usual
Fftlength=512,pass=4:Tune: sum=23176.8(ms); min=0.2643(ms); max=24.43(ms); mean=17.72(ms); s_mean=20.83; sleep=15(ms); delta=1314; N=1308; usual
Fftlength=512,pass=5:Tune: sum=17227.4(ms); min=0.259(ms); max= 19(ms); mean=13.4(ms); s_mean=15.4; sleep=15(ms); delta=1291; N=1286; usual
Fftlength=1024,pass=3:Tune: sum=70058.6(ms); min=0.1871(ms); max=37.73(ms); mean=29.93(ms); s_mean=31.73; sleep=30(ms); delta=2344; N=2341; high_perf
Fftlength=1024,pass=4:Tune: sum=604.049(ms); min=0.1357(ms); max=10.46(ms); mean=3.872(ms); s_mean=8.726; sleep=0(ms); delta=2334; N=156; usual
Fftlength=1024,pass=5:Tune: sum=482.476(ms); min=0.1407(ms); max=7.934(ms); mean=3.305(ms); s_mean=7.697; sleep=0(ms); delta=2323; N=146; usual
Fftlength=2048,pass=3:Tune: sum=54756.6(ms); min=5.494(ms); max=13.16(ms); mean=12.25(ms); s_mean=12.31; sleep=15(ms); delta=1; N=4469; high_perf
Fftlength=4096,pass=3:Tune: sum=56869(ms); min=2.639(ms); max=6.769(ms); mean=6.362(ms); s_mean=6.364; sleep=0(ms); delta=1; N=8939; high_perf
Fftlength=8192,pass=3:Tune: sum=37226.5(ms); min=2.053(ms); max=2.453(ms); mean=2.082(ms); s_mean=2.079; sleep=0(ms); delta=1; N=17877; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=47223,	N=47223,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=823,	N=823,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=35669,	N=35669,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=14,	N=14,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=12,	N=12,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=47212,	N=47212,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=835,	N=835,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
09:05:02 (7176): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.