Task 8682042304

Name blc66_2bit_guppi_58838_30369_TIC43647325_0114.13397.0.20.29.201.vlar_0
Workunit 3946982559
Created 25 Mar 2020, 2:29:10 UTC
Sent 25 Mar 2020, 4:51:48 UTC
Report deadline 14 Jun 2020, 3:16:20 UTC
Received 25 Mar 2020, 21:31:07 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8908826
Run time 18 min 54 sec
CPU time 18 min 50 sec
Validate state Valid
Credit 152.98
Device peak FLOPS 618.07 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 140.47 MB
Peak swap size 156.21 MB
Peak disk usage 0.05 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: Intel(R) Corporation
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-8700 CPU @ 3.20GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.028621  NumCfft=117637  NumGauss=0  NumPulse=102906999748  NumTriplet=130478938693
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1200Mhz
  Max memory allocation:			 858992640
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717985280
  Constant buffer size:				 858992640
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) UHD Graphics 630
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.7262
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_unified_shared_memory_preview cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_device_side_avc_motion_estimation cl_intel_advanced_motion_estimation cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 9
  Max work group size:				 1024
  Max clock frequency:				 1784Mhz
  Max memory allocation:			 805306368
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 147456
  Global memory size:				 3221225472
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1060 3GB
  Vendor:					 NVIDIA Corporation
  Driver version:				 432.00
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.028621
Used GPU device parameters are:
	Number of compute units: 9
	Single buffer allocation size: 128MB
	Total device global memory: 3072MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=2.526136, time=43.13, period=4.534, d_freq=7695655905.73, score=1.01, chirp=0.29447, fft_len=4k
D:	threshold 0.9057745; unscaled peak power: 0.9124626 exceeds threshold for 0.7384%
Pulse: peak=1.241182, time=48.63, period=1.717, d_freq=7695662015.7, score=1.008, chirp=3.1821, fft_len=1024 
D:	threshold 0.1371375; unscaled peak power: 0.1377288 exceeds threshold for 0.4312%
Pulse: peak=2.322655, time=48.59, period=3.646, d_freq=7695658209.38, score=1.064, chirp=-7.9901, fft_len=2k
D:	threshold 0.3419764; unscaled peak power: 0.3570358 exceeds threshold for 4.404%
Triplet: peak=10.67667, time=46.29, period=28.55, d_freq=7695659533.43, chirp=-9.9613, fft_len=128 
Triplet: peak=10.69583, time=46.29, period=28.55, d_freq=7695659533.43, chirp=-9.9613, fft_len=128 
Pulse: peak=6.7177, time=43.08, period=16.35, d_freq=7695664657.92, score=1.038, chirp=13.558, fft_len=1024 
D:	threshold 0.4669803; unscaled peak power: 0.4824421 exceeds threshold for 3.311%
Pulse: peak=6.75386, time=48.63, period=16.35, d_freq=7695664733.14, score=1.044, chirp=13.558, fft_len=1024 
D:	threshold 0.4648026; unscaled peak power: 0.4824421 exceeds threshold for 3.795%
Autocorr: peak=17.99108, time=51.54, delay=1.928, d_freq=7695660559.54, chirp=17.901, fft_len=128k
Autocorr: peak=17.8121, time=51.54, delay=1.928, d_freq=7695660560.07, chirp=17.911, fft_len=128k
Pulse: peak=1.656967, time=48.56, period=2.492, d_freq=7695660144.44, score=1.006, chirp=-21.307, fft_len=512 
D:	threshold 0.0843441; unscaled peak power: 0.08465414 exceeds threshold for 0.3676%
Triplet: peak=11.94184, time=60.22, period=24.23, d_freq=7695663579.24, chirp=-38.463, fft_len=512 
Pulse: peak=0.3207754, time=48.57, period=0.1401, d_freq=7695660285.65, score=1.019, chirp=46.487, fft_len=64 
D:	threshold 0.005154684; unscaled peak power: 0.005178275 exceeds threshold for 0.4577%
Pulse: peak=1.066951, time=48.58, period=1.28, d_freq=7695656236.54, score=1.004, chirp=54.236, fft_len=256 
D:	threshold 0.03222832; unscaled peak power: 0.03228805 exceeds threshold for 0.1853%
Pulse: peak=2.35586, time=48.56, period=3.657, d_freq=7695664519.79, score=1.033, chirp=57.28, fft_len=512 
D:	threshold 0.1032142; unscaled peak power: 0.1055781 exceeds threshold for 2.29%
Pulse: peak=5.24281, time=49.03, period=13.24, d_freq=7695658048.52, score=1.013, chirp=-69.602, fft_len=8k
D:	threshold 3.160656; unscaled peak power: 3.196408 exceeds threshold for 1.131%
Pulse: peak=5.38014, time=43.13, period=11.45, d_freq=7695659076.99, score=1.018, chirp=72.723, fft_len=4k
D:	threshold 1.578457; unscaled peak power: 1.602811 exceeds threshold for 1.543%
Pulse: peak=5.430589, time=48.5, period=11.45, d_freq=7695659467.42, score=1.028, chirp=72.723, fft_len=4k
D:	threshold 1.566074; unscaled peak power: 1.602811 exceeds threshold for 2.346%
Pulse: peak=5.605093, time=48.59, period=14.18, d_freq=7695661815.58, score=1.027, chirp=-74.54, fft_len=2k
D:	threshold 0.8512302; unscaled peak power: 0.8709215 exceeds threshold for 2.313%
Pulse: peak=7.604752, time=48.59, period=18.85, d_freq=7695657798.53, score=1.004, chirp=-82.806, fft_len=2k
D:	threshold 1.151261; unscaled peak power: 1.155606 exceeds threshold for 0.3774%
Pulse: peak=9.894682, time=43.13, period=26.31, d_freq=7695661890.98, score=1.05, chirp=-87.667, fft_len=4k
D:	threshold 2.476374; unscaled peak power: 2.587417 exceeds threshold for 4.484%
Pulse: peak=9.596508, time=48.5, period=26.31, d_freq=7695661420.33, score=1.018, chirp=-87.667, fft_len=4k
D:	threshold 2.546056; unscaled peak power: 2.587417 exceeds threshold for 1.624%
Pulse: peak=3.6682, time=48.57, period=7.14, d_freq=7695655586.25, score=1.055, chirp=-90.762, fft_len=128 
D:	threshold 0.03521561; unscaled peak power: 0.03670932 exceeds threshold for 4.242%
Pulse: peak=1.057383, time=43.08, period=1.124, d_freq=7695665196.2, score=1.04, chirp=-98.718, fft_len=1024 
D:	threshold 0.1231232; unscaled peak power: 0.1256081 exceeds threshold for 2.018%

Best spike: peak=23.78202, time=25.05, d_freq=7695664088.61, chirp=-27.538, fft_len=16k
Best autocorr: peak=17.99108, time=51.54, delay=1.928, d_freq=7695660559.54, chirp=17.901, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=2.322655, time=48.59, period=3.646, d_freq=7695658209.38, score=1.064, chirp=-7.9901, fft_len=2k
Best triplet: peak=11.94184, time=60.22, period=24.23, d_freq=7695663579.24, chirp=-38.463, fft_len=512 
Spike count:    0
Autocorr count: 2
Pulse count:    18
Triplet count:  3
Gaussian count: 0
Wallclock time elapsed since last restart: 1129.1 seconds
Fftlength=32,pass=3:Tune: sum=85717.5(ms); min=4.345(ms); max=73.5(ms); mean=57.72(ms); s_mean=57.17; sleep=60(ms); delta=343; N=1485; usual
Fftlength=32,pass=4:Tune: sum=44880.1(ms); min=3.482(ms); max= 71(ms); mean=46.95(ms); s_mean=58.36; sleep=60(ms); delta=579; N=956; usual
Fftlength=32,pass=5:Tune: sum=31011.4(ms); min=3.424(ms); max=64.1(ms); mean=37.64(ms); s_mean=56.79; sleep=45(ms); delta=689; N=824; usual
Fftlength=64,pass=3:Tune: sum=50949.4(ms); min=2.093(ms); max=79.89(ms); mean=45.29(ms); s_mean=50.5; sleep=45(ms); delta=639; N=1125; usual
Fftlength=64,pass=4:Tune: sum=35074.6(ms); min=1.676(ms); max=72.47(ms); mean=37.67(ms); s_mean=64.33; sleep=60(ms); delta=770; N=931; usual
Fftlength=64,pass=5:Tune: sum=26862(ms); min=1.684(ms); max=72.06(ms); mean=30.56(ms); s_mean=45.32; sleep=45(ms); delta=890; N=879; usual
Fftlength=128,pass=3:Tune: sum=50039.1(ms); min=1.094(ms); max=80.54(ms); mean=43.59(ms); s_mean=66.34; sleep=60(ms); delta=643; N=1148; usual
Fftlength=128,pass=4:Tune: sum=35409(ms); min=1.032(ms); max=89.17(ms); mean=34.34(ms); s_mean=44.94; sleep=45(ms); delta=975; N=1031; usual
Fftlength=128,pass=5:Tune: sum=27881.8(ms); min=0.8871(ms); max=81.59(ms); mean=27.94(ms); s_mean=35.08; sleep=30(ms); delta=1018; N=998; usual
Fftlength=256,pass=3:Tune: sum=49942.5(ms); min=0.5606(ms); max=74.37(ms); mean=41.31(ms); s_mean=41.89; sleep=30(ms); delta=641; N=1209; usual
Fftlength=256,pass=4:Tune: sum=35691(ms); min=0.4504(ms); max=52.73(ms); mean=32.93(ms); s_mean=49.35; sleep=45(ms); delta=1095; N=1084; usual
Fftlength=256,pass=5:Tune: sum=28014.9(ms); min=0.4474(ms); max=41.08(ms); mean=26.86(ms); s_mean=38.65; sleep=30(ms); delta=1053; N=1043; usual
Fftlength=512,pass=3:Tune: sum=35738.3(ms); min=0.3005(ms); max=26.76(ms); mean=21.74(ms); s_mean=24.59; sleep=15(ms); delta=1652; N=1644; usual
Fftlength=512,pass=4:Tune: sum=27307.5(ms); min=0.271(ms); max=20.69(ms); mean=16.84(ms); s_mean=19.37; sleep=15(ms); delta=1630; N=1622; usual
Fftlength=512,pass=5:Tune: sum=21124.4(ms); min=0.2583(ms); max=16.21(ms); mean=13.19(ms); s_mean=14.26; sleep=15(ms); delta=1609; N=1601; usual
Fftlength=1024,pass=3:Tune: sum=86117.5(ms); min=13.21(ms); max=32.27(ms); mean=29.79(ms); s_mean=29.79; sleep=30(ms); delta=1; N=2891; high_perf
Fftlength=2048,pass=3:Tune: sum=85301.6(ms); min=6.497(ms); max=15.98(ms); mean=14.75(ms); s_mean=14.61; sleep=15(ms); delta=1; N=5783; high_perf
Fftlength=4096,pass=3:Tune: sum=85710.8(ms); min=1.66(ms); max=8.497(ms); mean=7.411(ms); s_mean=7.442; sleep=0(ms); delta=1; N=11565; high_perf
Fftlength=8192,pass=3:Tune: sum=57334.7(ms); min=2.445(ms); max=3.263(ms); mean=2.479(ms); s_mean=2.474; sleep=0(ms); delta=1; N=23129; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=57393,	N=57393,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=1155,	N=1155,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=46152,	N=46152,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=17,	N=17,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=7,	N=7,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=57381,	N=57381,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=1168,	N=1168,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
14:29:13 (7580): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.