Task 8615744010

Name blc44_2bit_guppi_58838_08730_TIC449050247_0041.32368.818.20.29.53.vlar_0
Workunit 3916849845
Created 5 Mar 2020, 20:11:33 UTC
Sent 5 Mar 2020, 20:12:54 UTC
Report deadline 28 Apr 2020, 1:12:36 UTC
Received 5 Mar 2020, 22:32:03 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8885744
Run time 4 min 27 sec
CPU time 4 min 25 sec
Validate state Valid
Credit 107.31
Device peak FLOPS 3,348.95 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 137.63 MB
Peak swap size 166.60 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.012338  NumCfft=105485  NumGauss=0  NumPulse=35668628352  NumTriplet=48626261152
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 48
  Max work group size:				 1024
  Max clock frequency:				 1815Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 1572864
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce RTX 2080 SUPER
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.50
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1200Mhz
  Max memory allocation:			 858992640
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717985280
  Constant buffer size:				 858992640
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) UHD Graphics 630
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.7263
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_unified_shared_memory_preview cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_device_side_avc_motion_estimation cl_intel_advanced_motion_estimation cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.012338
Used GPU device parameters are:
	Number of compute units: 48
	Single buffer allocation size: 128MB
	Total device global memory: 8192MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=10.38503, time=45.86, period=27.69, d_freq=10439088410.5, score=1.045, chirp=9.5716, fft_len=1024 
D:	threshold 0.6948107; unscaled peak power: 0.7231387 exceeds threshold for 4.077%
Spike: peak=24.20671, time=5.727, d_freq=10439081137.3, chirp=-16.997, fft_len=128k
Spike: peak=24.23426, time=85.9, d_freq=10439078255.3, chirp=-28.276, fft_len=128k
Spike: peak=24.58505, time=85.9, d_freq=10439078255.3, chirp=-28.281, fft_len=128k
Pulse: peak=5.456564, time=45.99, period=13.51, d_freq=10439084085.8, score=1.027, chirp=-47.249, fft_len=4k
D:	threshold 1.725624; unscaled peak power: 1.765436 exceeds threshold for 2.307%
Pulse: peak=2.917665, time=45.82, period=5.422, d_freq=10439078986.7, score=1.018, chirp=58.556, fft_len=64 
D:	threshold 0.01501569; unscaled peak power: 0.01521685 exceeds threshold for 1.34%
Pulse: peak=0.5222501, time=45.82, period=0.3492, d_freq=10439088656.7, score=1.105, chirp=90.086, fft_len=64 
D:	threshold 0.005784708; unscaled peak power: 0.005980307 exceeds threshold for 3.381%

Best spike: peak=24.58505, time=85.9, d_freq=10439078255.3, chirp=-28.281, fft_len=128k
Best autocorr: peak=17.0537, time=62.99, delay=3.0749, d_freq=10439082088.8, chirp=-16.035, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=0.5222501, time=45.82, period=0.3492, d_freq=10439088656.7, score=1.105, chirp=90.086, fft_len=64 
Best triplet: peak=0, time=-2.124e+011, period=0, d_freq=0, chirp=0, fft_len=0 
Spike count:    3
Autocorr count: 0
Pulse count:    4
Triplet count:  0
Gaussian count: 0
Wallclock time elapsed since last restart: 262.5 seconds
Fftlength=32,pass=3:Tune: sum=25909.6(ms); min=4.434(ms); max=67.64(ms); mean=33.69(ms); s_mean=50.11; sleep=45(ms); delta=802; N=769; usual
Fftlength=32,pass=4:Tune: sum=14633.7(ms); min=2.937(ms); max=41.46(ms); mean=20.55(ms); s_mean=33.52; sleep=30(ms); delta=839; N=712; usual
Fftlength=32,pass=5:Tune: sum=9365.04(ms); min=2.609(ms); max=28.31(ms); mean=14.66(ms); s_mean=19.67; sleep=15(ms); delta=798; N=639; usual
Fftlength=64,pass=3:Tune: sum=13174.1(ms); min=1.995(ms); max=37.59(ms); mean=16.2(ms); s_mean=20.11; sleep=15(ms); delta=908; N=813; usual
Fftlength=64,pass=4:Tune: sum=9136.73(ms); min=1.648(ms); max=27.16(ms); mean=12.02(ms); s_mean=21.48; sleep=15(ms); delta=855; N=760; usual
Fftlength=64,pass=5:Tune: sum=6774.78(ms); min=1.433(ms); max=21.78(ms); mean=9.582(ms); s_mean=15.96; sleep=15(ms); delta=802; N=707; usual
Fftlength=128,pass=3:Tune: sum=7385.31(ms); min=1.004(ms); max=22.07(ms); mean=8.792(ms); s_mean=12.13; sleep=15(ms); delta=935; N=840; usual
Fftlength=128,pass=4:Tune: sum=5091.1(ms); min=0.7908(ms); max=16.61(ms); mean=6.38(ms); s_mean=8.637; sleep=0(ms); delta=893; N=798; usual
Fftlength=128,pass=5:Tune: sum=3038.59(ms); min=0.7233(ms); max=11.21(ms); mean=4.214(ms); s_mean=5.017; sleep=0(ms); delta=864; N=721; usual
Fftlength=256,pass=3:Tune: sum=8671.41(ms); min=0.4699(ms); max=25.03(ms); mean=9.809(ms); s_mean=20.89; sleep=15(ms); delta=979; N=884; high_perf
Fftlength=256,pass=4:Tune: sum=1618.11(ms); min=0.3912(ms); max=9.237(ms); mean=2.98(ms); s_mean=6.551; sleep=0(ms); delta=936; N=543; usual
Fftlength=256,pass=5:Tune: sum=1185.69(ms); min=0.3707(ms); max=6.578(ms); mean=2.376(ms); s_mean=4.9; sleep=0(ms); delta=892; N=499; usual
Fftlength=512,pass=3:Tune: sum=10551.3(ms); min=0.2478(ms); max=14.42(ms); mean=8.502(ms); s_mean=10.88; sleep=0(ms); delta=1288; N=1241; high_perf
Fftlength=512,pass=4:Tune: sum= 395(ms); min=0.2035(ms); max=4.318(ms); mean=1.452(ms); s_mean=3.326; sleep=0(ms); delta=1267; N=272; usual
Fftlength=512,pass=5:Tune: sum=290.237(ms); min=0.1888(ms); max=2.84(ms); mean=1.161(ms); s_mean=2.371; sleep=0(ms); delta=1245; N=250; usual
Fftlength=1024,pass=3:Tune: sum=10915.8(ms); min=0.1126(ms); max=7.772(ms); mean=4.919(ms); s_mean=5.238; sleep=0(ms); delta=2242; N=2219; high_perf
Fftlength=1024,pass=4:Tune: sum=94.673(ms); min=0.1022(ms); max=1.564(ms); mean=0.7013(ms); s_mean=1.295; sleep=0(ms); delta=2231; N=135; usual
Fftlength=1024,pass=5:Tune: sum=69.5763(ms); min=0.09994(ms); max=1.221(ms); mean=0.5611(ms); s_mean=1.105; sleep=0(ms); delta=2220; N=124; usual
Fftlength=2048,pass=3:Tune: sum=10451.5(ms); min=1.09(ms); max=4.084(ms); mean=2.452(ms); s_mean=2.484; sleep=0(ms); delta=1; N=4263; high_perf
Fftlength=4096,pass=3:Tune: sum=9058.65(ms); min=0.4576(ms); max=1.973(ms); mean=1.063(ms); s_mean=1.04; sleep=0(ms); delta=1; N=8525; high_perf
Fftlength=8192,pass=3:Tune: sum=3506.25(ms); min=0.1995(ms); max=0.4526(ms); mean=0.2056(ms); s_mean=0.206; sleep=0(ms); delta=1; N=17051; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=34819,	N=34819,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=11577,	N=11577,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=34022,	N=34022,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=13,	N=13,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=10,	N=10,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=34811,	N=34811,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=11586,	N=11586,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
16:31:10 (20180): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.