Task 8703762453

Name blc64_2bit_guppi_58838_19300_TIC427348923_0079.28137.818.19.28.52.vlar_3
Workunit 3928668518
Created 22 Apr 2020, 12:07:04 UTC
Sent 22 Apr 2020, 12:09:16 UTC
Report deadline 14 Jun 2020, 17:08:58 UTC
Received 22 Apr 2020, 19:53:56 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8838940
Run time 7 min 57 sec
CPU time 7 min 53 sec
Validate state Valid
Credit 115.77
Device peak FLOPS 1,039.44 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 133.28 MB
Peak swap size 160.87 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-8700K CPU @ 3.70GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.015309  NumCfft=114891  NumGauss=0  NumPulse=45511475072  NumTriplet=58482608288
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 22
  Max work group size:				 1024
  Max clock frequency:				 1228Mhz
  Max memory allocation:			 1610612736
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 1081344
  Global memory size:				 6442450944
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 980 Ti
  Vendor:					 NVIDIA Corporation
  Driver version:				 450.82
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1200Mhz
  Max memory allocation:			 858992640
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717985280
  Constant buffer size:				 858992640
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) UHD Graphics 630
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.7262
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_unified_shared_memory_preview cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_device_side_avc_motion_estimation cl_intel_advanced_motion_estimation cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.015309
Used GPU device parameters are:
	Number of compute units: 22
	Single buffer allocation size: 128MB
	Total device global memory: 6144MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Triplet: peak=10.32171, time=53.92, period=15.07, d_freq=8180282987.87, chirp=2.3532, fft_len=128 
Pulse: peak=4.023798, time=45.86, period=9.127, d_freq=8180279917.5, score=1.073, chirp=-7.6474, fft_len=1024 
D:	threshold 0.3042566; unscaled peak power: 0.3217266 exceeds threshold for 5.742%
Spike: peak=24.18469, time=62.99, d_freq=8180285069.25, chirp=20.027, fft_len=128k
Triplet: peak=10.53012, time=13.78, period=10.12, d_freq=8180285080.66, chirp=-20.59, fft_len=256 
Triplet: peak=10.20866, time=31.5, period=28.9, d_freq=8180281101.7, chirp=28.238, fft_len=2k
Pulse: peak=2.632918, time=45.86, period=4.616, d_freq=8180281015.76, score=1.004, chirp=42.135, fft_len=1024 
D:	threshold 0.2154826; unscaled peak power: 0.2161283 exceeds threshold for 0.2997%
Triplet: peak=11.40328, time=18.79, period=7.74, d_freq=8180280556.19, chirp=-48.313, fft_len=1024 
Pulse: peak=5.523081, time=45.99, period=12.35, d_freq=8180281644.43, score=1.043, chirp=-52.209, fft_len=4k
D:	threshold 1.640351; unscaled peak power: 1.69984 exceeds threshold for 3.627%

Best spike: peak=24.18469, time=62.99, d_freq=8180285069.25, chirp=20.027, fft_len=128k
Best autocorr: peak=16.6883, time=62.99, delay=2.4136, d_freq=8180283704.15, chirp=17.643, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=4.023798, time=45.86, period=9.127, d_freq=8180279917.5, score=1.073, chirp=-7.6474, fft_len=1024 
Best triplet: peak=11.40328, time=18.79, period=7.74, d_freq=8180280556.19, chirp=-48.313, fft_len=1024 
Spike count:    1
Autocorr count: 0
Pulse count:    3
Triplet count:  4
Gaussian count: 0
Wallclock time elapsed since last restart: 472.5 seconds
Fftlength=32,pass=3:Tune: sum=43398.6(ms); min=6.051(ms); max=68.48(ms); mean=44.93(ms); s_mean=57.7; sleep=60(ms); delta=684; N=966; usual
Fftlength=32,pass=4:Tune: sum=29989.7(ms); min=4.316(ms); max=63.82(ms); mean=35.45(ms); s_mean=56.51; sleep=45(ms); delta=824; N=846; usual
Fftlength=32,pass=5:Tune: sum=25478.3(ms); min=4.685(ms); max=66.62(ms); mean=31.69(ms); s_mean=57.96; sleep=60(ms); delta=882; N=804; usual
Fftlength=64,pass=3:Tune: sum=23197.1(ms); min=2.79(ms); max=54.08(ms); mean=25.05(ms); s_mean=42.73; sleep=45(ms); delta=1013; N=926; usual
Fftlength=64,pass=4:Tune: sum=16159.2(ms); min=2.063(ms); max=38.29(ms); mean=18.45(ms); s_mean=28.93; sleep=30(ms); delta=963; N=876; usual
Fftlength=64,pass=5:Tune: sum=9883.77(ms); min=2.32(ms); max=23.31(ms); mean=12.54(ms); s_mean=16.51; sleep=15(ms); delta=919; N=788; usual
Fftlength=128,pass=3:Tune: sum=13865.8(ms); min=1.4(ms); max=32.48(ms); mean=13.98(ms); s_mean=18.64; sleep=15(ms); delta=1079; N=992; usual
Fftlength=128,pass=4:Tune: sum=9707.07(ms); min=1.179(ms); max=23.63(ms); mean=10.19(ms); s_mean=12.75; sleep=15(ms); delta=1040; N=953; usual
Fftlength=128,pass=5:Tune: sum=7901.24(ms); min=1.181(ms); max=20.89(ms); mean=8.645(ms); s_mean=10.51; sleep=0(ms); delta=1001; N=914; usual
Fftlength=256,pass=3:Tune: sum=24206.4(ms); min=0.6974(ms); max=45.64(ms); mean=22.48(ms); s_mean=44.19; sleep=45(ms); delta=1120; N=1077; high_perf
Fftlength=256,pass=4:Tune: sum=3373(ms); min=0.5711(ms); max=13.45(ms); mean=5.669(ms); s_mean=13.37; sleep=15(ms); delta=1077; N=595; usual
Fftlength=256,pass=5:Tune: sum=2750.23(ms); min=0.5968(ms); max=11.72(ms); mean=4.991(ms); s_mean=11.26; sleep=0(ms); delta=1033; N=551; usual
Fftlength=512,pass=3:Tune: sum=28609.7(ms); min=0.3679(ms); max=22.96(ms); mean=18.36(ms); s_mean=22.06; sleep=15(ms); delta=1579; N=1558; high_perf
Fftlength=512,pass=4:Tune: sum=834.834(ms); min=0.2902(ms); max=6.736(ms); mean=2.801(ms); s_mean=6.551; sleep=0(ms); delta=1558; N=298; usual
Fftlength=512,pass=5:Tune: sum=682.419(ms); min=0.3058(ms); max=5.788(ms); mean=2.473(ms); s_mean=5.599; sleep=0(ms); delta=1536; N=276; usual
Fftlength=1024,pass=3:Tune: sum=29233(ms); min=0.1854(ms); max=11.27(ms); mean=10.37(ms); s_mean=10.86; sleep=0(ms); delta=2828; N=2818; high_perf
Fftlength=1024,pass=4:Tune: sum=201.47(ms); min=0.1565(ms); max=3.274(ms); mean=1.361(ms); s_mean=2.835; sleep=0(ms); delta=2817; N=148; usual
Fftlength=1024,pass=5:Tune: sum=166.845(ms); min=0.1592(ms); max=2.781(ms); mean=1.218(ms); s_mean=2.666; sleep=0(ms); delta=2806; N=137; usual
Fftlength=2048,pass=3:Tune: sum=21198.6(ms); min=1.715(ms); max=4.266(ms); mean=3.898(ms); s_mean=3.884; sleep=0(ms); delta=1; N=5439; high_perf
Fftlength=4096,pass=3:Tune: sum=20558.3(ms); min=0.8747(ms); max=2.017(ms); mean=1.89(ms); s_mean=1.873; sleep=0(ms); delta=1; N=10879; high_perf
Fftlength=8192,pass=3:Tune: sum=19519.7(ms); min=0.8871(ms); max=0.9124(ms); mean=0.8971(ms); s_mean=0.8974; sleep=0(ms); delta=1; N=21759; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=52923,	N=52923,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=2879,	N=2879,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=43420,	N=43420,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=7,	N=7,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=5,	N=5,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=52918,	N=52918,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=2885,	N=2885,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
15:46:28 (201916): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.