Task 8703810482

Name 28fe20ab.30065.4566.12.39.224_3
Workunit 3909665359
Created 22 Apr 2020, 14:51:58 UTC
Sent 22 Apr 2020, 14:51:58 UTC
Report deadline 14 Jun 2020, 10:31:44 UTC
Received 25 Apr 2020, 9:24:16 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8068685
Run time 32 min 32 sec
CPU time 2 min 59 sec
Validate state Valid
Credit 112.18
Device peak FLOPS 129.84 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 129.18 MB
Peak swap size 146.13 MB
Peak disk usage 0.05 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: Intel(R) Corporation
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
Low-performance GPU detected, default period_iterations_num set to 500
For low-performance GPU path use_sleep enabled with 1ms per iteration, high prec timer enabled
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.437099  NumCfft=193681  NumGauss=1081617910  NumPulse=226419478359  NumTriplet=452806473435
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1150Mhz
  Max memory allocation:			 858992640
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717985280
  Constant buffer size:				 858992640
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) HD Graphics 530
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.7263
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_unified_shared_memory_preview cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_device_side_avc_motion_estimation cl_intel_advanced_motion_estimation cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 3
  Max work group size:				 1024
  Max clock frequency:				 1124Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 49152
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Quadro K620
  Vendor:					 NVIDIA Corporation
  Driver version:				 361.91
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.437099
Used GPU device parameters are:
	Number of compute units: 3
	Single buffer allocation size: 128MB
	Total device global memory: 2048MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: yes
	HighPerformanceGPU path: no
period_iterations_num=500
Pulse: peak=6.827114, time=61.35, period=2.051, d_freq=1419683381.01, score=1.086, chirp=-6.2036, fft_len=128 
D:	threshold 0.05721848; unscaled peak power: 0.06146079 exceeds threshold for 7.414%
Pulse: peak=6.391202, time=61.35, period=2.051, d_freq=1419683393.86, score=1.017, chirp=-7.2379, fft_len=128 
D:	threshold 0.05650856; unscaled peak power: 0.05731779 exceeds threshold for 1.432%
Spike: peak=24.51505, time=33.55, d_freq=1419685027.67, chirp=9.0661, fft_len=128k
Spike: peak=24.44943, time=33.55, d_freq=1419685027.68, chirp=9.0707, fft_len=128k
Pulse: peak=6.825706, time=61.35, period=2.051, d_freq=1419683369, score=1.086, chirp=-11.374, fft_len=128 
D:	threshold 0.05863889; unscaled peak power: 0.06297518 exceeds threshold for 7.395%
Pulse: peak=6.825749, time=61.35, period=2.051, d_freq=1419683381.85, score=1.086, chirp=-12.408, fft_len=128 
D:	threshold 0.05833266; unscaled peak power: 0.06264665 exceeds threshold for 7.396%
Spike: peak=24.11809, time=6.711, d_freq=1419683607.62, chirp=-18.06, fft_len=128k
Pulse: peak=4.470146, time=61.35, period=1.525, d_freq=1419689512.82, score=1.029, chirp=-26.884, fft_len=128 
D:	threshold 0.04225594; unscaled peak power: 0.04324421 exceeds threshold for 2.339%
Triplet: peak=10.87398, time=74.84, period=4.588, d_freq=1419691123.78, chirp=36.189, fft_len=128 
Triplet: peak=11.03559, time=74.84, period=4.588, d_freq=1419691124.89, chirp=37.224, fft_len=128 
Triplet: peak=11.23245, time=74.84, period=4.588, d_freq=1419691125.99, chirp=38.258, fft_len=128 
Pulse: peak=5.4795, time=101.2, period=1.717, d_freq=1419684735.41, score=1.01, chirp=38.258, fft_len=128 
D:	threshold 0.0499313; unscaled peak power: 0.05035435 exceeds threshold for 0.8473%
Triplet: peak=11.16011, time=74.84, period=4.588, d_freq=1419691127.1, chirp=39.292, fft_len=128 
Triplet: peak=10.82185, time=74.84, period=4.588, d_freq=1419691128.13, chirp=40.325, fft_len=128 
Triplet: peak=10.53129, time=74.84, period=4.588, d_freq=1419691129.24, chirp=41.36, fft_len=128 

Best spike: peak=24.51505, time=33.55, d_freq=1419685027.67, chirp=9.0661, fft_len=128k
Best autocorr: peak=17.38911, time=100.7, delay=6.4422, d_freq=1419689623.71, chirp=21.097, fft_len=128k
Best gaussian: peak=3.601461, mean=0.5504479, ChiSq=1.375025, time=89.76, d_freq=1419691415.11,
	score=-0.8154984, null_hyp=2.183604, chirp=-83.648, fft_len=16k
Best pulse: peak=6.827114, time=61.35, period=2.051, d_freq=1419683381.01, score=1.086, chirp=-6.2036, fft_len=128 
Best triplet: peak=11.23245, time=74.84, period=4.588, d_freq=1419691125.99, chirp=38.258, fft_len=128 
Spike count:    3
Autocorr count: 0
Pulse count:    6
Triplet count:  6
Gaussian count: 0
Wallclock time elapsed since last restart: 1948.5 seconds
Fftlength=8,pass=3:Tune: sum=19975.7(ms); min=3.998(ms); max=72.24(ms); mean=58.58(ms); s_mean=41.83; sleep=30(ms); delta=92; N=341; usual
Fftlength=8,pass=4:Tune: sum=14305(ms); min=5.509(ms); max=69.38(ms); mean=55.45(ms); s_mean=54.92; sleep=45(ms); delta=120; N=258; usual
Fftlength=8,pass=5:Tune: sum=10802.2(ms); min=5.818(ms); max=65.51(ms); mean=50.95(ms); s_mean=54.13; sleep=45(ms); delta=135; N=212; usual
Fftlength=16,pass=3:Tune: sum=13851.9(ms); min=2.381(ms); max=70.72(ms); mean=49.47(ms); s_mean=56.99; sleep=45(ms); delta=140; N=280; usual
Fftlength=16,pass=4:Tune: sum=9556.63(ms); min=3.954(ms); max=68.54(ms); mean=41.37(ms); s_mean=46.26; sleep=45(ms); delta=184; N=231; usual
Fftlength=16,pass=5:Tune: sum=7218.32(ms); min=2.096(ms); max=65.83(ms); mean=35.04(ms); s_mean= 57; sleep=60(ms); delta=205; N=206; usual
Fftlength=32,pass=3:Tune: sum=11693.3(ms); min=0.9379(ms); max=71.09(ms); mean=43.8(ms); s_mean=57.76; sleep=60(ms); delta=163; N=267; usual
Fftlength=32,pass=4:Tune: sum=8225.96(ms); min=1.078(ms); max=63.27(ms); mean=34.71(ms); s_mean=55.75; sleep=45(ms); delta=195; N=237; usual
Fftlength=32,pass=5:Tune: sum=6334.12(ms); min=1.21(ms); max=64.74(ms); mean=28.03(ms); s_mean=38.12; sleep=30(ms); delta=237; N=226; usual
Fftlength=64,pass=3:Tune: sum=11940.8(ms); min=1.92(ms); max=70.48(ms); mean=43.26(ms); s_mean=61.8; sleep=60(ms); delta=159; N=276; usual
Fftlength=64,pass=4:Tune: sum=8345.24(ms); min=1.357(ms); max=86.22(ms); mean=31.37(ms); s_mean=38.84; sleep=30(ms); delta=271; N=266; usual
Fftlength=64,pass=5:Tune: sum=6256.92(ms); min=0.6149(ms); max=65.68(ms); mean=24.93(ms); s_mean=58.95; sleep=60(ms); delta=255; N=251; usual
Fftlength=128,pass=3:Tune: sum=13738.4(ms); min=65.59(ms); max=76.24(ms); mean=71.18(ms); s_mean=72.41; sleep=75(ms); delta=1; N=193; usual
Fftlength=128,pass=4:Tune: sum=8995.14(ms); min=44.16(ms); max=50.76(ms); mean=46.85(ms); s_mean=47.01; sleep=45(ms); delta=1; N=192; usual
Fftlength=128,pass=5:Tune: sum=6421.28(ms); min=33.41(ms); max=34.46(ms); mean=33.62(ms); s_mean=33.55; sleep=30(ms); delta=1; N=191; usual
Fftlength=256,pass=3:Tune: sum=13006.5(ms); min=32.38(ms); max=34.58(ms); mean=33.61(ms); s_mean=33.46; sleep=30(ms); delta=1; N=387; usual
Fftlength=256,pass=4:Tune: sum=9638.35(ms); min=24.28(ms); max=25.39(ms); mean=24.91(ms); s_mean=25.06; sleep=15(ms); delta=1; N=387; usual
Fftlength=256,pass=5:Tune: sum=6566.71(ms); min=16.71(ms); max=17.35(ms); mean=16.97(ms); s_mean=16.98; sleep=15(ms); delta=1; N=387; usual
Fftlength=512,pass=3:Tune: sum=28002.5(ms); min=16.43(ms); max=36.78(ms); mean=36.23(ms); s_mean=36.26; sleep=30(ms); delta=1; N=773; high_perf
Fftlength=1024,pass=3:Tune: sum=9136.95(ms); min=5.643(ms); max=6.269(ms); mean=5.906(ms); s_mean=5.934; sleep=0(ms); delta=1; N=1547; usual
Fftlength=2048,pass=3:Tune: sum=5927.16(ms); min=1.838(ms); max=2.092(ms); mean=1.915(ms); s_mean=1.942; sleep=0(ms); delta=1; N=3095; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=53,	N=53,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24322,	N=24322,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=411,	N=411,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12357,	N=12357,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=10,	N=10,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=4,	N=4,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=24319,	N=24319,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=415,	N=415,	<>=1,	min=1	max=1

class SleepQuantum:		total=22394.791,	N=12357,	<>=1.8123162,	min=1.3875456	max=1.9972755

GPU device sync requested...  ...GPU device synched
21:01:13 (2464): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.