Task 8695480918

Name 23mr20ac.7912.22562.11.38.223_0
Workunit 3952930178
Created 30 Mar 2020, 20:05:50 UTC
Sent 30 Mar 2020, 20:06:22 UTC
Report deadline 22 May 2020, 23:12:42 UTC
Received 2 Apr 2020, 6:33:09 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8369221
Run time 27 min 6 sec
CPU time 26 min 53 sec
Validate state Valid
Credit 103.32
Device peak FLOPS 226.35 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 120.73 MB
Peak swap size 144.55 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.8.3</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: Intel(R) Corporation
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6700HQ CPU @ 2.60GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.430271  NumCfft=195249  NumGauss=1098774550  NumPulse=226375366912  NumTriplet=452753980650
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1050Mhz
  Max memory allocation:			 858966016
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717932032
  Constant buffer size:				 858966016
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) HD Graphics 530
  Vendor:					 Intel(R) Corporation
  Driver version:				 23.20.16.4973
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_intel_device_side_avc_motion_estimation cl_khr_priority_hints cl_khr_subgroups cl_khr_il_program cl_khr_fp64 cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_advanced_motion_estimation cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_simultaneous_sharing 


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 5
  Max work group size:				 1024
  Max clock frequency:				 1176Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 81920
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 960M
  Vendor:					 NVIDIA Corporation
  Driver version:				 376.67
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.430271
Used GPU device parameters are:
	Number of compute units: 5
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Autocorr: peak=19.73731, time=100.7, delay=3.4803, d_freq=1419680222.06, chirp=24.713, fft_len=128k
Autocorr: peak=17.90961, time=20.13, delay=2.446, d_freq=1419678326.46, chirp=29.409, fft_len=128k
Triplet: peak=9.965421, time=20.87, period=0.5505, d_freq=1419679738.09, chirp=-36.515, fft_len=512 

Best spike: peak=23.55765, time=62.08, d_freq=1419673939.36, chirp=-46.982, fft_len=32k
Best autocorr: peak=19.73731, time=100.7, delay=3.4803, d_freq=1419680222.06, chirp=24.713, fft_len=128k
Best gaussian: peak=3.31931, mean=0.5412068, ChiSq=1.334211, time=51.17, d_freq=1419674097.85,
	score=-1.049648, null_hyp=2.146624, chirp=-78.056, fft_len=16k
Best pulse: peak=1.493899, time=56.18, period=0.3648, d_freq=1419674928.78, score=0.9926, chirp=44.785, fft_len=512 
Best triplet: peak=9.965421, time=20.87, period=0.5505, d_freq=1419679738.09, chirp=-36.515, fft_len=512 
Spike count:    0
Autocorr count: 2
Pulse count:    0
Triplet count:  1
Gaussian count: 0
Wallclock time elapsed since last restart: 1614.5 seconds
Fftlength=8,pass=3:Tune: sum=17062.7(ms); min=6.734(ms); max=71.46(ms); mean=57.07(ms); s_mean=49.72; sleep=45(ms); delta=113; N=299; usual
Fftlength=8,pass=4:Tune: sum=11841(ms); min=8.674(ms); max=68.43(ms); mean=54.07(ms); s_mean=57.3; sleep=60(ms); delta=143; N=219; usual
Fftlength=8,pass=5:Tune: sum=8828.12(ms); min=4.84(ms); max=67.11(ms); mean=50.45(ms); s_mean=56.89; sleep=45(ms); delta=171; N=175; usual
Fftlength=16,pass=3:Tune: sum=9059.21(ms); min=3.214(ms); max=69.54(ms); mean=40.09(ms); s_mean=58.08; sleep=60(ms); delta=214; N=226; usual
Fftlength=16,pass=4:Tune: sum=6262.7(ms); min=2.63(ms); max=59.76(ms); mean=30.4(ms); s_mean=48.75; sleep=45(ms); delta=245; N=206; usual
Fftlength=16,pass=5:Tune: sum=4937.25(ms); min=2.969(ms); max=47.59(ms); mean=25.71(ms); s_mean=36.38; sleep=30(ms); delta=231; N=192; usual
Fftlength=32,pass=3:Tune: sum=6158.19(ms); min=1.632(ms); max=57.81(ms); mean=25.24(ms); s_mean=38.9; sleep=30(ms); delta=263; N=244; usual
Fftlength=32,pass=4:Tune: sum=4263.85(ms); min=0.9447(ms); max=42.02(ms); mean=18.54(ms); s_mean=27.2; sleep=30(ms); delta=249; N=230; usual
Fftlength=32,pass=5:Tune: sum=3582.72(ms); min=1.487(ms); max=36.29(ms); mean=16.59(ms); s_mean=22.37; sleep=15(ms); delta=235; N=216; usual
Fftlength=64,pass=3:Tune: sum=5925.55(ms); min=0.7285(ms); max=54.84(ms); mean=21.09(ms); s_mean=26.06; sleep=15(ms); delta=290; N=281; usual
Fftlength=64,pass=4:Tune: sum=4238.88(ms); min=0.5196(ms); max=44.59(ms); mean=15.53(ms); s_mean=19.16; sleep=15(ms); delta=282; N=273; usual
Fftlength=64,pass=5:Tune: sum=3569.68(ms); min=0.5733(ms); max=36.61(ms); mean=14.11(ms); s_mean=34.28; sleep=30(ms); delta=262; N=253; usual
Fftlength=128,pass=3:Tune: sum=6014.73(ms); min=29.3(ms); max=32.32(ms); mean=30.53(ms); s_mean=30.36; sleep=30(ms); delta=1; N=197; usual
Fftlength=128,pass=4:Tune: sum=5028.76(ms); min=21.87(ms); max=27.84(ms); mean=25.53(ms); s_mean=24.72; sleep=15(ms); delta=1; N=197; usual
Fftlength=128,pass=5:Tune: sum=3485.93(ms); min=17.49(ms); max=18.23(ms); mean=17.7(ms); s_mean=17.58; sleep=15(ms); delta=1; N=197; usual
Fftlength=256,pass=3:Tune: sum=15305.9(ms); min=15.03(ms); max=39.7(ms); mean=38.95(ms); s_mean=39.06; sleep=30(ms); delta=1; N=393; high_perf
Fftlength=512,pass=3:Tune: sum=14618.6(ms); min=8.187(ms); max=18.99(ms); mean=18.62(ms); s_mean=18.64; sleep=15(ms); delta=1; N=785; high_perf
Fftlength=1024,pass=3:Tune: sum=12611.3(ms); min=7.422(ms); max=8.929(ms); mean=8.028(ms); s_mean=8.003; sleep=0(ms); delta=1; N=1571; usual
Fftlength=2048,pass=3:Tune: sum=7867.13(ms); min=2.437(ms); max=2.61(ms); mean=2.503(ms); s_mean=2.498; sleep=0(ms); delta=1; N=3143; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=35,	N=35,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24782,	N=24782,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=341,	N=341,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12555,	N=12555,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=6,	N=6,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=4,	N=4,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=24779,	N=24779,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=345,	N=345,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
00:42:20 (10116): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.