Task 8703933258

Name 09fe09ae.18164.9070.7.34.150_3
Workunit 3950094325
Created 22 Apr 2020, 19:37:11 UTC
Sent 22 Apr 2020, 19:40:56 UTC
Report deadline 9 Jun 2020, 22:32:32 UTC
Received 22 Apr 2020, 20:50:37 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8070424
Run time 16 min 29 sec
CPU time 16 min 1 sec
Validate state Valid
Credit 74.88
Device peak FLOPS 226.35 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 115.44 MB
Peak swap size 141.36 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6500U CPU @ 2.50GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.575522  NumCfft=169903  NumGauss=821893736  NumPulse=112903847059  NumTriplet=225878939207
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 5
  Max work group size:				 1024
  Max clock frequency:				 1176Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 122880
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 960M
  Vendor:					 NVIDIA Corporation
  Driver version:				 445.87
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1050Mhz
  Max memory allocation:			 1561123226
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1561123226
  Constant buffer size:				 1561123226
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) HD Graphics 520
  Vendor:					 Intel(R) Corporation
  Driver version:				 21.20.16.4550
  Version:					 OpenCL 2.0 
  Extensions:					 cl_intel_accelerator cl_intel_advanced_motion_estimation cl_intel_d3d11_nv12_media_sharing cl_intel_driver_diagnostics cl_intel_dx9_media_sharing cl_intel_motion_estimation cl_intel_packed_yuv cl_intel_required_subgroup_size cl_intel_simultaneous_sharing cl_intel_subgroups cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_depth_images cl_khr_dx9_media_sharing cl_khr_fp16 cl_khr_fp64 cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_gl_sharing cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_khr_spir cl_khr_subgroups 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.575522
Used GPU device parameters are:
	Number of compute units: 5
	Single buffer allocation size: 128MB
	Total device global memory: 2048MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Spike: peak=24.97259, time=45.3, d_freq=1418963696.95, chirp=-45.902, fft_len=32k
Spike: peak=26.28637, time=45.3, d_freq=1418963696.95, chirp=-45.962, fft_len=32k
Spike: peak=26.4357, time=45.3, d_freq=1418963696.96, chirp=-46.021, fft_len=32k
Spike: peak=25.41793, time=45.3, d_freq=1418963696.96, chirp=-46.08, fft_len=32k
Spike: peak=24.07714, time=52.01, d_freq=1418961813.2, chirp=54.879, fft_len=32k
Triplet: peak=9.039398, time=44.22, period=1.648, d_freq=1418962625.77, chirp=-63.955, fft_len=64 

Best spike: peak=26.4357, time=45.3, d_freq=1418963696.96, chirp=-46.021, fft_len=32k
Best autocorr: peak=17.64364, time=100.7, delay=4.421, d_freq=1418963183.47, chirp=-16.493, fft_len=128k
Best gaussian: peak=5.093915, mean=0.5805346, ChiSq=1.328042, time=74.66, d_freq=1418968275.98,
	score=-0.1028748, null_hyp=2.179318, chirp=-67.283, fft_len=16k
Best pulse: peak=7.349272, time=88.66, period=1.885, d_freq=1418960808.9, score=0.9681, chirp=35.38, fft_len=64 
Best triplet: peak=9.039398, time=44.22, period=1.648, d_freq=1418962625.77, chirp=-63.955, fft_len=64 
Spike count:    5
Autocorr count: 0
Pulse count:    0
Triplet count:  1
Gaussian count: 0
Wallclock time elapsed since last restart: 981.1 seconds
Fftlength=8,pass=3:Tune: sum=6298.34(ms); min=2.42(ms); max=69.04(ms); mean=44.35(ms); s_mean=53.86; sleep=45(ms); delta=149; N=142; usual
Fftlength=8,pass=4:Tune: sum=4384.89(ms); min=7.069(ms); max=59.33(ms); mean=37.48(ms); s_mean=49.86; sleep=45(ms); delta=180; N=117; usual
Fftlength=8,pass=5:Tune: sum=3221.9(ms); min=5.845(ms); max=48.73(ms); mean=32.88(ms); s_mean=41.19; sleep=30(ms); delta=177; N=98; usual
Fftlength=16,pass=3:Tune: sum=4422.5(ms); min=2.351(ms); max=53.28(ms); mean=27.47(ms); s_mean=43.35; sleep=45(ms); delta=200; N=161; usual
Fftlength=16,pass=4:Tune: sum=3111.6(ms); min=1.138(ms); max=38.77(ms); mean=20.61(ms); s_mean=30.2; sleep=30(ms); delta=190; N=151; usual
Fftlength=16,pass=5:Tune: sum=2266.9(ms); min=1.209(ms); max=29.63(ms); mean=15.96(ms); s_mean=20.93; sleep=15(ms); delta=181; N=142; usual
Fftlength=32,pass=3:Tune: sum=3073.05(ms); min=0.5933(ms); max=37.28(ms); mean=16.17(ms); s_mean=25.23; sleep=15(ms); delta=199; N=190; usual
Fftlength=32,pass=4:Tune: sum=2152.01(ms); min=0.4916(ms); max=27.02(ms); mean=11.96(ms); s_mean=17.7; sleep=15(ms); delta=189; N=180; usual
Fftlength=32,pass=5:Tune: sum=1691.35(ms); min=0.6394(ms); max=23.06(ms); mean=9.949(ms); s_mean=13.72; sleep=15(ms); delta=179; N=170; usual
Fftlength=64,pass=3:Tune: sum=2853.55(ms); min=0.4495(ms); max=37.81(ms); mean=13.85(ms); s_mean=17.84; sleep=15(ms); delta=215; N=206; usual
Fftlength=64,pass=4:Tune: sum=2097.35(ms); min=0.5869(ms); max=29.17(ms); mean=10.54(ms); s_mean=12.94; sleep=15(ms); delta=208; N=199; usual
Fftlength=64,pass=5:Tune: sum=1407.52(ms); min=0.3589(ms); max=19.94(ms); mean=7.608(ms); s_mean=17.01; sleep=15(ms); delta=194; N=185; usual
Fftlength=128,pass=3:Tune: sum=3268.17(ms); min=21.03(ms); max=23.55(ms); mean=22.23(ms); s_mean=22.38; sleep=15(ms); delta=1; N=147; usual
Fftlength=128,pass=4:Tune: sum=2353.78(ms); min=14.95(ms); max=16.95(ms); mean=16.01(ms); s_mean=16.02; sleep=15(ms); delta=1; N=147; usual
Fftlength=128,pass=5:Tune: sum=1493.77(ms); min=9.773(ms); max=10.82(ms); mean=10.16(ms); s_mean=10.17; sleep=0(ms); delta=1; N=147; usual
Fftlength=256,pass=3:Tune: sum=6817.1(ms); min=10.23(ms); max=23.81(ms); mean=23.27(ms); s_mean=23.34; sleep=15(ms); delta=1; N=293; high_perf
Fftlength=512,pass=3:Tune: sum=5519.58(ms); min=7.498(ms); max=9.926(ms); mean=9.403(ms); s_mean=9.509; sleep=0(ms); delta=1; N=587; usual
Fftlength=1024,pass=3:Tune: sum=3395.68(ms); min=2.363(ms); max=3.054(ms); mean=2.89(ms); s_mean=2.876; sleep=0(ms); delta=1; N=1175; usual
Fftlength=2048,pass=3:Tune: sum=2373.77(ms); min=0.8383(ms); max=1.161(ms); mean=1.01(ms); s_mean=1.015; sleep=0(ms); delta=1; N=2351; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=27,	N=27,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=9076,	N=9076,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=308,	N=308,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=4687,	N=4687,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=3,	N=3,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=2,	N=2,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=9074,	N=9074,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=311,	N=311,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
15:23:22 (25444): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.