Task 8703629880

Name 06dc08af.14489.21342.16.43.154_3
Workunit 3911223764
Created 22 Apr 2020, 6:06:06 UTC
Sent 22 Apr 2020, 6:07:09 UTC
Report deadline 16 Jun 2020, 15:32:40 UTC
Received 22 Apr 2020, 19:20:31 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 7924605
Run time 17 min 26 sec
CPU time 17 min 7 sec
Validate state Valid
Credit 92.25
Device peak FLOPS 226.34 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 134.59 MB
Peak swap size 179.57 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6700HQ CPU @ 2.60GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.386236  NumCfft=206779  NumGauss=1224657212  NumPulse=226456205907  NumTriplet=452877248759
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 5
  Max work group size:				 1024
  Max clock frequency:				 1176Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 122880
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 960M
  Vendor:					 NVIDIA Corporation
  Driver version:				 441.20
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL HD Graphics
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1050Mhz
  Max memory allocation:			 858992640
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1717985280
  Constant buffer size:				 858992640
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) HD Graphics 530
  Vendor:					 Intel(R) Corporation
  Driver version:				 26.20.100.7755
  Version:					 OpenCL 2.1 NEO 
  Extensions:					 cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_depth_images cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_intel_subgroups cl_intel_required_subgroup_size cl_intel_subgroups_short cl_khr_spir cl_intel_accelerator cl_intel_media_block_io cl_intel_driver_diagnostics cl_khr_priority_hints cl_khr_throttle_hints cl_khr_create_command_queue cl_khr_fp64 cl_khr_subgroups cl_khr_il_program cl_intel_spirv_device_side_avc_motion_estimation cl_intel_spirv_media_block_io cl_intel_spirv_subgroups cl_khr_spirv_no_integer_wrap_decoration cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_intel_unified_shared_memory_preview cl_intel_planar_yuv cl_intel_packed_yuv cl_intel_motion_estimation cl_intel_device_side_avc_motion_estimation cl_intel_advanced_motion_estimation cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_gl_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_intel_dx9_media_sharing cl_khr_dx9_media_sharing cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_intel_d3d11_nv12_media_sharing cl_intel_unified_sharing cl_intel_simultaneous_sharing 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.386236
Used GPU device parameters are:
	Number of compute units: 5
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Spike: peak=24.72983, time=87.24, d_freq=1419003003, chirp=4.2451, fft_len=128k
Spike: peak=24.73679, time=87.24, d_freq=1419003003.01, chirp=4.2461, fft_len=128k
Pulse: peak=9.337449, time=20.79, period=4.168, d_freq=1419001352.01, score=1.009, chirp=-16.438, fft_len=512 
D:	threshold 0.337487; unscaled peak power: 0.3401005 exceeds threshold for 0.7744%
Pulse: peak=9.783518, time=20.79, period=4.168, d_freq=1419001347.36, score=1.057, chirp=-17.579, fft_len=512 
D:	threshold 0.3159965; unscaled peak power: 0.3321846 exceeds threshold for 5.123%
Autocorr: peak=18.38684, time=33.55, delay=4.2227, d_freq=1419004521.8, chirp=18.345, fft_len=128k
Autocorr: peak=19.06272, time=33.55, delay=4.2227, d_freq=1419004522.01, chirp=18.351, fft_len=128k

Best spike: peak=24.73679, time=87.24, d_freq=1419003003.01, chirp=4.2461, fft_len=128k
Best autocorr: peak=19.06272, time=33.55, delay=4.2227, d_freq=1419004522.01, chirp=18.351, fft_len=128k
Best gaussian: peak=2.987732, mean=0.5086324, ChiSq=1.342936, time=91.44, d_freq=1419003841.44,
	score=-0.1083527, null_hyp=2.208732, chirp=8.5152, fft_len=16k
Best pulse: peak=9.783518, time=20.79, period=4.168, d_freq=1419001347.36, score=1.057, chirp=-17.579, fft_len=512 
Best triplet: peak=0, time=-2.121e+011, period=0, d_freq=0, chirp=0, fft_len=0 
Spike count:    2
Autocorr count: 2
Pulse count:    2
Triplet count:  0
Gaussian count: 0
Wallclock time elapsed since last restart: 1036.3 seconds
Fftlength=8,pass=3:Tune: sum=13823.2(ms); min=7.057(ms); max=71.91(ms); mean=52.96(ms); s_mean=56.16; sleep=45(ms); delta=158; N=261; usual
Fftlength=8,pass=4:Tune: sum=9605.32(ms); min=7.504(ms); max=67.47(ms); mean=44.06(ms); s_mean=45.73; sleep=45(ms); delta=194; N=218; usual
Fftlength=8,pass=5:Tune: sum=7572.84(ms); min=4.68(ms); max=62.45(ms); mean=38.25(ms); s_mean=51.33; sleep=45(ms); delta=211; N=198; usual
Fftlength=16,pass=3:Tune: sum=9991.02(ms); min=1.405(ms); max=68.63(ms); mean=37.99(ms); s_mean=58.17; sleep=60(ms); delta=237; N=263; usual
Fftlength=16,pass=4:Tune: sum=6866.19(ms); min=2.434(ms); max=57.05(ms); mean=28.03(ms); s_mean=48.01; sleep=45(ms); delta=270; N=245; usual
Fftlength=16,pass=5:Tune: sum=5473.03(ms); min=1.808(ms); max=48.67(ms); mean=23.8(ms); s_mean=38.39; sleep=30(ms); delta=252; N=230; usual
Fftlength=32,pass=3:Tune: sum=8092.74(ms); min=2.083(ms); max=66.44(ms); mean=28.8(ms); s_mean=44.98; sleep=45(ms); delta=295; N=281; usual
Fftlength=32,pass=4:Tune: sum=5600.29(ms); min=1.915(ms); max=47.35(ms); mean=20.74(ms); s_mean=30.71; sleep=30(ms); delta=282; N=270; usual
Fftlength=32,pass=5:Tune: sum=4705.8(ms); min=0.66(ms); max=42.92(ms); mean=18.17(ms); s_mean=25.06; sleep=15(ms); delta=270; N=259; usual
Fftlength=64,pass=3:Tune: sum=7872.82(ms); min=0.5065(ms); max=67.33(ms); mean=25.15(ms); s_mean=32.02; sleep=30(ms); delta=320; N=313; usual
Fftlength=64,pass=4:Tune: sum=5386.65(ms); min=0.4898(ms); max=48.66(ms); mean=17.72(ms); s_mean=21.42; sleep=15(ms); delta=310; N=304; usual
Fftlength=64,pass=5:Tune: sum=4537.23(ms); min=0.4161(ms); max=42.51(ms); mean=15.86(ms); s_mean=40.16; sleep=30(ms); delta=291; N=286; usual
Fftlength=128,pass=3:Tune: sum=7933.15(ms); min=0.2488(ms); max=38.47(ms); mean=24.11(ms); s_mean=36.21; sleep=30(ms); delta=332; N=329; usual
Fftlength=128,pass=4:Tune: sum=5356.69(ms); min=0.2241(ms); max=26.4(ms); mean=16.9(ms); s_mean=24.24; sleep=15(ms); delta=320; N=317; usual
Fftlength=128,pass=5:Tune: sum=4485.41(ms); min=0.213(ms); max=21.88(ms); mean=14.61(ms); s_mean=20.51; sleep=15(ms); delta=309; N=307; usual
Fftlength=256,pass=3:Tune: sum=18441.9(ms); min=18.49(ms); max=42.54(ms); mean=42.01(ms); s_mean=42.07; sleep=45(ms); delta=1; N=439; high_perf
Fftlength=512,pass=3:Tune: sum=14580.1(ms); min=7.198(ms); max=17.08(ms); mean=16.62(ms); s_mean=16.69; sleep=15(ms); delta=1; N=877; high_perf
Fftlength=1024,pass=3:Tune: sum=8322(ms); min=4.687(ms); max=4.815(ms); mean=4.747(ms); s_mean=4.737; sleep=0(ms); delta=1; N=1753; usual
Fftlength=2048,pass=3:Tune: sum=5131.7(ms); min=1.428(ms); max=1.489(ms); mean=1.464(ms); s_mean=1.468; sleep=0(ms); delta=1; N=3505; usual
Fftlength=4096,pass=3:Tune: sum=4798.54(ms); min=0.6753(ms); max=0.733(ms); mean=0.6846(ms); s_mean=0.6845; sleep=0(ms); delta=1; N=7009; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=30,	N=30,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=27587,	N=27587,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=422,	N=422,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=13998,	N=13998,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=7,	N=7,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=2,	N=2,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=27581,	N=27581,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=429,	N=429,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
13:09:10 (23592): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.