Task 8704083510

Name 23mr20ac.23131.23993.16.43.38_3
Workunit 3953042846
Created 23 Apr 2020, 3:38:39 UTC
Sent 23 Apr 2020, 3:39:51 UTC
Report deadline 15 Jun 2020, 7:05:45 UTC
Received 23 Apr 2020, 23:38:22 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8070424
Run time 18 min 17 sec
CPU time 17 min 51 sec
Validate state Valid
Credit 90.24
Device peak FLOPS 226.35 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 116.38 MB
Peak swap size 154.41 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6500U CPU @ 2.50GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.429977  NumCfft=195277  NumGauss=1099011578  NumPulse=226468983638  NumTriplet=452869038922
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 5
  Max work group size:				 1024
  Max clock frequency:				 1176Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 122880
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 960M
  Vendor:					 NVIDIA Corporation
  Driver version:				 445.87
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 24
  Max work group size:				 256
  Max clock frequency:				 1050Mhz
  Max memory allocation:			 1561123226
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1561123226
  Constant buffer size:				 1561123226
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 Intel(R) HD Graphics 520
  Vendor:					 Intel(R) Corporation
  Driver version:				 21.20.16.4550
  Version:					 OpenCL 2.0 
  Extensions:					 cl_intel_accelerator cl_intel_advanced_motion_estimation cl_intel_d3d11_nv12_media_sharing cl_intel_driver_diagnostics cl_intel_dx9_media_sharing cl_intel_motion_estimation cl_intel_packed_yuv cl_intel_required_subgroup_size cl_intel_simultaneous_sharing cl_intel_subgroups cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_depth_images cl_khr_dx9_media_sharing cl_khr_fp16 cl_khr_fp64 cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_gl_sharing cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_khr_spir cl_khr_subgroups 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.429977
Used GPU device parameters are:
	Number of compute units: 5
	Single buffer allocation size: 128MB
	Total device global memory: 2048MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Triplet: peak=9.895246, time=71.43, period=1.468, d_freq=1420373730.46, chirp=-33.582, fft_len=128 
Gaussian: peak=4.261994, mean=0.5530661, ChiSq=1.37758, time=83.05, d_freq=1420367571.45,
	score=0.6309061, null_hyp=2.263563, chirp=-34.576, fft_len=16k
Gaussian: peak=4.00004, mean=0.5539509, ChiSq=1.412468, time=84.72, d_freq=1420367513.45,
	score=0.8150639, null_hyp=2.294721, chirp=-34.576, fft_len=16k
Triplet: peak=9.607249, time=71.43, period=1.468, d_freq=1420373734.07, chirp=-34.6, fft_len=128 
Triplet: peak=10.593, time=25.3, period=4.941, d_freq=1420374044.84, chirp=86.499, fft_len=256 

Best spike: peak=23.45389, time=73.82, d_freq=1420374864.33, chirp=-13.778, fft_len=128k
Best autocorr: peak=17.17419, time=73.82, delay=5.3722, d_freq=1420371104.05, chirp=0.13956, fft_len=128k
Best gaussian: peak=4.00004, mean=0.5539509, ChiSq=1.412468, time=84.72, d_freq=1420367513.45,
	score=0.8150639, null_hyp=2.294721, chirp=-34.576, fft_len=16k
Best pulse: peak=3.058585, time=93.59, period=0.7815, d_freq=1420370155.29, score=0.9682, chirp=31.547, fft_len=128 
Best triplet: peak=10.593, time=25.3, period=4.941, d_freq=1420374044.84, chirp=86.499, fft_len=256 
Spike count:    0
Autocorr count: 0
Pulse count:    0
Triplet count:  3
Gaussian count: 2
Wallclock time elapsed since last restart: 1088.5 seconds
Fftlength=8,pass=3:Tune: sum=12953(ms); min=4.347(ms); max=72.54(ms); mean=52.65(ms); s_mean=44.62; sleep=45(ms); delta=147; N=246; usual
Fftlength=8,pass=4:Tune: sum=8932.07(ms); min=3.352(ms); max=64.23(ms); mean=45.81(ms); s_mean=47.65; sleep=45(ms); delta=184; N=195; usual
Fftlength=8,pass=5:Tune: sum=6594.79(ms); min=3.876(ms); max=62.21(ms); mean=40.96(ms); s_mean=45.93; sleep=45(ms); delta=218; N=161; usual
Fftlength=16,pass=3:Tune: sum=7830.56(ms); min=2.061(ms); max=68.71(ms); mean=34.5(ms); s_mean=43.91; sleep=45(ms); delta=247; N=227; usual
Fftlength=16,pass=4:Tune: sum=5488.14(ms); min=2.05(ms); max=51.67(ms); mean=26.01(ms); s_mean=40.86; sleep=30(ms); delta=250; N=211; usual
Fftlength=16,pass=5:Tune: sum=4129.44(ms); min=2.23(ms); max=39.59(ms); mean=20.96(ms); s_mean=29.3; sleep=30(ms); delta=236; N=197; usual
Fftlength=32,pass=3:Tune: sum=5305.07(ms); min=1.091(ms); max=48.99(ms); mean=21.57(ms); s_mean= 33; sleep=30(ms); delta=265; N=246; usual
Fftlength=32,pass=4:Tune: sum=3703.53(ms); min=0.6153(ms); max=35.46(ms); mean=15.83(ms); s_mean=22.72; sleep=15(ms); delta=253; N=234; usual
Fftlength=32,pass=5:Tune: sum=2874.26(ms); min=1.048(ms); max=28.56(ms); mean=12.95(ms); s_mean=16.93; sleep=15(ms); delta=241; N=222; usual
Fftlength=64,pass=3:Tune: sum=5058.29(ms); min=0.6117(ms); max=49.57(ms); mean=18.07(ms); s_mean=23.4; sleep=15(ms); delta=289; N=280; usual
Fftlength=64,pass=4:Tune: sum=3519.21(ms); min=0.6154(ms); max=37.13(ms); mean=13.13(ms); s_mean=16.47; sleep=15(ms); delta=277; N=268; usual
Fftlength=64,pass=5:Tune: sum=2700.98(ms); min=0.4553(ms); max=28.65(ms); mean=10.8(ms); s_mean=25.43; sleep=15(ms); delta=259; N=250; usual
Fftlength=128,pass=3:Tune: sum=5308.05(ms); min=25.47(ms); max=28.88(ms); mean=26.94(ms); s_mean=27.38; sleep=30(ms); delta=1; N=197; usual
Fftlength=128,pass=4:Tune: sum=4056.21(ms); min=18.5(ms); max=21.83(ms); mean=20.69(ms); s_mean=20.58; sleep=15(ms); delta=1; N=196; usual
Fftlength=128,pass=5:Tune: sum=2692.39(ms); min=13.31(ms); max=14.29(ms); mean=13.74(ms); s_mean=13.74; sleep=15(ms); delta=1; N=196; usual
Fftlength=256,pass=3:Tune: sum=12332.7(ms); min=13.83(ms); max=32.05(ms); mean=31.38(ms); s_mean=31.57; sleep=30(ms); delta=1; N=393; high_perf
Fftlength=512,pass=3:Tune: sum=12450(ms); min=6.461(ms); max=16.19(ms); mean=15.82(ms); s_mean=15.78; sleep=15(ms); delta=1; N=787; high_perf
Fftlength=1024,pass=3:Tune: sum=7453.78(ms); min=3.568(ms); max=5.201(ms); mean=4.739(ms); s_mean=4.708; sleep=0(ms); delta=1; N=1573; usual
Fftlength=2048,pass=3:Tune: sum=5101.39(ms); min=1.284(ms); max=1.808(ms); mean=1.622(ms); s_mean=1.634; sleep=0(ms); delta=1; N=3145; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=25,	N=25,	<>=1,	min=1	max=1
class Gaussian_report:		total=2,	N=2,	<>=1,	min=1	max=1
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24716,	N=24716,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=419,	N=419,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12559,	N=12559,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=10,	N=10,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=4,	N=4,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=24708,	N=24708,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=428,	N=428,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
18:16:06 (18316): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.