Task 8703771514

Name blc43_2bit_guppi_58838_26948_TIC67772767_0103.9569.818.20.29.77.vlar_3
Workunit 3929375578
Created 22 Apr 2020, 13:07:10 UTC
Sent 22 Apr 2020, 13:07:28 UTC
Report deadline 14 Jun 2020, 18:07:10 UTC
Received 22 Apr 2020, 21:08:05 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 7845794
Run time 11 min 7 sec
CPU time 10 min 50 sec
Validate state Valid
Credit 113.96
Device peak FLOPS 589.08 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 99.87 MB
Peak swap size 128.16 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.010912  NumCfft=105187  NumGauss=0  NumPulse=35356278656  NumTriplet=48313911456
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 13
  Max work group size:				 1024
  Max clock frequency:				 1177Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 638976
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 970
  Vendor:					 NVIDIA Corporation
  Driver version:				 445.87
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.010912
Used GPU device parameters are:
	Number of compute units: 13
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Spike: peak=24.59196, time=51.54, d_freq=10533108599.5, chirp=-8.0637, fft_len=128k
Spike: peak=24.77676, time=51.54, d_freq=10533108599.5, chirp=-8.0688, fft_len=128k
Spike: peak=24.10628, time=51.54, d_freq=10533108599.5, chirp=-8.0739, fft_len=128k
Pulse: peak=5.920416, time=45.82, period=12.25, d_freq=10533109342.7, score=1.017, chirp=-15.15, fft_len=256 
D:	threshold 0.1064972; unscaled peak power: 0.1080157 exceeds threshold for 1.426%
Spike: peak=24.0934, time=40.09, d_freq=10533105255.5, chirp=16.939, fft_len=128k
Spike: peak=24.4145, time=40.09, d_freq=10533105255.5, chirp=16.941, fft_len=128k
GPU device sync requested...  ...GPU device synched
Termination request detected or computations are finished. GPU device synched,  exiting...
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.010912  NumCfft=105187  NumGauss=0  NumPulse=35356278656  NumTriplet=48313911456
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 83.77 percent.
Used GPU device parameters are:
	Number of compute units: 13
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=10.60567, time=45.84, period=27.96, d_freq=10533105429.2, score=1.041, chirp=-81.998, fft_len=512 
D:	threshold 0.3417327; unscaled peak power: 0.354477 exceeds threshold for 3.729%
Triplet: peak=11.92861, time=44.22, period=7.326, d_freq=10533103741.7, chirp=87.111, fft_len=256 

Best spike: peak=24.77676, time=51.54, d_freq=10533108599.5, chirp=-8.0688, fft_len=128k
Best autocorr: peak=17.13081, time=62.99, delay=4.3869, d_freq=10533108640.9, chirp=14.031, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=10.60567, time=45.84, period=27.96, d_freq=10533105429.2, score=1.041, chirp=-81.998, fft_len=512 
Best triplet: peak=11.92861, time=44.22, period=7.326, d_freq=10533103741.7, chirp=87.111, fft_len=256 
Spike count:    5
Autocorr count: 0
Pulse count:    2
Triplet count:  1
Gaussian count: 0
Wallclock time elapsed since last restart: 186.2 seconds
Fftlength=32,pass=3:Tune: sum=23182.8(ms); min=12.78(ms); max=68.94(ms); mean=47.9(ms); s_mean=54.13; sleep=45(ms); delta=367; N=484; usual
Fftlength=32,pass=4:Tune: sum=8990.97(ms); min=4.809(ms); max=35.97(ms); mean=21.31(ms); s_mean=29.6; sleep=30(ms); delta=525; N=422; usual
Fftlength=32,pass=5:Tune: sum=5820.59(ms); min=4.955(ms); max=28.11(ms); mean=16.4(ms); s_mean=22.9; sleep=15(ms); delta=510; N=355; usual
Fftlength=64,pass=3:Tune: sum=8879.76(ms); min=3.187(ms); max= 37(ms); mean=18.93(ms); s_mean=22.94; sleep=15(ms); delta=546; N=469; usual
Fftlength=64,pass=4:Tune: sum=5167.47(ms); min=2.479(ms); max=21.8(ms); mean=12.3(ms); s_mean=18.48; sleep=15(ms); delta=523; N=420; usual
Fftlength=64,pass=5:Tune: sum=4276.59(ms); min=2.465(ms); max=19.14(ms); mean=10.94(ms); s_mean=14.79; sleep=15(ms); delta=494; N=391; usual
Fftlength=128,pass=3:Tune: sum=7095.13(ms); min=1.544(ms); max=30.56(ms); mean=14.25(ms); s_mean=21.96; sleep=15(ms); delta=549; N=498; usual
Fftlength=128,pass=4:Tune: sum=4948.89(ms); min=1.28(ms); max=22.75(ms); mean=10.46(ms); s_mean=14.98; sleep=15(ms); delta=524; N=473; usual
Fftlength=128,pass=5:Tune: sum=3982.31(ms); min=1.221(ms); max=18.72(ms); mean=8.909(ms); s_mean=11.78; sleep=0(ms); delta=498; N=447; usual
Fftlength=256,pass=3:Tune: sum=6876.35(ms); min=0.826(ms); max=30.93(ms); mean=12.17(ms); s_mean=15.52; sleep=15(ms); delta=590; N=565; usual
Fftlength=256,pass=4:Tune: sum=4818.56(ms); min=0.6406(ms); max=22.86(ms); mean=8.825(ms); s_mean=10.69; sleep=0(ms); delta=571; N=546; usual
Fftlength=256,pass=5:Tune: sum=3919.96(ms); min=0.6241(ms); max=20.84(ms); mean=7.481(ms); s_mean=15.41; sleep=15(ms); delta=549; N=524; usual
Fftlength=512,pass=3:Tune: sum=13064.7(ms); min=0.3853(ms); max=42.18(ms); mean=21.77(ms); s_mean=40.13; sleep=30(ms); delta=612; N=600; high_perf
Fftlength=512,pass=4:Tune: sum=1472.73(ms); min=0.3078(ms); max=12.39(ms); mean=4.797(ms); s_mean=11.84; sleep=0(ms); delta=591; N=307; usual
Fftlength=512,pass=5:Tune: sum=1201.04(ms); min=0.3283(ms); max=10.28(ms); mean=4.229(ms); s_mean=9.936; sleep=0(ms); delta=568; N=284; usual
Fftlength=1024,pass=3:Tune: sum=11536.3(ms); min=0.2017(ms); max=16.19(ms); mean=13.11(ms); s_mean=15.59; sleep=15(ms); delta=892; N=880; high_perf
Fftlength=1024,pass=4:Tune: sum=277.793(ms); min=0.1627(ms); max=4.762(ms); mean=1.89(ms); s_mean=4.052; sleep=0(ms); delta=882; N=147; usual
Fftlength=1024,pass=5:Tune: sum=216.206(ms); min=0.17(ms); max=3.715(ms); mean=1.602(ms); s_mean=3.536; sleep=0(ms); delta=870; N=135; usual
Fftlength=2048,pass=3:Tune: sum=13052.7(ms); min=3.798(ms); max=8.728(ms); mean=8.34(ms); s_mean=8.389; sleep=0(ms); delta=1; N=1565; high_perf
Fftlength=4096,pass=3:Tune: sum=13377.5(ms); min=1.913(ms); max=4.93(ms); mean=4.275(ms); s_mean=4.262; sleep=0(ms); delta=1; N=3129; high_perf
Fftlength=8192,pass=3:Tune: sum=10455(ms); min=1.651(ms); max=1.807(ms); mean=1.67(ms); s_mean=1.671; sleep=0(ms); delta=1; N=6259; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=16768,	N=16768,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=307,	N=307,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12496,	N=12496,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=1,	N=1,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=1,	N=1,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=16767,	N=16767,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=308,	N=308,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
15:51:07 (29196): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.