Task 8622737382

Name blc44_2bit_guppi_58838_12153_TIC452808876_0052.23626.818.19.28.56.vlar_0
Workunit 3919974381
Created 8 Mar 2020, 2:27:08 UTC
Sent 8 Mar 2020, 2:27:09 UTC
Report deadline 30 Apr 2020, 7:26:51 UTC
Received 8 Mar 2020, 17:24:22 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 7462358
Run time 6 min 54 sec
CPU time 6 min 52 sec
Validate state Valid
Credit 111.87
Device peak FLOPS 1,411.03 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 109.09 MB
Peak swap size 137.13 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Xeon(R) CPU E5-2630 v3 @ 2.40GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.015107  NumCfft=105629  NumGauss=0  NumPulse=35819564928  NumTriplet=48777197728
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 2
  Max compute units:				 20
  Max work group size:				 1024
  Max clock frequency:				 1835Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 327680
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1080
  Vendor:					 NVIDIA Corporation
  Driver version:				 419.35
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Max compute units:				 20
  Max work group size:				 1024
  Max clock frequency:				 1835Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 327680
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1080
  Vendor:					 NVIDIA Corporation
  Driver version:				 419.35
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.015107
Used GPU device parameters are:
	Number of compute units: 20
	Single buffer allocation size: 128MB
	Total device global memory: 8192MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=2.750682, time=45.82, period=5.132, d_freq=10395173751.1, score=1.001, chirp=-11.213, fft_len=256 
D:	threshold 0.05927251; unscaled peak power: 0.0593299 exceeds threshold for 0.09682%
Spike: peak=24.55736, time=62.99, d_freq=10395172387.6, chirp=-15.174, fft_len=128k
Spike: peak=24.33405, time=62.99, d_freq=10395172387.6, chirp=-15.175, fft_len=128k
Pulse: peak=2.193576, time=45.82, period=2.977, d_freq=10395167947.7, score=1.051, chirp=20.184, fft_len=128 
D:	threshold 0.02435338; unscaled peak power: 0.02519067 exceeds threshold for 3.438%
Pulse: peak=2.030995, time=45.84, period=2.989, d_freq=10395167456.4, score=1.014, chirp=51.394, fft_len=512 
D:	threshold 0.09474567; unscaled peak power: 0.09561582 exceeds threshold for 0.9184%
Pulse: peak=5.869833, time=45.86, period=14, d_freq=10395169741, score=1.051, chirp=-58.684, fft_len=1024 
D:	threshold 0.3961934; unscaled peak power: 0.4133151 exceeds threshold for 4.322%
Pulse: peak=1.406121, time=45.86, period=2.016, d_freq=10395169568.8, score=1.013, chirp=64.29, fft_len=1024 
D:	threshold 0.1485669; unscaled peak power: 0.149692 exceeds threshold for 0.7573%
Pulse: peak=3.277375, time=45.86, period=7.438, d_freq=10395169679.9, score=1.004, chirp=-70.738, fft_len=1024 
D:	threshold 0.2710377; unscaled peak power: 0.2718575 exceeds threshold for 0.3025%
Triplet: peak=10.49359, time=45.9, period=19.83, d_freq=10395172386.1, chirp=83.727, fft_len=16 
Pulse: peak=9.348285, time=46.17, period=25.41, d_freq=10395177266.6, score=1.02, chirp=88.855, fft_len=8k
D:	threshold 4.947309; unscaled peak power: 5.036165 exceeds threshold for 1.796%

Best spike: peak=24.55736, time=62.99, d_freq=10395172387.6, chirp=-15.174, fft_len=128k
Best autocorr: peak=16.31594, time=28.63, delay=2.1926, d_freq=10395171497.2, chirp=-21.722, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=5.869833, time=45.86, period=14, d_freq=10395169741, score=1.051, chirp=-58.684, fft_len=1024 
Best triplet: peak=10.49359, time=45.9, period=19.83, d_freq=10395172386.1, chirp=83.727, fft_len=16 
Spike count:    2
Autocorr count: 0
Pulse count:    7
Triplet count:  1
Gaussian count: 0
Wallclock time elapsed since last restart: 408.2 seconds
Fftlength=32,pass=3:Tune: sum=30894.2(ms); min=4.522(ms); max=64.69(ms); mean=39.26(ms); s_mean=54.16; sleep=45(ms); delta=687; N=787; usual
Fftlength=32,pass=4:Tune: sum=21184.5(ms); min=3.15(ms); max=59.07(ms); mean=28.78(ms); s_mean=49.96; sleep=45(ms); delta=834; N=736; usual
Fftlength=32,pass=5:Tune: sum=10703.3(ms); min=3.256(ms); max=30.67(ms); mean=16.75(ms); s_mean=22.7; sleep=15(ms); delta=798; N=639; usual
Fftlength=64,pass=3:Tune: sum=16904.7(ms); min=2.239(ms); max=46.5(ms); mean=20.44(ms); s_mean=26.91; sleep=15(ms); delta=906; N=827; usual
Fftlength=64,pass=4:Tune: sum=8972.68(ms); min=1.749(ms); max=25.8(ms); mean=12.08(ms); s_mean=21.86; sleep=15(ms); delta=862; N=743; usual
Fftlength=64,pass=5:Tune: sum=5966.89(ms); min=1.655(ms); max=19.08(ms); mean=9.027(ms); s_mean=14.13; sleep=15(ms); delta=820; N=661; usual
Fftlength=128,pass=3:Tune: sum=10343.5(ms); min=1.104(ms); max= 28(ms); mean=12.11(ms); s_mean=18.15; sleep=15(ms); delta=933; N=854; usual
Fftlength=128,pass=4:Tune: sum=7171.84(ms); min=0.8406(ms); max=20.78(ms); mean=8.811(ms); s_mean=12.78; sleep=15(ms); delta=893; N=814; usual
Fftlength=128,pass=5:Tune: sum=5403.23(ms); min=0.8215(ms); max=17.01(ms); mean=6.972(ms); s_mean=9.462; sleep=0(ms); delta=854; N=775; usual
Fftlength=256,pass=3:Tune: sum=16723.4(ms); min=0.5332(ms); max=42.61(ms); mean=17.85(ms); s_mean=41.46; sleep=30(ms); delta=976; N=937; high_perf
Fftlength=256,pass=4:Tune: sum=3132.01(ms); min=0.4301(ms); max=13.35(ms); mean=5.237(ms); s_mean=13.02; sleep=15(ms); delta=932; N=598; usual
Fftlength=256,pass=5:Tune: sum=2397.22(ms); min=0.4178(ms); max=10.7(ms); mean=4.335(ms); s_mean=10.12; sleep=0(ms); delta=887; N=553; usual
Fftlength=512,pass=3:Tune: sum=21004.7(ms); min=0.2918(ms); max=21.78(ms); mean=16.51(ms); s_mean=20.85; sleep=15(ms); delta=1291; N=1272; high_perf
Fftlength=512,pass=4:Tune: sum=788.658(ms); min=0.2232(ms); max=6.642(ms); mean=2.629(ms); s_mean=6.41; sleep=0(ms); delta=1270; N=300; usual
Fftlength=512,pass=5:Tune: sum=599.17(ms); min=0.2181(ms); max=5.253(ms); mean=2.171(ms); s_mean=5.057; sleep=0(ms); delta=1246; N=276; usual
Fftlength=1024,pass=3:Tune: sum=21797.8(ms); min=0.1422(ms); max=12.72(ms); mean=9.722(ms); s_mean=10.37; sleep=0(ms); delta=2251; N=2242; high_perf
Fftlength=1024,pass=4:Tune: sum=190.649(ms); min=0.1167(ms); max=3.247(ms); mean=1.28(ms); s_mean=2.793; sleep=0(ms); delta=2240; N=149; usual
Fftlength=1024,pass=5:Tune: sum=148.012(ms); min=0.112(ms); max=2.509(ms); mean=1.073(ms); s_mean=2.397; sleep=0(ms); delta=2229; N=138; usual
Fftlength=2048,pass=3:Tune: sum=21220.6(ms); min=2.229(ms); max=5.302(ms); mean=4.957(ms); s_mean=4.977; sleep=0(ms); delta=1; N=4281; high_perf
Fftlength=4096,pass=3:Tune: sum=17832.5(ms); min=0.9494(ms); max=2.377(ms); mean=2.083(ms); s_mean=2.092; sleep=0(ms); delta=1; N=8561; high_perf
Fftlength=8192,pass=3:Tune: sum=17299.9(ms); min=0.9801(ms); max=1.192(ms); mean=1.01(ms); s_mean=1.014; sleep=0(ms); delta=1; N=17123; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=44080,	N=44080,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=2460,	N=2460,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=34168,	N=34168,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=11,	N=11,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=9,	N=9,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=44071,	N=44071,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=2470,	N=2470,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
17:19:32 (3632): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.