Task 8675974896

Name blc45_2bit_guppi_58838_10648_TIC449050248_0047.24532.0.20.29.164.vlar_0
Workunit 3944223480
Created 23 Mar 2020, 7:59:11 UTC
Sent 23 Mar 2020, 11:13:31 UTC
Report deadline 26 Jun 2020, 6:18:44 UTC
Received 23 Mar 2020, 12:15:27 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 6011644
Run time 13 min 25 sec
CPU time 13 min 8 sec
Validate state Valid
Credit 136.41
Device peak FLOPS 588.93 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 122.30 MB
Peak swap size 148.79 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 1
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 1
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Phenom(tm) II X6 1090T Processor 

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.022040  NumCfft=106337  NumGauss=0  NumPulse=79344566670  NumTriplet=85061859681
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 2
  Max compute units:				 13
  Max work group size:				 1024
  Max clock frequency:				 1177Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 212992
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 970
  Vendor:					 NVIDIA Corporation
  Driver version:				 387.92
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Max compute units:				 13
  Max work group size:				 1024
  Max clock frequency:				 1177Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 212992
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 970
  Vendor:					 NVIDIA Corporation
  Driver version:				 387.92
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.022040
Used GPU device parameters are:
	Number of compute units: 13
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Spike: peak=24.40605, time=40.09, d_freq=10185468631.5, chirp=-8.8291, fft_len=128k
Spike: peak=25.18459, time=40.09, d_freq=10185468631.5, chirp=-8.8355, fft_len=128k
Spike: peak=25.03636, time=40.09, d_freq=10185468631.5, chirp=-8.838, fft_len=128k
Pulse: peak=7.561471, time=42.41, period=17.06, d_freq=10185468314.1, score=1.028, chirp=-18.129, fft_len=4k
D:	threshold 2.067895; unscaled peak power: 2.119617 exceeds threshold for 2.501%
Pulse: peak=7.486593, time=49.57, period=17.06, d_freq=10185468184.3, score=1.018, chirp=-18.129, fft_len=4k
D:	threshold 2.08614; unscaled peak power: 2.119617 exceeds threshold for 1.605%
Pulse: peak=1.476735, time=49.38, period=2.054, d_freq=10185474435.9, score=1.021, chirp=35.526, fft_len=256 
D:	threshold 0.03830337; unscaled peak power: 0.03877085 exceeds threshold for 1.22%
Pulse: peak=3.78254, time=42.28, period=8.087, d_freq=10185470611.3, score=1.013, chirp=40.378, fft_len=1024 
D:	threshold 0.2916674; unscaled peak power: 0.2945557 exceeds threshold for 0.9903%
Triplet: peak=10.81966, time=30.47, period=19.98, d_freq=10185469141.3, chirp=-51.639, fft_len=512 
Triplet: peak=10.95626, time=30.47, period=19.98, d_freq=10185469141.3, chirp=-51.639, fft_len=512 
Pulse: peak=10.50377, time=42.25, period=23.61, d_freq=10185474148.8, score=1.013, chirp=-81.672, fft_len=256 
D:	threshold 0.1758599; unscaled peak power: 0.1778735 exceeds threshold for 1.145%
Pulse: peak=2.621898, time=42.32, period=4.474, d_freq=10185476236.3, score=1.023, chirp=-99.664, fft_len=2k
D:	threshold 0.4287457; unscaled peak power: 0.4359264 exceeds threshold for 1.675%
Pulse: peak=2.56703, time=49.48, period=4.474, d_freq=10185475522.9, score=1.002, chirp=-99.664, fft_len=2k
D:	threshold 0.4340684; unscaled peak power: 0.4346524 exceeds threshold for 0.1345%

Best spike: peak=25.18459, time=40.09, d_freq=10185468631.5, chirp=-8.8355, fft_len=128k
Best autocorr: peak=16.34716, time=28.63, delay=4.9318, d_freq=10185470894, chirp=10.948, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=7.561471, time=42.41, period=17.06, d_freq=10185468314.1, score=1.028, chirp=-18.129, fft_len=4k
Best triplet: peak=10.95626, time=30.47, period=19.98, d_freq=10185469141.3, chirp=-51.639, fft_len=512 
Spike count:    3
Autocorr count: 0
Pulse count:    7
Triplet count:  2
Gaussian count: 0
Wallclock time elapsed since last restart: 799.1 seconds
Fftlength=32,pass=3:Tune: sum=52589.9(ms); min=5.58(ms); max=75.03(ms); mean=52.85(ms); s_mean=43.59; sleep=45(ms); delta=416; N=995; usual
Fftlength=32,pass=4:Tune: sum=36449.8(ms); min=4.494(ms); max=71.84(ms); mean=45.62(ms); s_mean=54.12; sleep=45(ms); delta=522; N=799; usual
Fftlength=32,pass=5:Tune: sum=22173.2(ms); min=4.479(ms); max=67.28(ms); mean=32.32(ms); s_mean=49.78; sleep=45(ms); delta=726; N=686; usual
Fftlength=64,pass=3:Tune: sum=29736.7(ms); min=2.813(ms); max=68.07(ms); mean=35.53(ms); s_mean=48.59; sleep=45(ms); delta=721; N=837; usual
Fftlength=64,pass=4:Tune: sum=21066.6(ms); min=2.308(ms); max=60.75(ms); mean=27.11(ms); s_mean=48.4; sleep=45(ms); delta=828; N=777; usual
Fftlength=64,pass=5:Tune: sum=17212.4(ms); min=2.297(ms); max=52.73(ms); mean=23.55(ms); s_mean=38.91; sleep=30(ms); delta=782; N=731; usual
Fftlength=128,pass=3:Tune: sum=28759.3(ms); min=1.438(ms); max=73.42(ms); mean=32.5(ms); s_mean=49.19; sleep=45(ms); delta=821; N=885; usual
Fftlength=128,pass=4:Tune: sum=20589.9(ms); min=1.133(ms); max=60.9(ms); mean=24.22(ms); s_mean=34.69; sleep=30(ms); delta=875; N=850; usual
Fftlength=128,pass=5:Tune: sum=16863.9(ms); min=1.154(ms); max=54.57(ms); mean=20.79(ms); s_mean=27.82; sleep=30(ms); delta=836; N=811; usual
Fftlength=256,pass=3:Tune: sum=28760.2(ms); min=0.7055(ms); max=54.1(ms); mean=30.56(ms); s_mean=52.46; sleep=45(ms); delta=953; N=941; usual
Fftlength=256,pass=4:Tune: sum=20501.5(ms); min=0.5581(ms); max=38.54(ms); mean=22.8(ms); s_mean=37.45; sleep=30(ms); delta=911; N=899; usual
Fftlength=256,pass=5:Tune: sum=16714.2(ms); min=0.5803(ms); max=32.31(ms); mean=19.48(ms); s_mean=30.62; sleep=30(ms); delta=870; N=858; usual
Fftlength=512,pass=3:Tune: sum=23519.3(ms); min=0.3487(ms); max=23.47(ms); mean=18.32(ms); s_mean=21.19; sleep=15(ms); delta=1296; N=1284; usual
Fftlength=512,pass=4:Tune: sum=17480.8(ms); min=0.3109(ms); max=17.57(ms); mean=13.84(ms); s_mean=15.55; sleep=15(ms); delta=1275; N=1263; usual
Fftlength=512,pass=5:Tune: sum=12984.2(ms); min=0.3192(ms); max=13.01(ms); mean=10.45(ms); s_mean=11.76; sleep=0(ms); delta=1254; N=1242; usual
Fftlength=1024,pass=3:Tune: sum=59647.6(ms); min=11.93(ms); max=28.43(ms); mean=27.3(ms); s_mean=27.42; sleep=30(ms); delta=1; N=2185; high_perf
Fftlength=2048,pass=3:Tune: sum=58243.9(ms); min=5.838(ms); max=13.69(ms); mean=13.33(ms); s_mean=13.33; sleep=15(ms); delta=1; N=4369; high_perf
Fftlength=4096,pass=3:Tune: sum=60465.9(ms); min=2.901(ms); max=7.115(ms); mean=6.921(ms); s_mean=6.893; sleep=0(ms); delta=1; N=8737; high_perf
Fftlength=8192,pass=3:Tune: sum=38481.7(ms); min=2.171(ms); max=2.419(ms); mean=2.202(ms); s_mean=2.205; sleep=0(ms); delta=1; N=17475; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=46500,	N=46500,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=748,	N=748,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=34872,	N=34872,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=13,	N=13,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=8,	N=8,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=46490,	N=46490,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=759,	N=759,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
07:05:30 (6084): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.