Task 8704187625

Name blc43_2bit_guppi_58838_25622_TIC459942762_0099.30364.0.19.28.212.vlar_3
Workunit 3927525983
Created 23 Apr 2020, 8:39:44 UTC
Sent 23 Apr 2020, 8:41:30 UTC
Report deadline 29 Jul 2020, 19:55:24 UTC
Received 23 Apr 2020, 13:53:39 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8906239
Run time 21 min 54 sec
CPU time 21 min 41 sec
Validate state Valid
Credit 155.33
Device peak FLOPS 413.23 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 62.03 MB
Peak swap size 87.12 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 1
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 1
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Ryzen 3 1200 Quad-Core Processor            

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.021105  NumCfft=105061  NumGauss=0  NumPulse=75948213090  NumTriplet=81620232009
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 3
  Max compute units:				 8
  Max work group size:				 1024
  Max clock frequency:				 1342Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 393216
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 960
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1124Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 760
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1137Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 760
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.021105
Used GPU device parameters are:
	Number of compute units: 6
	Single buffer allocation size: 128MB
	Total device global memory: 2048MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Triplet: peak=14.24088, time=67.55, period=4.32, d_freq=10569809675.2, chirp=0, fft_len=32 
Triplet: peak=14.23026, time=67.55, period=4.32, d_freq=10569809675.2, chirp=0, fft_len=32 
Pulse: peak=9.533949, time=42.77, period=22.91, d_freq=10569813858.1, score=1.016, chirp=-0.71207, fft_len=4k
D:	threshold 2.593173; unscaled peak power: 2.631541 exceeds threshold for 1.48%
Pulse: peak=3.314929, time=49.12, period=6.241, d_freq=10569814615, score=1.046, chirp=4.8931, fft_len=2k
D:	threshold 0.5193245; unscaled peak power: 0.5373646 exceeds threshold for 3.474%
Pulse: peak=10.19951, time=49.15, period=25.77, d_freq=10569806084.7, score=1.004, chirp=-8.9319, fft_len=512 
D:	threshold 0.3435867; unscaled peak power: 0.3448176 exceeds threshold for 0.3583%
Pulse: peak=1.417421, time=42.5, period=1.963, d_freq=10569809783.5, score=1.046, chirp=26.081, fft_len=2k
D:	threshold 0.2964148; unscaled peak power: 0.3042776 exceeds threshold for 2.653%
Pulse: peak=3.22609, time=49.12, period=5.548, d_freq=10569805903.9, score=1.021, chirp=29.36, fft_len=2k
D:	threshold 0.5494468; unscaled peak power: 0.55814 exceeds threshold for 1.582%
Pulse: peak=2.591456, time=42.5, period=4.578, d_freq=10569813133, score=1.011, chirp=-37.104, fft_len=2k
D:	threshold 0.4446621; unscaled peak power: 0.4483094 exceeds threshold for 0.8202%
Pulse: peak=5.63629, time=42.55, period=11.25, d_freq=10569803312, score=1.017, chirp=54.538, fft_len=1024 
D:	threshold 0.4084766; unscaled peak power: 0.4142928 exceeds threshold for 1.424%
Pulse: peak=3.960705, time=49.15, period=7.142, d_freq=10569804564.6, score=1.041, chirp=-64.42, fft_len=512 
D:	threshold 0.1463816; unscaled peak power: 0.1511127 exceeds threshold for 3.232%
Spike: peak=24.35386, time=18.61, d_freq=10569814092.1, chirp=-93.074, fft_len=32k
Spike: peak=24.00553, time=18.61, d_freq=10569814092.1, chirp=-93.094, fft_len=32k
Pulse: peak=7.684823, time=49.12, period=19.45, d_freq=10569812590.5, score=1.014, chirp=-95.016, fft_len=2k
D:	threshold 1.087569; unscaled peak power: 1.100844 exceeds threshold for 1.221%

Best spike: peak=24.35386, time=18.61, d_freq=10569814092.1, chirp=-93.074, fft_len=32k
Best autocorr: peak=17.79724, time=51.54, delay=4.0662, d_freq=10569808457, chirp=-9.7595, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=1.417421, time=42.5, period=1.963, d_freq=10569809783.5, score=1.046, chirp=26.081, fft_len=2k
Best triplet: peak=14.24088, time=67.55, period=4.32, d_freq=10569809675.2, chirp=0, fft_len=32 
Spike count:    2
Autocorr count: 0
Pulse count:    9
Triplet count:  2
Gaussian count: 0
Wallclock time elapsed since last restart: 1309.3 seconds
Fftlength=32,pass=3:Tune: sum=60114(ms); min=6.106(ms); max=73.09(ms); mean=55.97(ms); s_mean=44.87; sleep=45(ms); delta=337; N=1074; usual
Fftlength=32,pass=4:Tune: sum=46333.8(ms); min=5.646(ms); max=68.5(ms); mean=51.65(ms); s_mean=56.42; sleep=45(ms); delta=384; N=897; usual
Fftlength=32,pass=5:Tune: sum=38661(ms); min=5.244(ms); max=69.91(ms); mean=49.44(ms); s_mean=56.61; sleep=45(ms); delta=406; N=782; usual
Fftlength=64,pass=3:Tune: sum=59807.6(ms); min=3.191(ms); max=74.31(ms); mean=53.12(ms); s_mean=46.13; sleep=45(ms); delta=359; N=1126; usual
Fftlength=64,pass=4:Tune: sum=45824.2(ms); min=2.925(ms); max=72.31(ms); mean=48.54(ms); s_mean=56.31; sleep=45(ms); delta=385; N=944; usual
Fftlength=64,pass=5:Tune: sum=36215.1(ms); min=2.742(ms); max=74.58(ms); mean=44.16(ms); s_mean=50.64; sleep=45(ms); delta=503; N=820; usual
Fftlength=128,pass=3:Tune: sum=57831.2(ms); min=1.581(ms); max=81.86(ms); mean=49.73(ms); s_mean=46.26; sleep=45(ms); delta=422; N=1163; usual
Fftlength=128,pass=4:Tune: sum=42998.5(ms); min=1.493(ms); max=96.62(ms); mean=44.33(ms); s_mean=62.83; sleep=60(ms); delta=570; N=970; usual
Fftlength=128,pass=5:Tune: sum=35268(ms); min=1.413(ms); max=76.33(ms); mean=42.29(ms); s_mean=59.05; sleep=60(ms); delta=508; N=834; usual
Fftlength=256,pass=3:Tune: sum=56985.9(ms); min=0.8147(ms); max=109.5(ms); mean=47.97(ms); s_mean=48.09; sleep=45(ms); delta=632; N=1188; usual
Fftlength=256,pass=4:Tune: sum=43769.8(ms); min=0.7749(ms); max=82.58(ms); mean=43.64(ms); s_mean=72.99; sleep=75(ms); delta=570; N=1003; usual
Fftlength=256,pass=5:Tune: sum=35676.3(ms); min=0.7256(ms); max=67.76(ms); mean=40.18(ms); s_mean=50.41; sleep=45(ms); delta=504; N=888; usual
Fftlength=512,pass=3:Tune: sum=56946.5(ms); min=0.429(ms); max=54.6(ms); mean=45.56(ms); s_mean=54.06; sleep=45(ms); delta=336; N=1250; usual
Fftlength=512,pass=4:Tune: sum=44759.6(ms); min=0.4214(ms); max=42.63(ms); mean=36.42(ms); s_mean=42.54; sleep=45(ms); delta=1234; N=1229; usual
Fftlength=512,pass=5:Tune: sum=37421.4(ms); min=0.3795(ms); max=35.66(ms); mean=30.93(ms); s_mean=35.59; sleep=30(ms); delta=1215; N=1210; usual
Fftlength=1024,pass=3:Tune: sum=80552.7(ms); min=16.13(ms); max=41.52(ms); mean=38.27(ms); s_mean=38.26; sleep=30(ms); delta=1; N=2105; high_perf
Fftlength=2048,pass=3:Tune: sum=86060.9(ms); min=8.617(ms); max=21.01(ms); mean=20.45(ms); s_mean=20.43; sleep=15(ms); delta=1; N=4209; high_perf
Fftlength=4096,pass=3:Tune: sum=83401.7(ms); min=2.059(ms); max=10.28(ms); mean=9.906(ms); s_mean=9.876; sleep=0(ms); delta=1; N=8419; high_perf
Fftlength=8192,pass=3:Tune: sum=105877(ms); min=6.268(ms); max=6.973(ms); mean=6.288(ms); s_mean=6.285; sleep=0(ms); delta=1; N=16839; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=45211,	N=45211,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=761,	N=761,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=33598,	N=33598,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=13,	N=13,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=4,	N=4,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=45199,	N=45199,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=774,	N=774,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
09:15:05 (840): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.