Task 8704097904

Name 25mr20aa.32735.13155.9.36.208.vlar_3
Workunit 3953540370
Created 23 Apr 2020, 4:38:46 UTC
Sent 23 Apr 2020, 4:38:52 UTC
Report deadline 15 Jun 2020, 9:38:34 UTC
Received 24 Apr 2020, 4:25:58 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8775524
Run time 12 min 27 sec
CPU time 12 min 9 sec
Validate state Valid
Credit 139.19
Device peak FLOPS 841.43 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 103.09 MB
Peak swap size 127.46 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 1
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 1
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-5820K CPU @ 3.30GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.011039  NumCfft=146021  NumGauss=0  NumPulse=50198134656  NumTriplet=67992016800
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 2
  Max compute units:				 16
  Max work group size:				 1024
  Max clock frequency:				 1367Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 786432
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 980
  Vendor:					 NVIDIA Corporation
  Driver version:				 445.87
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Max compute units:				 16
  Max work group size:				 1024
  Max clock frequency:				 1367Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 786432
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 980
  Vendor:					 NVIDIA Corporation
  Driver version:				 445.87
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.011039
Used GPU device parameters are:
	Number of compute units: 16
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Triplet: peak=11.34827, time=39.43, period=31.77, d_freq=1419527464, chirp=-7.1334, fft_len=2k
Pulse: peak=7.988229, time=53.71, period=24.9, d_freq=1419534823.07, score=1.001, chirp=-11.6, fft_len=512 
D:	threshold 0.297184; unscaled peak power: 0.297328 exceeds threshold for 0.04846%
Pulse: peak=3.845025, time=53.71, period=10.54, d_freq=1419534517.43, score=1.003, chirp=-18.001, fft_len=512 
D:	threshold 0.1498944; unscaled peak power: 0.1502346 exceeds threshold for 0.2269%
Spike: peak=24.57102, time=6.711, d_freq=1419527753.48, chirp=-21.899, fft_len=128k
Spike: peak=25.475, time=6.711, d_freq=1419527753.47, chirp=-21.9, fft_len=128k
Spike: peak=25.48282, time=6.711, d_freq=1419527753.47, chirp=-21.901, fft_len=128k
Spike: peak=24.50884, time=6.711, d_freq=1419527753.46, chirp=-21.902, fft_len=128k
Spike: peak=24.60985, time=63.75, d_freq=1419527913.34, chirp=-22.293, fft_len=64k
Pulse: peak=8.072681, time=53.71, period=24.01, d_freq=1419534001.69, score=1.012, chirp=-28.668, fft_len=512 
D:	threshold 0.2737089; unscaled peak power: 0.27672 exceeds threshold for 1.1%
GPU device sync requested...  ...GPU device synched
Termination request detected or computations are finished. GPU device synched,  exiting...
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i7-5820K CPU @ 3.30GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.011039  NumCfft=146021  NumGauss=0  NumPulse=50198134656  NumTriplet=67992016800
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 56.84 percent.
Used GPU device parameters are:
	Number of compute units: 16
	Single buffer allocation size: 128MB
	Total device global memory: 4096MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=8.072681, time=53.71, period=24.01, d_freq=1419534001.69, score=1.012, chirp=-28.668, fft_len=512 
D:	threshold 0.2737089; unscaled peak power: 0.27672 exceeds threshold for 1.1%
Pulse: peak=2.026057, time=53.71, period=4.135, d_freq=1419528787.01, score=1.006, chirp=66.001, fft_len=512 
D:	threshold 0.09586708; unscaled peak power: 0.09626543 exceeds threshold for 0.4155%
Pulse: peak=6.781505, time=53.71, period=18.56, d_freq=1419531726.25, score=1.025, chirp=-76.002, fft_len=512 
D:	threshold 0.2333259; unscaled peak power: 0.2383117 exceeds threshold for 2.137%
Pulse: peak=0.7929053, time=53.74, period=1.103, d_freq=1419529619.42, score=1.001, chirp=76.135, fft_len=1024 
D:	threshold 0.1091858; unscaled peak power: 0.1092273 exceeds threshold for 0.03797%
Pulse: peak=3.76422, time=53.74, period=9.699, d_freq=1419535426.67, score=1.007, chirp=-90.869, fft_len=1024 
D:	threshold 0.3004559; unscaled peak power: 0.3020968 exceeds threshold for 0.5462%
Triplet: peak=10.13078, time=68.8, period=4.358, d_freq=1419527642.87, chirp=91.736, fft_len=64 

Best spike: peak=25.48282, time=6.713, d_freq=1419527753.47, chirp=-21.901, fft_len=128k
Best autocorr: peak=17.009, time=73.82, delay=3.2188, d_freq=1419532357.97, chirp=15.009, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.125e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=6.781505, time=53.71, period=18.56, d_freq=1419531726.25, score=1.025, chirp=-76.002, fft_len=512 
Best triplet: peak=11.34827, time=39.42, period=31.77, d_freq=1419527464, chirp=-7.1334, fft_len=2k
Spike count:    5
Autocorr count: 0
Pulse count:    7
Triplet count:  2
Gaussian count: 0
Wallclock time elapsed since last restart: 438.4 seconds
Fftlength=32,pass=3:Tune: sum=46453(ms); min=5.776(ms); max=71.58(ms); mean=49.47(ms); s_mean=54.19; sleep=45(ms); delta=513; N=939; usual
Fftlength=32,pass=4:Tune: sum=18113.9(ms); min=4.001(ms); max=47.25(ms); mean=24.61(ms); s_mean=37.4; sleep=30(ms); delta=863; N=736; usual
Fftlength=32,pass=5:Tune: sum=15336.1(ms); min=4.541(ms); max=43.01(ms); mean=22.26(ms); s_mean=31.69; sleep=30(ms); delta=816; N=689; usual
Fftlength=64,pass=3:Tune: sum=18801.5(ms); min=2.699(ms); max=48.45(ms); mean=22.54(ms); s_mean=42.66; sleep=45(ms); delta=929; N=834; usual
Fftlength=64,pass=4:Tune: sum=11046.8(ms); min=1.999(ms); max=29.05(ms); mean=14.59(ms); s_mean=24.12; sleep=15(ms); delta=884; N=757; usual
Fftlength=64,pass=5:Tune: sum=9070.89(ms); min=2.232(ms); max=27.14(ms); mean=12.81(ms); s_mean=20.42; sleep=15(ms); delta=835; N=708; usual
Fftlength=128,pass=3:Tune: sum=15490.6(ms); min=1.463(ms); max=39.78(ms); mean=17.23(ms); s_mean=25.32; sleep=15(ms); delta=962; N=899; usual
Fftlength=128,pass=4:Tune: sum=10835.3(ms); min=1.089(ms); max=30.06(ms); mean=12.6(ms); s_mean=17.58; sleep=15(ms); delta=923; N=860; usual
Fftlength=128,pass=5:Tune: sum=8725.49(ms); min=1.12(ms); max=26.07(ms); mean=10.63(ms); s_mean=14.26; sleep=15(ms); delta=884; N=821; usual
Fftlength=256,pass=3:Tune: sum=15171.4(ms); min=0.6977(ms); max=27.6(ms); mean=15.59(ms); s_mean=26.82; sleep=15(ms); delta=1004; N=973; usual
Fftlength=256,pass=4:Tune: sum=10696.1(ms); min=0.5545(ms); max=19.51(ms); mean=11.5(ms); s_mean=18.9; sleep=15(ms); delta=961; N=930; usual
Fftlength=256,pass=5:Tune: sum=8807.88(ms); min=0.5696(ms); max=16.6(ms); mean=9.941(ms); s_mean=15.78; sleep=15(ms); delta=917; N=886; usual
Fftlength=512,pass=3:Tune: sum=32932.7(ms); min=0.349(ms); max=32.33(ms); mean=24.71(ms); s_mean=31.09; sleep=30(ms); delta=1348; N=1333; high_perf
Fftlength=512,pass=4:Tune: sum=1149.18(ms); min=0.2828(ms); max=9.717(ms); mean=3.793(ms); s_mean=9.376; sleep=0(ms); delta=1326; N=303; usual
Fftlength=512,pass=5:Tune: sum=919.837(ms); min=0.2985(ms); max=8.021(ms); mean=3.285(ms); s_mean=7.815; sleep=0(ms); delta=1303; N=280; usual
Fftlength=1024,pass=3:Tune: sum=34277.3(ms); min=0.1813(ms); max= 16(ms); mean=14.53(ms); s_mean=15.46; sleep=15(ms); delta=2366; N=2359; high_perf
Fftlength=1024,pass=4:Tune: sum=276.236(ms); min=0.1492(ms); max=4.692(ms); mean=1.829(ms); s_mean=4.049; sleep=0(ms); delta=2355; N=151; usual
Fftlength=1024,pass=5:Tune: sum=226.343(ms); min=0.1516(ms); max=3.863(ms); mean=1.617(ms); s_mean=3.701; sleep=0(ms); delta=2344; N=140; usual
Fftlength=2048,pass=3:Tune: sum=33483.6(ms); min=3.241(ms); max=8.013(ms); mean=7.418(ms); s_mean=7.479; sleep=0(ms); delta=1; N=4514; high_perf
Fftlength=4096,pass=3:Tune: sum=32703.8(ms); min=1.578(ms); max=4.766(ms); mean=3.622(ms); s_mean=3.615; sleep=0(ms); delta=1; N=9029; high_perf
Fftlength=8192,pass=3:Tune: sum=22411.6(ms); min=1.228(ms); max=1.318(ms); mean=1.241(ms); s_mean=1.243; sleep=0(ms); delta=1; N=18060; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=47914,	N=47914,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=905,	N=905,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=36038,	N=36038,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=5,	N=5,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=5,	N=5,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=47910,	N=47910,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=909,	N=909,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
00:02:58 (21192): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.