Task 8593244057

Name 26fe20ab.27798.3748.15.42.98_1
Workunit 3906692641
Created 27 Feb 2020, 23:35:19 UTC
Sent 27 Feb 2020, 23:35:29 UTC
Report deadline 20 Apr 2020, 19:21:58 UTC
Received 28 Feb 2020, 13:38:58 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8146712
Run time 5 min 5 sec
CPU time 4 min 55 sec
Validate state Valid
Credit 103.77
Device peak FLOPS 970.86 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 119.54 MB
Peak swap size 143.26 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Maximum single buffer size set to:1024MB
High-performance path selected. If GUI lags occur consider to remove -high_perf option from tuning line
Target kernel sequence time set to 500ms
SpikeFind FFT size threshold override set to:4096
TUNE: kernel 1 now has workgroup size of (64,1,4)
oclFFT global radix override set to:256
oclFFT local radix override set to:16
oclFFT max WG size override set to:256
oclFFT max local FFT size override set to:512
oclFFT number of local memory banks set to:64
oclFFT minimal memory coalesce width set to:64
Running on device number: 1
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 1
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD FX(tm)-6300 Six-Core Processor              

     Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.436995  NumCfft=193619  NumGauss=1080919604  NumPulse=226459474911  NumTriplet=452849798865
Currently allocated 1125 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 3
  Max compute units:				 15
  Max work group size:				 1024
  Max clock frequency:				 1683Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 245760
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1070
  Vendor:					 NVIDIA Corporation
  Driver version:				 432.00
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Max compute units:				 15
  Max work group size:				 1024
  Max clock frequency:				 1683Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 245760
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1070
  Vendor:					 NVIDIA Corporation
  Driver version:				 432.00
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Max compute units:				 15
  Max work group size:				 1024
  Max clock frequency:				 1683Mhz
  Max memory allocation:			 2147483648
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 245760
  Global memory size:				 8589934592
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1070
  Vendor:					 NVIDIA Corporation
  Driver version:				 432.00
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.436995
Used GPU device parameters are:
	Number of compute units: 15
	Single buffer allocation size: 1024MB
	Total device global memory: 8192MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: yes
period_iterations_num=50
Spike: peak=24.36699, time=33.55, d_freq=1420954613.08, chirp=-2.9955, fft_len=128k
Triplet: peak=9.888998, time=3.742, period=0.6717, d_freq=1420961609.59, chirp=81.739, fft_len=64 
Triplet: peak=9.926474, time=3.742, period=0.6717, d_freq=1420961613.46, chirp=82.774, fft_len=64 
Triplet: peak=9.929989, time=3.742, period=0.6717, d_freq=1420961617.33, chirp=83.808, fft_len=64 
Triplet: peak=9.899129, time=3.742, period=0.6717, d_freq=1420961621.2, chirp=84.843, fft_len=64 
Triplet: peak=9.834086, time=3.742, period=0.6717, d_freq=1420961625.07, chirp=85.878, fft_len=64 
Triplet: peak=9.735485, time=3.742, period=0.6717, d_freq=1420961628.95, chirp=86.913, fft_len=64 
Triplet: peak=9.60479, time=3.742, period=0.6717, d_freq=1420961632.82, chirp=87.947, fft_len=64 

Best spike: peak=24.36699, time=33.55, d_freq=1420954613.08, chirp=-2.9955, fft_len=128k
Best autocorr: peak=17.47644, time=33.55, delay=5.1493, d_freq=1420957554.32, chirp=15.589, fft_len=128k
Best gaussian: peak=3.746052, mean=0.5529013, ChiSq=1.418348, time=99.82, d_freq=1420952820.82,
	score=-0.1721001, null_hyp=2.245479, chirp=4.3533, fft_len=16k
Best pulse: peak=1.605275, time=12.27, period=0.3665, d_freq=1420953978.6, score=0.9942, chirp=49.664, fft_len=64 
Best triplet: peak=9.929989, time=3.742, period=0.6717, d_freq=1420961617.33, chirp=83.808, fft_len=64 
Spike count:    1
Autocorr count: 0
Pulse count:    0
Triplet count:  7
Gaussian count: 0
Wallclock time elapsed since last restart: 299.0 seconds
Fftlength=8,pass=3:Tune: sum=5185.26(ms); min=19.8(ms); max=186.4(ms); mean=99.72(ms); s_mean=129.4; sleep=120(ms); delta=1056; N=52; usual
Fftlength=8,pass=4:Tune: sum=3414.35(ms); min=21.41(ms); max=125.6(ms); mean=69.68(ms); s_mean=84.25; sleep=75(ms); delta=1008; N=49; usual
Fftlength=8,pass=5:Tune: sum=2401.17(ms); min=6.922(ms); max=93.53(ms); mean=51.09(ms); s_mean=56.58; sleep=45(ms); delta=976; N=47; usual
Fftlength=16,pass=3:Tune: sum=2898.27(ms); min=1.858(ms); max=71.02(ms); mean=35.78(ms); s_mean=55.13; sleep=45(ms); delta=760; N=81; usual
Fftlength=16,pass=4:Tune: sum=1946.93(ms); min=2.754(ms); max=50.33(ms); mean=25.96(ms); s_mean=37.78; sleep=30(ms); delta=712; N=75; usual
Fftlength=16,pass=5:Tune: sum=1409.4(ms); min=2.14(ms); max=39.26(ms); mean=19.85(ms); s_mean=27.29; sleep=30(ms); delta=680; N=71; usual
Fftlength=32,pass=3:Tune: sum=1842.94(ms); min=0.9564(ms); max=33.86(ms); mean=14.18(ms); s_mean=16.37; sleep=15(ms); delta=576; N=130; usual
Fftlength=32,pass=4:Tune: sum=1287.67(ms); min=0.6912(ms); max=25.61(ms); mean=10.3(ms); s_mean=11.37; sleep=0(ms); delta=556; N=125; usual
Fftlength=32,pass=5:Tune: sum=915.267(ms); min=0.5356(ms); max=19.14(ms); mean=7.89(ms); s_mean=16.48; sleep=15(ms); delta=520; N=116; usual
Fftlength=64,pass=3:Tune: sum=2682.57(ms); min=0.3451(ms); max=41.42(ms); mean=14.58(ms); s_mean=41.02; sleep=30(ms); delta=396; N=184; high_perf
Fftlength=64,pass=4:Tune: sum=659.291(ms); min=0.2437(ms); max=13.04(ms); mean=5.071(ms); s_mean=10.04; sleep=0(ms); delta=374; N=130; usual
Fftlength=64,pass=5:Tune: sum=479.136(ms); min=0.2775(ms); max=9.978(ms); mean=4.026(ms); s_mean=9.223; sleep=0(ms); delta=352; N=119; usual
Fftlength=128,pass=3:Tune: sum=4386.23(ms); min=22.24(ms); max=23.22(ms); mean=22.73(ms); s_mean=22.9; sleep=15(ms); delta=1; N=193; usual
Fftlength=256,pass=3:Tune: sum=4068.75(ms); min=10.2(ms); max=10.8(ms); mean=10.51(ms); s_mean=10.47; sleep=0(ms); delta=1; N=387; usual
Fftlength=512,pass=3:Tune: sum=3969.96(ms); min=4.989(ms); max=5.34(ms); mean=5.136(ms); s_mean=5.154; sleep=0(ms); delta=1; N=773; usual
Fftlength=1024,pass=3:Tune: sum=1749.5(ms); min=1.007(ms); max=1.187(ms); mean=1.131(ms); s_mean=1.13; sleep=0(ms); delta=1; N=1547; usual
Fftlength=2048,pass=3:Tune: sum=1234.38(ms); min=0.3594(ms); max=0.4127(ms); mean=0.3991(ms); s_mean=0.4002; sleep=0(ms); delta=1; N=3093; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=20,	N=20,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24305,	N=24305,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=414,	N=414,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12358,	N=12358,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=3,	N=3,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=3,	N=3,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=24303,	N=24303,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=417,	N=417,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
08:38:14 (5480): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.