Task 8633672845

Name 25se11ab.4020.13564.15.42.144_1
Workunit 3925018164
Created 11 Mar 2020, 1:46:26 UTC
Sent 11 Mar 2020, 1:57:31 UTC
Report deadline 3 May 2020, 18:13:26 UTC
Received 11 Mar 2020, 21:12:04 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8322522
Run time 30 min
CPU time 28 min 57 sec
Validate state Valid
Credit 119.16
Device peak FLOPS 617.98 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 118.64 MB
Peak swap size 145.35 MB
Peak disk usage 0.03 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 1
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 1
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Ryzen 7 1700 Eight-Core Processor           

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.418706  NumCfft=198093  NumGauss=1129767130  NumPulse=226471023522  NumTriplet=452873486198
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 2
  Max compute units:				 9
  Max work group size:				 1024
  Max clock frequency:				 1784Mhz
  Max memory allocation:			 805306368
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 147456
  Global memory size:				 3221225472
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1060 3GB
  Vendor:					 NVIDIA Corporation
  Driver version:				 432.00
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
  Max compute units:				 9
  Max work group size:				 1024
  Max clock frequency:				 1784Mhz
  Max memory allocation:			 805306368
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 147456
  Global memory size:				 3221225472
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 1060 3GB
  Vendor:					 NVIDIA Corporation
  Driver version:				 432.00
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.418706
Used GPU device parameters are:
	Number of compute units: 9
	Single buffer allocation size: 128MB
	Total device global memory: 3072MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Spike: peak=24.04582, time=73.82, d_freq=1418902069.11, chirp=17.453, fft_len=128k
Spike: peak=24.73323, time=31.88, d_freq=1418907430.52, chirp=-34.161, fft_len=32k
Spike: peak=25.07686, time=31.88, d_freq=1418907430.54, chirp=-34.235, fft_len=32k
Spike: peak=24.5413, time=31.88, d_freq=1418907430.5, chirp=-34.264, fft_len=32k

Best spike: peak=25.07686, time=31.88, d_freq=1418907430.54, chirp=-34.235, fft_len=32k
Best autocorr: peak=17.34593, time=46.98, delay=6.7003, d_freq=1418907559.15, chirp=27.868, fft_len=128k
Best gaussian: peak=4.059689, mean=0.5702441, ChiSq=1.315725, time=101.5, d_freq=1418903645.39,
	score=-0.6687775, null_hyp=2.159217, chirp=7.3238, fft_len=16k
Best pulse: peak=0.8692789, time=19.25, period=0.1361, d_freq=1418901595.49, score=0.9817, chirp=35.638, fft_len=64 
Best triplet: peak=0, time=-2.122e+011, period=0, d_freq=0, chirp=0, fft_len=0 
Spike count:    4
Autocorr count: 0
Pulse count:    0
Triplet count:  0
Gaussian count: 0
Wallclock time elapsed since last restart: 1790.2 seconds
Fftlength=8,pass=3:Tune: sum=17158.6(ms); min=6.029(ms); max=228.6(ms); mean=56.26(ms); s_mean=44.43; sleep=45(ms); delta=120; N=305; usual
Fftlength=8,pass=4:Tune: sum=11146.3(ms); min=7.902(ms); max=157.7(ms); mean=48.89(ms); s_mean=50.87; sleep=45(ms); delta=153; N=228; usual
Fftlength=8,pass=5:Tune: sum=8650.74(ms); min=4.747(ms); max=171.2(ms); mean=43.25(ms); s_mean=61.86; sleep=60(ms); delta=171; N=200; usual
Fftlength=16,pass=3:Tune: sum=10944.5(ms); min=1.522(ms); max=265.3(ms); mean=41.77(ms); s_mean=58.78; sleep=60(ms); delta=184; N=262; usual
Fftlength=16,pass=4:Tune: sum=7657.51(ms); min=1.815(ms); max=231.5(ms); mean=32.72(ms); s_mean=54.63; sleep=45(ms); delta=218; N=234; usual
Fftlength=16,pass=5:Tune: sum=5815.68(ms); min=1.64(ms); max=177(ms); mean=26.32(ms); s_mean=39.68; sleep=30(ms); delta=218; N=221; usual
Fftlength=32,pass=3:Tune: sum=8679.11(ms); min=0.6861(ms); max=228.2(ms); mean= 33(ms); s_mean=52.79; sleep=45(ms); delta=219; N=263; usual
Fftlength=32,pass=4:Tune: sum=4903.3(ms); min=1.356(ms); max=93.9(ms); mean=19.61(ms); s_mean=26.78; sleep=15(ms); delta=261; N=250; usual
Fftlength=32,pass=5:Tune: sum=3965.4(ms); min=0.8018(ms); max=95.96(ms); mean=16.73(ms); s_mean=22.37; sleep=15(ms); delta=247; N=237; usual
Fftlength=64,pass=3:Tune: sum=6106.04(ms); min=0.3901(ms); max=122.1(ms); mean=21.2(ms); s_mean=32.37; sleep=30(ms); delta=293; N=288; usual
Fftlength=64,pass=4:Tune: sum=3509.85(ms); min=0.6564(ms); max=151.2(ms); mean=12.58(ms); s_mean=25.27; sleep=15(ms); delta=287; N=279; usual
Fftlength=64,pass=5:Tune: sum=2650.95(ms); min=0.2284(ms); max=79.68(ms); mean=10.16(ms); s_mean=24.01; sleep=15(ms); delta=269; N=261; usual
Fftlength=128,pass=3:Tune: sum=19012.5(ms); min=14.59(ms); max=341.2(ms); mean=93.66(ms); s_mean=104.2; sleep=105(ms); delta=1; N=203; high_perf
Fftlength=256,pass=3:Tune: sum=14571.1(ms); min=9.525(ms); max=198.7(ms); mean=35.98(ms); s_mean=26.55; sleep=15(ms); delta=1; N=405; high_perf
Fftlength=512,pass=3:Tune: sum=9616(ms); min=4.295(ms); max=97.38(ms); mean=11.89(ms); s_mean=10.22; sleep=0(ms); delta=1; N=809; high_perf
Fftlength=1024,pass=3:Tune: sum=3336.42(ms); min=1.111(ms); max=7.197(ms); mean=2.063(ms); s_mean=2.355; sleep=0(ms); delta=1; N=1617; usual
Fftlength=2048,pass=3:Tune: sum=2438.71(ms); min=0.3535(ms); max=2.851(ms); mean=0.7543(ms); s_mean=0.8226; sleep=0(ms); delta=1; N=3233; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=39,	N=39,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=25450,	N=25450,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=389,	N=389,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12911,	N=12911,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=10,	N=10,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=2,	N=2,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=25441,	N=25441,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=399,	N=399,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
22:04:17 (16816): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.