Task 8704164402

Name 28fe20ab.30237.3339.14.41.94_4
Workunit 3909674856
Created 23 Apr 2020, 7:39:28 UTC
Sent 23 Apr 2020, 7:40:18 UTC
Report deadline 15 Jun 2020, 3:29:28 UTC
Received 23 Apr 2020, 8:41:30 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8906239
Run time 8 min 58 sec
CPU time 8 min 45 sec
Validate state Valid
Credit 98.07
Device peak FLOPS 413.23 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 65.95 MB
Peak swap size 91.85 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 2
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 2
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: AMD Ryzen 3 1200 Quad-Core Processor            

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.436953  NumCfft=193631  NumGauss=1081046568  NumPulse=226457612784  NumTriplet=452846075698
Currently allocated 229 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 3
  Max compute units:				 8
  Max work group size:				 1024
  Max clock frequency:				 1342Mhz
  Max memory allocation:			 1073741824
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 393216
  Global memory size:				 4294967296
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 960
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1124Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 760
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics
  Max compute units:				 6
  Max work group size:				 1024
  Max clock frequency:				 1137Mhz
  Max memory allocation:			 536870912
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 98304
  Global memory size:				 2147483648
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GTX 760
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.19
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.436953
Used GPU device parameters are:
	Number of compute units: 6
	Single buffer allocation size: 128MB
	Total device global memory: 2048MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Gaussian: peak=3.699875, mean=0.5250837, ChiSq=1.405751, time=69.63, d_freq=1420917576.31,
	score=1.819183, null_hyp=2.342487, chirp=31.877, fft_len=16k
Triplet: peak=12.9005, time=20.74, period=1.678, d_freq=1420916149.59, chirp=-36.209, fft_len=128 
Triplet: peak=11.14727, time=20.74, period=1.678, d_freq=1420916138.85, chirp=-36.726, fft_len=128 
Triplet: peak=13.37146, time=20.74, period=1.678, d_freq=1420916150.77, chirp=-39.83, fft_len=128 
Triplet: peak=12.59224, time=20.74, period=1.678, d_freq=1420916140.03, chirp=-40.348, fft_len=128 
Triplet: peak=12.9109, time=20.74, period=1.678, d_freq=1420916151.95, chirp=-43.451, fft_len=128 
Triplet: peak=13.21854, time=20.74, period=1.678, d_freq=1420916141.23, chirp=-43.968, fft_len=128 
Triplet: peak=13.01439, time=20.74, period=1.678, d_freq=1420916142.41, chirp=-47.589, fft_len=128 
Triplet: peak=12.13529, time=20.74, period=1.678, d_freq=1420916143.61, chirp=-51.21, fft_len=128 
Triplet: peak=12.21828, time=20.74, period=1.678, d_freq=1420916143.61, chirp=-51.21, fft_len=128 
Triplet: peak=11.17852, time=20.74, period=1.678, d_freq=1420916144.8, chirp=-54.831, fft_len=128 
Triplet: peak=10.72906, time=20.74, period=1.678, d_freq=1420916144.8, chirp=-54.831, fft_len=128 

Best spike: peak=23.64994, time=30.2, d_freq=1420917996.71, chirp=-11.908, fft_len=64k
Best autocorr: peak=17.10462, time=73.82, delay=3.4996, d_freq=1420919459.62, chirp=20.196, fft_len=128k
Best gaussian: peak=3.699875, mean=0.5250837, ChiSq=1.405751, time=69.63, d_freq=1420917576.31,
	score=1.819183, null_hyp=2.342487, chirp=31.877, fft_len=16k
Best pulse: peak=1.393864, time=98.23, period=0.2997, d_freq=1420921425.7, score=0.9626, chirp=16.553, fft_len=16 
Best triplet: peak=13.37146, time=20.74, period=1.678, d_freq=1420916150.77, chirp=-39.83, fft_len=128 
Spike count:    0
Autocorr count: 0
Pulse count:    0
Triplet count:  11
Gaussian count: 1
Wallclock time elapsed since last restart: 533.9 seconds
Fftlength=8,pass=3:Tune: sum=10101.4(ms); min=7.189(ms); max=68.28(ms); mean=45.71(ms); s_mean=51.62; sleep=45(ms); delta=182; N=221; usual
Fftlength=8,pass=4:Tune: sum=7293.18(ms); min=6.585(ms); max=63.22(ms); mean=39.64(ms); s_mean=50.11; sleep=45(ms); delta=211; N=184; usual
Fftlength=8,pass=5:Tune: sum=5463.98(ms); min=3.974(ms); max=55.45(ms); mean=34.8(ms); s_mean=45.78; sleep=45(ms); delta=236; N=157; usual
Fftlength=16,pass=3:Tune: sum=5802.59(ms); min=1.677(ms); max=56.08(ms); mean=27.37(ms); s_mean=45.83; sleep=45(ms); delta=259; N=212; usual
Fftlength=16,pass=4:Tune: sum=4106.61(ms); min=2.07(ms); max=42.85(ms); mean=20.85(ms); s_mean=34.03; sleep=30(ms); delta=244; N=197; usual
Fftlength=16,pass=5:Tune: sum=3001.39(ms); min=1.959(ms); max=31.25(ms); mean=16.67(ms); s_mean=24.12; sleep=15(ms); delta=227; N=180; usual
Fftlength=32,pass=3:Tune: sum=3718.98(ms); min=0.581(ms); max=36.23(ms); mean=15.43(ms); s_mean=23.65; sleep=15(ms); delta=264; N=241; usual
Fftlength=32,pass=4:Tune: sum=2787.9(ms); min=0.8064(ms); max=27.48(ms); mean=12.17(ms); s_mean=17.23; sleep=15(ms); delta=252; N=229; usual
Fftlength=32,pass=5:Tune: sum=2193.46(ms); min=0.7545(ms); max=23.33(ms); mean=10.11(ms); s_mean=13.26; sleep=15(ms); delta=240; N=217; usual
Fftlength=64,pass=3:Tune: sum=3741.77(ms); min=0.4761(ms); max=36.39(ms); mean=13.76(ms); s_mean=17.19; sleep=15(ms); delta=283; N=272; usual
Fftlength=64,pass=4:Tune: sum=2753.26(ms); min=0.3794(ms); max=28.53(ms); mean=10.47(ms); s_mean=12.57; sleep=15(ms); delta=274; N=263; usual
Fftlength=64,pass=5:Tune: sum=2201.64(ms); min=0.3552(ms); max=23.6(ms); mean=8.95(ms); s_mean=21.54; sleep=15(ms); delta=257; N=246; usual
Fftlength=128,pass=3:Tune: sum=4316.28(ms); min=21.85(ms); max=22.74(ms); mean=22.36(ms); s_mean=22.32; sleep=15(ms); delta=1; N=193; usual
Fftlength=128,pass=4:Tune: sum=3235.1(ms); min=16.35(ms); max=17.15(ms); mean=16.76(ms); s_mean=16.76; sleep=15(ms); delta=1; N=193; usual
Fftlength=128,pass=5:Tune: sum=2561.52(ms); min=12.96(ms); max=13.73(ms); mean=13.34(ms); s_mean=13.37; sleep=15(ms); delta=1; N=192; usual
Fftlength=256,pass=3:Tune: sum=10127.4(ms); min=11.1(ms); max=26.64(ms); mean=26.17(ms); s_mean=26.28; sleep=15(ms); delta=1; N=387; high_perf
Fftlength=512,pass=3:Tune: sum=8740.85(ms); min=4.841(ms); max=11.67(ms); mean=11.31(ms); s_mean=11.31; sleep=0(ms); delta=1; N=773; high_perf
Fftlength=1024,pass=3:Tune: sum=7782.48(ms); min=4.962(ms); max=5.573(ms); mean=5.031(ms); s_mean=4.978; sleep=0(ms); delta=1; N=1547; usual
Fftlength=2048,pass=3:Tune: sum=4557.3(ms); min=1.449(ms); max=1.627(ms); mean=1.473(ms); s_mean=1.459; sleep=0(ms); delta=1; N=3093; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=15,	N=15,	<>=1,	min=1	max=1
class Gaussian_report:		total=1,	N=1,	<>=1,	min=1	max=1
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=24336,	N=24336,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=387,	N=387,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=12356,	N=12356,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=7,	N=7,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=3,	N=3,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=24331,	N=24331,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=393,	N=393,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
03:59:19 (4544): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.