Task 8625273537

Name blc44_2bit_guppi_58838_14733_TIC249067445_0060.21711.818.20.29.35.vlar_1
Workunit 3921148863
Created 8 Mar 2020, 19:28:03 UTC
Sent 8 Mar 2020, 19:28:09 UTC
Report deadline 1 May 2020, 0:27:51 UTC
Received 9 Mar 2020, 3:22:40 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8885749
Run time 4 min
CPU time 3 min 54 sec
Validate state Valid
Credit 120.54
Device peak FLOPS 4,311.87 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 109.11 MB
Peak swap size 136.98 MB
Peak disk usage 0.01 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.011492  NumCfft=105679  NumGauss=0  NumPulse=35871981440  NumTriplet=48829614240
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 68
  Max work group size:				 1024
  Max clock frequency:				 1650Mhz
  Max memory allocation:			 2952790016
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 2228224
  Global memory size:				 11811160064
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce RTX 2080 Ti
  Vendor:					 NVIDIA Corporation
  Driver version:				 442.50
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.011492
Used GPU device parameters are:
	Number of compute units: 68
	Single buffer allocation size: 128MB
	Total device global memory: 11264MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Pulse: peak=6.554981, time=45.99, period=15.84, d_freq=10377353486.4, score=1.065, chirp=21.805, fft_len=4k
D:	threshold 1.766139; unscaled peak power: 1.865489 exceeds threshold for 5.625%
Pulse: peak=9.475282, time=45.99, period=28.99, d_freq=10377359280.2, score=1.001, chirp=30.107, fft_len=4k
D:	threshold 2.944803; unscaled peak power: 2.947959 exceeds threshold for 0.1072%
Pulse: peak=6.368633, time=45.9, period=15.61, d_freq=10377350782, score=1.01, chirp=-40.113, fft_len=2k
D:	threshold 0.8824849; unscaled peak power: 0.8901256 exceeds threshold for 0.8658%
Pulse: peak=3.792335, time=45.86, period=8.892, d_freq=10377352429, score=1.012, chirp=44.87, fft_len=1024 
D:	threshold 0.3023257; unscaled peak power: 0.3051911 exceeds threshold for 0.9478%
Pulse: peak=1.653185, time=45.84, period=2.509, d_freq=10377351208.2, score=1.003, chirp=69.218, fft_len=512 
D:	threshold 0.08507257; unscaled peak power: 0.08523544 exceeds threshold for 0.1914%
Pulse: peak=7.62893, time=46.17, period=22.19, d_freq=10377348263.4, score=1.054, chirp=82.907, fft_len=8k
D:	threshold 4.430549; unscaled peak power: 4.640502 exceeds threshold for 4.739%
Pulse: peak=9.299884, time=46.17, period=29.35, d_freq=10377348223, score=1.008, chirp=84.33, fft_len=8k
D:	threshold 5.706918; unscaled peak power: 5.750397 exceeds threshold for 0.7619%

Best spike: peak=23.88209, time=51.54, d_freq=10377356822.7, chirp=-17.668, fft_len=128k
Best autocorr: peak=15.5332, time=5.727, delay=4.9959, d_freq=10377353774.3, chirp=18.519, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=6.554982, time=45.99, period=15.84, d_freq=10377353486.4, score=1.065, chirp=21.805, fft_len=4k
Best triplet: peak=0, time=-2.124e+011, period=0, d_freq=0, chirp=0, fft_len=0 
Spike count:    0
Autocorr count: 0
Pulse count:    7
Triplet count:  0
Gaussian count: 0
Wallclock time elapsed since last restart: 234.6 seconds
Fftlength=32,pass=3:Tune: sum=25185.9(ms); min=4.524(ms); max=64.88(ms); mean=32.75(ms); s_mean=45.98; sleep=45(ms); delta=839; N=769; usual
Fftlength=32,pass=4:Tune: sum=14130.3(ms); min=2.837(ms); max=37.43(ms); mean=19.85(ms); s_mean=31.82; sleep=30(ms); delta=839; N=712; usual
Fftlength=32,pass=5:Tune: sum=8881.36(ms); min=2.654(ms); max=24.9(ms); mean=13.9(ms); s_mean=18.97; sleep=15(ms); delta=798; N=639; usual
Fftlength=64,pass=3:Tune: sum=12841.2(ms); min=1.943(ms); max=33.5(ms); mean=15.79(ms); s_mean=19.93; sleep=15(ms); delta=908; N=813; usual
Fftlength=64,pass=4:Tune: sum=7239.16(ms); min=1.433(ms); max=20.13(ms); mean=9.836(ms); s_mean=16.18; sleep=15(ms); delta=863; N=736; usual
Fftlength=64,pass=5:Tune: sum=5097.66(ms); min=1.142(ms); max=14.74(ms); mean=7.497(ms); s_mean=11.37; sleep=0(ms); delta=815; N=680; usual
Fftlength=128,pass=3:Tune: sum=9008.57(ms); min=0.7259(ms); max=23.25(ms); mean=10.4(ms); s_mean=15.39; sleep=15(ms); delta=933; N=866; usual
Fftlength=128,pass=4:Tune: sum=6345.15(ms); min=0.8114(ms); max=18.21(ms); mean=7.682(ms); s_mean=10.99; sleep=0(ms); delta=893; N=826; usual
Fftlength=128,pass=5:Tune: sum=2873.45(ms); min=0.5577(ms); max=8.546(ms); mean=3.931(ms); s_mean=4.854; sleep=0(ms); delta=866; N=731; usual
Fftlength=256,pass=3:Tune: sum=8094.1(ms); min=0.3605(ms); max=21.6(ms); mean=8.885(ms); s_mean=19.53; sleep=15(ms); delta=978; N=911; high_perf
Fftlength=256,pass=4:Tune: sum=1378.94(ms); min=0.3046(ms); max=6.477(ms); mean=2.549(ms); s_mean=5.615; sleep=0(ms); delta=939; N=541; usual
Fftlength=256,pass=5:Tune: sum=934.831(ms); min=0.3543(ms); max=5.183(ms); mean=1.989(ms); s_mean=3.894; sleep=0(ms); delta=902; N=470; usual
Fftlength=512,pass=3:Tune: sum=8300.57(ms); min=0.2465(ms); max=9.784(ms); mean=6.748(ms); s_mean=8.161; sleep=0(ms); delta=1297; N=1230; high_perf
Fftlength=512,pass=4:Tune: sum=300.614(ms); min=0.2002(ms); max=2.504(ms); mean=1.193(ms); s_mean=2.432; sleep=0(ms); delta=1276; N=252; usual
Fftlength=512,pass=5:Tune: sum=222.336(ms); min=0.1862(ms); max=1.941(ms); mean=0.9667(ms); s_mean=1.929; sleep=0(ms); delta=1254; N=230; usual
Fftlength=1024,pass=3:Tune: sum=8292.63(ms); min=0.1249(ms); max=5.345(ms); mean=3.732(ms); s_mean=3.954; sleep=0(ms); delta=2255; N=2222; high_perf
Fftlength=1024,pass=4:Tune: sum=73.1682(ms); min=0.1018(ms); max=1.2(ms); mean=0.5807(ms); s_mean=1.082; sleep=0(ms); delta=2245; N=126; usual
Fftlength=1024,pass=5:Tune: sum=53.9648(ms); min=0.09219(ms); max=0.9398(ms); mean=0.4652(ms); s_mean=0.9181; sleep=0(ms); delta=2235; N=116; usual
Fftlength=2048,pass=3:Tune: sum=7491.01(ms); min=0.7908(ms); max=2.695(ms); mean=1.747(ms); s_mean=1.743; sleep=0(ms); delta=1; N=4287; high_perf
Fftlength=4096,pass=3:Tune: sum=6741.16(ms); min=0.3523(ms); max=1.568(ms); mean=0.7861(ms); s_mean=0.7851; sleep=0(ms); delta=1; N=8575; high_perf
Fftlength=8192,pass=3:Tune: sum=3394.52(ms); min=0.1923(ms); max=0.4402(ms); mean=0.1979(ms); s_mean=0.1968; sleep=0(ms); delta=1; N=17151; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=33860,	N=33860,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=12730,	N=12730,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=34220,	N=34220,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=9,	N=9,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=3,	N=3,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=33852,	N=33852,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=12739,	N=12739,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
03:18:24 (12376): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.