Task 8682041887

Name blc45_2bit_guppi_58838_12153_TIC452808876_0052.11443.409.20.29.109.vlar_0
Workunit 3946982402
Created 25 Mar 2020, 2:28:58 UTC
Sent 25 Mar 2020, 4:51:41 UTC
Report deadline 17 May 2020, 9:51:23 UTC
Received 26 Mar 2020, 7:58:42 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8307601
Run time 13 min 43 sec
CPU time 3 min 54 sec
Validate state Valid
Credit 103.60
Device peak FLOPS 106.72 GFLOPS
Application version SETI@home v8 v8.24 (opencl_ati_nocal)
windows_intelx86
Peak working set size 90.68 MB
Peak swap size 116.39 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 2
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: Advanced Micro Devices, Inc.
BOINC assigns device 2
1 slot of 64 used for this instance
Info: BOINC provided OpenCL device ID used
Info: CPU affinity mask used: 4; system mask is ffffff

Build features: SETI8	Non-graphics	OpenCL	OCL_ZERO_COPY	OCL_CHIRP3	FFTW	AMD specific	USE_SSE2	x86	
     CPUID: AMD Eng Sample, ZS262445TCG45_32/26/20_2/16     

     Cache: L1=64K L2=2048K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.011232  NumCfft=106181  NumGauss=0  NumPulse=36398105728  NumTriplet=49355738528
Currently allocated 185 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE2xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 AMD Accelerated Parallel Processing
Number of devices:				 3
  Max compute units:				 28
  Max work group size:				 256
  Max clock frequency:				 990Mhz
  Max memory allocation:			 1879048192
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 16384
  Global memory size:				 2147483648
  Constant buffer size:				 1879048192
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 32768
  Queue properties:				 
    Out-of-Order:				 No
  Name:						 Tonga
  Vendor:					 Advanced Micro Devices, Inc.
  Driver version:				 2766.5
  Version:					 OpenCL 1.2 AMD-APP (2766.5)
  Extensions:					 cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash 
  Max compute units:				 28
  Max work group size:				 256
  Max clock frequency:				 990Mhz
  Max memory allocation:			 1879048192
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 16384
  Global memory size:				 2147483648
  Constant buffer size:				 1879048192
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 32768
  Queue properties:				 
    Out-of-Order:				 No
  Name:						 Tonga
  Vendor:					 Advanced Micro Devices, Inc.
  Driver version:				 2766.5
  Version:					 OpenCL 1.2 AMD-APP (2766.5)
  Extensions:					 cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash 
  Max compute units:				 28
  Max work group size:				 256
  Max clock frequency:				 990Mhz
  Max memory allocation:			 1879048192
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 16384
  Global memory size:				 2147483648
  Constant buffer size:				 1879048192
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 32768
  Queue properties:				 
    Out-of-Order:				 No
  Name:						 Tonga
  Vendor:					 Advanced Micro Devices, Inc.
  Driver version:				 2766.5
  Version:					 OpenCL 1.2 AMD-APP (2766.5)
  Extensions:					 cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.011232
Used GPU device parameters are:
	Number of compute units: 28
	Single buffer allocation size: 128MB
	Total device global memory: 2048MB
	max WG size: 256
	local mem type: Real
	LotOfMem path: no
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Spike: peak=24.52505, time=74.45, d_freq=10228785467.3, chirp=-11.081, fft_len=128k
Spike: peak=25.7292, time=74.45, d_freq=10228785467.3, chirp=-11.082, fft_len=128k
Spike: peak=25.59886, time=74.45, d_freq=10228785467.3, chirp=-11.083, fft_len=128k
Spike: peak=24.18813, time=74.45, d_freq=10228785467.3, chirp=-11.085, fft_len=128k
Pulse: peak=3.504818, time=45.84, period=7.125, d_freq=10228781519.4, score=1.052, chirp=-12.873, fft_len=512 
D:	threshold 0.1326266; unscaled peak power: 0.1378889 exceeds threshold for 3.968%
Pulse: peak=2.659923, time=45.9, period=4.966, d_freq=10228788770.8, score=1.036, chirp=-22.022, fft_len=2k
D:	threshold 0.4598086; unscaled peak power: 0.4715607 exceeds threshold for 2.556%
Pulse: peak=5.686728, time=45.9, period=11.86, d_freq=10228784214, score=1.049, chirp=-66.755, fft_len=2k
D:	threshold 0.7633314; unscaled peak power: 0.7948622 exceeds threshold for 4.131%
Pulse: peak=1.501982, time=45.82, period=2.245, d_freq=10228786996.4, score=1.015, chirp=-80.18, fft_len=128 
D:	threshold 0.01950608; unscaled peak power: 0.01968595 exceeds threshold for 0.9221%
Pulse: peak=4.44938, time=45.9, period=9.932, d_freq=10228782264.6, score=1.027, chirp=80.317, fft_len=2k
D:	threshold 0.6495843; unscaled peak power: 0.6637216 exceeds threshold for 2.176%

Best spike: peak=25.7292, time=74.45, d_freq=10228785467.3, chirp=-11.082, fft_len=128k
Best autocorr: peak=16.98872, time=74.45, delay=1.655, d_freq=10228784957.9, chirp=-20.284, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=3.504818, time=45.84, period=7.125, d_freq=10228781519.4, score=1.052, chirp=-12.873, fft_len=512 
Best triplet: peak=0, time=-2.124e+011, period=0, d_freq=0, chirp=0, fft_len=0 
Spike count:    4
Autocorr count: 0
Pulse count:    5
Triplet count:  0
Gaussian count: 0
Wallclock time elapsed since last restart: 813.8 seconds
Fftlength=32,pass=3:Tune: sum=51473.4(ms); min=9.896(ms); max=67.78(ms); mean=51.73(ms); s_mean=54.66; sleep=45(ms); delta=423; N=995; usual
Fftlength=32,pass=4:Tune: sum=45939.1(ms); min=8.262(ms); max=70.66(ms); mean=51.04(ms); s_mean=55.03; sleep=45(ms); delta=410; N=900; usual
Fftlength=32,pass=5:Tune: sum=16712.9(ms); min=6.4(ms); max=43.69(ms); mean=26.15(ms); s_mean=34.31; sleep=30(ms); delta=798; N=639; usual
Fftlength=64,pass=3:Tune: sum=26292.1(ms); min=5.013(ms); max=69.03(ms); mean=32.1(ms); s_mean=57.73; sleep=60(ms); delta=912; N=819; usual
Fftlength=64,pass=4:Tune: sum=25345.3(ms); min=3.861(ms); max=72.28(ms); mean=33.53(ms); s_mean=60.89; sleep=60(ms); delta=825; N=756; usual
Fftlength=64,pass=5:Tune: sum=12241.1(ms); min=3.295(ms); max=35.55(ms); mean=17.34(ms); s_mean=27.68; sleep=30(ms); delta=817; N=706; usual
Fftlength=128,pass=3:Tune: sum=22925.5(ms); min=2.605(ms); max=59.25(ms); mean=25.93(ms); s_mean=38.44; sleep=30(ms); delta=939; N=884; usual
Fftlength=128,pass=4:Tune: sum=12777.9(ms); min=2.287(ms); max=36.92(ms); mean=16.05(ms); s_mean=21.71; sleep=15(ms); delta=907; N=796; usual
Fftlength=128,pass=5:Tune: sum=8044.57(ms); min=1.741(ms); max=23.25(ms); mean=10.63(ms); s_mean=13.8; sleep=15(ms); delta=868; N=757; usual
Fftlength=256,pass=3:Tune: sum=16136.4(ms); min=1.281(ms); max=32.03(ms); mean=17.83(ms); s_mean=30.99; sleep=30(ms); delta=988; N=905; usual
Fftlength=256,pass=4:Tune: sum=12013.5(ms); min=1.031(ms); max=23.62(ms); mean=14.34(ms); s_mean=22.65; sleep=15(ms); delta=949; N=838; usual
Fftlength=256,pass=5:Tune: sum=8818.73(ms); min=0.8573(ms); max=17.72(ms); mean=11.11(ms); s_mean=16.67; sleep=15(ms); delta=905; N=794; usual
Fftlength=512,pass=3:Tune: sum=34589.9(ms); min=0.01746(ms); max=56.75(ms); mean=27.54(ms); s_mean=33.36; sleep=30(ms); delta=1311; N=1256; high_perf
Fftlength=512,pass=4:Tune: sum=1190.91(ms); min=0.5135(ms); max=11.1(ms); mean=4.528(ms); s_mean=10.49; sleep=0(ms); delta=1289; N=263; usual
Fftlength=512,pass=5:Tune: sum=870.392(ms); min=0.4299(ms); max=8.429(ms); mean=3.612(ms); s_mean=8.146; sleep=0(ms); delta=1267; N=241; usual
Fftlength=1024,pass=3:Tune: sum=36762.5(ms); min=0.3313(ms); max=498.2(ms); mean=16.27(ms); s_mean=17.02; sleep=15(ms); delta=2277; N=2259; high_perf
Fftlength=1024,pass=4:Tune: sum=293.179(ms); min=0.2658(ms); max=5.545(ms); mean=2.221(ms); s_mean=4.882; sleep=0(ms); delta=2276; N=132; usual
Fftlength=1024,pass=5:Tune: sum=212.807(ms); min=0.2259(ms); max=4.083(ms); mean=1.773(ms); s_mean=3.846; sleep=0(ms); delta=2264; N=120; usual
Fftlength=2048,pass=3:Tune: sum=36672.8(ms); min=3.813(ms); max=8.741(ms); mean=8.429(ms); s_mean=8.396; sleep=0(ms); delta=1; N=4351; high_perf
Fftlength=4096,pass=3:Tune: sum=20932.7(ms); min=1.809(ms); max=4.281(ms); mean=2.406(ms); s_mean=2.345; sleep=0(ms); delta=1; N=8701; high_perf
Fftlength=8192,pass=3:Tune: sum=37174.3(ms); min=0.897(ms); max=2.24(ms); mean=2.136(ms); s_mean=2.136; sleep=0(ms); delta=1; N=17401; high_perf

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=46207,	N=46207,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=885,	N=885,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=34721,	N=34721,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=10,	N=10,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=6,	N=6,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=46200,	N=46200,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=893,	N=893,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
09:57:16 (4648): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.