Task 8703534212

Name blc43_2bit_guppi_58838_18666_TIC427348923_0077.25539.0.19.28.117_2
Workunit 3922958516
Created 20 Apr 2020, 18:54:36 UTC
Sent 20 Apr 2020, 18:58:04 UTC
Report deadline 20 Jul 2020, 1:11:00 UTC
Received 21 Apr 2020, 7:59:20 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8705897
Run time 13 min 35 sec
CPU time 2 min 46 sec
Validate state Valid
Credit 50.93
Device peak FLOPS 65.94 GFLOPS
Application version SETI@home v8 v8.24 (opencl_ati5_nocal)
windows_intelx86
Peak working set size 132.88 MB
Peak swap size 136.94 MB
Peak disk usage 0.04 MB

Stderr output

<core_client_version>7.16.5</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: Advanced Micro Devices, Inc.
BOINC assigns device 0
1 slot of 64 used for this instance
Info: BOINC provided OpenCL device ID used
Info: CPU affinity mask used: 4; system mask is ff

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_HD5xxx	OCL_ZERO_COPY	OCL_CHIRP3	FFTW	AMD specific	USE_SSE2	x86	
     CPUID: AMD Ryzen 5 3550H with Radeon Vega Mobile Gfx   

     Cache: L1=64K L2=512K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A 
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.149903  NumCfft=151775  NumGauss=904950914  NumPulse=164878721116  NumTriplet=329729067212
Currently allocated 209 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE2xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

AMD HD5 version by Raistmer

Number of OpenCL platforms:				 1


 OpenCL Platform Name:					 AMD Accelerated Parallel Processing
Number of devices:				 2
  Max compute units:				 14
  Max work group size:				 256
  Max clock frequency:				 1223Mhz
  Max memory allocation:			 3221225472
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 16384
  Global memory size:				 3221225472
  Constant buffer size:				 3221225472
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 32768
  Queue properties:				 
    Out-of-Order:				 No
  Name:						 Baffin
  Vendor:					 Advanced Micro Devices, Inc.
  Driver version:				 2841.19
  Version:					 OpenCL 1.2 AMD-APP (2841.19)
  Extensions:					 cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash 
  Max compute units:				 8
  Max work group size:				 256
  Max clock frequency:				 1201Mhz
  Max memory allocation:			 2503535001
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 16384
  Global memory size:				 3079553024
  Constant buffer size:				 2503535001
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 32768
  Queue properties:				 
    Out-of-Order:				 No
  Name:						 gfx902
  Vendor:					 Advanced Micro Devices, Inc.
  Driver version:				 2841.19 (PAL,HSAIL)
  Version:					 OpenCL 1.2 AMD-APP (2841.19)
  Extensions:					 cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_gl_event cl_amd_liquid_flash 


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.149903
Used GPU device parameters are:
	Number of compute units: 14
	Single buffer allocation size: 128MB
	Total device global memory: 3072MB
	max WG size: 256
	local mem type: Real
	LotOfMem path: no
	LowPerformanceGPU path: no
	HighPerformanceGPU path: no
period_iterations_num=50
Triplet: peak=11.02293, time=75.22, period=2.93, d_freq=10504268373.9, chirp=-1.2363, fft_len=64 
Spike: peak=24.40483, time=42.95, d_freq=10504271640, chirp=-3.4524, fft_len=64k
Triplet: peak=10.96302, time=75.22, period=2.93, d_freq=10504268359.8, chirp=-6.1788, fft_len=64 
Triplet: peak=10.59525, time=75.22, period=2.93, d_freq=10504268359.8, chirp=-6.1788, fft_len=64 
Triplet: peak=10.93156, time=75.22, period=2.93, d_freq=10504268352.7, chirp=-8.6514, fft_len=64 
Spike: peak=24.31107, time=71.58, d_freq=10504268704.3, chirp=17.211, fft_len=64k
Spike: peak=24.09426, time=71.58, d_freq=10504268704.3, chirp=17.216, fft_len=64k
Pulse: peak=5.875545, time=36.14, period=1.854, d_freq=10504272040.5, score=1.003, chirp=84.039, fft_len=32 
D:	threshold 0.01331355; unscaled peak power: 0.0133447 exceeds threshold for 0.234%

Best spike: peak=24.40483, time=42.95, d_freq=10504271640, chirp=-3.4524, fft_len=64k
Best autocorr: peak=16.97178, time=17.18, delay=1.5032, d_freq=10504268352.1, chirp=-17.091, fft_len=128k
Best gaussian: peak=2.795178, mean=0.5251512, ChiSq=1.419976, time=52.26, d_freq=10504263464.1,
	score=-2.249868, null_hyp=2.193376, chirp=11.519, fft_len=16k
Best pulse: peak=5.875545, time=36.14, period=1.854, d_freq=10504272040.5, score=1.003, chirp=84.039, fft_len=32 
Best triplet: peak=11.02293, time=75.22, period=2.93, d_freq=10504268373.9, chirp=-1.2363, fft_len=64 
Spike count:    3
Autocorr count: 0
Pulse count:    1
Triplet count:  4
Gaussian count: 0
Wallclock time elapsed since last restart: 808.1 seconds
Fftlength=8,pass=3:Tune: sum=6727.4(ms); min=5.222(ms); max=62.81(ms); mean=34.86(ms); s_mean=50.77; sleep=45(ms); delta=243; N=193; usual
Fftlength=8,pass=4:Tune: sum=4174.77(ms); min=3.752(ms); max=39.46(ms); mean=23.59(ms); s_mean=26.86; sleep=15(ms); delta=240; N=177; usual
Fftlength=8,pass=5:Tune: sum=3012.97(ms); min=3.162(ms); max=29.9(ms); mean=18.37(ms); s_mean=23.18; sleep=15(ms); delta=227; N=164; usual
Fftlength=16,pass=3:Tune: sum=6010.42(ms); min=2.411(ms); max=56.62(ms); mean=28.76(ms); s_mean=44.06; sleep=45(ms); delta=256; N=209; usual
Fftlength=16,pass=4:Tune: sum=4204.86(ms); min=3.672(ms); max=40.93(ms); mean=22.73(ms); s_mean=29.11; sleep=30(ms); delta=248; N=185; usual
Fftlength=16,pass=5:Tune: sum=3020.98(ms); min=1.323(ms); max=31.43(ms); mean=18.65(ms); s_mean=26.83; sleep=15(ms); delta=241; N=162; usual
Fftlength=32,pass=3:Tune: sum=4482.21(ms); min=1.329(ms); max=44.49(ms); mean=21.45(ms); s_mean=36.07; sleep=30(ms); delta=256; N=209; usual
Fftlength=32,pass=4:Tune: sum=2415.84(ms); min=0.8234(ms); max=24.99(ms); mean=13.2(ms); s_mean=19.5; sleep=15(ms); delta=246; N=183; usual
Fftlength=32,pass=5:Tune: sum=1819.27(ms); min=0.7874(ms); max=20.18(ms); mean=11.44(ms); s_mean=14.27; sleep=15(ms); delta=238; N=159; usual
Fftlength=64,pass=3:Tune: sum=3924.05(ms); min=0.5478(ms); max=38.29(ms); mean=17.44(ms); s_mean=22.41; sleep=15(ms); delta=272; N=225; usual
Fftlength=64,pass=4:Tune: sum=2764.05(ms); min=0.429(ms); max=28.49(ms); mean=13.29(ms); s_mean=15.63; sleep=15(ms); delta=263; N=208; usual
Fftlength=64,pass=5:Tune: sum=2064.2(ms); min=0.379(ms); max=23.16(ms); mean=10.48(ms); s_mean=11.54; sleep=0(ms); delta=252; N=197; usual
Fftlength=128,pass=3:Tune: sum=3911.58(ms); min=0.3182(ms); max=25.25(ms); mean=15.58(ms); s_mean=24.41; sleep=15(ms); delta=278; N=251; usual
Fftlength=128,pass=4:Tune: sum=2824.51(ms); min=0.239(ms); max=18.14(ms); mean=11.77(ms); s_mean=17.69; sleep=15(ms); delta=267; N=240; usual
Fftlength=128,pass=5:Tune: sum=2194.13(ms); min=0.2219(ms); max=13.89(ms); mean=9.623(ms); s_mean=13.76; sleep=15(ms); delta=255; N=228; usual
Fftlength=256,pass=3:Tune: sum=8935.39(ms); min=13.78(ms); max=33.56(ms); mean=27.66(ms); s_mean=29.43; sleep=30(ms); delta=1; N=323; high_perf
Fftlength=512,pass=3:Tune: sum=9374.62(ms); min=7.128(ms); max=17.3(ms); mean=14.49(ms); s_mean=15.59; sleep=15(ms); delta=1; N=647; high_perf
Fftlength=1024,pass=3:Tune: sum=7435.29(ms); min=5.709(ms); max=5.945(ms); mean=5.742(ms); s_mean=5.743; sleep=0(ms); delta=1; N=1295; usual
Fftlength=2048,pass=3:Tune: sum=3662.05(ms); min=1.392(ms); max=2.039(ms); mean=1.414(ms); s_mean=1.411; sleep=0(ms); delta=1; N=2589; usual
Fftlength=4096,pass=3:Tune: sum=2684.8(ms); min=0.4946(ms); max=0.9237(ms); mean=0.5184(ms); s_mean=0.5194; sleep=0(ms); delta=1; N=5179; usual

class Gaussian_transfer_not_needed:		total=82822,	N=82822,	<>=1,	min=1	max=1
class Gaussian_transfer_needed:		total=16,	N=16,	<>=1,	min=1	max=1


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=34,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=17,	N=34,	<>=0.5,	min=0	max=1


class Gaussian_new_best:		total=25,	N=25,	<>=1,	min=1	max=1
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=9,	N=9,	<>=1,	min=1	max=1


class PC_triplet_find_hit:		total=20556,	N=20556,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=137,	N=137,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=10343,	N=10343,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=4,	N=4,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=4,	N=4,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=1,	N=1,	<>=1,	min=1	max=1


class PoT_transfer_not_needed:		total=20554,	N=20554,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=140,	N=140,	<>=1,	min=1	max=1

class SleepQuantum:		total=0,	N=0,	<>=0,	min=0	max=0

GPU device sync requested...  ...GPU device synched
08:47:31 (13764): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.