Task 8703902865

Name blc66_2bit_guppi_58838_27577_TIC67772767_0105.9795.409.19.28.205.vlar_3
Workunit 3940965777
Created 22 Apr 2020, 18:36:56 UTC
Sent 22 Apr 2020, 18:38:27 UTC
Report deadline 14 Jun 2020, 23:38:09 UTC
Received 2 May 2020, 20:30:46 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x00000000)
Computer ID 8126787
Run time 2 hours 21 min 3 sec
CPU time 34 min 52 sec
Validate state Valid
Credit 116.72
Device peak FLOPS 87.56 GFLOPS
Application version SETI@home v8 v8.22 (opencl_nvidia_SoG)
windows_intelx86
Peak working set size 109.79 MB
Peak swap size 142.34 MB
Peak disk usage 0.05 MB

Stderr output

<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: Intel(R) Corporation
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used

Build features: SETI8	Non-graphics	OpenCL	USE_OPENCL_NV	OCL_ZERO_COPY	SIGNALS_ON_GPU	OCL_CHIRP3	FFTW	USE_SSE3	x86	
     CPUID: Intel(R) Core(TM) i5-4330M CPU @ 2.80GHz 

     Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX 
Low-performance GPU detected, default period_iterations_num set to 500
For low-performance GPU path use_sleep enabled with 1ms per iteration, high prec timer enabled
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl 
ar=0.013868  NumCfft=117671  NumGauss=0  NumPulse=48420676480  NumTriplet=61395610784
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer

OpenCL version by Raistmer, r3584

Number of OpenCL platforms:				 2


 OpenCL Platform Name:					 Intel(R) OpenCL
Number of devices:				 1
  Max compute units:				 20
  Max work group size:				 512
  Max clock frequency:				 400Mhz
  Max memory allocation:			 229536563
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 262144
  Global memory size:				 918146253
  Constant buffer size:				 65536
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Queue properties:				 
    Out-of-Order:				 No
  Name:						 Intel(R) HD Graphics 4600
  Vendor:					 Intel(R) Corporation
  Driver version:				 10.18.14.5057
  Version:					 OpenCL 1.2 
  Extensions:					 cl_intel_accelerator cl_intel_advanced_motion_estimation cl_intel_ctz cl_intel_d3d11_nv12_media_sharing cl_intel_dx9_media_sharing cl_intel_motion_estimation cl_intel_simultaneous_sharing cl_intel_subgroups cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_depth_images cl_khr_dx9_media_sharing cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_khr_gl_sharing cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir 


 OpenCL Platform Name:					 NVIDIA CUDA
Number of devices:				 1
  Max compute units:				 2
  Max work group size:				 1024
  Max clock frequency:				 758Mhz
  Max memory allocation:			 268435456
  Cache type:					 Read/Write
  Cache line size:				 128
  Cache size:					 32768
  Global memory size:				 1073741824
  Constant buffer size:				 65536
  Max number of constant args:			 9
  Local memory type:				 Scratchpad
  Local memory size:				 49152
  Queue properties:				 
    Out-of-Order:				 Yes
  Name:						 GeForce GT 730M
  Vendor:					 NVIDIA Corporation
  Driver version:				 391.25
  Version:					 OpenCL 1.2 CUDA
  Extensions:					 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer


Work Unit Info:
...............
Credit multiplier is :  2.85
WU true angle range is :  0.013868
Used GPU device parameters are:
	Number of compute units: 2
	Single buffer allocation size: 128MB
	Total device global memory: 1024MB
	max WG size: 1024
	local mem type: Real
	FERMI path used: yes
	LotOfMem path: yes
	LowPerformanceGPU path: yes
	HighPerformanceGPU path: no
period_iterations_num=500
Pulse: peak=1.707438, time=45.84, period=2.315, d_freq=7689850172.49, score=1.039, chirp=-6.3591, fft_len=512 
D:	threshold 0.08424776; unscaled peak power: 0.08628378 exceeds threshold for 2.417%
Pulse: peak=1.977554, time=45.86, period=3.361, d_freq=7689846122.61, score=1.005, chirp=-8.6413, fft_len=1024 
D:	threshold 0.1870431; unscaled peak power: 0.1876311 exceeds threshold for 0.3144%
Triplet: peak=12.85192, time=61.57, period=8.084, d_freq=7689841422.99, chirp=12.166, fft_len=64 
Triplet: peak=10.41123, time=8.098, period=0.1678, d_freq=7689850445.89, chirp=-13.273, fft_len=256 
Triplet: peak=10.3383, time=8.098, period=0.1678, d_freq=7689850445.82, chirp=-18.802, fft_len=256 
Triplet: peak=10.08903, time=8.098, period=0.1678, d_freq=7689850443.58, chirp=-19.079, fft_len=256 
Triplet: peak=10.78621, time=27.82, period=19.3, d_freq=7689849840.97, chirp=-22.396, fft_len=256 
Triplet: peak=10.16757, time=8.098, period=0.1678, d_freq=7689850445.74, chirp=-24.332, fft_len=256 
Triplet: peak=10.23314, time=8.098, period=0.1678, d_freq=7689850445.66, chirp=-29.862, fft_len=256 
Pulse: peak=4.072783, time=45.82, period=7.915, d_freq=7689848097.2, score=1.021, chirp=-40.923, fft_len=128 
D:	threshold 0.03882484; unscaled peak power: 0.03949075 exceeds threshold for 1.715%
Pulse: peak=6.21973, time=45.99, period=15.3, d_freq=7689842422.09, score=1.013, chirp=51.861, fft_len=4k
D:	threshold 1.575421; unscaled peak power: 1.592525 exceeds threshold for 1.086%
Pulse: peak=2.692513, time=45.86, period=5.108, d_freq=7689841493.55, score=1.023, chirp=79.288, fft_len=1024 
D:	threshold 0.2299831; unscaled peak power: 0.2338556 exceeds threshold for 1.684%
Pulse: peak=1.316106, time=45.82, period=1.436, d_freq=7689848180.1, score=1.012, chirp=-91.799, fft_len=128 
D:	threshold 0.01813617; unscaled peak power: 0.01825735 exceeds threshold for 0.6682%
Pulse: peak=5.349666, time=45.99, period=13.96, d_freq=7689845535.54, score=1.005, chirp=92.888, fft_len=4k
D:	threshold 1.675159; unscaled peak power: 1.682749 exceeds threshold for 0.4531%
Triplet: peak=12.19243, time=46.5, period=20.24, d_freq=7689841041.84, chirp=97.33, fft_len=256 
Pulse: peak=3.269602, time=45.86, period=7.36, d_freq=7689841547.95, score=1.002, chirp=98.021, fft_len=1024 
D:	threshold 0.2624692; unscaled peak power: 0.2628635 exceeds threshold for 0.1502%
Pulse: peak=5.62705, time=45.9, period=12.75, d_freq=7689846762.96, score=1.035, chirp=-98.401, fft_len=2k
D:	threshold 0.7584128; unscaled peak power: 0.7809987 exceeds threshold for 2.978%

Best spike: peak=23.83296, time=5.727, d_freq=7689847817.86, chirp=6.0913, fft_len=128k
Best autocorr: peak=17.67622, time=62.99, delay=4.3353, d_freq=7689844628.62, chirp=-22.379, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
	score=-12, null_hyp=0, chirp=0, fft_len=0 
Best pulse: peak=1.707438, time=45.84, period=2.315, d_freq=7689850172.49, score=1.039, chirp=-6.3591, fft_len=512 
Best triplet: peak=12.85192, time=61.57, period=8.084, d_freq=7689841422.99, chirp=12.166, fft_len=64 
Spike count:    0
Autocorr count: 0
Pulse count:    9
Triplet count:  8
Gaussian count: 0
Wallclock time elapsed since last restart: 8456.4 seconds
Fftlength=32,pass=3:Tune: sum=237550(ms); min=7.555(ms); max=87.16(ms); mean=59.25(ms); s_mean=42.31; sleep=45(ms); delta=128; N=4009; usual
Fftlength=32,pass=4:Tune: sum=168655(ms); min=8.812(ms); max=80.54(ms); mean=58.62(ms); s_mean=54.65; sleep=45(ms); delta=159; N=2877; usual
Fftlength=32,pass=5:Tune: sum=150909(ms); min=7.772(ms); max=73.58(ms); mean=59.88(ms); s_mean=55.99; sleep=45(ms); delta=163; N=2520; usual
Fftlength=64,pass=3:Tune: sum=270110(ms); min=5.714(ms); max=78.34(ms); mean=59.14(ms); s_mean= 47; sleep=45(ms); delta=112; N=4567; usual
Fftlength=64,pass=4:Tune: sum=190834(ms); min=4.458(ms); max=76.11(ms); mean=59.14(ms); s_mean=42.23; sleep=45(ms); delta=144; N=3227; usual
Fftlength=64,pass=5:Tune: sum=141842(ms); min=4.963(ms); max=68.7(ms); mean=58.37(ms); s_mean=56.08; sleep=45(ms); delta=169; N=2430; usual
Fftlength=128,pass=3:Tune: sum=272642(ms); min=2.618(ms); max=76.35(ms); mean=61.73(ms); s_mean=56.57; sleep=45(ms); delta=115; N=4417; usual
Fftlength=128,pass=4:Tune: sum=187176(ms); min=2.776(ms); max=66.97(ms); mean=56.65(ms); s_mean=54.65; sleep=45(ms); delta=137; N=3304; usual
Fftlength=128,pass=5:Tune: sum=140099(ms); min=2.123(ms); max=73.67(ms); mean=61.91(ms); s_mean=62.85; sleep=60(ms); delta=182; N=2263; usual
Fftlength=256,pass=3:Tune: sum=260273(ms); min=2.562(ms); max=72.41(ms); mean=59.21(ms); s_mean=56.68; sleep=45(ms); delta=115; N=4396; usual
Fftlength=256,pass=4:Tune: sum=187734(ms); min=1.527(ms); max=73.8(ms); mean=63.08(ms); s_mean=63.22; sleep=60(ms); delta=155; N=2976; usual
Fftlength=256,pass=5:Tune: sum=146370(ms); min=1.155(ms); max=74.74(ms); mean=57.92(ms); s_mean=58.63; sleep=60(ms); delta=183; N=2527; usual
Fftlength=512,pass=3:Tune: sum=278683(ms); min=0.9462(ms); max=74.04(ms); mean=63.29(ms); s_mean=62.29; sleep=60(ms); delta=116; N=4403; usual
Fftlength=512,pass=4:Tune: sum=213018(ms); min=0.7158(ms); max=80.69(ms); mean=57.84(ms); s_mean=57.18; sleep=60(ms); delta=155; N=3683; usual
Fftlength=512,pass=5:Tune: sum=163175(ms); min=0.7537(ms); max=63.92(ms); mean=54.94(ms); s_mean=55.38; sleep=45(ms); delta=145; N=2970; usual
Fftlength=1024,pass=3:Tune: sum=314666(ms); min=0.5468(ms); max=112.3(ms); mean=61.27(ms); s_mean=48.61; sleep=45(ms); delta=169; N=5136; usual
Fftlength=1024,pass=4:Tune: sum=245750(ms); min=0.4792(ms); max=87.51(ms); mean=58.46(ms); s_mean=47.71; sleep=45(ms); delta=152; N=4204; usual
Fftlength=1024,pass=5:Tune: sum=195885(ms); min=0.4414(ms); max=71.52(ms); mean=58.4(ms); s_mean=64.94; sleep=60(ms); delta=139; N=3354; usual
Fftlength=2048,pass=3:Tune: sum=263180(ms); min=45.05(ms); max=46.04(ms); mean=45.48(ms); s_mean=45.41; sleep=45(ms); delta=1; N=5787; usual
Fftlength=2048,pass=4:Tune: sum=208446(ms); min=35.23(ms); max=36.32(ms); mean=36.03(ms); s_mean=36.04; sleep=30(ms); delta=1; N=5786; usual
Fftlength=2048,pass=5:Tune: sum=166754(ms); min=28.78(ms); max=29.67(ms); mean=28.82(ms); s_mean=28.82; sleep=30(ms); delta=1; N=5786; usual
Fftlength=4096,pass=3:Tune: sum=252084(ms); min=21.73(ms); max=22.22(ms); mean=21.78(ms); s_mean=21.78; sleep=15(ms); delta=1; N=11573; usual
Fftlength=4096,pass=4:Tune: sum=201702(ms); min=17.31(ms); max=17.97(ms); mean=17.43(ms); s_mean=17.36; sleep=15(ms); delta=1; N=11572; usual
Fftlength=4096,pass=5:Tune: sum=160367(ms); min=13.83(ms); max=14.09(ms); mean=13.86(ms); s_mean=13.85; sleep=15(ms); delta=1; N=11572; usual
Fftlength=8192,pass=3:Tune: sum=267368(ms); min=10.44(ms); max=12.89(ms); mean=11.55(ms); s_mean=11.95; sleep=0(ms); delta=1; N=23147; usual

class Gaussian_transfer_not_needed:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_transfer_needed:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_skip1_no_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip2_bad_group_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip3_too_weak_peak:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip4_too_big_ChiSq:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_skip6_low_power:		total=0,	N=0,	<>=0,	min=0	max=0


class Gaussian_new_best:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_report:		total=0,	N=0,	<>=0,	min=0	max=0
class Gaussian_miss:		total=0,	N=0,	<>=0,	min=0	max=0


class PC_triplet_find_hit:		total=57551,	N=57551,	<>=1,	min=1	max=1
class PC_triplet_find_miss:		total=1031,	N=1031,	<>=1,	min=1	max=1


class PC_pulse_find_hit:		total=46190,	N=46190,	<>=1,	min=1	max=1
class PC_pulse_find_miss:		total=13,	N=13,	<>=1,	min=1	max=1
class PC_pulse_find_early_miss:		total=10,	N=10,	<>=1,	min=1	max=1
class PC_pulse_find_2CPU:		total=0,	N=0,	<>=0,	min=0	max=0


class PoT_transfer_not_needed:		total=57541,	N=57541,	<>=1,	min=1	max=1
class PoT_transfer_needed:		total=1042,	N=1042,	<>=1,	min=1	max=1

class SleepQuantum:		total=10143.793,	N=9770,	<>=1.0382593,	min=0.75994802	max=34.988533

GPU device sync requested...  ...GPU device synched
22:34:07 (5992): called boinc_finish(0)

</stderr_txt>
]]>



 
©2020 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.