| Name | blc43_2bit_guppi_58838_26948_TIC67772767_0103.9569.818.20.29.74.vlar_3 |
| Workunit | 3929375572 |
| Created | 22 Apr 2020, 13:07:10 UTC |
| Sent | 22 Apr 2020, 13:07:28 UTC |
| Report deadline | 14 Jun 2020, 18:07:10 UTC |
| Received | 22 Apr 2020, 23:38:53 UTC |
| Server state | Over |
| Outcome | Success |
| Client state | Done |
| Exit status | 0 (0x00000000) |
| Computer ID | 7845794 |
| Run time | 10 min 59 sec |
| CPU time | 10 min 42 sec |
| Validate state | Valid |
| Credit | 107.54 |
| Device peak FLOPS | 589.08 GFLOPS |
| Application version | SETI@home v8 v8.22 (opencl_nvidia_SoG) windows_intelx86 |
| Peak working set size | 99.82 MB |
| Peak swap size | 123.98 MB |
| Peak disk usage | 0.04 MB |
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used
Build features: SETI8 Non-graphics OpenCL USE_OPENCL_NV OCL_ZERO_COPY SIGNALS_ON_GPU OCL_CHIRP3 FFTW USE_SSE3 x86
CPUID: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz
Cache: L1=64K L2=256K
CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl
ar=0.010912 NumCfft=105187 NumGauss=0 NumPulse=35356278656 NumTriplet=48313911456
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale
SETI8 update by Raistmer
OpenCL version by Raistmer, r3584
Number of OpenCL platforms: 1
OpenCL Platform Name: NVIDIA CUDA
Number of devices: 1
Max compute units: 13
Max work group size: 1024
Max clock frequency: 1177Mhz
Max memory allocation: 1073741824
Cache type: Read/Write
Cache line size: 128
Cache size: 638976
Global memory size: 4294967296
Constant buffer size: 65536
Max number of constant args: 9
Local memory type: Scratchpad
Local memory size: 49152
Queue properties:
Out-of-Order: Yes
Name: GeForce GTX 970
Vendor: NVIDIA Corporation
Driver version: 445.87
Version: OpenCL 1.2 CUDA
Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer cl_khr_int64_base_atomics cl_khr_int64_extended_atomics
Work Unit Info:
...............
Credit multiplier is : 2.85
WU true angle range is : 0.010912
Used GPU device parameters are:
Number of compute units: 13
Single buffer allocation size: 128MB
Total device global memory: 4096MB
max WG size: 1024
local mem type: Real
FERMI path used: yes
LotOfMem path: yes
LowPerformanceGPU path: no
HighPerformanceGPU path: no
period_iterations_num=50
Autocorr: peak=19.05127, time=28.63, delay=3.6159, d_freq=10533073414.2, chirp=-0.37063, fft_len=128k
Spike: peak=24.65141, time=20.04, d_freq=10533072315.3, chirp=-1.5739, fft_len=64k
Spike: peak=25.14312, time=20.04, d_freq=10533072315.3, chirp=-1.5993, fft_len=64k
Spike: peak=25.74439, time=20.04, d_freq=10533072315.3, chirp=-1.6094, fft_len=64k
Spike: peak=24.68091, time=20.04, d_freq=10533072315.3, chirp=-1.6348, fft_len=64k
Triplet: peak=10.79704, time=55.03, period=23.83, d_freq=10533072319.6, chirp=-8.7111, fft_len=256
Pulse: peak=3.511743, time=45.82, period=6.585, d_freq=10533072961.9, score=1.012, chirp=11.363, fft_len=128
D: threshold 0.03531863; unscaled peak power: 0.03565049 exceeds threshold for 0.9396%
Triplet: peak=11.3324, time=46.34, period=25.93, d_freq=10533077719, chirp=-33.708, fft_len=256
Pulse: peak=1.653201, time=45.86, period=2.781, d_freq=10533077679.5, score=1.021, chirp=-38.821, fft_len=1024
D: threshold 0.153321; unscaled peak power: 0.1553124 exceeds threshold for 1.299%
Triplet: peak=11.9254, time=19.57, period=11.23, d_freq=10533076567.7, chirp=-63.249, fft_len=512
Pulse: peak=10.15059, time=45.86, period=26.08, d_freq=10533077940, score=1.023, chirp=-76.032, fft_len=1024
D: threshold 0.6754475; unscaled peak power: 0.6898508 exceeds threshold for 2.132%
GPU device sync requested... ...GPU device synched
Termination request detected or computations are finished. GPU device synched, exiting...
Running on device number: 0
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 0
Info: BOINC provided OpenCL device ID used
Build features: SETI8 Non-graphics OpenCL USE_OPENCL_NV OCL_ZERO_COPY SIGNALS_ON_GPU OCL_CHIRP3 FFTW USE_SSE3 x86
CPUID: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz
Cache: L1=64K L2=256K
CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl
ar=0.010912 NumCfft=105187 NumGauss=0 NumPulse=35356278656 NumTriplet=48313911456
Currently allocated 201 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 90.40 percent.
Used GPU device parameters are:
Number of compute units: 13
Single buffer allocation size: 128MB
Total device global memory: 4096MB
max WG size: 1024
local mem type: Real
FERMI path used: yes
LotOfMem path: yes
LowPerformanceGPU path: no
HighPerformanceGPU path: no
period_iterations_num=50
Best spike: peak=25.74439, time=20.04, d_freq=10533072315.3, chirp=-1.6094, fft_len=64k
Best autocorr: peak=19.05127, time=28.63, delay=3.6159, d_freq=10533073414.2, chirp=-0.37063, fft_len=128k
Best gaussian: peak=0, mean=0, ChiSq=0, time=-2.124e+011, d_freq=0,
score=-12, null_hyp=0, chirp=0, fft_len=0
Best pulse: peak=10.15059, time=45.86, period=26.08, d_freq=10533077940, score=1.023, chirp=-76.032, fft_len=1024
Best triplet: peak=11.9254, time=19.57, period=11.23, d_freq=10533076567.7, chirp=-63.249, fft_len=512
Spike count: 4
Autocorr count: 1
Pulse count: 3
Triplet count: 3
Gaussian count: 0
Wallclock time elapsed since last restart: 107.4 seconds
Fftlength=32,pass=3:Tune: sum=14152(ms); min=6.42(ms); max=69.28(ms); mean=42.24(ms); s_mean=50.98; sleep=45(ms); delta=372; N=335; usual
Fftlength=32,pass=4:Tune: sum=5667.67(ms); min=4.811(ms); max=28.06(ms); mean=18.17(ms); s_mean=19.29; sleep=15(ms); delta=415; N=312; usual
Fftlength=32,pass=5:Tune: sum=3709.46(ms); min=4.969(ms); max=21.11(ms); mean=14.49(ms); s_mean=18.1; sleep=15(ms); delta=411; N=256; usual
Fftlength=64,pass=3:Tune: sum=5465.83(ms); min=3.339(ms); max=30.1(ms); mean=15.89(ms); s_mean=22.4; sleep=15(ms); delta=421; N=344; usual
Fftlength=64,pass=4:Tune: sum=3171.52(ms); min=2.587(ms); max=17.04(ms); mean=10.36(ms); s_mean=11.5; sleep=0(ms); delta=409; N=306; usual
Fftlength=64,pass=5:Tune: sum=2629.64(ms); min=2.488(ms); max=14.89(ms); mean=9.292(ms); s_mean=12.55; sleep=15(ms); delta=386; N=283; usual
Fftlength=128,pass=3:Tune: sum=4251.51(ms); min=1.586(ms); max=24.55(ms); mean=11.52(ms); s_mean=16.04; sleep=15(ms); delta=420; N=369; usual
Fftlength=128,pass=4:Tune: sum=2972.26(ms); min=1.293(ms); max=16.97(ms); mean=8.468(ms); s_mean=10.54; sleep=0(ms); delta=402; N=351; usual
Fftlength=128,pass=5:Tune: sum=2389.21(ms); min=1.222(ms); max=14.74(ms); mean=7.306(ms); s_mean=12.97; sleep=15(ms); delta=378; N=327; usual
Fftlength=256,pass=3:Tune: sum=4085.58(ms); min=0.8225(ms); max=22.95(ms); mean=10.09(ms); s_mean=16.47; sleep=15(ms); delta=430; N=405; usual
Fftlength=256,pass=4:Tune: sum=2854.3(ms); min=0.6393(ms); max=16.58(ms); mean=7.395(ms); s_mean=11.38; sleep=0(ms); delta=411; N=386; usual
Fftlength=256,pass=5:Tune: sum=2306.02(ms); min=0.6288(ms); max=14.75(ms); mean=6.283(ms); s_mean=9.175; sleep=0(ms); delta=392; N=367; usual
Fftlength=512,pass=3:Tune: sum=6568.61(ms); min=0.3804(ms); max=40.79(ms); mean=14.93(ms); s_mean=40.22; sleep=30(ms); delta=452; N=440; high_perf
Fftlength=512,pass=4:Tune: sum=1476.71(ms); min=0.3083(ms); max=12.46(ms); mean=4.81(ms); s_mean=11.95; sleep=0(ms); delta=431; N=307; usual
Fftlength=512,pass=5:Tune: sum=1202.65(ms); min=0.329(ms); max=10.29(ms); mean=4.235(ms); s_mean=9.916; sleep=0(ms); delta=408; N=284; usual
Fftlength=1024,pass=3:Tune: sum=6551.57(ms); min=0.198(ms); max=15.8(ms); mean=11.68(ms); s_mean=15.35; sleep=15(ms); delta=573; N=561; high_perf
Fftlength=1024,pass=4:Tune: sum=278.129(ms); min=0.1633(ms); max=4.782(ms); mean=1.892(ms); s_mean=4.117; sleep=0(ms); delta=563; N=147; usual
Fftlength=1024,pass=5:Tune: sum=217.408(ms); min=0.1692(ms); max=3.721(ms); mean=1.61(ms); s_mean=3.577; sleep=0(ms); delta=551; N=135; usual
Fftlength=2048,pass=3:Tune: sum=7643.72(ms); min=3.746(ms); max=8.558(ms); mean=8.263(ms); s_mean=8.275; sleep=0(ms); delta=1; N=925; high_perf
Fftlength=4096,pass=3:Tune: sum=7872.67(ms); min=1.881(ms); max=4.452(ms); mean=4.258(ms); s_mean=4.249; sleep=0(ms); delta=1; N=1849; high_perf
Fftlength=8192,pass=3:Tune: sum=6193.77(ms); min=1.653(ms); max=1.896(ms); mean=1.674(ms); s_mean=1.674; sleep=0(ms); delta=1; N=3700; usual
class Gaussian_transfer_not_needed: total=0, N=0, <>=0, min=0 max=0
class Gaussian_transfer_needed: total=0, N=0, <>=0, min=0 max=0
class Gaussian_skip1_no_peak: total=0, N=0, <>=0, min=0 max=0
class Gaussian_skip2_bad_group_peak: total=0, N=0, <>=0, min=0 max=0
class Gaussian_skip3_too_weak_peak: total=0, N=0, <>=0, min=0 max=0
class Gaussian_skip4_too_big_ChiSq: total=0, N=0, <>=0, min=0 max=0
class Gaussian_skip6_low_power: total=0, N=0, <>=0, min=0 max=0
class Gaussian_new_best: total=0, N=0, <>=0, min=0 max=0
class Gaussian_report: total=0, N=0, <>=0, min=0 max=0
class Gaussian_miss: total=0, N=0, <>=0, min=0 max=0
class PC_triplet_find_hit: total=9896, N=9896, <>=1, min=1 max=1
class PC_triplet_find_miss: total=201, N=201, <>=1, min=1 max=1
class PC_pulse_find_hit: total=7389, N=7389, <>=1, min=1 max=1
class PC_pulse_find_miss: total=0, N=0, <>=0, min=0 max=0
class PC_pulse_find_early_miss: total=0, N=0, <>=0, min=0 max=0
class PC_pulse_find_2CPU: total=0, N=0, <>=0, min=0 max=0
class PoT_transfer_not_needed: total=9896, N=9896, <>=1, min=1 max=1
class PoT_transfer_needed: total=201, N=201, <>=1, min=1 max=1
class SleepQuantum: total=0, N=0, <>=0, min=0 max=0
GPU device sync requested... ...GPU device synched
19:27:38 (14260): called boinc_finish(0)
</stderr_txt>
]]>
©2020 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.