| Name | 26fe20ab.27798.3748.15.42.98_1 |
| Workunit | 3906692641 |
| Created | 27 Feb 2020, 23:35:19 UTC |
| Sent | 27 Feb 2020, 23:35:29 UTC |
| Report deadline | 20 Apr 2020, 19:21:58 UTC |
| Received | 28 Feb 2020, 13:38:58 UTC |
| Server state | Over |
| Outcome | Success |
| Client state | Done |
| Exit status | 0 (0x00000000) |
| Computer ID | 8146712 |
| Run time | 5 min 5 sec |
| CPU time | 4 min 55 sec |
| Validate state | Valid |
| Credit | 103.77 |
| Device peak FLOPS | 970.86 GFLOPS |
| Application version | SETI@home v8 v8.22 (opencl_nvidia_SoG) windows_intelx86 |
| Peak working set size | 119.54 MB |
| Peak swap size | 143.26 MB |
| Peak disk usage | 0.03 MB |
<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
Maximum single buffer size set to:1024MB
High-performance path selected. If GUI lags occur consider to remove -high_perf option from tuning line
Target kernel sequence time set to 500ms
SpikeFind FFT size threshold override set to:4096
TUNE: kernel 1 now has workgroup size of (64,1,4)
oclFFT global radix override set to:256
oclFFT local radix override set to:16
oclFFT max WG size override set to:256
oclFFT max local FFT size override set to:512
oclFFT number of local memory banks set to:64
oclFFT minimal memory coalesce width set to:64
Running on device number: 1
Priority of worker thread raised successfully
Priority of process adjusted successfully, below normal priority class used
OpenCL platform detected: NVIDIA Corporation
BOINC assigns device 1
Info: BOINC provided OpenCL device ID used
Build features: SETI8 Non-graphics OpenCL USE_OPENCL_NV OCL_ZERO_COPY SIGNALS_ON_GPU OCL_CHIRP3 FFTW USE_SSE3 x86
CPUID: AMD FX(tm)-6300 Six-Core Processor
Cache: L1=64K L2=2048K
CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX SSE4A XOP FMA4
OpenCL-kernels filename : MultiBeam_Kernels_r3584.cl
ar=0.436995 NumCfft=193619 NumGauss=1080919604 NumPulse=226459474911 NumTriplet=452849798865
Currently allocated 1125 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win32 Build 3584 , Ported by : Raistmer, JDWhale
SETI8 update by Raistmer
OpenCL version by Raistmer, r3584
Number of OpenCL platforms: 1
OpenCL Platform Name: NVIDIA CUDA
Number of devices: 3
Max compute units: 15
Max work group size: 1024
Max clock frequency: 1683Mhz
Max memory allocation: 2147483648
Cache type: Read/Write
Cache line size: 128
Cache size: 245760
Global memory size: 8589934592
Constant buffer size: 65536
Max number of constant args: 9
Local memory type: Scratchpad
Local memory size: 49152
Queue properties:
Out-of-Order: Yes
Name: GeForce GTX 1070
Vendor: NVIDIA Corporation
Driver version: 432.00
Version: OpenCL 1.2 CUDA
Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
Max compute units: 15
Max work group size: 1024
Max clock frequency: 1683Mhz
Max memory allocation: 2147483648
Cache type: Read/Write
Cache line size: 128
Cache size: 245760
Global memory size: 8589934592
Constant buffer size: 65536
Max number of constant args: 9
Local memory type: Scratchpad
Local memory size: 49152
Queue properties:
Out-of-Order: Yes
Name: GeForce GTX 1070
Vendor: NVIDIA Corporation
Driver version: 432.00
Version: OpenCL 1.2 CUDA
Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
Max compute units: 15
Max work group size: 1024
Max clock frequency: 1683Mhz
Max memory allocation: 2147483648
Cache type: Read/Write
Cache line size: 128
Cache size: 245760
Global memory size: 8589934592
Constant buffer size: 65536
Max number of constant args: 9
Local memory type: Scratchpad
Local memory size: 49152
Queue properties:
Out-of-Order: Yes
Name: GeForce GTX 1070
Vendor: NVIDIA Corporation
Driver version: 432.00
Version: OpenCL 1.2 CUDA
Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_copy_opts cl_nv_create_buffer
Work Unit Info:
...............
Credit multiplier is : 2.85
WU true angle range is : 0.436995
Used GPU device parameters are:
Number of compute units: 15
Single buffer allocation size: 1024MB
Total device global memory: 8192MB
max WG size: 1024
local mem type: Real
FERMI path used: yes
LotOfMem path: yes
LowPerformanceGPU path: no
HighPerformanceGPU path: yes
period_iterations_num=50
Spike: peak=24.36699, time=33.55, d_freq=1420954613.08, chirp=-2.9955, fft_len=128k
Triplet: peak=9.888998, time=3.742, period=0.6717, d_freq=1420961609.59, chirp=81.739, fft_len=64
Triplet: peak=9.926474, time=3.742, period=0.6717, d_freq=1420961613.46, chirp=82.774, fft_len=64
Triplet: peak=9.929989, time=3.742, period=0.6717, d_freq=1420961617.33, chirp=83.808, fft_len=64
Triplet: peak=9.899129, time=3.742, period=0.6717, d_freq=1420961621.2, chirp=84.843, fft_len=64
Triplet: peak=9.834086, time=3.742, period=0.6717, d_freq=1420961625.07, chirp=85.878, fft_len=64
Triplet: peak=9.735485, time=3.742, period=0.6717, d_freq=1420961628.95, chirp=86.913, fft_len=64
Triplet: peak=9.60479, time=3.742, period=0.6717, d_freq=1420961632.82, chirp=87.947, fft_len=64
Best spike: peak=24.36699, time=33.55, d_freq=1420954613.08, chirp=-2.9955, fft_len=128k
Best autocorr: peak=17.47644, time=33.55, delay=5.1493, d_freq=1420957554.32, chirp=15.589, fft_len=128k
Best gaussian: peak=3.746052, mean=0.5529013, ChiSq=1.418348, time=99.82, d_freq=1420952820.82,
score=-0.1721001, null_hyp=2.245479, chirp=4.3533, fft_len=16k
Best pulse: peak=1.605275, time=12.27, period=0.3665, d_freq=1420953978.6, score=0.9942, chirp=49.664, fft_len=64
Best triplet: peak=9.929989, time=3.742, period=0.6717, d_freq=1420961617.33, chirp=83.808, fft_len=64
Spike count: 1
Autocorr count: 0
Pulse count: 0
Triplet count: 7
Gaussian count: 0
Wallclock time elapsed since last restart: 299.0 seconds
Fftlength=8,pass=3:Tune: sum=5185.26(ms); min=19.8(ms); max=186.4(ms); mean=99.72(ms); s_mean=129.4; sleep=120(ms); delta=1056; N=52; usual
Fftlength=8,pass=4:Tune: sum=3414.35(ms); min=21.41(ms); max=125.6(ms); mean=69.68(ms); s_mean=84.25; sleep=75(ms); delta=1008; N=49; usual
Fftlength=8,pass=5:Tune: sum=2401.17(ms); min=6.922(ms); max=93.53(ms); mean=51.09(ms); s_mean=56.58; sleep=45(ms); delta=976; N=47; usual
Fftlength=16,pass=3:Tune: sum=2898.27(ms); min=1.858(ms); max=71.02(ms); mean=35.78(ms); s_mean=55.13; sleep=45(ms); delta=760; N=81; usual
Fftlength=16,pass=4:Tune: sum=1946.93(ms); min=2.754(ms); max=50.33(ms); mean=25.96(ms); s_mean=37.78; sleep=30(ms); delta=712; N=75; usual
Fftlength=16,pass=5:Tune: sum=1409.4(ms); min=2.14(ms); max=39.26(ms); mean=19.85(ms); s_mean=27.29; sleep=30(ms); delta=680; N=71; usual
Fftlength=32,pass=3:Tune: sum=1842.94(ms); min=0.9564(ms); max=33.86(ms); mean=14.18(ms); s_mean=16.37; sleep=15(ms); delta=576; N=130; usual
Fftlength=32,pass=4:Tune: sum=1287.67(ms); min=0.6912(ms); max=25.61(ms); mean=10.3(ms); s_mean=11.37; sleep=0(ms); delta=556; N=125; usual
Fftlength=32,pass=5:Tune: sum=915.267(ms); min=0.5356(ms); max=19.14(ms); mean=7.89(ms); s_mean=16.48; sleep=15(ms); delta=520; N=116; usual
Fftlength=64,pass=3:Tune: sum=2682.57(ms); min=0.3451(ms); max=41.42(ms); mean=14.58(ms); s_mean=41.02; sleep=30(ms); delta=396; N=184; high_perf
Fftlength=64,pass=4:Tune: sum=659.291(ms); min=0.2437(ms); max=13.04(ms); mean=5.071(ms); s_mean=10.04; sleep=0(ms); delta=374; N=130; usual
Fftlength=64,pass=5:Tune: sum=479.136(ms); min=0.2775(ms); max=9.978(ms); mean=4.026(ms); s_mean=9.223; sleep=0(ms); delta=352; N=119; usual
Fftlength=128,pass=3:Tune: sum=4386.23(ms); min=22.24(ms); max=23.22(ms); mean=22.73(ms); s_mean=22.9; sleep=15(ms); delta=1; N=193; usual
Fftlength=256,pass=3:Tune: sum=4068.75(ms); min=10.2(ms); max=10.8(ms); mean=10.51(ms); s_mean=10.47; sleep=0(ms); delta=1; N=387; usual
Fftlength=512,pass=3:Tune: sum=3969.96(ms); min=4.989(ms); max=5.34(ms); mean=5.136(ms); s_mean=5.154; sleep=0(ms); delta=1; N=773; usual
Fftlength=1024,pass=3:Tune: sum=1749.5(ms); min=1.007(ms); max=1.187(ms); mean=1.131(ms); s_mean=1.13; sleep=0(ms); delta=1; N=1547; usual
Fftlength=2048,pass=3:Tune: sum=1234.38(ms); min=0.3594(ms); max=0.4127(ms); mean=0.3991(ms); s_mean=0.4002; sleep=0(ms); delta=1; N=3093; usual
class Gaussian_transfer_not_needed: total=0, N=0, <>=0, min=0 max=0
class Gaussian_transfer_needed: total=0, N=0, <>=0, min=0 max=0
class Gaussian_skip1_no_peak: total=0, N=0, <>=0, min=0 max=0
class Gaussian_skip2_bad_group_peak: total=0, N=0, <>=0, min=0 max=0
class Gaussian_skip3_too_weak_peak: total=0, N=0, <>=0, min=0 max=0
class Gaussian_skip4_too_big_ChiSq: total=0, N=0, <>=0, min=0 max=0
class Gaussian_skip6_low_power: total=0, N=0, <>=0, min=0 max=0
class Gaussian_new_best: total=20, N=20, <>=1, min=1 max=1
class Gaussian_report: total=0, N=0, <>=0, min=0 max=0
class Gaussian_miss: total=0, N=0, <>=0, min=0 max=0
class PC_triplet_find_hit: total=24305, N=24305, <>=1, min=1 max=1
class PC_triplet_find_miss: total=414, N=414, <>=1, min=1 max=1
class PC_pulse_find_hit: total=12358, N=12358, <>=1, min=1 max=1
class PC_pulse_find_miss: total=3, N=3, <>=1, min=1 max=1
class PC_pulse_find_early_miss: total=3, N=3, <>=1, min=1 max=1
class PC_pulse_find_2CPU: total=1, N=1, <>=1, min=1 max=1
class PoT_transfer_not_needed: total=24303, N=24303, <>=1, min=1 max=1
class PoT_transfer_needed: total=417, N=417, <>=1, min=1 max=1
class SleepQuantum: total=0, N=0, <>=0, min=0 max=0
GPU device sync requested... ...GPU device synched
08:38:14 (5480): called boinc_finish(0)
</stderr_txt>
]]>
©2020 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.