Posts by -= Vyper =-


log in
1) Message boards : Number crunching : Monitoring inconclusive GBT validations and harvesting data for testing (Message 1813396)
Posted 1 day ago by Profile -= Vyper =-
however you're wanting to shove unfinished code in for the sakes of performance


Not this time, now i'm more than ever concerned that other applications has precision drifting aswell and this needs to be addressed properly to pinpoint where it all began Deep within those burried lines of code.
2) Message boards : Number crunching : Monitoring inconclusive GBT validations and harvesting data for testing (Message 1813395)
Posted 1 day ago by Profile -= Vyper =-
Let's be clear I'm in full support of Petri's work, however you're wanting to shove unfinished code in for the sakes of performance, when the landscape is turd. The stock Apple situation needs resolution before new technologies are introduced. New technologies are not a solution, but an incremental refinement.


No, not at this Point! I just wanted to post my findings how come the Apple code is not targeted aswell.
It just seems odd and this kind of proved that we must dig further in the original code because in one way or Another some parts of the optimisation tree seems to differ from the Berkeley compiled executable with no optimisations in Place.

I just wanted to Point out that it seems like the precision drifting started way back Before he started to improve it for Nvidia cards and someone needs to dig deeper to see why it has started to not align to original code.

Has someone here a list of applications that you could do a "checklist" on that has a strongly similar of atleast 99.95% or so, the best would ofcourse be 100%.

I didn't know that other applications differed so much more compared to Petris but still considers valid or is that Apple code not distributed by the S@H servers?
Just confused now here.
3) Message boards : Number crunching : GPU FLOPS: Theory vs Reality (Message 1813392)
Posted 1 day ago by Profile -= Vyper =-
Regarding the 750ti this is a snippet of nvidia-smi on my Quad 750Ti host (It's 4 Gigabyte black editions and has been tested from factory a week in a 24/7 Environment)

nvidia-smi
Mon Aug 29 11:25:14 2016
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 367.35 Driver Version: 367.35 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GeForce GTX 750 Ti Off | 0000:01:00.0 Off | N/A |
| 39% 57C P0 23W / 46W | 1016MiB / 1998MiB | 100% Default |
+-------------------------------+----------------------+----------------------+
| 1 GeForce GTX 750 Ti Off | 0000:02:00.0 Off | N/A |
| 40% 58C P0 27W / 46W | 1016MiB / 2000MiB | 99% Default |
+-------------------------------+----------------------+----------------------+
| 2 GeForce GTX 750 Ti Off | 0000:04:00.0 Off | N/A |
| 38% 53C P0 25W / 46W | 1016MiB / 2000MiB | 100% Default |
+-------------------------------+----------------------+----------------------+
| 3 GeForce GTX 750 Ti Off | 0000:05:00.0 Off | N/A |
| 37% 51C P0 24W / 46W | 1016MiB / 2000MiB | 98% Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 31331 C ...tiathome_x41zc_x86_64-pc-linux-gnu_cuda65 1014MiB |
| 1 31337 C ...tiathome_x41zc_x86_64-pc-linux-gnu_cuda65 1014MiB |
| 2 31322 C ...tiathome_x41zc_x86_64-pc-linux-gnu_cuda65 1014MiB |
| 3 31342 C ...tiathome_x41zc_x86_64-pc-linux-gnu_cuda65 1014MiB |
+-----------------------------------------------------------------------------+
4) Message boards : Number crunching : Monitoring inconclusive GBT validations and harvesting data for testing (Message 1813390)
Posted 1 day ago by Profile -= Vyper =-
I haven't read through this thread completely but i want to post my findings.
Look at these results!
It looks like that Petris code is more accurate than the Apple code in my mind.

Quick compare:

CPU:
Spike: peak=24.09495, time=33.55, d_freq=1420751144.89, chirp=-0.028652, fft_len=128k
Spike: peak=24.42812, time=33.55, d_freq=1420751144.89, chirp=-0.033273, fft_len=128k
Spike: peak=24.31746, time=33.55, d_freq=1420751144.88, chirp=-0.037895, fft_len=128k
Autocorr: peak=19.48975, time=73.82, delay=5.1587, d_freq=1420751429.61, chirp=-7.0919, fft_len=128k
Spike: peak=24.78413, time=68.79, d_freq=1420750198.94, chirp=-95.62, fft_len=32k

APPLE:
Spike: peak=24.09507, time=33.55, d_freq=1420751144.89, chirp=-0.028652, fft_len=128k
Spike: peak=24.4283, time=33.55, d_freq=1420751144.89, chirp=-0.033273, fft_len=128k
Spike: peak=24.31762, time=33.55, d_freq=1420751144.88, chirp=-0.037895, fft_len=128k
Autocorr: peak=19.48845, time=73.82, delay=5.1587, d_freq=1420751429.61, chirp=-7.0919, fft_len=128k
Spike: peak=24.11478, time=68.79, d_freq=1420750198.94, chirp=-95.62, fft_len=32k

PETRI:
Spike: peak=24.09493, time=33.55, d_freq=1420751144.89, chirp=-0.028652, fft_len=128k
Spike: peak=24.42814, time=33.55, d_freq=1420751144.89, chirp=-0.033273, fft_len=128k
Spike: peak=24.31745, time=33.55, d_freq=1420751144.88, chirp=-0.037895, fft_len=128k
Autocorr: peak=19.48978, time=73.82, delay=5.1587, d_freq=1420751429.61, chirp=-7.0919, fft_len=128k
Spike: peak=24.78424, time=68.79, d_freq=1420750198.94, chirp=-95.62, fft_len=32k

This was a WU that has been crunched by cpu, apple and by Petris tweaked code. The problem to me seems like this precision drift started way back or is my assumption wrong?

http://setiathome.berkeley.edu/workunit.php?wuid=2248302050

Complete STDERR below

CPU:

<stderr_txt>

Build features: SETI8 Non-graphics FFTW USE_SSE3 x64
CPUID: Intel(R) Core(TM) i7-4770 CPU @ 3.40GHz

Cache: L1=64K L2=256K

CPU features: FPU TSC PAE CMPXCHG8B APIC SYSENTER MTRR CMOV/CCMP MMX FXSAVE/FXRSTOR SSE SSE2 HT SSE3 SSSE3 FMA3 SSE4.1 SSE4.2 AVX
ar=0.410125 NumCfft=200105 NumGauss=1151864510 NumPulse=226294165221 NumTriplet=452692139687
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768

Windows optimized setiathome_v8 application
Based on Intel, Core 2-optimized v8-nographics V5.13 by Alex Kan
SSE3xj Win64 Build 3330 , Ported by : Raistmer, JDWhale

SETI8 update by Raistmer
Work Unit Info:
...............
Credit multiplier is : 2.85
WU true angle range is : 0.410125
Spike: peak=24.09495, time=33.55, d_freq=1420751144.89, chirp=-0.028652, fft_len=128k
Spike: peak=24.42812, time=33.55, d_freq=1420751144.89, chirp=-0.033273, fft_len=128k
Spike: peak=24.31746, time=33.55, d_freq=1420751144.88, chirp=-0.037895, fft_len=128k
Autocorr: peak=19.48975, time=73.82, delay=5.1587, d_freq=1420751429.61, chirp=-7.0919, fft_len=128k
Spike: peak=24.78413, time=68.79, d_freq=1420750198.94, chirp=-95.62, fft_len=32k

Best spike: peak=24.78413, time=68.79, d_freq=1420750198.94, chirp=-95.62, fft_len=32k
Best autocorr: peak=19.48975, time=73.82, delay=5.1587, d_freq=1420751429.61, chirp=-7.0919, fft_len=128k
Best gaussian: peak=2.818341, mean=0.5207298, ChiSq=1.416708, time=12.58, d_freq=1420752531.85,
score=-0.9713898, null_hyp=2.204585, chirp=10.797, fft_len=16k
Best pulse: peak=3.243532, time=6.544, period=0.9724, d_freq=1420754890.84, score=0.9946, chirp=-87.382, fft_len=64
Best triplet: peak=0, time=-2.122e+011, period=0, d_freq=0, chirp=0, fft_len=0


Flopcounter: 38853887058724.039000

Spike count: 4
Autocorr count: 1
Pulse count: 0
Triplet count: 0
Gaussian count: 0
Wallclock time elapsed since last restart: 8021.9 seconds

10:32:47 (3944): called boinc_finish(0)

</stderr_txt>

APPLE:

<stderr_txt>
OpenCL platform detected: Apple
Number of OpenCL devices found : 1
BOINC assigns slot on device #0.
Info: BOINC provided OpenCL device ID used
DOUBLE_FP supported.
cl_khr_fp64 supported.
cl_APPLE_fp64_basic_ops supported.
FERMI : true

Build features: SETI8 Non-graphics OpenCL USE_OPENCL_NV OCL_ZERO_COPY OCL_CHIRP3 ASYNC_SPIKE FFTW SSE3 64bit
System: Darwin x86_64 Kernel: 15.6.0
CPU : Intel(R) Core(TM) i5-3330S CPU @ 2.70GHz
GenuineIntel x86, Family 6 Model 58 Stepping 9
Features : FPU TSC PAE APIC MTRR MMX SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX1.0

OpenCL-kernels filename : MultiBeam_Kernels_r3321.cl
ar=0.410125 NumCfft=200105 NumGauss=1151864510 NumPulse=226294165221 NumTriplet=452692139687
Currently allocated 209 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
OS X optimized setiathome_v8 application
Version info: SSE3x (Intel, Core 2-optimized v8-nographics) V5.13 by Alex Kan
SSE3x OS X 64bit Build 3321 , Ported by : Raistmer, JDWhale, Urs Echternacht


OpenCL version by Raistmer, r3321

Number of OpenCL platforms: 1


OpenCL Platform Name: Apple
Number of devices: 1
Max compute units: 2
Max work group size: 1024
Max clock frequency: 745Mhz
Max memory allocation: 134217728
Cache type: None
Cache line size: 0
Cache size: 0
Global memory size: 536870912
Constant buffer size: 65536
Max number of constant args: 9
Local memory type: Scratchpad
Local memory size: 49152
Queue properties:
Out-of-Order: No
Name: GeForce GT 640M
Vendor: NVIDIA
Driver version: 10.10.13 310.42.25f01
Version: OpenCL 1.2
Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_APPLE_fp64_basic_ops cl_khr_fp64 cl_khr_3d_image_writes cl_khr_depth_images cl_khr_gl_depth_images cl_khr_gl_msaa_sharing cl_khr_image2d_from_buffer cl_APPLE_ycbcr_422 cl_APPLE_rgb_422


Work Unit Info:
...............
Credit multiplier is : 2.85
WU true angle range is : 0.410125
Used GPU device parameters are:
Number of compute units: 2
Single buffer allocation size: 128MB
Total device global memory: 512MB
max WG size: 1024
local mem type: Real
FERMI path used: yes
LotOfMem path: no
period_iterations_num=50
Spike: peak=24.09507, time=33.55, d_freq=1420751144.89, chirp=-0.028652, fft_len=128k
Spike: peak=24.4283, time=33.55, d_freq=1420751144.89, chirp=-0.033273, fft_len=128k
Spike: peak=24.31762, time=33.55, d_freq=1420751144.88, chirp=-0.037895, fft_len=128k
GPU device sync requested... ...GPU device synched
Termination request detected or computations are finished. GPU device synched, exiting...
OpenCL platform detected: Apple
Number of OpenCL devices found : 1
BOINC assigns slot on device #0.
Info: BOINC provided OpenCL device ID used
DOUBLE_FP supported.
cl_khr_fp64 supported.
cl_APPLE_fp64_basic_ops supported.
FERMI : true

Build features: SETI8 Non-graphics OpenCL USE_OPENCL_NV OCL_ZERO_COPY OCL_CHIRP3 ASYNC_SPIKE FFTW SSE3 64bit
System: Darwin x86_64 Kernel: 15.6.0
CPU : Intel(R) Core(TM) i5-3330S CPU @ 2.70GHz
GenuineIntel x86, Family 6 Model 58 Stepping 9
Features : FPU TSC PAE APIC MTRR MMX SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX1.0

OpenCL-kernels filename : MultiBeam_Kernels_r3321.cl
ar=0.410125 NumCfft=200105 NumGauss=1151864510 NumPulse=226294165221 NumTriplet=452692139687
Currently allocated 209 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
OS X optimized setiathome_v8 application
Version info: SSE3x (Intel, Core 2-optimized v8-nographics) V5.13 by Alex Kan
SSE3x OS X 64bit Build 3321 , Ported by : Raistmer, JDWhale, Urs Echternacht


OpenCL version by Raistmer, r3321

Number of OpenCL platforms: 1


OpenCL Platform Name: Apple
Number of devices: 1
Max compute units: 2
Max work group size: 1024
Max clock frequency: 745Mhz
Max memory allocation: 134217728
Cache type: None
Cache line size: 0
Cache size: 0
Global memory size: 536870912
Constant buffer size: 65536
Max number of constant args: 9
Local memory type: Scratchpad
Local memory size: 49152
Queue properties:
Out-of-Order: No
Name: GeForce GT 640M
Vendor: NVIDIA
Driver version: 10.10.13 310.42.25f01
Version: OpenCL 1.2
Extensions: cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_APPLE_clut cl_APPLE_query_kernel_names cl_APPLE_gl_sharing cl_khr_gl_event cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_APPLE_fp64_basic_ops cl_khr_fp64 cl_khr_3d_image_writes cl_khr_depth_images cl_khr_gl_depth_images cl_khr_gl_msaa_sharing cl_khr_image2d_from_buffer cl_APPLE_ycbcr_422 cl_APPLE_rgb_422


Work Unit Info:
...............
Credit multiplier is : 2.85
WU true angle range is : 0.410125
Used GPU device parameters are:
Number of compute units: 2
Single buffer allocation size: 128MB
Total device global memory: 512MB
max WG size: 1024
local mem type: Real
FERMI path used: yes
LotOfMem path: no
period_iterations_num=50
Spike: peak=24.09507, time=33.55, d_freq=1420751144.89, chirp=-0.028652, fft_len=128k
Spike: peak=24.4283, time=33.55, d_freq=1420751144.89, chirp=-0.033273, fft_len=128k
Spike: peak=24.31762, time=33.55, d_freq=1420751144.88, chirp=-0.037895, fft_len=128k
GPU device sync requested... ...GPU device synched
Termination request detected or computations are finished. GPU device synched, exiting...
OpenCL platform detected: Apple
Number of OpenCL devices found : 1
BOINC assigns slot on device #0.
Info: BOINC provided OpenCL device ID used
DOUBLE_FP supported.
cl_khr_fp64 supported.
cl_APPLE_fp64_basic_ops supported.
FERMI : true

Build features: SETI8 Non-graphics OpenCL USE_OPENCL_NV OCL_ZERO_COPY OCL_CHIRP3 ASYNC_SPIKE FFTW SSE3 64bit
System: Darwin x86_64 Kernel: 15.6.0
CPU : Intel(R) Core(TM) i5-3330S CPU @ 2.70GHz
GenuineIntel x86, Family 6 Model 58 Stepping 9
Features : FPU TSC PAE APIC MTRR MMX SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX1.0

OpenCL-kernels filename : MultiBeam_Kernels_r3321.cl
ar=0.410125 NumCfft=200105 NumGauss=1151864510 NumPulse=226294165221 NumTriplet=452692139687
Currently allocated 209 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 2.00 percent.
Used GPU device parameters are:
Number of compute units: 2
Single buffer allocation size: 128MB
Total device global memory: 512MB
max WG size: 1024
local mem type: Real
FERMI path used: yes
LotOfMem path: no
period_iterations_num=50
GPU device sync requested... ...GPU device synched
Termination request detected or computations are finished. GPU device synched, exiting...
OpenCL platform detected: Apple
Number of OpenCL devices found : 1
BOINC assigns slot on device #0.
Info: BOINC provided OpenCL device ID used
DOUBLE_FP supported.
cl_khr_fp64 supported.
cl_APPLE_fp64_basic_ops supported.
FERMI : true

Build features: SETI8 Non-graphics OpenCL USE_OPENCL_NV OCL_ZERO_COPY OCL_CHIRP3 ASYNC_SPIKE FFTW SSE3 64bit
System: Darwin x86_64 Kernel: 15.6.0
CPU : Intel(R) Core(TM) i5-3330S CPU @ 2.70GHz
GenuineIntel x86, Family 6 Model 58 Stepping 9
Features : FPU TSC PAE APIC MTRR MMX SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX1.0

OpenCL-kernels filename : MultiBeam_Kernels_r3321.cl
ar=0.410125 NumCfft=200105 NumGauss=1151864510 NumPulse=226294165221 NumTriplet=452692139687
Currently allocated 209 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 2.00 percent.
Used GPU device parameters are:
Number of compute units: 2
Single buffer allocation size: 128MB
Total device global memory: 512MB
max WG size: 1024
local mem type: Real
FERMI path used: yes
LotOfMem path: no
period_iterations_num=50
Autocorr: peak=19.48845, time=73.82, delay=5.1587, d_freq=1420751429.61, chirp=-7.0919, fft_len=128k
GPU device sync requested... ...GPU device synched
Termination request detected or computations are finished. GPU device synched, exiting...
OpenCL platform detected: Apple
Number of OpenCL devices found : 1
BOINC assigns slot on device #0.
Info: BOINC provided OpenCL device ID used
DOUBLE_FP supported.
cl_khr_fp64 supported.
cl_APPLE_fp64_basic_ops supported.
FERMI : true

Build features: SETI8 Non-graphics OpenCL USE_OPENCL_NV OCL_ZERO_COPY OCL_CHIRP3 ASYNC_SPIKE FFTW SSE3 64bit
System: Darwin x86_64 Kernel: 15.6.0
CPU : Intel(R) Core(TM) i5-3330S CPU @ 2.70GHz
GenuineIntel x86, Family 6 Model 58 Stepping 9
Features : FPU TSC PAE APIC MTRR MMX SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX1.0

OpenCL-kernels filename : MultiBeam_Kernels_r3321.cl
ar=0.410125 NumCfft=200105 NumGauss=1151864510 NumPulse=226294165221 NumTriplet=452692139687
Currently allocated 209 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 44.13 percent.
Used GPU device parameters are:
Number of compute units: 2
Single buffer allocation size: 128MB
Total device global memory: 512MB
max WG size: 1024
local mem type: Real
FERMI path used: yes
LotOfMem path: no
period_iterations_num=50
GPU device sync requested... ...GPU device synched
Termination request detected or computations are finished. GPU device synched, exiting...
OpenCL platform detected: Apple
Number of OpenCL devices found : 1
BOINC assigns slot on device #0.
Info: BOINC provided OpenCL device ID used
DOUBLE_FP supported.
cl_khr_fp64 supported.
cl_APPLE_fp64_basic_ops supported.
FERMI : true

Build features: SETI8 Non-graphics OpenCL USE_OPENCL_NV OCL_ZERO_COPY OCL_CHIRP3 ASYNC_SPIKE FFTW SSE3 64bit
System: Darwin x86_64 Kernel: 15.6.0
CPU : Intel(R) Core(TM) i5-3330S CPU @ 2.70GHz
GenuineIntel x86, Family 6 Model 58 Stepping 9
Features : FPU TSC PAE APIC MTRR MMX SSE SSE2 HT SSE3 SSSE3 SSE4.1 SSE4.2 AVX1.0

OpenCL-kernels filename : MultiBeam_Kernels_r3321.cl
ar=0.410125 NumCfft=200105 NumGauss=1151864510 NumPulse=226294165221 NumTriplet=452692139687
Currently allocated 209 MB for GPU buffers
In v_BaseLineSmooth: NumDataPoints=1048576, BoxCarLength=8192, NumPointsInChunk=32768
Restarted at 66.08 percent.
Used GPU device parameters are:
Number of compute units: 2
Single buffer allocation size: 128MB
Total device global memory: 512MB
max WG size: 1024
local mem type: Real
FERMI path used: yes
LotOfMem path: no
period_iterations_num=50
Spike: peak=24.11478, time=68.79, d_freq=1420750198.94, chirp=-95.62, fft_len=32k

Best spike: peak=24.4283, time=33.56, d_freq=1420751144.89, chirp=-0.033273, fft_len=128k
Best autocorr: peak=19.48845, time=73.82, delay=5.1587, d_freq=1420751429.61, chirp=-7.0919, fft_len=128k
Best gaussian: peak=2.805941, mean=0.5229927, ChiSq=1.398399, time=12.58, d_freq=1420752531.85,
score=-1.300901, null_hyp=2.174957, chirp=10.797, fft_len=16k
Best pulse: peak=3.245201, time=6.54, period=0.9724, d_freq=1420754890.84, score=0.9951, chirp=-87.382, fft_len=64
Best triplet: peak=0, time=-2.122e+11, period=0, d_freq=0, chirp=0, fft_len=0


Flopcounter: 19269428530118.300781

Spike count: 4
Autocorr count: 1
Pulse count: 0
Triplet count: 0
Gaussian count: 0
Time cpu in use since last restart: 18.1 seconds
GPU device sync requested... ...GPU device synched
16:01:17 (34168): called boinc_finish(0)

</stderr_txt>

PETRI:

<stderr_txt>
setiathome_CUDA: Found 4 CUDA device(s):
Device 1: GeForce GTX 750 Ti, 1998 MiB, regsPerBlock 65536
computeCap 5.0, multiProcs 5
pciBusID = 1, pciSlotID = 0
Device 2: GeForce GTX 750 Ti, 2000 MiB, regsPerBlock 65536
computeCap 5.0, multiProcs 5
pciBusID = 2, pciSlotID = 0
Device 3: GeForce GTX 750 Ti, 2000 MiB, regsPerBlock 65536
computeCap 5.0, multiProcs 5
pciBusID = 4, pciSlotID = 0
Device 4: GeForce GTX 750 Ti, 2000 MiB, regsPerBlock 65536
computeCap 5.0, multiProcs 5
pciBusID = 5, pciSlotID = 0
In cudaAcc_initializeDevice(): Boinc passed DevPref 2
setiathome_CUDA: CUDA Device 2 specified, checking...
Device 2: GeForce GTX 750 Ti is okay
SETI@home using CUDA accelerated device GeForce GTX 750 Ti
Using pfb = 8 from command line args
Using pfp = 240 from command line args
Using unroll = 12 from command line args

setiathome v8 enhanced x41p_zi3d, Cuda 7.50 special
Compiled with NVCC 8.0, using 6.5 libraries. Modifications done by petri33.



Detected setiathome_enhanced_v8 task. Autocorrelations enabled, size 128k elements.
Work Unit Info:
...............
WU true angle range is : 0.410125
Sigma 3
Thread call stack limit is: 1k
Spike: peak=24.09493, time=33.55, d_freq=1420751144.89, chirp=-0.028652, fft_len=128k
Spike: peak=24.42814, time=33.55, d_freq=1420751144.89, chirp=-0.033273, fft_len=128k
Spike: peak=24.31745, time=33.55, d_freq=1420751144.88, chirp=-0.037895, fft_len=128k
Autocorr: peak=19.48978, time=73.82, delay=5.1587, d_freq=1420751429.61, chirp=-7.0919, fft_len=128k
Spike: peak=24.78424, time=68.79, d_freq=1420750198.94, chirp=-95.62, fft_len=32k
cudaAcc_free() called...
cudaAcc_free() running...
cudaAcc_free() PulseFind freed...
cudaAcc_free() Gaussfit freed...
cudaAcc_free() AutoCorrelation freed...
1,2,3,4,5,6,7,8,9,10,10,11,12,cudaAcc_free() DONE.
13
Best spike: peak=24.78424, time=68.79, d_freq=1420750198.94, chirp=-95.62, fft_len=32k
Best autocorr: peak=19.48978, time=73.82, delay=5.1587, d_freq=1420751429.61, chirp=-7.0919, fft_len=128k
Best gaussian: peak=2.818346, mean=0.5207286, ChiSq=1.416716, time=12.58, d_freq=1420752531.85,
score=-0.9712276, null_hyp=2.204598, chirp=10.797, fft_len=16k
Best pulse: peak=3.243528, time=6.544, period=0.9724, d_freq=1420754890.84, score=0.9946, chirp=-87.382, fft_len=64
Best triplet: peak=0, time=-2.122e+11, period=0, d_freq=0, chirp=0, fft_len=0

Flopcounter: 40132345887159.960938

Spike count: 4
Autocorr count: 1
Pulse count: 0
Triplet count: 0
Gaussian count: 0
09:44:58 (30909): called boinc_finish(0)

</stderr_txt>
5) Message boards : Number crunching : Intel HD Graphics 530 - Keeps "Postponing"... (Message 1763793)
Posted 10 Feb 2016 by Profile -= Vyper =-
Well this is not new!

I reported this issue here for the devs but couldnt get any apropriate real guidance so i gave up.

http://lunatics.kwsn.info/index.php/topic,1763.msg59314.html#msg59314

This is to the developer board and i have the exact same issue as you TimeLord04. (Non approved members cannot reach this post though)

In my case i had a Intel 5775C with Iris Pro 6200 iGPU and i had issues with running Astropulse work but not MB7 work.

Tried without any parameters at all and reduced values! Nothing worked so i gave up. Now i've exchanged the 5775c to a 6700K instead so i dont need to tweak the rig as its processor is more mobile in its design that it is a desktop. (And wont be using its GPU until i hear that it seems to work again :) )

Very good IPC though due to the 128MB eDRAM on the chip.

Good luck all!
6) Message boards : Number crunching : Intel 5th gen GPU with SETI v8 issue? (Message 1758995)
Posted 24 Jan 2016 by Profile -= Vyper =-
+1, Thanks, Vyper.
Always good to get experimental confirmation to any theories.


No worries!

Glad to be to assistance one way or Another.
I still got that pesky Astropulse aborting error compared to the V7,V8 apps so i've disabled Astropulse OpenCL for now.
7) Message boards : Number crunching : Intel 5th gen GPU with SETI v8 issue? (Message 1758707)
Posted 23 Jan 2016 by Profile -= Vyper =-
If you want you can adjust so you push the cpu harder. If it's a smalltop, notebook i wouldnt suggest it but if it is a regular computer with big cpu fan etc you can up the limits so you push it harder.

Please try to stay below 80 degrees C to be safe.
8) Message boards : Number crunching : Intel 5th gen GPU with SETI v8 issue? (Message 1758640)
Posted 23 Jan 2016 by Profile -= Vyper =-
Have you tried https://downloadcenter.intel.com/download/24075/Intel-Extreme-Tuning-Utility-Intel-XTU- so it isn't harder on the gpu resulting in TDP max kicking in to slow the cpu/gpu down compared to V7.

I have reported earlier issues with my Broadwell and Iris Pro 6200 and it reaches it ceilings directly with AP and sometimes it aborts it due to errors.
9) Message boards : Cafe SETI : I ain't played for anybody in years............... (Message 1725621)
Posted 14 Sep 2015 by Profile -= Vyper =-
Familiar? :)

10) Message boards : Number crunching : Lunatics Windows Installer v0.43a Maintenance Release (Message 1597592)
Posted 6 Nov 2014 by Profile -= Vyper =-


Choose HD4 version for MB7 please.


Why??

Is there any performance difference between the two versions? I have two 5970's that fly with AP wu's but are slow is h*ll with HD5? Could it be that the pre HD5 version is much faster on ATI's?
11) Message boards : Number crunching : AP V7 (Message 1584549)
Posted 10 Oct 2014 by Profile -= Vyper =-
Nope.. http://setiathome.berkeley.edu/results.php?hostid=7374749&offset=0&show_names=0&state=4&appid=20

AP7 credit is low as hell. Thats not that good really.
12) Message boards : Number crunching : Creadit Scoring. (Message 1583895)
Posted 9 Oct 2014 by Profile -= Vyper =-
And if running an older card, use the nvidia 320 driver. It seems to be the one of the last the runs old hardware well.

Newer drivers break opencl etc.
13) Message boards : Number crunching : CPU power required to drive 4 AMD GPU ? (Message 1583893)
Posted 9 Oct 2014 by Profile -= Vyper =-
The impact is when the workunits requires more cpu to Crunch, if i recall blanking is done by cpu.
And ofcourse the initials of every wu is made by the cpu. So the stronger cpu the more utilisation on the gpu and higher productivity.
Best thing is to monitor with gpu-z how much gpu utilisation you have on average if you are in the 95% range almost all the time then no need to bother.
More in parallell is often better as you have set it than fewer until the time when bus/cpu becomes bottleneck.
14) Message boards : Number crunching : CPU power required to drive 4 AMD GPU ? (Message 1583891)
Posted 9 Oct 2014 by Profile -= Vyper =-
As i suspected when i checked your webpage. Those that have been running coinmining are soon returning to distributed computing as it's not sustainable anylonger to mine due to powerbill is higher than the relative income. :)

Welcome with your rig at S@H :)
15) Message boards : Number crunching : AP V7 (Message 1583515)
Posted 8 Oct 2014 by Profile -= Vyper =-
Got ya.. Thanks all for the clearup.

Yes i have a Nvidia card in this case! :)
16) Message boards : Number crunching : AP V7 (Message 1583502)
Posted 8 Oct 2014 by Profile -= Vyper =-
It says 701 in version, in the apps section i read 705 at http://setiathome.berkeley.edu/apps.php

Shouldnt it say 705 in the app_info instead of 701 as in the aistub file?
17) Message boards : Number crunching : AP V7 (Message 1583497)
Posted 8 Oct 2014 by Profile -= Vyper =-
An app_info example would be great aswell.
Or is it copy paste from a AP6 config and just changing the numbers?
If someone got a working AP7 app_info with new apps i would gladly try it aswell.

Mike's packages include an 'AIstub' file - a segment of information in app_info.xml format which can be added to an existing app_info file to add AP V7 functionality. Add as many as you wish, for CPU and/or any of the three GPU manufacturers.

No-one has tested them on the main project yet, because there's no work available - that's another reason why the installer will be slow arriving, because I need to test it with data after I've finished writing it.


There!! Thanks!
18) Message boards : Number crunching : AP V7 (Message 1583490)
Posted 8 Oct 2014 by Profile -= Vyper =-
An app_info example would be great aswell.
Or is it copy paste from a AP6 config and just changing the numbers?
If someone got a working AP7 app_info with new apps i would gladly try it aswell.
19) Message boards : Cafe SETI : Fundraising time again..... (Message 1569655)
Posted 9 Sep 2014 by Profile -= Vyper =-
Agree fully, like wth?

Interfere, how can Money interfere in any way? Just my 2 cents.
Very odd and strange!
20) Message boards : Cafe SETI : What are you YouTubing today? (Message 1525934)
Posted 8 Jun 2014 by Profile -= Vyper =-
https://www.youtube.com/watch?v=PWGzPVxtxYs


Next 20

Copyright © 2016 University of California