Posts by ralphw

1) Questions and Answers : GPU applications : NVIDIA fan controls (Linux MINT 18, Driver Version 367.57, two NVidia GTX 950s) (Message 1831601)
Posted 17 days ago by Profile ralphw
The problem seems to be that manual fan control is enabled by the "Coolbits" option in xorg.conf

Problem 1 (solved): xorg.conf gets overwritten with new values upon reboot. Enabling nogpumanager in GRUB doesn't fix this in Linux MINT 18, so I had to use restore to using chattr +i on the /etc/X11/xorg.conf file.

Problem 2 (working on): even though the line
"Coolbits" "12" is now preserved for both device entries, only one NVidia device allows for manual control of fan speed. Working on a solution using this Folding@home page as a guide:

https://foldingforum.org/viewtopic.php?p=267165
It requires setting up multiple X Servers, one per GPU.

I'd prefer to easier configuration options for multiple GPUs and multiple monitors, along with a more aggressive adaptive fan control system. I'm not overclocking, but SETI@home pushes GPUs hard.
2) Questions and Answers : GPU applications : NVIDIA fan controls (Linux MINT 18, Driver Version 367.57, two NVidia GTX 950s) (Message 1831561)
Posted 18 days ago by Profile ralphw
So I'm thinking that there is a driver compatibility issue with Linux Mint 18, that might be preventing things from working. This guide shows use of the PPA install method with Linux MINT 18.
https://johners.tech/2016/07/installing-the-latest-nvidia-graphics-drivers-on-linux-mint-18/

I was using the PPA for graphics drivers, but am going to try switching to the .run installation method (downloading the driver directly from NVidia).
(This post talks about CUDA support, but recommends NOT using the PPA method - https://devtalk.nvidia.com/default/topic/955464/cuda-setup-and-installation/gtx-1080-cuda-8-0-on-linux-mint-18-problems-setting-up-/#reply)

Not sure if this will work any better, but I need to try something to combat the frustration of (works with 14.04 / doesn't work with MINT 18) syndrome.

Fan controls in NVIDIA seem important when I'm running GPU apps on SETI.
90 degree GPU temps seem too high.
3) Questions and Answers : GPU applications : NVIDIA fan controls (Linux MINT 18, Driver Version 367.57, two NVidia GTX 950s) (Message 1831536)
Posted 18 days ago by Profile ralphw
Those look interesting, but the NVIDIA driver controls are supposed to be able to regulate this. Aside from googling (which I'm trying), how can I be sure the driver version is "proper" and supports the fan speed setting capability?
4) Questions and Answers : GPU applications : NVIDIA fan controls (Linux MINT 18, Driver Version 367.57, two NVidia GTX 950s) (Message 1831446)
Posted 18 days ago by Profile ralphw
I'm running SETI@home 8.0 applications on Linux Mint 18 (based on Ubuntu 16.04).
GPU temps are extemely high, and I feel I need better speed controls for the GPU fans.

>nvidia-smi
>Sat Nov 19 12:00:51 2016
>+-----------------------------------------------------------------------------+
>| NVIDIA-SMI 367.57 Driver Version: 367.57 |
>|-------------------------------+----------------------+----------------------+
>| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
>| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
>|===============================+======================+======================|
>| 0 GeForce GTX 950 Off | 0000:01:00.0 On | N/A |
>| 0% 40C P0 23W / 125W | 214MiB / 1996MiB | 0% Default |
>+-------------------------------+----------------------+----------------------+
>| 1 GeForce GTX 950 Off | 0000:02:00.0 Off | N/A |
>| 0% 25C P8 6W / 125W | 1MiB / 1996MiB | 0% Default |
>+-------------------------------+----------------------+----------------------+

....


Despite playing with the "coolbits" settings in the Xorg.conf file, no controls to set the fans are ever activated.
It also seems like the settings should be under the "device" section, but the nvidia-settings CLI option puts them in the "screens" section.

I have two NVidia GTX 950s in this system.

When runnning SETI@home, GPU temps would peak briefly at 90 degrees C, which is about 40 degrees too hot.
So I'm looking for a way to turn the fans on more or continuously when running GPU work units.
5) Questions and Answers : Wish list : Request: Linux client use DMI info (base board manuf/product) to populate "Model" field for computer (Message 1758007)
Posted 22 Jan 2016 by Profile ralphw
When you look at computer details, the "Product Name" field shows the same information as what's in the "Model" field in the summary list of computers:

The detail page for this mac shows "MacBookPro11,3" in the Product Name field.

On the summary page (listing all computers), the same field is visible under the column "Model". I don't know why the field name changes between the summary and detail page...

So Boinc must support the transmission of this information somehow.
6) Questions and Answers : Wish list : Request: Linux client use DMI info (base board manuf/product) to populate "Model" field for computer (Message 1757649)
Posted 20 Jan 2016 by Profile ralphw
When I look at "My Computers", I see only one system that has populated the computer "Model field".

I'd love for this to populate for an PC motherboard as well, what would I have to do to manually populate this?

Can it be done at a workunit level?

(Here is what "dmidecode -t baseboard" shows on the system I'm using right now)

Handle 0x0002, DMI type 2, 15 bytes
Base Board Information
Manufacturer: Gigabyte Technology Co., Ltd.
Product Name: Z97X-SOC Force
7) Message boards : Number crunching : Intel / OpenCL binaries for Linux (2015) (Message 1756480)
Posted 15 Jan 2016 by Profile ralphw
My experiment with Beignet doesn't appear to be successful with Astropulse 7.0.8.
My machine has three GPUs (Nvidia GTX 750Ti), plus the Intel HD 4600 graphics.

I think it's getting confused, thinking it sees two Nvidias, but trying to use the Intel. Would trying an older NVidia driver help?



https://setiathome.berkeley.edu/result.php?resultid=4662135832
....

<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
Running on device number: 0
OpenCL platform detected: NVIDIA Corporation
Number of OpenCL devices found : 2
BOINC assigns slot on device #0.
Info: BOINC provided OpenCL device ID used
Used GPU device parameters are:
Number of compute units: 5
Single buffer allocation size: 256MB
Total device global memory: 2047MB
max WG size: 1024
local mem type: Real
FERMI path used: yes
-unroll default value used: 5
-ffa_block default value used: 1280
-ffa_block_fetch default value used: 640
AstroPulse v7.08
Linux 64 bit, Rev 2751, OpenCL version by Raistmer, GPU mode
V7, by Raistmer ported to Linux by Lunatics.kwsn.net team.
by Urs Echternacht
ffa threshold mods by Joe Segur
SSE3 dechirping by JDWhale using SSE3 emulation

Build features: Non-graphics OpenCL USE_OPENCL_NV OCL_ZERO_COPY OPENCL_WRITE COMBINED_DECHIRP_KERNEL SMALL_CHIRP_TABLE TWIN_FFA FFTW BLANKIT USE_INCREASED_PRECISION SSE2 64bit
System: Linux x86_64 Kernel: 3.16.0-38-generic
CPU : Intel(R) Core(TM) i7-4790K CPU @ 4.00GHz
8 core(s), Speed : 4300.000 MHz
L1 : 64 KB, Cache : 8192 KB

Number of OpenCL platforms: 2


OpenCL Platform Name: NVIDIA CUDA
Number of devices: 2
Max compute units: 5
Max work group size: 1024
Max clock frequency: 1254Mhz
Max memory allocation: 536821760
Cache type: Read/Write
Cache line size: 128
Cache size: 81920
Global memory size: 2147287040
Constant buffer size: 65536
Max number of constant args: 9
Local memory type: Scratchpad
Local memory size: 49152
Queue properties:
Out-of-Order: Yes
Name: GeForce GTX 750 Ti
Vendor: NVIDIA Corporation
Driver version: 352.63
Version: OpenCL 1.2 CUDA
Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64
Max compute units: 5
Max work group size: 1024
Max clock frequency: 1254Mhz
Max memory allocation: 536821760
Cache type: Read/Write
Cache line size: 128
Cache size: 81920
Global memory size: 2147287040
Constant buffer size: 65536
Max number of constant args: 9
Local memory type: Scratchpad
Local memory size: 49152
Queue properties:
Out-of-Order: Yes
Name: GeForce GTX 750 Ti
Vendor: NVIDIA Corporation
Driver version: 352.63
Version: OpenCL 1.2 CUDA
Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64


OpenCL Platform Name: Experiment Intel Gen OCL Driver
DRM_IOCTL_I915_GEM_APERTURE failed: Invalid argument
Assuming 131072kB available aperture size.
May lead to reduced performance or incorrect rendering.
get chip id failed: -1 [22]
param: 4, val: 0
SIGSEGV: segmentation violation
Stack trace (15 frames):
../../projects/setiathome.berkeley.edu/astropulse_7.08_x86_64-pc-linux-gnu__opencl_nvidia_100(boinc_catch_signal+0x4d)[0x4c6fdd]
/lib/x86_64-linux-gnu/libpthread.so.0(+0x10340)[0x7fc8c8194340]
/usr/lib/x86_64-linux-gnu/libdrm_intel.so.1(drm_intel_bufmgr_gem_enable_reuse+0x0)[0x7fc8c2f4d0d0]
/usr/lib/beignet/libcl.so(+0x13b58)[0x7fc8c4a5eb58]
/usr/lib/beignet/libcl.so(+0x13d07)[0x7fc8c4a5ed07]
/usr/lib/beignet/libcl.so(+0x13e21)[0x7fc8c4a5ee21]
/usr/lib/beignet/libcl.so(+0x13f08)[0x7fc8c4a5ef08]
/usr/lib/beignet/libcl.so(+0xf45d)[0x7fc8c4a5a45d]
/usr/lib/beignet/libcl.so(+0xf521)[0x7fc8c4a5a521]
../../projects/setiathome.berkeley.edu/astropulse_7.08_x86_64-pc-linux-gnu__opencl_nvidia_100[0x4885df]
../../projects/setiathome.berkeley.edu/astropulse_7.08_x86_64-pc-linux-gnu__opencl_nvidia_100[0x488dfc]
../../projects/setiathome.berkeley.edu/astropulse_7.08_x86_64-pc-linux-gnu__opencl_nvidia_100[0x461896]
../../projects/setiathome.berkeley.edu/astropulse_7.08_x86_64-pc-linux-gnu__opencl_nvidia_100[0x46a205]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf5)[0x7fc8c719dec5]
../../projects/setiathome.berkeley.edu/astropulse_7.08_x86_64-pc-linux-gnu__opencl_nvidia_100[0x40bd89]

Exiting...

</stderr_txt>
]]>
8) Questions and Answers : Unix/Linux : One of my computers is "lost" (Message 1756478)
Posted 15 Jan 2016 by Profile ralphw
Sometimes BOINC assigns separate identities to the same computer by mistake. You can correct this by merging old identities with the newest one.

This list below seems pretty strange.

It looks like I should ALWAYS do an "add project" by signing up with email/password, instead of using the authenticator in the account_info.xml file.

>> Here's the output from when I went into the "merge computers" section.

Host 1161925 has an incompatible OS: (Linux, Darwin)
Host 1284323 has an incompatible OS: (Linux, Microsoft Windows 2000)
Host 2175038 has an incompatible CPU: (GenuineIntel Intel(R) Core(TM) i7-4790K CPU @ 4.00GHz [Family 6 Model 60 Stepping 3], GenuineIntel Intel(R) Pentium(R) 4 CPU 3.00GHz)
Host 5070334 has an incompatible OS: (Linux, Microsoft Windows XP)
Host 5080967 has an incompatible OS: (Linux, Darwin)
Host 6437752 has overlapping lifetime: (13 Jan 2016 - 15 Jan 2016), (25 Feb 2012 - 14 Jan 2016)
Host 7582484 has an incompatible OS: (Linux, Darwin)
Host 7834880 has an incompatible CPU: (GenuineIntel Intel(R) Core(TM) i7-4790K CPU @ 4.00GHz [Family 6 Model 60 Stepping 3], GenuineIntel Intel(R) Pentium(R) CPU J2900 @ 2.41GHz [Family 6 Model 55 Stepping 8])
Host 7837118 has overlapping lifetime: (13 Jan 2016 - 15 Jan 2016), (29 Nov 2015 - 15 Jan 2016)
Host 7874920 has overlapping lifetime: (13 Jan 2016 - 15 Jan 2016), (5 Jan 2016 - 15 Jan 2016)
No hosts are eligible for merging with this one.
9) Questions and Answers : Unix/Linux : One of my computers is "lost" (Message 1755914)
Posted 13 Jan 2016 by Profile ralphw
Yes, both are visible now.

I'm still wondering what happened, since it was missing from the earlier list of "my computers".

I did put the USB->Wireless adapter back in its original place, and turned that machine back on.
10) Questions and Answers : Unix/Linux : One of my computers is "lost" (Message 1755896)
Posted 13 Jan 2016 by Profile ralphw
A processor with a XEON CPU, Locutus, is no longer listed under my 'computers'.

How does BOINC/SETI determine the Computer ID?
My stats no longer seem to list 'locutus', a XEON 3550 on a HP Z400 board.

I did two things within an hour of each other.

First, I moved the USB-> Wireless adapter from 'locutus' to a different computer, named 'Asimov'. Did the computer ID move with the Mac address?
(They run the same version of Linux mint, if that matters.)

I also moved my SETI account info.xml file for SETI from a third computer to Asimov.

But I still see 'Sagan' (oldest computer - core i7-920) and 'Asimov' (newest computer, Intel 4790k) as independent systems.

I'm stumped as to how this happened.

- Ralph
11) Message boards : Number crunching : Intel / OpenCL binaries for Linux (2015) (Message 1754032)
Posted 5 Jan 2016 by Profile ralphw
Is there direct support for the Intel Graphics (HD 4600) chip?

I'm now using a motherboard with an Intel i7 4790k, plus a couple of NVIDIA GEFORCE GTX 950s. Having all three doing GPU tasks would be wonderful.

The OS X Science app seems to be able to do this, though it seems both chips are using OpenCL support:


  • NVIDIA GeForce GT 750M (2048MB) OpenCL: 1.2,
  • INTEL Iris Pro (1536MB) OpenCL: 1.2



If I need to install Darwin (Unix) on this Linux (Unix) box, I'll give that a shot if it means more RAC.

12) Message boards : News : SETI@home Version 8 has been released. (Message 1753770)
Posted 4 Jan 2016 by Profile ralphw
Thanks for the update. I had noticed my RAC had dropped off, more due to networking issues than anything else.

I see that I'm running a combination of SETI@home v7 and SETI@home v8.

Do I need to do anything to the BOINC software at this point?

Computing for science on Unix,
13) Message boards : Number crunching : Different Setting for more than 1 GPU (Message 1745663)
Posted 29 Nov 2015 by Profile ralphw
If I could chose the time-space tradeoffs for each card, that would be helpful.
While Astropulse is more constrained by memory bandwidth, it would be great if there were explicit controls available to maximize GPU utilization for the specific architecture, for *all* of the SETI@home science clients.

<device0> (for example, NVidia 950 w/ 4G RAM)
<unroll>32</unroll>
<ffa_block>12288</ffa_block>
<ffa_block_fetch>16384</ffa_block_fetch>
</device0>

<device1> (for example, NVidia 750 with 2G RAM)
<unroll>16</unroll>
<ffa_block>6144</ffa_block>
<ffa_block_fetch>8192</ffa_block_fetch>
</device1>

I'm making assumptions here:
- that more loop unrolling is better
- more unrolling needs more RAM on the host and the GPU, but it gives more memory bandwidth.
- it's sad that compilers can't implement this in 2015.


Is there a utility that can be run to calculate the optimal SETI settings for a specific GPU, from benchmarking runs? (NVidia, AMD, or Intel)
14) Message boards : Number crunching : Intel / OpenCL binaries for Linux (2015) (Message 1745469)
Posted 28 Nov 2015 by Profile ralphw
I'll play around with Beignet this weekend.

Intel HD Graphics Desktop (Bay Trail) is the GPU family - just 4 execution units for the GPU - it's designed for low power consumption - a "notebook GPU"
http://setiathome.berkeley.edu/show_host_detail.php?hostid=7834880

www.notebookcheck.net/Intel-Pentium-J2900-Desktop-Processor.105900.0.html
has some additional details on clock rates, etc.
15) Message boards : Number crunching : Intel / OpenCL binaries for Linux (2015) (Message 1745083)
Posted 26 Nov 2015 by Profile ralphw
I'm looking for answers to two questions:

1) Does the "Intel HD" graphics capability baked into the J2900 processor offer any potential benefit to SETI crunching?

26-Nov-2015 11:34:34 [---] No usable GPUs found
26-Nov-2015 11:34:34 [---] Creating new client state file
26-Nov-2015 11:34:34 [---] Host name: xyzzy
26-Nov-2015 11:34:34 [---] Processor: 4 GenuineIntel Intel(R) Pentium(R) CPU J2
900 @ 2.41GHz [Family 6 Model 55 Stepping 8]
26-Nov-2015 11:34:34 [---] Processor features: fpu vme de pse tsc msr pae mce cx
8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht
tm pbe syscall nx rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xto
pology nonstop_tsc aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ss
se3 cx16 xtpr pdcm sse4_1 sse4_2 movbe popcnt tsc_deadline_timer rdrand lahf_lm
3dnowprefetch ida arat epb dtherm tpr_shadow vnmi flexpriority ept vpid tsc_adju
st smep erms

2) Linux 64-bit builds seem to offer no OpenCL/Intel client. Is there a minimum level of Intel GPU capability that can be supported? (The J2900 only claims to have 4 cores, vs hundreds of cores in other Intel graphics features)

Here's the list of SETI@home apps and the hardware they support

Windows/x86
7.07 (opencl_intel_gpu_sah) 14 Jul 2015 (19,276 GigaFLOPS)

Mac OS/X 10.3+
7.00 29 May 2013, (162 GigaFLOPS)

Mac OS X/Intel
7.00 29 May 2013 (1,655 GigaFLOPS)

Linux/x86_64
7.01 29 May 2013 (39,858 GigaFLOPS)
7.08 (opencl_ati5_cat132) 18 Sep 2015 (97 GigaFLOPS)
7.08 (opencl_ati5_nocal) 18 Sep 2015 (163 GigaFLOPS)
7.08 (opencl_ati5_sah) 18 Sep 2015 (0 GigaFLOPS)
7.08 (opencl_ati_cat132) 18 Sep 2015 (60 GigaFLOPS)
7.08 (opencl_ati_nocal) 18 Sep 2015 (15 GigaFLOPS)
7.08 (opencl_ati_sah) 18 Sep 2015 (29 GigaFLOPS)
7.08 (opencl_nvidia_sah) 18 Sep 2015 (3,672 GigaFLOPS)
NO opencl_intel client here
[/quote]
16) Message boards : Number crunching : Welcome to the 15 year Club (Message 1741112)
Posted 10 Nov 2015 by Profile ralphw
I was getting ready to move in May 2014, so was inactive then. I'm happy to be crunching data again.

I've upgraded my main computer a little bit since 1999, but this CPU is from 2009, so it's not TOO old yet as long as I put a new GPU in once in a while.

I'd love to see if I can compile a usable client for my Playstation 2 - it has the Linux kit, but (sadly) only 16M of RAM. The "emotion engine" was quite a chip in it's day - when 2 Million polygons per second rendering was a good rate.
17) Message boards : Number crunching : Optimizing for happiness with NVIDIA GPU (Message 1740402)
Posted 7 Nov 2015 by Profile ralphw
Yes, I meant Credit - Recent Average Credit (RAC). Not WorkUnits.

10,000 Workunits per day would mean I wouldn't need my furnace to heat my home anymore!
18) Message boards : Number crunching : Optimizing for happiness with NVIDIA GPU (Message 1739041)
Posted 2 Nov 2015 by Profile ralphw
The story so far:


  • Running the stock OpenCL client for AP and SETI 7.01/7.08
  • Updated my drivers to support 950GTX and w 750GTX Ti in the same box
  • processing about 6800 workunits per day (75% of a single core, plus all GPU work)



I'm ready to try out an optimized client, what do you recommend for an 8-core Intel i7 with three GPUs (one is about 50% "faster" than the other two.

I think I can achieve 10,000 WUs/day

19) Message boards : Number crunching : Since NVIDIA 950GTX cards exist, why don't they show up in GPU stats? (Message 1738827)
Posted 1 Nov 2015 by Profile ralphw
When I check the "top GPU stats", I don't see any Nvidia 950 GTX statistics for Linux (or any other platform, for that matter):

What determines whether a GPU model is recognized as a distinct one?
20) Message boards : Number crunching : NVIDIA GPU blues (750Ti and 250) (Message 1737720)
Posted 28 Oct 2015 by Profile ralphw
I added a GTX 950, though I'm not currently seeing the peak GFLOPS of that card reflected in time to do GPU workunits.

OpenCL: GPU 0: GeForce GTX 950 (driver 352.55, device OpenCL 1.2 CUDA, 2047MB, 1790MB available, 3208 GFLOPS peak)
OpenCL: GPU 1: GeForce GTX 750 Ti (driver 352.55, device OpenCL 1.2 CUDA, 2048MB, 2011MB available, 2409 GFLOPS peak)
OpenCL: GPU 2: GeForce GTX 750 Ti (driver 352.55, device OpenCL 1.2 CUDA, 2048MB, 2011MB available, 2409 GFLOPS peak)

At any rate, I updated the driver from 340.X to 352.55 as well. I'll play around to see if I can get CUDA workunits to process, but I think I'm ready to try an optimized client now.


Next 20


 
©2016 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.