Posts by Sean

1) Message boards : Number crunching : Flakey AMD/ATI GPUs, including RX 5700 XT, Cross Validating, polluting the Database (Message 2019519)
Posted 18 Nov 2019 by Profile Sean Project Donor
Post:
Yet another instance: https://setiathome.berkeley.edu/workunit.php?wuid=3744949125
2) Message boards : Cafe SETI : 20th Anniversary T-shirts - More in stock (Message 1989125)
Posted 7 Apr 2019 by Profile Sean Project Donor
Post:
I'm in for 1 medium.
3) Message boards : Number crunching : Developing AMD GPU Utilities (Message 1982156)
Posted 25 Feb 2019 by Profile Sean Project Donor
Post:
amdgpu-ls output is now:
./amdgpu-ls
AMD Wattman features enabled: 0xffffffff
amdgpu version: 18.50-725072
2 AMD GPUs detected
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/power1_cap_max
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/power1_cap
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/power1_average
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/temp1_input
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/temp1_crit
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/fan1_enable
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/fan1_target
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/fan1_input
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/fan1_max
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/pwm1_enable
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/pwm1
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/pwm1_max
Error: HW file doesn't exist: /sys/class/drm/card1/device/hwmon/hwmon1/in0_label
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/power1_cap_max
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/power1_cap
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/power1_average
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/temp1_input
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/temp1_crit
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/fan1_enable
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/fan1_target
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/fan1_input
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/fan1_max
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/pwm1_enable
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/pwm1
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/pwm1_max
Error: HW file doesn't exist: /sys/class/drm/card0/device/hwmon/hwmon0/in0_label
2 are Compatible

Error: Invalid pstate entry: /sys/class/drm/card0/device/pp_od_clk_voltage
Error: Invalid pstate entry: /sys/class/drm/card0/device/pp_od_clk_voltage
Error: Invalid pstate entry: /sys/class/drm/card0/device/pp_od_clk_voltage
UUID: 5d20111fb1d24b97a38ea653c57c55af
Card Model:  Vega 10 XT [Radeon RX Vega 64]
Short Card Model:  RX Vega 64
Card Number: 1
Card Path: /sys/class/drm/card1/device/
PCIe ID: 06:00.0
Driver: amdgpu
HWmon: /sys/class/drm/card1/device/hwmon/hwmon1/
Current Power (W): -1
Power Cap (W): -1
Power Cap Range (W): [-1, -1]
Fan Enable: -1
Fan PWM Mode: [-1, 'UNK']
Current Fan PWM (%): -1
Current Fan Speed (rpm): -1
Fan Target Speed (rpm): -1
Fan Speed Range (rpm): [-1, -1]
Fan PWM Range (%): [-1, -1]
Current Temp (C): -1
Critical Temp (C): -1
Current VddGFX (mV): -1
Vddc Range: ['800mV', '1200mV']
Current Loading (%): 83
Link Speed: 8 GT/s
Link Width: 16
vBIOS Version: 113-D0500300-101
Current SCLK P-State: 7
Current SCLK: 1590Mhz 
SCLK Range: ['852MHz', '2400MHz']
Current MCLK P-State: 3
Current MCLK: 800Mhz 
MCLK Range: ['167MHz', '1500MHz']
Power Performance Mode: 2-VIDEO
Power Force Performance Level: auto

UUID: 2ffbc1178e06458783b121e71dc487bd
Card Model:  Device 081e
Short Card Model:  Device 081e
Card Number: 0
Card Path: /sys/class/drm/card0/device/
PCIe ID: 03:00.0
Driver: amdgpu
HWmon: /sys/class/drm/card0/device/hwmon/hwmon0/
Current Power (W): -1
Power Cap (W): -1
Power Cap Range (W): [-1, -1]
Fan Enable: -1
Fan PWM Mode: [-1, 'UNK']
Current Fan PWM (%): -1
Current Fan Speed (rpm): -1
Fan Target Speed (rpm): -1
Fan Speed Range (rpm): [-1, -1]
Fan PWM Range (%): [-1, -1]
Current Temp (C): -1
Critical Temp (C): -1
Current VddGFX (mV): -1
Vddc Range: ['', '']
Current Loading (%): 97
Link Speed: 8 GT/s
Link Width: 16
vBIOS Version: 113-D3600200-105
Current SCLK P-State: -1
Current SCLK: 
SCLK Range: ['808Mhz', '2200Mhz']
Current MCLK P-State: -1
Current MCLK: 
MCLK Range: ['351Mhz', '1200Mhz']
Power Performance Mode: 2-VIDEO
Power Force Performance Level: auto

Running Ubuntu 18.04 LTS.
I am on GitHub, although I will say this is the first time I have actually used it.
The thread you linked has the GRUB setting as "0xffffffff". Is there a difference between that and "0xffff7fff"?
4) Message boards : Number crunching : Developing AMD GPU Utilities (Message 1982091)
Posted 25 Feb 2019 by Profile Sean Project Donor
Post:
Master branch initially. I've now switched to the v2.1.0 branch and the results appear the same.
Output is the same with and without using sudo.

amdgpu-ls outputs:
./amdgpu-ls
AMD Wattman features enabled: 0xffff7fff
amdgpu version: 18.50-725072
2 AMD GPUs detected
2 are Compatible

Traceback (most recent call last):
  File "./amdgpu-ls", line 136, in <module>
    main()
  File "./amdgpu-ls", line 124, in main
    gpu_list.get_pstates()
  File "/home/sean/amdgpu-utils/GPUmodules/GPUmodules.py", line 512, in get_pstates
    v.get_pstates()
  File "/home/sean/amdgpu-utils/GPUmodules/GPUmodules.py", line 311, in get_pstates
    self.sclk_state[lineitems[0]] = [lineitems[1],lineitems[2]]
IndexError: list index out of range
5) Message boards : Number crunching : Developing AMD GPU Utilities (Message 1982089)
Posted 25 Feb 2019 by Profile Sean Project Donor
Post:
Here are my results with the Vega56 and Radeon VII
amdgpu-monitor outputs:
┌────────────┬────────────┬────────────┐
│Card #      │card1       │card0       │
├────────────┼────────────┼────────────┤
│Model       │ RX Vega 64 │ Device 081e│
│Load %      │88          │97          │
│Power (W)   │-1          │-1          │
│Power Cap (W│-1          │-1          │
│T (C)       │-1          │-1          │
│VddGFX (mV) │-1          │-1          │
│Sclk (MHz)  │1590Mhz     │            │
│Sclk Pstate │7           │-1          │
│Mclk (MHz)  │800Mhz      │            │
│Mclk Pstate │3           │-1          │
│Perf Mode   │2-VIDEO     │2-VIDEO     │
└────────────┴────────────┴────────────┘

amdgpu-pac gives me the following:
~/amdgpu-utils$ sudo ./amdgpu-pac
AMD Wattman features enabled: 0xffff7fff
amdgpu version: 18.50-725072
2 AMD GPUs detected
Traceback (most recent call last):
  File "./amdgpu-pac", line 758, in <module>
    main()
  File "./amdgpu-pac", line 731, in main
    gpu_list.get_pstates()
  File "/home/sean/amdgpu-utils/GPUmodules/GPUmodules.py", line 457, in get_pstates
    v.get_pstates()
  File "/home/sean/amdgpu-utils/GPUmodules/GPUmodules.py", line 320, in get_pstates
    self.sclk_state[lineitems[0]] = [lineitems[1],lineitems[2]]
IndexError: list index out of range

Something I'm doing wrong?
6) Message boards : Number crunching : Developing AMD GPU Utilities (Message 1982020)
Posted 24 Feb 2019 by Profile Sean Project Donor
Post:
I really appreciate your efforts in developing this. I haven't installed your utilities just yet, but plan to sometime during the next week. Thank you!
7) Message boards : Number crunching : Radeon VII Seti performance vs 1080ti SoG? (Message 1981122)
Posted 19 Feb 2019 by Profile Sean Project Donor
Post:
Up and crunching now under Ubuntu 18.04. Vega 56 and Radeon VII in the same system.

https://setiathome.berkeley.edu/show_host_detail.php?hostid=6993114
8) Message boards : Number crunching : Radeon VII Seti performance vs 1080ti SoG? (Message 1980616)
Posted 16 Feb 2019 by Profile Sean Project Donor
Post:
I'm curious to see how your VII performs over time. I also got a VII and was able to run a dozen or so work units on it, but ran into the same problem that I had with my Vega56: It seems to get through some units just fine but then runs into others that cause display driver crashes/restarts or outright system hangs. The same system runs work units on a R9-280x and simultaneously mines on the Vega (now VII) for weeks on end, so I don't think power is the issue. Anyway, I'm planning to try crunching under Linux this weekend.
9) Questions and Answers : GPU applications : C2D + RX460 (Message 1924514)
Posted 14 Mar 2018 by Profile Sean Project Donor
Post:
https://setiathome.berkeley.edu/show_host_detail.php?hostid=8203828

Depends on what you consider to be worth your "time/power" :)
10) Questions and Answers : GPU applications : Gpu overheat on AMD Sapphire Tri-X R9 290 (Message 1903760)
Posted 30 Nov 2017 by Profile Sean Project Donor
Post:
In your BOINC data directory there may (or may not) be a file called cc_config.xml

https://boinc.berkeley.edu/wiki/Client_configuration has in depth details on creating and editing this file, but adding:

    <ignore_ati_dev>0</ignore_ati_dev>


should stop BOINC from using your Radeon.

11) Questions and Answers : GPU applications : AMD Rx Vega 56 issues (Message 1893506)
Posted 6 Oct 2017 by Profile Sean Project Donor
Post:
Scanning thru the first page of your results under the inconclusive column, the RX280 is throwing more than the Vega 56.

I hate to admit it, but that's because I've had the Vega solely on mining duty for the past couple weeks.
The 280x and CPU I've kept dedicated to SETI tasks. The 280x seems particularly well suited for SETI - I can't remember the last time it gave an invalid result and I'm quite happy with 30k RAC from the machine. I almost wish I had just bought another 280x instead of the Vega!
12) Message boards : Number crunching : It only took about 7 months (old cruncher thread)... (Message 1891781)
Posted 25 Sep 2017 by Profile Sean Project Donor
Post:
But my Pentium 3 rig has now completed the amount of work that my main rig does in one day!
Too bad those SLI Voodoo2's can't help crunch :)

https://setiathome.berkeley.edu/show_host_detail.php?hostid=8203828
13) Questions and Answers : GPU applications : AMD Rx Vega 56 issues (Message 1891420)
Posted 22 Sep 2017 by Profile Sean Project Donor
Post:
The one error was a (Unknown error) - exit code -6 (0xfffffffa) error

I guess I should have mentioned that the error(s) occurred when I crashed the system while tinkering, not during normal operation.
I will take a look at different drivers though!
14) Questions and Answers : GPU applications : AMD Rx Vega 56 issues (Message 1891401)
Posted 22 Sep 2017 by Profile Sean Project Donor
Post:
I've recently added a Vega 56 to my main SETI crunching box:
https://setiathome.berkeley.edu/show_host_detail.php?hostid=6993114

Despite what the details say, the system has a Vega 56 and a R9 280x running together (not 2 Vega cards). I initially ran just the Vega card to try to work out any problems and it seemed like after updating to the newest AMD HD5 SoG app and setting the Vega to run at its base GPU clock speed, tasks were (mostly) validating. At that point, I added the 280x back in and now both cards are running tasks.

So the main issue is that the Vega is still producing quite a few invalids and inconclusives (which I suspect will turn invalid). It also seems that when the Vega and 280x run similar point value tasks, the 280x consistently finishes them slightly faster (and with no errors).

Some potentially important details:
I'm using a 1000 watt platinum power supply (I can mine 24/7 on both cards with no problems)
I am running the AMD HD5 app with settings tuned for the 280x. Maybe Vega doesn't work well with those settings? I'll try reverting to the default settings this evening.
I wiped the old drivers with 'Display Driver Uninstaller' and did a clean install of Crimson 17.9.2.
I run 2 tasks on both cards simultaneously and keep 1 CPU core free for each GPU task.

I would really appreciate feedback from anyone else running a Vega.

Thanks!
15) Questions and Answers : Unix/Linux : Dual Radeon HD7770, Ubuntu 12.04, Not showing full GPU memory? (Message 1868906)
Posted 23 May 2017 by Profile Sean Project Donor
Post:
Question pertains to this host: https://setiathome.berkeley.edu/show_host_detail.php?hostid=7184655

This system has been running mostly trouble free (which is why I hesitate to 'upgrade' to a newer Ubuntu) for a few years now with a single HD7770.
To boost its RAC, I decided to add a second and identical Radeon. It took some fiddling to get it to work, but once it was working, the S@H host information indicated that the cards were something like 930mb each whereas before it showed the full 1024mb on the single card. Well, that didn't bother me too much but then I goofed it up and had to reinitialize the driver for a second time. Now it is saying the cards are 899mb... what's happening?

Thanks in advance!
16) Questions and Answers : Unix/Linux : Trying to move boinc-client folder to a USB flash drive (Message 1839678)
Posted 2 Jan 2017 by Profile Sean Project Donor
Post:
It's working now! For anyone who might be wondering:
    sudo gedit /usr/share/applications/boinc-manager.desktop
    Changed the Exec= line to "Exec=boincmgr -d /media/me/usbdrive/boinc-client"
    Made sure the boinc-client folder permissions were set "sudo chown -R boinc:boinc /media/me/usbdrive/boinc-client"
    Start the boinc service, launch the manager, and it's working with the data directory on the usb drive


For whatever reason, I still have to manually restart the boinc-client service after rebooting, but I can live with that. I should only be rebooting for updates and power outages.

17) Questions and Answers : Unix/Linux : Trying to move boinc-client folder to a USB flash drive (Message 1839650)
Posted 2 Jan 2017 by Profile Sean Project Donor
Post:
Perhaps making a little progress figuring this out, but I'm still not sure why it isn't working the way I think it should...

I went back to trying the symlink method:
    boinc-client folder moved to the flash drive
    symlink created pointing /var/lib/boinc-client to the boinc-client directory on the flash drive
    restart the boinc service and launch the manager
    result = boinc manager stuck on trying to connect to the client



Now I discovered a bit by accident running "boinc" from the command line (with or without sudo) gives me some useful information about the data directory:

    02-Jan-2017 10:53:07 [---] Data directory: /home/me
    Why is it trying to use my home directory? (edit: figured out it's just going with the current active directory)



OK, so now if I run "sudo boinc --dir /media/me/usbdrive/boinc-client" I get:

    02-Jan-2017 11:13:05 [---] Data directory: /media/me/usbdrive/boinc-client
    Looking good so far...
    Launch the manager and success!



Maybe I can just change the command line for the launcher icon to force the directory?

18) Questions and Answers : Unix/Linux : Trying to move boinc-client folder to a USB flash drive (Message 1839299)
Posted 1 Jan 2017 by Profile Sean Project Donor
Post:
So I've tried a different approach:
    1. Stop the boinc client = sudo service boinc-client stop
    2. Copy the existing data to the new location = sudo cp -R /var/lib/boinc-client /media/me/usbdrive/boinc-client
    3. Make sure permissions are set correctly = sudo chown boinc:boinc -R /media/me/usbdrive/boinc-client/*
    4. Edit the /etc/default/boinc-client file to point to the new data directory = sudo gedit /etc/default/boinc-client and look for the entry that starts with # This is the data directory of the BOINC core client. Add an # in front of the BOINC_DIR="/var/lib/boinc-client" line and then add the following below that line BOINC_DIR="/media/me/usbdrive/boinc-client" and save the file then exit the text editor.
    5. Start the boinc client = sudo service boinc-client start



I also edited the init.d file similar to what was done in step 4.
BOINC manager starts up fine now and appears to be working, but the event log is still saying "Data directory: /var/lib/boinc-client"

Is there something else I'm missing?

19) Questions and Answers : Unix/Linux : Trying to move boinc-client folder to a USB flash drive (Message 1839214)
Posted 31 Dec 2016 by Profile Sean Project Donor
Post:
I've recently assembled a new system (i7-6700T, 32GB ram, M.2 SSD, Ubuntu 16.04 LTS) and I'm trying to relocate the boinc-client folder to a USB flash drive to reduce write activity on the SSD (I'd rather let it chew on an $8 flash drive than a $200 SSD).

I have successfully moved the boinc-client folder to my home directory via the following:
    sudo mv /var/lib/boinc-client /home/me/boinc-client
    sudo ln -s /home/me/boinc-client /var/lib/boinc-client


And that is what I have running at the moment.

However, when I try (from a fresh install):

    sudo mv /var/lib/boinc-client /media/me/usbdrive/boinc-client
    sudo ln -s /media/me/usbdrive/boinc-client /var/lib/boinc-client


Boinc manager is unable to connect to the client. The manager screen will sit on "Connecting to client" for minute or so before popping up an error along the lines of "The client has closed unexpectedly several times in the last several minutes, try again?"
The USB drive is formatted to EXT4 and it does appear that the move command and link command are working correctly. Permissions also appear to be correct for the client folder on the flash drive.

At the risk of making this more complicated, I do have BOINC running on two other older Ubuntu systems. On the older machines, I just downloaded the .sh file and then extracted it and ran BOINC out of my home folder. I tried this on the new system, but unfortunately it looks like version 7.4 wants libraries that are not present in Ubuntu 16 and I could not find anywhere to download a version 7.6 .sh directly (had to install thru Ubuntu software center).

20) Message boards : Number crunching : v8 issues on very old cruncher (Message 1773505)
Posted 23 Mar 2016 by Profile Sean Project Donor
Post:
If you're looking to throw the processing muscle of that Pentium MMX behind another project, PrimeGrid seems to be running fine so far on my K6-2+.

I also tried einstein@home and lhc@home. Neither worked.


Next 20


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.