Ubuntu 16.04.4

Message boards : Number crunching : Ubuntu 16.04.4
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile RueiKe Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 14 Feb 16
Posts: 403
Credit: 215,394,539
RAC: 314,599
Taiwan
Message 1922029 - Posted: 2 Mar 2018, 5:22:31 UTC

Has anyone seen stability issues since upgrading to 16.04.4? I have had 2 system crashes and 2 cases of GPU compute hanging or giving computation errors in the few days since I have upgraded. This is my Threadripper/ProDuo system 8365846. Perhaps it is related to Meltdown/Spectre fixes or maybe I need to reinstall GPU drivers after the upgrade. I did the upgrade during the Tuesday downtime and since then there has been a lot more Arecibo work, which could also be a different stress on my OC.
Instagram: rpc_labs
ID: 1922029 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 6193
Credit: 442,138,070
RAC: 1,023,062
United States
Message 1922033 - Posted: 2 Mar 2018, 6:05:00 UTC - in response to Message 1922029.  

The day Ubuntu released the Meltdown/Spectre fixes in a new kernel and concomitant with Nvidia driver 384.111, caused issues for both Juan and myself with crashes and corrupted tasks. If you hadn't updated since that, likely you got bit too. Both Juan and myself are up to the HWE release kernel 4.13.0-36 now. And moved on to Nvidia driver 390.25.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1922033 · Report as offensive
Profile RueiKe Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 14 Feb 16
Posts: 403
Credit: 215,394,539
RAC: 314,599
Taiwan
Message 1922040 - Posted: 2 Mar 2018, 6:57:01 UTC - in response to Message 1922033.  

Good to know that I am not alone! I am running the latest GPU driver, so maybe I will just try a reinstall. Seems like I am running the same kernel release as you, 4.13.0-36-generic in my case. I was planning to test out adding a 7th GPU to the system, but the instability I am facing will complicate that.

The day Ubuntu released the Meltdown/Spectre fixes in a new kernel and concomitant with Nvidia driver 384.111, caused issues for both Juan and myself with crashes and corrupted tasks. If you hadn't updated since that, likely you got bit too. Both Juan and myself are up to the HWE release kernel 4.13.0-36 now. And moved on to Nvidia driver 390.25.

Instagram: rpc_labs
ID: 1922040 · Report as offensive
Profile RueiKe Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 14 Feb 16
Posts: 403
Credit: 215,394,539
RAC: 314,599
Taiwan
Message 1922090 - Posted: 2 Mar 2018, 12:51:29 UTC

Latest update: After reinstalling drivers, everything is running fine. Probably need to see it run a week to be certain all is well. I did have an issue after uninstalling AMD GPU drivers that I had never seen with 16.04.3. The system would not enter desktop after login. It would just cycle back to login screen. After I reinstalled drivers while in recovery mode, the problem was resolved.

I also attempted to install a 4th graphics card, by adding a Nano in addition to the 3 ProDuo cards. This was not successful. It gave an error about graphics before even getting to the login screen. Given the new problem I had with uninstall in 16.04.4, I have to wonder if it would work in 16.04.3.
Instagram: rpc_labs
ID: 1922090 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 6193
Credit: 442,138,070
RAC: 1,023,062
United States
Message 1922124 - Posted: 2 Mar 2018, 16:50:45 UTC - in response to Message 1922090.  
Last modified: 2 Mar 2018, 16:54:31 UTC

That is a common problem with lots of posts in the Linux help forums. I believe the most common issue is with permissions on the xauthority file and is caused by the removal of video drivers and and the reset of the x.org server configuration. You might want to search on "endless login loop"
[Edit] Did the work for you. Stuck in login loop (Ubuntu 16.04)

Basically what you already discovered.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1922124 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 6193
Credit: 442,138,070
RAC: 1,023,062
United States
Message 1922125 - Posted: 2 Mar 2018, 17:00:49 UTC

I know that the Nvidia driver 384.111 had the Spectre side-channel attack fix in it and was probably the reason why it trashed our tasks. The issue was what we see over in Microsoft land with no OpenCL or CUDA support in the driver when Microsoft releases a video driver. The fix is always to get the current video driver directly from Nvidia.

Maybe the same thing happened with the ATI drivers.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1922125 · Report as offensive
Profile RueiKe Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 14 Feb 16
Posts: 403
Credit: 215,394,539
RAC: 314,599
Taiwan
Message 1922244 - Posted: 3 Mar 2018, 0:38:28 UTC - in response to Message 1922124.  

Hi Keith, I had done a quick search as I first encountered the problem, but I quickly went the brute force approach of just reinstalling drivers in recovery mode as I did not want to spend my entire Friday night working on it. Hopefully my approach doesn't leave any latent issues.

I also found that 17.50 drivers are not compatible with 16.04.4. One way not to have an issue is to uninstall the drivers before kernel upgrade and reinstall after. AMD driver releases for Linux are quarterly, so the release for this quarter should be compatible. I am considering doing a clean install of 16.04.4 from iso before installing the new drivers when available. I will also retry the 4th GPU card install afterward.

That is a common problem with lots of posts in the Linux help forums. I believe the most common issue is with permissions on the xauthority file and is caused by the removal of video drivers and and the reset of the x.org server configuration. You might want to search on "endless login loop"
[Edit] Did the work for you. Stuck in login loop (Ubuntu 16.04)

Basically what you already discovered.

Instagram: rpc_labs
ID: 1922244 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 6193
Credit: 442,138,070
RAC: 1,023,062
United States
Message 1922252 - Posted: 3 Mar 2018, 1:00:25 UTC - in response to Message 1922244.  

Interesting, from the notes on 17.50 driver page, it said expected to be fixed in 16.04.4. Guess that didn't happen. I assume that is with the open source AMDGPU-Pro drivers. Have you looked at the additional open source Vulkan driver AMDVLK yet? AMD Open-Source Driver For Vulkan "AMDVLK" Is Now Available
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1922252 · Report as offensive
Profile RueiKe Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 14 Feb 16
Posts: 403
Credit: 215,394,539
RAC: 314,599
Taiwan
Message 1922294 - Posted: 3 Mar 2018, 3:42:22 UTC - in response to Message 1922252.  

I think the issue mentioned on the 17.50 page is concerning a screen corruption issue (minor problem that I have seen). Seems like changes in the Kernel breaks the 17.50 unistall script. Also, there could be other compatibility issues. Hopefully the 1Q18 release comes out soon. I have not tried any open source drivers. Not enough time to investigate.

Interesting, from the notes on 17.50 driver page, it said expected to be fixed in 16.04.4. Guess that didn't happen. I assume that is with the open source AMDGPU-Pro drivers. Have you looked at the additional open source Vulkan driver AMDVLK yet? AMD Open-Source Driver For Vulkan "AMDVLK" Is Now Available

Instagram: rpc_labs
ID: 1922294 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 6193
Credit: 442,138,070
RAC: 1,023,062
United States
Message 1922298 - Posted: 3 Mar 2018, 3:52:16 UTC - in response to Message 1922294.  

I think the issue mentioned on the 17.50 page is concerning a screen corruption issue (minor problem that I have seen). Seems like changes in the Kernel breaks the 17.50 unistall script. Also, there could be other compatibility issues. Hopefully the 1Q18 release comes out soon. I have not tried any open source drivers. Not enough time to investigate.

Interesting, from the notes on 17.50 driver page, it said expected to be fixed in 16.04.4. Guess that didn't happen. I assume that is with the open source AMDGPU-Pro drivers. Have you looked at the additional open source Vulkan driver AMDVLK yet? AMD Open-Source Driver For Vulkan "AMDVLK" Is Now Available

But you are actually already running open source AMD drivers in 16.04.4. The proprietary fglrx drivers got removed in 16.04. The 16.04 repository supplies the open-source AMDGPU-PRO driver.

I was just wondering if you had tried or tested the new Vulkan drivers. Supposed to be better because it has a more modern HAL layer for the newer hardware.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1922298 · Report as offensive
Al Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 3 Apr 99
Posts: 1676
Credit: 395,900,042
RAC: 292,382
United States
Message 1922321 - Posted: 3 Mar 2018, 4:33:33 UTC

Quick question for you Ubuntu gurus, I noticed the version was 16.04, which means that next month (approximately), 18.04 will be released as the new LTS version. Anyone hear about what might be new and improved since the last LTS release a couple years ago? 2 years is seemingly almost forever in software years...

ID: 1922321 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 6193
Credit: 442,138,070
RAC: 1,023,062
United States
Message 1922341 - Posted: 3 Mar 2018, 5:27:06 UTC - in response to Message 1922321.  

I'm looking forward to it to. Actually quite a few changes. Different Desktop, new kernel and full support for Ryzen and Threadripper. That is what I'm interested in the most. Biggest change for the 16.04 user will be the removal of Unity and putting the Gnome Desktop in place of that. Also, thankfully, they are dropping the Wayland graphics manager and sticking with good old Xorg. I was dreading the upset to Nvidia users on Wayland. This website is posting all the latest 18.04 LTS news and is being updated as things change.

Ubuntu 18.04 LTS Release Date and New Features
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1922341 · Report as offensive

Message boards : Number crunching : Ubuntu 16.04.4


 
©2018 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.