Ubuntu 18.04 desktop x64 and iGPU

Message boards : Number crunching : Ubuntu 18.04 desktop x64 and iGPU
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4

AuthorMessage
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1973789 - Posted: 6 Jan 2019, 11:04:03 UTC - in response to Message 1973787.  

You are running the CUDA9.0 version. Maybe it´s better if you try to run with: setiathome v8 enhanced x41p_V0.97b2, Cuda 9.20 special who was the latest avaliable for the CUDA 9 builds.

If you can´t find it PM and i will send it for you.

I have this one: "setiathome_x41p_V0.97b2_Linux-Pascal+_cuda92"

Will it run with 396 driver ?


From Ian & Steve post:

CUDA90/91 i believe can use the the normal repository driver (version 380 something?)
CUDA92 will need at least nvidia driver 396 or higher. 
CUDA100 will need at least nvidia driver 410 or higher. 


So i believe yes.
ID: 1973789 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1973794 - Posted: 6 Jan 2019, 12:39:12 UTC - in response to Message 1973779.  

  PID USER      PR  NI    VIRT    RES    SHR S  %CPU %MEM     TIME+ COMMAND                                                                                                      
 5996 r         39  19   52788  35956   4536 R  99,7  0,4  22:08.57 MBv8_8.22r3711_                                                                                              
 5939 r         39  19   52756  35928   4536 R  97,7  0,4  35:56.98 MBv8_8.22r3711_                                                                                              
 5937 r         39  19   53264  43464   4536 R  95,7  0,5  35:58.28 MBv8_8.22r3711_                                                                                              
 6010 r         39  19   53264  43460   4536 R  91,7  0,5  18:53.41 MBv8_8.22r3711_                                                                                              
 6069 r         30  10 14,901g 650504 340880 R   8,6  8,0   0:17.73 setiathome_x41p                                                                                              
  973 root     -51   0       0      0      0 S   2,7  0,0  24:23.97 irq/30-nvidia                                                                                                
 1105 r         20   0 4011528 332288  93460 S   0,7  4,1   1:08.97 gnome-shell 
The way I see it, you aren't taking you're own advice and freeing a CPU core to run the GPU. The biggest user after the CPUs is the GPU. Since the CPUs are trying to use every bit of your 4 cores, the GPU has to use part of what the CPUs are trying to use. It's a typical SETI Cluster, both the CPUs and GPU run slower because CPU time isn't allotted to the GPU. Happens a Lot around here.

I would suggest you read your own ReadMe and free a CPU core for the GPU before changing anything else.
;-)
ID: 1973794 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1973808 - Posted: 6 Jan 2019, 14:43:04 UTC
Last modified: 6 Jan 2019, 14:50:46 UTC

Following the suggestion of Tbar, i suggest you to do the test first without running any CPU WU. Do the tests on the GPU only and when you be satisfied with the test slowly increase some CPU tasks. Take some time to test with or without -nobs. In my host that makes a difference of a pair of seconds. Probably the sweet spot on your 4 core will be 2 CPU + 1 GPU. But that needs to be tested as usual.
ID: 1973808 · Report as offensive
Sleepy
Volunteer tester
Avatar

Send message
Joined: 21 May 99
Posts: 219
Credit: 98,947,784
RAC: 28,360
Italy
Message 1973811 - Posted: 6 Jan 2019, 15:02:58 UTC - in response to Message 1973447.  

dont think you can run the monitor off the iGPU while crunching on the GPUs.
Some lot of water has flowed since this message, but anyway I reply, hoping it is still useful.

I do on one of my crunchers.
You just need to prepare an xorg.conf file.
I put the relevant links to information I followed here: http://setiathome.berkeley.edu/forum_thread.php?id=81271&postid=1952070.
You can skip all the driver installation things and just rely on the xorg.conf information. That is all that it takes, after you are are already running the right NVidia drivers. I realised that after some time.
It works as a charm.

If you need more help I am here.
If I can give back just 0.00001 % what Raistmer hase been giving to the community I will be happy to.
ID: 1973811 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1973828 - Posted: 6 Jan 2019, 17:10:29 UTC - in response to Message 1973811.  

Can you elaborate? You say all you have to do is prepare the xorg config file, but the post you linked to says otherwise

- I found no need to edit xorg.conf.

Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 1973828 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1973837 - Posted: 6 Jan 2019, 17:39:21 UTC - in response to Message 1973811.  

dont think you can run the monitor off the iGPU while crunching on the GPUs.
Some lot of water has flowed since this message, but anyway I reply, hoping it is still useful.

I do on one of my crunchers.
You just need to prepare an xorg.conf file.
I put the relevant links to information I followed here: http://setiathome.berkeley.edu/forum_thread.php?id=81271&postid=1952070.
You can skip all the driver installation things and just rely on the xorg.conf information. That is all that it takes, after you are are already running the right NVidia drivers. I realised that after some time.
It works as a charm.

If you need more help I am here.
If I can give back just 0.00001 % what Raistmer hase been giving to the community I will be happy to.


Thanks! I'll just keep this info for now and return to multi-GPU configs later when single one will be tuned enough.

Regarding CPU stealing cycles from GPU - yep, I'll test it on this setup thoroughly.

Unfortunately, there are inconclusives started to develop, This one, for example:
http://setiathome.berkeley.edu/result.php?resultid=7308235024
GPU found additional Triplet. CPU missed it. As one can see from stderr, there are many triplets in single PoT detected so app switched to re-processing on CPU. SoG does the same with triplets.
That is, CPU processing somehow different. To ensure correct operation of CUDA app is the main target, optimal performance on particula host is nice but less important.
I'll try to catch that WU and check it with different apps.
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1973837 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1973840 - Posted: 6 Jan 2019, 18:19:48 UTC - in response to Message 1973778.  

Please advise some tools like Windows's GPU-Z/CPU-Z for Linux.
For CPU usage I could use top command, what about GPU?

TOP covers just about everything related to cpu.

You have a couple of tools for the gpu. You have the Nvidia X Server Settings app installed by the drivers in your applications. You can always look at the gpu/s for their temps, fan speeds and clock frequencies.

nvidia-smi on the command line shows utilization, temps and power usage.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1973840 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1973884 - Posted: 6 Jan 2019, 21:42:37 UTC

Here https://cloud.mail.ru/public/HzTH/FczfHokkg I will put TestCases for CUDA Special app along with testing results.
For now excess Triplet under investigation.
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1973884 · Report as offensive
Juhani Karjanlahti Project Donor
Volunteer tester

Send message
Joined: 23 Jan 03
Posts: 15
Credit: 83,675,733
RAC: 149
Finland
Message 1973993 - Posted: 7 Jan 2019, 6:45:25 UTC - in response to Message 1973837.  

If I remember right, upgrading Special app to version x41p_V0.97b2 should help with missing pulses
You can find it here https://drive.google.com/file/d/1y6u53aJ_1nKqWdi0pngmAWKTB3hK4QWO/view?usp=sharing
ID: 1973993 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1974021 - Posted: 7 Jan 2019, 8:10:02 UTC - in response to Message 1973993.  
Last modified: 7 Jan 2019, 8:17:18 UTC

The version I gave Raistmer is the latest version. It is the version I recommended to Eric for BETA.
Of course, I haven't heard anything from Eric since...and then Raistmer showed up, finally willing to run Linux. If Eric gets a version from Me it will be the version I gave Raistmer, which works with the repository drivers for 16.04 & 18.04.
<shrugs>
ID: 1974021 · Report as offensive
Juhani Karjanlahti Project Donor
Volunteer tester

Send message
Joined: 23 Jan 03
Posts: 15
Credit: 83,675,733
RAC: 149
Finland
Message 1974025 - Posted: 7 Jan 2019, 8:36:32 UTC - in response to Message 1974021.  

Only tried to help. Let me explain:
I was earlier looking some Raistmers validated tasks and stderr output indicates use of "setiathome v8 enhanced x41p_V0.97, Cuda 9.00 special", not "x41p_V0.97b2", which I found on my tasks.
So I thought he is running earlier version of Special Sauce.
ID: 1974025 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1974371 - Posted: 9 Jan 2019, 22:59:26 UTC

BTW, I was trying to find success with my new ASUS mining board which seems to suffer stalled tasks when more than 8 gpus are used. I decided to try the older OSes since the newer ones weren't working very well. First try was with Ubuntu 14.04.5, where I found they have NV driver 384.130 in the repository. 384.130 works with the cuda90 App, so, that means the App will work with the repository drivers for 14.04, 16.04, and 18.04. 5 Years of Linux system repository drivers will work with the App Raistmer is running, which is what a SETI App needs to do. Now that he has corrected his setup, the App is just as fast as any of the others even without running nobs, which is what my tests show. Looks good to me, in fact, it seems the server decided his inconclusive results were far too low and introduced him to Mr. Bad Wingman.

In other news, 14.04.5 worked fine with 8 gpus but wouldn't use more than 8. 15.04 on the other hand seems to work fine with 10 gpus, without any stalls. It's a shame the ASUS board seems a little slower than the Gigabyte mining board. I'll have to try the Gigabyte one again when the replacement arrives. Now to try 12 on the ASUS board, I had 12 working on the Gigabyte board just before it died.
ID: 1974371 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1975975 - Posted: 19 Jan 2019, 22:58:11 UTC - in response to Message 1973884.  

Here https://cloud.mail.ru/public/HzTH/FczfHokkg I will put TestCases for CUDA Special app along with testing results.
For now excess Triplet under investigation.
It seems all these Arecibo tasks have the Inconclusive rates much higher than normal. As I mentioned to Eric, the App does much better on the BLC tasks. Looking at your current Inconclusive results it appears there are 4 BLC GPU tasks, the rest are Arecibos, with most being Bad Best Pulses on the shorter tasks. The Bad Best pulses are another one of those problems that has been present for a Very long time, about as long as the resuming suspended tasks problem. At least we know it's not associated with the Unroll function, I believe the Unroll is only used on the VLARs, and most of the Inconclusive tasks are not VLARs. Oh well...
ID: 1975975 · Report as offensive
Previous · 1 · 2 · 3 · 4

Message boards : Number crunching : Ubuntu 18.04 desktop x64 and iGPU


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.