Setting up Linux to crunch CUDA90 and above for Windows users

Message boards : Number crunching : Setting up Linux to crunch CUDA90 and above for Windows users
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 112 · 113 · 114 · 115 · 116 · 117 · 118 . . . 162 · Next

AuthorMessage
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2001440 - Posted: 7 Jul 2019, 1:08:12 UTC - in response to Message 2001433.  

If the zi3v code was originally targeted at CUDA8, I think we should try that first. Should only need the download and installation of the CUDA8 package and then change the configuration to point at the CUDA8 directories. I need to speak with Sam about that. We have given up on the 0.97 code branch.


. . 0.97 works fine for me :) It is a pity it doesn't for CUDA60 ...

Stephen

? ?

Stephen, I am confusing you obviously with carrying on two conversations in the thread. There is no reason why the zi3v app couldn't be compiled for x86_64 with the CUDA 8 toolkit for the GT 730 in Linux x86_64.

https://developer.nvidia.com/cuda-80-ga2-download-archive

I am just screwed that I don't have any choice for a lesser CUDA toolkit other than CUDA10 for my Jetson Nano Tegra X1 ARM64 platform.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2001440 · Report as offensive     Reply Quote
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 2001441 - Posted: 7 Jul 2019, 1:12:05 UTC - in response to Message 2001439.  

Actually, it may work with 6.5. It says *aarch64 - 64-bit ARM CPU architecture, like that found on Jetson TX1 onwards* https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#cross-platform You just need to install Ubuntu 14.04, read the manual, and chose the correct packages.
Good Luck, and Have Fun ;-)
ID: 2001441 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2001442 - Posted: 7 Jul 2019, 1:18:49 UTC - in response to Message 2001441.  

Thanks for that TBar. I actually wasn't thinking of cross-compiling on Ubuntu for aarch64. I was just thinking of compiling natively like we have done so far on the Nano.

I think cross-compiling in Ubuntu 14.04 might be the only way to accomplish getting the zi3v app compiled for CUDA6.5.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2001442 · Report as offensive     Reply Quote
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 2001456 - Posted: 7 Jul 2019, 2:52:18 UTC - in response to Message 2001442.  

After looking at it a little harder, it seems the only ARMv8 64-bit Toolkit is the one meant to run on the aarch64 device itself, and is labeled "Generic CUDA Toolkit", https://developer.nvidia.com/cuda-toolkit-65
The one for Ubuntu 14.04 Cross-Compile is only 32 bit. So, try removing the cuda 10 Toolkit and see if the one here works on the Tegra X1, http://developer.download.nvidia.com/compute/cuda/6_5/rel/installers/cuda_6.5.14_linux_aarch64_native.run It does appear 6.5 is the only Toolkit with a section labeled Linux ARM...
ID: 2001456 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2001520 - Posted: 7 Jul 2019, 15:07:03 UTC - in response to Message 2001440.  

Stephen, I am confusing you obviously with carrying on two conversations in the thread. There is no reason why the zi3v app couldn't be compiled for x86_64 with the CUDA 8 toolkit for the GT 730 in Linux x86_64.
https://developer.nvidia.com/cuda-80-ga2-download-archive
I am just screwed that I don't have any choice for a lesser CUDA toolkit other than CUDA10 for my Jetson Nano Tegra X1 ARM64 platform.


. . OK, well TBar must not have used that resource or option when he compiled the CUDA80 version he made available because when I tested for him, while it ran on the GT730 it produced a lot of results with errors, though not all were in error. But His Cuda60 release worked without errors. I cannot follow up on that because my GT730 is now in a new home ...

Stephen

< shrug >
ID: 2001520 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2001545 - Posted: 7 Jul 2019, 19:50:42 UTC

Well the ball is again in my court. Just need to install the CUDA6.5 toolkit and make another run at compiling. I really want to drop the number of Inconclusives even if the crunch times go longer.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2001545 · Report as offensive     Reply Quote
Loren Datlof

Send message
Joined: 24 Jan 14
Posts: 73
Credit: 19,652,385
RAC: 0
United States
Message 2001556 - Posted: 7 Jul 2019, 22:34:57 UTC

Two quick questions about upgrading from the special app that runs CUDA 90 to the one that runs CUDA101:

1. Will the 101 WUs crunch faster than the 90WUs?
2. If so, do I have to exhaust my cache before the upgrade to avoid computation errors?

The hosts I am looking to upgrade:

https://setiathome.berkeley.edu/show_host_detail.php?hostid=8657594

https://setiathome.berkeley.edu/show_host_detail.php?hostid=8706475
ID: 2001556 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2001558 - Posted: 7 Jul 2019, 23:25:25 UTC - in response to Message 2001556.  

I would say to not bother since the CUDA101 is more suited for the new Turing cards to see any performance benefit. Since you have older Pascal low-mid range cards, I would just stick with the CUDA90 app.

If you do decide to change don't do anything more than change the appname in the two places in the app_info. Don't do anything else. You don't have to run your cache down at all. Just stop BOINC, edit the new filename, save the app_info and restart BOINC
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2001558 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2001562 - Posted: 8 Jul 2019, 0:03:05 UTC
Last modified: 8 Jul 2019, 0:03:56 UTC

. . @ Keith

. . Speaking of updating, as a volunteer who has used both Cuda90 0.97 and 0.98b over several platforms. would you say there is any significant improvement in run times with the latter version? That is, do you feel there has been any improvement in efficiency?

Stephen

? ?
ID: 2001562 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2001570 - Posted: 8 Jul 2019, 1:09:40 UTC - in response to Message 2001562.  

I would say there is an improvement in the 0.98 code compared to 0.97. I proved that to myself when I benched the betas for Petri before wide release. But Ian states that his beta of the 0.98b1 CUDA10 with driver version 410 beats out the 0.98b1 CUDA101 version with drivers 418. He kept his beta release and is running that to great effect. I was not a tester of that version so never got a chance to benchmark it.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2001570 · Report as offensive     Reply Quote
Loren Datlof

Send message
Joined: 24 Jan 14
Posts: 73
Credit: 19,652,385
RAC: 0
United States
Message 2001571 - Posted: 8 Jul 2019, 1:12:55 UTC - in response to Message 2001558.  

I would say to not bother since the CUDA101 is more suited for the new Turing cards to see any performance benefit. Since you have older Pascal low-mid range cards, I would just stick with the CUDA90 app.

If you do decide to change don't do anything more than change the appname in the two places in the app_info. Don't do anything else. You don't have to run your cache down at all. Just stop BOINC, edit the new filename, save the app_info and restart BOINC

Thanks for the info. Not knowing that the 101 is for Turing cards I am running the 101 on two hosts with GTX 750 Ti cards. Would it make sense to downgrade to 90 or just leave well enough alone?
ID: 2001571 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2001573 - Posted: 8 Jul 2019, 1:53:48 UTC - in response to Message 2001571.  

Don't know for the low level cards how they would respond to the CUDA90 app. Try it and report back.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2001573 · Report as offensive     Reply Quote
Loren Datlof

Send message
Joined: 24 Jan 14
Posts: 73
Credit: 19,652,385
RAC: 0
United States
Message 2001574 - Posted: 8 Jul 2019, 2:04:29 UTC - in response to Message 2001573.  

Don't know for the low level cards how they would respond to the CUDA90 app. Try it and report back.


I am already running the CUDA90 app on similar cards and it seems pretty fast to me. See this host
https://setiathome.berkeley.edu/show_host_detail.php?hostid=8657594
ID: 2001574 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2001575 - Posted: 8 Jul 2019, 2:20:58 UTC - in response to Message 2001574.  
Last modified: 8 Jul 2019, 2:36:05 UTC

Try to find a host with a 1060 that is running the CUDA101 app for comparison.

[Edit] Or download Rick's benchmark tool and do the comparison between the two apps yourself in a controlled test with the same WU's.

https://github.com/Ricks-Lab/benchMT
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2001575 · Report as offensive     Reply Quote
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 2001576 - Posted: 8 Jul 2019, 2:41:33 UTC - in response to Message 2001575.  

Try to find a host with a 1060 that is running the CUDA101 app for comparison.
Like this 1?

https://setiathome.berkeley.edu/show_host_detail.php?hostid=8150019

Cheers.
ID: 2001576 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2001584 - Posted: 8 Jul 2019, 3:43:34 UTC

Comparing the two looks like the CUDA101 app has the best and most consistent times. Loren, you have more cpu resources and I don't see any cpu tasks being run or at least not for Seti, so lack of cpu support for the gpu tasks is not likely. The 1060 3GB cards on Vyper's host seem to be turning in better times and more consistently.

I would say go for the CUDA101 app and see if you can replicate the same times as Vyper's host.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2001584 · Report as offensive     Reply Quote
Loren Datlof

Send message
Joined: 24 Jan 14
Posts: 73
Credit: 19,652,385
RAC: 0
United States
Message 2001591 - Posted: 8 Jul 2019, 5:06:45 UTC

Just replacing those two lines in the app_info.xml caused computation errors. So I am setting NNT and will then change to CUDA101.
ID: 2001591 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2001601 - Posted: 8 Jul 2019, 7:16:47 UTC - in response to Message 2001591.  

So, are you running the required 418 series drivers? The readme states that is necessary to run the CUDA101 application in the same sentence it tells you to swap appnames in the app_info file.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2001601 · Report as offensive     Reply Quote
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 2001625 - Posted: 8 Jul 2019, 13:11:34 UTC - in response to Message 2001601.  
Last modified: 8 Jul 2019, 13:14:52 UTC

Yeah I’m guessing he tried it on the system with 396 drivers.

He currently has 2 qualifying systems. One running 418 drivers and another running 430.

He has a 3rd system with a 1050ti and 390 drivers, and a 4th with some 1060s and 396 drivers. It looks like that’s the one he tried. He’ll have to upgrade the drivers to run 10.1 on those systems.
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 2001625 · Report as offensive     Reply Quote
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2001634 - Posted: 8 Jul 2019, 14:14:58 UTC - in response to Message 2001570.  

I would say there is an improvement in the 0.98 code compared to 0.97. I proved that to myself when I benched the betas for Petri before wide release. But Ian states that his beta of the 0.98b1 CUDA10 with driver version 410 beats out the 0.98b1 CUDA101 version with drivers 418. He kept his beta release and is running that to great effect. I was not a tester of that version so never got a chance to benchmark it.


. . Would you say that improvement (0.98b over 0.97) is enough to bother updating? I am running 0.97 on both the rig with 970s and the rig with the 1050ti (which sadly is using the repository version of BOINC) so after reading things like the comment that the 101 version is only useful on Turing cards I am not confident it would make any significant difference on my rigs.

Stephen

? ?
ID: 2001634 · Report as offensive     Reply Quote
Previous · 1 . . . 112 · 113 · 114 · 115 · 116 · 117 · 118 . . . 162 · Next

Message boards : Number crunching : Setting up Linux to crunch CUDA90 and above for Windows users


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.