Step by Step, compile NVIDIA/MB CUDA app under Linux (Fedora 19)

Message boards : Number crunching : Step by Step, compile NVIDIA/MB CUDA app under Linux (Fedora 19)
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1527048 - Posted: 11 Jun 2014, 21:12:51 UTC - in response to Message 1526489.  
Last modified: 11 Jun 2014, 21:15:13 UTC

**And again, I get "Compiling entry function...sm_10, sm_11, sm_12, sm_13, sm_21, and sm_20", but nothing for sm_30 or sm_35

Maxwell Compatability Guide says to use
/usr/local/cuda/bin/nvcc
  -gencode=arch=compute_20,code=sm_20
  -gencode=arch=compute_30,code=sm_30
  -gencode=arch=compute_35,code=sm_35
  -gencode=arch=compute_35,code=compute_35
  -O2 -o mykernel.o -c mykernel.cu

Note the omission of the 1xs -- on compiling my hologram application with CUDA 6 on Windows (haven't upgraded Linux yet) I got warnings that they are deprecated and soon to be removed. OTOH, after putting all the recommended options into the Windows compile my GTX 750 Ti still ran my reconstructions at exactly the same speed.
ID: 1527048 · Report as offensive
Profile petri33
Volunteer tester

Send message
Joined: 6 Jun 02
Posts: 1668
Credit: 623,086,772
RAC: 156
Finland
Message 1527063 - Posted: 11 Jun 2014, 21:46:14 UTC - in response to Message 1527048.  

How about Maxwell being compute 50 ...

-gencode=arch=compute_50,code=sm_50
-gencode=arch=compute_50,code=compute_50

Just a guess.
To overcome Heisenbergs:
"You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones
ID: 1527063 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1527107 - Posted: 12 Jun 2014, 0:17:03 UTC - in response to Message 1527063.  
Last modified: 12 Jun 2014, 0:36:31 UTC

How about Maxwell being compute 50 ...

-gencode=arch=compute_50,code=sm_50
-gencode=arch=compute_50,code=compute_50

Just a guess.

Now that's a puzzle. Yes, my results are saying "computeCap 5.0". I'll have to check tomorrow whether the URL I quoted matches the documentation I have on my Windows systems.

[Edit]
Actually, due to the wonders of broadband, ssh, and cygwin:
admD405@W7-SE-D304-01 /cygdrive/d/ProgramData/HoloMov/HoloMovie
$ grep sm HoloMovie_vs2010.vcxproj
      <CodeGeneration>compute_10,sm_10;compute_20,sm_20</CodeGeneration>
      <CodeGeneration>compute_10,sm_10;compute_20,sm_20</CodeGeneration>
      <CodeGeneration>compute_10,sm_10;compute_20,sm_20</CodeGeneration>
 <CodeGeneration>compute_20,sm_20;compute_30,sm_30;compute_35,sm_35;compute_50,sm_50;compute_50,compute_50</CodeGeneration>
so yes, my latest docs did specify sm_50 -- but I still need to check that I actually applied it to x64 Release version...
[/Edit]
ID: 1527107 · Report as offensive
Previous · 1 · 2

Message boards : Number crunching : Step by Step, compile NVIDIA/MB CUDA app under Linux (Fedora 19)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.