I've Built a Couple OSX CUDA Apps...

Author	Message
Chris Adamek Volunteer tester Send message Joined: 15 May 99 Posts: 251 Credit: 434,772,072 RAC: 236	Message 1766535 - Posted: 20 Feb 2016, 19:36:32 UTC - in response to Message 1766497. I looked at that after someone suggested it in the Apple Forums. Mine are greyed out, don't work for Mavericks & Yosemite. The One for Lion still works though. Seems Apple has removed the Previous Free versions and now only offers El Capitan as a free version. Someone said if you email the Apple Store they will offer the Previous Free versions for around $20. Don't know if that's correct or not, I just read it on the internet ;-) That's odd, both show up as downloadable on my machines... Let me see if I can figure out an easy way to host a couple of 5GB files. Lol ID: 1766535 ·

TBar Volunteer tester Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768	Message 1766563 - Posted: 20 Feb 2016, 21:33:42 UTC Here's another Darwin 11.4.2 machine failing with the CPU Apps, http://setiathome.berkeley.edu/show_host_detail.php?hostid=6250696. Seems it's the LapTops with an i5 or i7 CPU having the problems. It also seems they mostly work with the SETI@home v8 8.03 i686-apple-darwin App. Unfortunately, the Server has decided the App that Fails is Faster than the App that sometimes works and is determined to keep sending the App that Fails. Perhaps a Server Tweak to only send SETI@home v8 8.03 i686-apple-darwin to Hosts running Darwin 11.4.2 with an i5 or i7 CPU would work? With a hint to upgrade the OS to run the Optimized CPU Apps? There are a number of Older Hosts without an i5 or i7 that can't update to a newer OS and don't have this CPU problem. So, whatever is done should only affect the Hosts with the i-series CPUs. ID: 1766563 ·

TBar Volunteer tester Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768	Message 1766626 - Posted: 21 Feb 2016, 4:25:26 UTC OK, I think this will work; MB CPU Apps for SETIv8. The SSE41 App will work with Core2 CPUs with at least SSE 4.1 in Snow Leopard and above, and with AVX CPUs in Mountain Lion and above. The AVX App will work with AVX CPUs in Mountain Lion and above. http://www.arkayn.us/forum/index.php?topic=191.msg4369#msg4369 Now to see if we can get these CPU Apps that work with something higher than SSE3 onto Beta. ID: 1766626 ·

TBar Volunteer tester Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768	Message 1769935 - Posted: 6 Mar 2016, 2:20:14 UTC Still No new Mac Apps on Beta... In other News, it appears I've finally managed to fix the problem with boinc-master in Mavericks and above. Seems it's simply NOT building the Libraries by just running Make. It works in Mountain Lion and below. In Yosemite after running Make you have to cd to the api & lib folders and run Make there as well...else it doesn't build the Required Libraries. This happens with both boinc-master versions 7.5 and 7.7. Unfortunately, the Apps I've built so far aren't any faster than those compiled in Mountain Lion. I'm also Still getting the Object.h Error with Petris' code whereas I don't get that Error with Jasons' code. It would be nice to compile the New code with NVCC 7.5... /usr/include/dispatch/object.h(145): error: expected an identifier /usr/include/dispatch/object.h(367): error: identifier "dispatch_block_t" is undefined 2 errors detected in the compilation of "/var/folders/6c/hy0ffg_90sz3wbxfk7gzw3800000gn/T//tmpxft_0000d160_00000000-19_cudaAcceleratio n.compute_52.cpp1.ii". I'm also seeing a large number of these errors; In file included from analyzeFuncs.cpp:128: ./cuda/cudaAcceleration.h:160:35: error: invalid output constraint '=f' in asm asm("ld.global.f32 %0, [%1];" : "=f"(v) : "l"(addr+offset)); but, I can Kill those by just removing what appears to be something dealing with a GTX 780. Being able to compile in Yosemite is a MAJOR breakthrough... ID: 1769935 ·

petri33 Volunteer tester Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156	Message 1769969 - Posted: 6 Mar 2016, 7:56:16 UTC - in response to Message 1769935. To fix dispatch_block_t error see http://hamelot.co.uk/programming/osx-gcc-dispatch_block_t-has-not-been-declared-invalid-typedef/ Still No new Mac Apps on Beta... In other News, it appears I've finally managed to fix the problem with boinc-master in Mavericks and above. Seems it's simply NOT building the Libraries by just running Make. It works in Mountain Lion and below. In Yosemite after running Make you have to cd to the api & lib folders and run Make there as well...else it doesn't build the Required Libraries. This happens with both boinc-master versions 7.5 and 7.7. Unfortunately, the Apps I've built so far aren't any faster than those compiled in Mountain Lion. I'm also Still getting the Object.h Error with Petris' code whereas I don't get that Error with Jasons' code. It would be nice to compile the New code with NVCC 7.5... /usr/include/dispatch/object.h(145): error: expected an identifier /usr/include/dispatch/object.h(367): error: identifier "dispatch_block_t" is undefined 2 errors detected in the compilation of "/var/folders/6c/hy0ffg_90sz3wbxfk7gzw3800000gn/T//tmpxft_0000d160_00000000-19_cudaAcceleratio n.compute_52.cpp1.ii". I'm also seeing a large number of these errors; In file included from analyzeFuncs.cpp:128: ./cuda/cudaAcceleration.h:160:35: error: invalid output constraint '=f' in asm asm("ld.global.f32 %0, [%1];" : "=f"(v) : "l"(addr+offset)); but, I can Kill those by just removing what appears to be something dealing with a GTX 780. Being able to compile in Yosemite is a MAJOR breakthrough... To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones ID: 1769969 ·

jason_gee Volunteer developer Volunteer tester Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0	Message 1769972 - Posted: 6 Mar 2016, 8:07:38 UTC - in response to Message 1769969. Should work fine. It's those kindof holes messing up cross platform. Price to pay living on the bleeding edge :D "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. ID: 1769972 ·

petri33 Volunteer tester Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156	Message 1769973 - Posted: 6 Mar 2016, 8:07:59 UTC - in response to Message 1769969. The other error in line 160 149 #if (__CUDA_ARCH__ > 350) 150 _device__ float inline LDG_f_ca(float addr, const int offset) 151 { 152 float v; 153 asm("ld.global.ca.nc.f32 %0, [%1];" : "=f"(v) : "l"(addr+offset)); 154 return v; 155 } 156 #else 157 __device__ float inline LDG_f_ca(float addr, const int offset) 158 { 159 float v; 160 asm("ld.global.f32 %0, [%1];" : "=f"(v) : "l"(addr+offset)); 161 return v; 162 } 163 #endif That is valid PTX code. If the mac cuda compiler does not understand that You can replace it with this: __device__ float inline LDG_f_ca(float * __restrict__ addr, const int offset) { return addr[offset]; // slower than original PTX } To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones ID: 1769973 ·

jason_gee Volunteer developer Volunteer tester Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0	Message 1769975 - Posted: 6 Mar 2016, 8:10:26 UTC - in response to Message 1769973. Hmmm, if that's part of the lgamma code, cost could be excessive. Any idea what nv/Apple might have broken in there ? "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. ID: 1769975 ·

TBar Volunteer tester Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768	Message 1769978 - Posted: 6 Mar 2016, 8:17:35 UTC - in response to Message 1769969. I found the same suggestion here, http://stackoverflow.com/questions/27976312/how-to-cope-with-non-gcc-compatible-code-in-os-x-yosemite-core-headers It doesn't work though. I even tried the Availability.h change with it. I've moved the MacOSX10.10.sdk/usr/include folder to the usr folder as it seems the compiler can't find it buried in the Xcode App. After the failure I realized the Answer doesn't apply to the problem. I'm Not using GCC, I'm using Clang, the answer is for people using GCC, header file contains code that can be processed by clang but not GCC. I don't have the problem running Jasons' code, I've managed to use NVCC 7.5 to compile setiathome_x41zi_x86_64-apple-darwin_cuda75. It works too... I have a feeling if I could compile your code with ToolKit 7.5 in Yosemite it might work better than ToolKit 6.5 in Mountain Lion. ID: 1769978 ·

petri33 Volunteer tester Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156	Message 1769979 - Posted: 6 Mar 2016, 8:18:05 UTC - in response to Message 1769975. Hmmm, if that's part of the lgamma code, cost could be excessive. Any idea what nv/Apple might have broken in there ? I do not know about Apple. If TBar could tell something more about the errors. Is the line 160 the only one or if he fixes the line 160 what is the next line that produces error(s). My guess is that all ld. -ptx lines. To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones ID: 1769979 ·

petri33 Volunteer tester Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156	Message 1769980 - Posted: 6 Mar 2016, 8:22:34 UTC Something causes object.h to be included. Can you see any difference in #include statements in the file that causes the error. CudaAcceleration.cu might be the file to look. To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones ID: 1769980 ·

jason_gee Volunteer developer Volunteer tester Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0	Message 1769981 - Posted: 6 Mar 2016, 8:25:46 UTC - in response to Message 1769979. Well when we get it all sussed out (eventually) it needs to go to nv (which I'll help with). It's the gcc to clang transition that's throwing me, and maintaining both in working order, well... wonder if we have to start deprecating some things on the stock side (which I'm purposely resisting) "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. ID: 1769981 ·

petri33 Volunteer tester Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156	Message 1769982 - Posted: 6 Mar 2016, 8:27:36 UTC - in response to Message 1769980. Something causes object.h to be included. Can you see any difference in #include statements in the file that causes the error. CudaAcceleration.cu might be the file to look. Try changing my code in CudaAcceleration.cu near line 50 to #ifdef _WIN32 #include "nvapi_device.h" #endif That makes the mac version not to inlcude nvapi_device.h I do not know if that helps or if it breaks some other code then. To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones ID: 1769982 ·

jason_gee Volunteer developer Volunteer tester Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0	Message 1769983 - Posted: 6 Mar 2016, 8:29:33 UTC - in response to Message 1769980. Something causes object.h to be included. Can you see any difference in #include statements in the file that causes the error. CudaAcceleration.cu might be the file to look. Will be able to have a dig a bit later, though suspect some kindof include deeper in. Probably just a nameclash that needs an #ifndef guard "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. ID: 1769983 ·

jason_gee Volunteer developer Volunteer tester Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0	Message 1769984 - Posted: 6 Mar 2016, 8:30:47 UTC - in response to Message 1769982. Something causes object.h to be included. Can you see any difference in #include statements in the file that causes the error. CudaAcceleration.cu might be the file to look. Try changing my code in CudaAcceleration.cu near line 50 to #ifdef _WIN32 #include "nvapi_device.h" #endif That makes the mac version not to inlcude nvapi_device.h I do not know if that helps or if it breaks some other code then. Might do, either way that include is my Windows driver level one, so should be inactive anyway. "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. ID: 1769984 ·

TBar Volunteer tester Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768	Message 1769987 - Posted: 6 Mar 2016, 9:07:25 UTC - in response to Message 1769979. Hmmm, if that's part of the lgamma code, cost could be excessive. Any idea what nv/Apple might have broken in there ? I do not know about Apple. If TBar could tell something more about the errors. Is the line 160 the only one or if he fixes the line 160 what is the next line that produces error(s). My guess is that all ld. -ptx lines. I'm getting numerous groups similar to; In file included from confsettings.cpp:6: ./cuda/cudaAcceleration.h:160:35: error: invalid output constraint '=f' in asm asm("ld.global.f32 %0, [%1];" : "=f"(v) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:168:50: error: invalid output constraint '=f' in asm asm("ld.global.ca.nc.v2.f32 {%0, %1}, [%2];" : "=f"(v.x), "=f"(v.y) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:176:58: error: invalid output constraint '=f' in asm asm("ld.global.ca.nc.v4.f32 {%0, %1, %2, %3}, [%4];" : "=f"(v.x), "=f"(v.y), "=f"(v.z), "=f"(v.w) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:184:41: error: invalid output constraint '=f' in asm asm("ld.global.cg.nc.f32 %0, [%1];" : "=f"(v) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:191:50: error: invalid output constraint '=f' in asm asm("ld.global.cg.nc.v2.f32 {%0, %1}, [%2];" : "=f"(v.x), "=f"(v.y) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:198:58: error: invalid output constraint '=f' in asm asm("ld.global.cg.nc.v4.f32 {%0, %1, %2, %3}, [%4];" : "=f"(v.x), "=f"(v.y), "=f"(v.z), "=f"(v.w) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:206:41: error: invalid output constraint '=f' in asm asm("ld.global.cs.nc.f32 %0, [%1];" : "=f"(v) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:213:50: error: invalid output constraint '=f' in asm asm("ld.global.cs.nc.v2.f32 {%0, %1}, [%2];" : "=f"(v.x), "=f"(v.y) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:220:58: error: invalid output constraint '=f' in asm asm("ld.global.cs.nc.v4.f32 {%0, %1, %2, %3}, [%4];" : "=f"(v.x), "=f"(v.y), "=f"(v.z), "=f"(v.w) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:228:41: error: invalid output constraint '=f' in asm asm("ld.global.lu.nc.f32 %0, [%1];" : "=f"(v) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:235:50: error: invalid output constraint '=f' in asm asm("ld.global.lu.nc.v2.f32 {%0, %1}, [%2];" : "=f"(v.x), "=f"(v.y) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:242:58: error: invalid output constraint '=f' in asm asm("ld.global.lu.nc.v4.f32 {%0, %1, %2, %3}, [%4];" : "=f"(v.x), "=f"(v.y), "=f"(v.z), "=f"(v.w) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:251:58: error: invalid output constraint '=f' in asm asm("ld.global.cs.nc.v4.f32 {%0, %1, %2, %3}, [%4];" : "=f"(v.x), "=f"(v.y), "=f"(v.z), "=f"(v.w) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:260:43: error: invalid output constraint '=f' in asm asm("ld.global.cs.nc.f32 {%0}, [%1];" : "=f"(v) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:280:39: error: invalid output constraint '=f' in asm asm("mul.rn.sat.f32 %0, %1, %2 ;" : "=f"(res) : "f"(a), "f"(b)); ./cuda/cudaAcceleration.h:294:36: error: invalid output constraint '=f' in asm asm("cvt.rni.f32.f32 %0, %1 ;" : "=f"(res) : "f"(a)); ./cuda/cudaAcceleration.h:301:35: error: invalid output constraint '=f' in asm asm("cvt.rz.f32.f64 %0, %1 ;" : "=f"(res) : "d"(a)); The next group starts; In file included from analyzeFuncs.cpp:128: ./cuda/cudaAcceleration.h:160:35: error: invalid output constraint '=f' in asm asm("ld.global.f32 %0, [%1];" : "=f"(v) : "l"(addr+offset)); ./cuda/cudaAcceleration.h:168:50: error: invalid output constraint '=f' in asm asm("ld.global.ca.nc.v2.f32 {%0, %1}, [%2];" : "=f"(v.x), "=f"(v.y) : "l"(addr+offset)); etc etc It's very late here, I'll have to try things earlier... ID: 1769987 ·

TBar Volunteer tester Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768	Message 1770214 - Posted: 7 Mar 2016, 9:16:23 UTC - in response to Message 1769984. Something causes object.h to be included. Can you see any difference in #include statements in the file that causes the error. CudaAcceleration.cu might be the file to look. Try changing my code in CudaAcceleration.cu near line 50 to #ifdef _WIN32 #include "nvapi_device.h" #endif That makes the mac version not to inlcude nvapi_device.h I do not know if that helps or if it breaks some other code then. Might do, either way that include is my Windows driver level one, so should be inactive anyway. Still No luck. If I remove everything in cudaAcceleration.h between; #if (__CUDA_ARCH__ > 350) and #ifdef USE_STOCKHAM The Errors are reduced down to just repeated lines such as; /usr/include/dispatch/object.h(143): error: expected an identifier /usr/include/dispatch/object.h(362): error: identifier "dispatch_block_t" is undefined /usr/include/dispatch/object.h(143): error: expected an identifier /usr/include/dispatch/object.h(362): error: identifier "dispatch_block_t" is undefined cuda/cudaAcc_gaussfit.cu(375): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_gaussfit.cu(410): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_gaussfit.cu(419): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_gaussfit.cu(439): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_gaussfit.cu(662): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_gaussfit.cu(222): error: identifier "LDG_f_ca" is undefined detected during instantiation of "void GaussFit_kernel<ul_FftLength>(float, result_flag , __nv_bool) [with ul_FftLength=8]" (1131): here cuda/cudaAcc_gaussfit.cu(227): error: identifier "LDG_f_ca" is undefined detected during instantiation of "void GaussFit_kernel<ul_FftLength>(float, result_flag , __nv_bool) [with ul_FftLength=8]" (1131): here etc, etc... cuda/cudaAcc_PowerSpectrum.cu(31): error: identifier "LDG_f4_cs" is undefined cuda/cudaAcc_PowerSpectrum.cu(50): error: identifier "LDG_f2_cs" is undefined cuda/cudaAcc_pulsefind.cu(382): error: identifier "LDG_f_cs" is undefined cuda/cudaAcc_pulsefind.cu(397): error: identifier "LDG_f_cs" is undefined cuda/cudaAcc_pulsefind.cu(420): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_pulsefind.cu(194): error: identifier "LDG_f_cs" is undefined etc, etc.... and finally; cuda/cudaAcc_autocorr.cu(226): error: identifier "LDG_f2_cs" is undefined Strange I don't see these Errors in Mountain Lion with Xcode 5.1.1 and ToolKit 6.5 They appeared in Yosemite with Xcode 7.2.1 and ToolKit 7.5. Again, I can compile other SETI Apps without any problems, including setiathome_x41zi_x86_64-apple-darwin_cuda75 ID: 1770214 ·

petri33 Volunteer tester Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156	Message 1770230 - Posted: 7 Mar 2016, 11:00:44 UTC - in response to Message 1770214. Something causes object.h to be included. Can you see any difference in #include statements in the file that causes the error. CudaAcceleration.cu might be the file to look. Try changing my code in CudaAcceleration.cu near line 50 to #ifdef _WIN32 #include "nvapi_device.h" #endif That makes the mac version not to inlcude nvapi_device.h I do not know if that helps or if it breaks some other code then. Might do, either way that include is my Windows driver level one, so should be inactive anyway. Still No luck. If I remove everything in cudaAcceleration.h between; #if (__CUDA_ARCH__ > 350) and #ifdef USE_STOCKHAM The Errors are reduced down to just repeated lines such as; /usr/include/dispatch/object.h(143): error: expected an identifier /usr/include/dispatch/object.h(362): error: identifier "dispatch_block_t" is undefined /usr/include/dispatch/object.h(143): error: expected an identifier /usr/include/dispatch/object.h(362): error: identifier "dispatch_block_t" is undefined cuda/cudaAcc_gaussfit.cu(375): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_gaussfit.cu(410): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_gaussfit.cu(419): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_gaussfit.cu(439): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_gaussfit.cu(662): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_gaussfit.cu(222): error: identifier "LDG_f_ca" is undefined detected during instantiation of "void GaussFit_kernel<ul_FftLength>(float, result_flag , __nv_bool) [with ul_FftLength=8]" (1131): here cuda/cudaAcc_gaussfit.cu(227): error: identifier "LDG_f_ca" is undefined detected during instantiation of "void GaussFit_kernel<ul_FftLength>(float, result_flag , __nv_bool) [with ul_FftLength=8]" (1131): here etc, etc... cuda/cudaAcc_PowerSpectrum.cu(31): error: identifier "LDG_f4_cs" is undefined cuda/cudaAcc_PowerSpectrum.cu(50): error: identifier "LDG_f2_cs" is undefined cuda/cudaAcc_pulsefind.cu(382): error: identifier "LDG_f_cs" is undefined cuda/cudaAcc_pulsefind.cu(397): error: identifier "LDG_f_cs" is undefined cuda/cudaAcc_pulsefind.cu(420): error: identifier "LDG_f_ca" is undefined cuda/cudaAcc_pulsefind.cu(194): error: identifier "LDG_f_cs" is undefined etc, etc.... and finally; cuda/cudaAcc_autocorr.cu(226): error: identifier "LDG_f2_cs" is undefined Strange I don't see these Errors in Mountain Lion with Xcode 5.1.1 and ToolKit 6.5 They appeared in Yosemite with Xcode 7.2.1 and ToolKit 7.5. Again, I can compile other SETI Apps without any problems, including setiathome_x41zi_x86_64-apple-darwin_cuda75 The LDG_... are defined in cudaAcceleration.h and they should not be removed. Something causes object.h to be included. To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones ID: 1770230 ·

petri33 Volunteer tester Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156	Message 1770231 - Posted: 7 Mar 2016, 11:08:48 UTC the asm statements that the LDG inline functions use are valid. The error "invalid output constraint '=f' in asm" is not giving much information. "=f" means an output variable that is of type float. Could there be a compiler setting that causes floats to be treated as doubles. If that is the case, you could try "=d" but that would probably slow things down and/or introduce calculation errors. To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones ID: 1770231 ·

petri33 Volunteer tester Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156	Message 1770233 - Posted: 7 Mar 2016, 11:16:16 UTC Do you use Makefile and make command to build? Check Makefile for "6.5". There may be wrong INCLUDE path settings. Set to 7.5. Or if not using Makefile then check the settings from your compiler-environment. PATH? try runnning nvcc --version from command line. ... To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones ID: 1770233 ·

©2024 University of California

SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.