Message boards :
Number crunching :
Mac OS Sierra
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 . . . 9 · Next
Author | Message |
---|---|
Urs Echternacht ![]() Send message Joined: 15 May 99 Posts: 692 Credit: 135,197,781 RAC: 211 ![]() ![]() |
Thanks for running the test. It looks like your MacBookPro does not have any issues and reported correct results already before the test. No need for you to run any of these tests here. _\|/_ U r s |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
Yes, thanks for running the test. Up until now I thought all the iGPUs were having this problem, it's interesting to see some are working correctly. Laurent's machine is running the CPU tasks well by themselves, I wonder how many CPU tasks he has running, https://setiathome.berkeley.edu/results.php?hostid=8177300&offset=200 |
W3Perl ![]() Send message Joined: 29 Apr 99 Posts: 251 Credit: 3,696,783,867 RAC: 12,606 ![]() ![]() |
Yes, thanks for running the test. Up until now I thought all the iGPUs were having this problem, it's interesting to see some are working correctly. I rerun the benchmark test with the same aux binary....works this time...don't know why ! here is output : KWSN-Darwin-MBbench v2.1.07 Running on domisses-MacBook-Pro.local at Mon Jan 16 08:45:37 2017 --------------------------------------------------- Starting benchmark run... --------------------------------------------------- Listing wu-file(s) in /testWUs : 18dc09ah.26284.16432.6.33.125.wu reference_work_unit_r3215.wu Listing executable(s) in /APPS : MBv8_8.22r3586_Intel_ssse3_x86_64-apple-darwin Listing executable in /REF_APPs : MBv8_8.06r3366_avx_x86_64-apple-darwin --------------------------------------------------- Current WU: 18dc09ah.26284.16432.6.33.125.wu --------------------------------------------------- Running default app with command : MBv8_8.06r3366_avx_x86_64-apple-darwin 1348.48 real 1344.56 user 1.43 sys Elapsed Time: ………………………………… 1349 seconds --------------------------------------------------- Running app with command : MBv8_8.22r3586_Intel_ssse3_x86_64-apple-darwin 1642.60 real 45.41 user 55.45 sys Elapsed Time : ……………………………… 1642 seconds Speed compared to default : 82 % ----------------- Comparing results ------------- R1:R2 ------------ ------------- R2:R1 ------------ Exact Super Tight Good Bad Exact Super Tight Good Bad Spike 0 0 0 0 0 0 0 0 0 0 Autocorr 0 0 0 0 0 0 0 0 0 1 Gaussian 0 0 0 0 0 0 0 0 0 0 Pulse 0 0 0 0 1 0 0 0 0 1 Triplet 0 2 2 3 0 0 2 2 3 0 Best Spike 0 0 1 1 0 0 0 1 1 0 Best Autocorr 0 0 0 1 0 0 0 0 1 0 Best Gaussian 1 1 1 1 0 1 1 1 1 0 Best Pulse 0 0 0 0 1 0 0 0 0 1 Best Triplet 0 0 0 1 0 0 0 0 1 0 ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- 1 3 4 7 2 1 3 4 7 3 Unmatched signal(s) in R1 at line(s) 393 473 Unmatched signal(s) in R2 at line(s) 359 410 490 For R1:R2 matched signals only, Q= ???? Result : Weakly similar. --------------------------------------------------- Done with 18dc09ah.26284.16432.6.33.125.wu. Current WU: reference_work_unit_r3215.wu --------------------------------------------------- Running default app with command : MBv8_8.06r3366_avx_x86_64-apple-darwin 1094.72 real 1091.32 user 1.04 sys Elapsed Time: ………………………………… 1094 seconds --------------------------------------------------- Running app with command : MBv8_8.22r3586_Intel_ssse3_x86_64-apple-darwin 935.59 real 69.43 user 32.42 sys Elapsed Time : ……………………………… 936 seconds Speed compared to default : 116 % ----------------- Comparing results ------------- R1:R2 ------------ ------------- R2:R1 ------------ Exact Super Tight Good Bad Exact Super Tight Good Bad Spike 0 8 10 13 0 0 8 10 13 0 Autocorr 0 1 1 1 0 0 1 1 1 0 Gaussian 0 0 0 2 4 0 0 0 2 3 Pulse 0 0 0 0 0 0 0 0 0 0 Triplet 0 0 1 2 0 0 0 1 2 0 Best Spike 0 1 1 1 0 0 1 1 1 0 Best Autocorr 0 1 1 1 0 0 1 1 1 0 Best Gaussian 0 0 0 1 0 0 0 0 1 0 Best Pulse 0 0 0 0 1 0 0 0 0 1 Best Triplet 0 0 1 1 0 0 0 1 1 0 ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- 0 11 15 22 5 0 11 15 22 4 Unmatched signal(s) in R1 at line(s) 499 526 553 634 720 Unmatched signal(s) in R2 at line(s) 499 526 607 693 For R1:R2 matched signals only, Q= 17.35% Result : Weakly similar. --------------------------------------------------- Done with reference_work_unit_r3215.wu. Done with Benchmark run! Removing temporary files! ![]() |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
OK, the i7-3520M with AVX 1 works, the i7-6700HQ with AVX 2 doesn't. That reminds me of a similar discussion at Beta with the Windows iGPU App. I keep thinking about how Apple doesn't use the 'other' OpenCL platforms, and just uses their Own Apple OpenCL Platform for All OpenCL devices. As in, there isn't an Intel OpenCL platform, it's called Apple,; OpenCL platform detected: Apple OpenCL Platform Name: Apple https://setiathome.berkeley.edu/result.php?resultid=5431092300 But, I'm having a hard time seeing where the Platform name would make a difference when compiling the App. I dunno, I keep thinking about how that i7-6700 would work with an AVX 2 App though. |
Urs Echternacht ![]() Send message Joined: 15 May 99 Posts: 692 Credit: 135,197,781 RAC: 211 ![]() ![]() |
Thanks Laurent for that first complete testrun. It shows perfectly that the first change is not the needed one. @TBar: The main difference between the iGPUs in Core ix 3xxx / Core ix 4xxx and Core ix 5xxx / Core ix 6xxx / Core ix 7xxx is, the first two series are purely 32bit internal (single precision or float precision) while the newer series also can support 64bit (double precision), if the driver is supporting that feature. An app that needs single precision on the newer series iGPUs with the same numerical precision as on the older series iGPUs therefore needs some testing work. In my tests a few month ago avx2 was slower than avx. Probably Joe Segur was correct in assumming exactly that after finishing his work on the avx CPU-apps. On to preparing the next three tests ... _\|/_ U r s |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
Hmmm, I remember people telling me ssse3 was faster than sse41 on Macs. However, it turns out on most Macs the sse41 App is about 10% faster. Same with the AVX App, it's about 10% faster than the sse41 App. According to Intel, AVX2 should be faster still. Of course, I don't have an AVX2 CPU...but I can still compile an App. I just can't test it, sorta like the iGPU App and the current AVX App. Maybe I'll try compiling an AVX2 App some day. |
W3Perl ![]() Send message Joined: 29 Apr 99 Posts: 251 Credit: 3,696,783,867 RAC: 12,606 ![]() ![]() |
Hmmm, I remember people telling me ssse3 was faster than sse41 on Macs. However, it turns out on most Macs the sse41 App is about 10% faster. Same with the AVX App, it's about 10% faster than the sse41 App. According to Intel, AVX2 should be faster still. Of course, I don't have an AVX2 CPU...but I can still compile an App. I just can't test it, sorta like the iGPU App and the current AVX App. Maybe I'll try compiling an AVX2 App some day. I'm ready to test and/or compile if required (sorry, I cannot send you my new laptop ;) ![]() |
![]() ![]() Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 ![]() ![]() |
AFAIK AVX2 mostly adds integer 256 bits to instructions set. We mostly use float ones instead. There are few new instructions that can be useful for us too like gather or permutations but testing needed. Accordingly to VC++ dev blog thay didn't use gather at least on Haswell cause though it was implemented it was slower than other ways to do same work. Nothing new in that. Support of some assembly instruction doesn't mean it will be faster to use it. Same was once on Venice that supported SSE3 but so inefficiently that better was to stay with SSE2 only. SETI apps news We're not gonna fight them. We're gonna transcend them. |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
There is some sort of problem with FFTW when trying to compile with the --enable-avx2. I keep getting the same Error; Stack dump:You get the same Error when compiling in Darwin 15.5. So, I was only able to compile FFTW with --enable-avx. The CPU App compiled OK using the AVX2 tags. I posted it at Crunchers Anonymous, http://www.arkayn.us/forum/index.php?action=dlattach;topic=191.0;attach=892. Just make sure you run it in the Benchmark package to see if it works. Replace the MBv8_8.22r3586_Intel_ssse3_x86_64-apple-darwin App in the APPS folder with MBv8_8.22r3588_avx2_x86_64-apple-darwin and it will compare it to the AVX App. |
W3Perl ![]() Send message Joined: 29 Apr 99 Posts: 251 Credit: 3,696,783,867 RAC: 12,606 ![]() ![]() |
There is some sort of problem with FFTW when trying to compile with the --enable-avx2. I keep getting the same Error; KWSN-Darwin-MBbench v2.1.07 Running on domisses-MacBook-Pro.local at Wed Jan 18 09:36:30 2017 --------------------------------------------------- Starting benchmark run... --------------------------------------------------- Listing wu-file(s) in /testWUs : 18dc09ah.26284.16432.6.33.125.wu reference_work_unit_r3215.wu Listing executable(s) in /APPS : MBv8_8.22r3588_avx2_x86_64-apple-darwin Listing executable in /REF_APPs : MBv8_8.06r3366_avx_x86_64-apple-darwin --------------------------------------------------- Current WU: 18dc09ah.26284.16432.6.33.125.wu --------------------------------------------------- Running default app with command : MBv8_8.06r3366_avx_x86_64-apple-darwin 1368.64 real 1364.40 user 1.68 sys Elapsed Time: ………………………………… 1369 seconds --------------------------------------------------- Running app with command : MBv8_8.22r3588_avx2_x86_64-apple-darwin 0.03 real 0.02 user 0.00 sys MBv8_8.22r3588_avx2_x86_64-apple-darwin failed to run on your system. Continuing with next (if any)... --------------------------------------------------- Done with 18dc09ah.26284.16432.6.33.125.wu. Current WU: reference_work_unit_r3215.wu --------------------------------------------------- Running default app with command : MBv8_8.06r3366_avx_x86_64-apple-darwin 1088.36 real 1084.95 user 1.08 sys Elapsed Time: ………………………………… 1089 seconds --------------------------------------------------- Running app with command : MBv8_8.22r3588_avx2_x86_64-apple-darwin 0.03 real 0.02 user 0.00 sys MBv8_8.22r3588_avx2_x86_64-apple-darwin failed to run on your system. Continuing with next (if any)... --------------------------------------------------- Done with reference_work_unit_r3215.wu. ![]() |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
Did it work long enough to leave a stderr in the testData folder? It might have a reason why it didn't run in the stderr file, if it left one. If it didn't leave a stderr, you could try creating an empty folder, placing a Work Unit in the folder and naming it work_unit.sah, place the App in the folder with the work_unit.sah, and then run it from the terminal. cd to the folder and run ./MBv8_8.22r3588_avx2_x86_64-apple-darwin That usually leaves a stderr with the error in it. Oh well, can't say I didn't try. Google isn't much help. It might need an older system or a newer version xCode, or something else entirely. Back to the new cuda build... |
W3Perl ![]() Send message Joined: 29 Apr 99 Posts: 251 Credit: 3,696,783,867 RAC: 12,606 ![]() ![]() |
Did it work long enough to leave a stderr in the testData folder? It might have a reason why it didn't run in the stderr file, if it left one. 15:08:04 (16543): Can't open init data file - running in standalone mode 15:08:04 (16543): Can't open init data file - running in standalone mode Not using mb_cmdline.txt-file, using commandline options. SIGSEGV: segmentation violation Crashed executable name: MBv8_8.22r3588_avx2_x86_64-apple-darwin Machine type Intel x86-64h Haswell (64-bit executable) System version: Macintosh OS 10.12.1 build 16B2659 Wed Jan 18 15:08:04 2017 0 MBv8_8.22r3588_avx2_x86_64-apple-darwin 0x000000010ae198f4 std::__1::__tree<std::__1::__value_type<int, PROCINFO>, std::__1::__map_value_compare<int, std::__1::__value_type<int, PROCINFO>, std::__1::less<int>, true>, std::__1::allocator<std::__1::__value_type<int, PROCINFO> > >::__insert_unique(std::__1::__value_type<int, PROCINFO> const&) + 1076 1 MBv8_8.22r3588_avx2_x86_64-apple-darwin 0x000000010ae07fd6 COPROCS::clear() + 4214 2 libsystem_platform.dylib 0x00007fff9287ebba _sigtramp + 26 3 ??? 0x000000000000ffff 0x0 + 65535 4 MBv8_8.22r3588_avx2_x86_64-apple-darwin 0x000000010acb2652 std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > x_csv_encode<char>(char const*, unsigned long) + 4178 5 MBv8_8.22r3588_avx2_x86_64-apple-darwin 0x000000010acae235 void std::__1::vector<unsigned char, std::__1::allocator<unsigned char> >::__push_back_slow_path<unsigned char const>(unsigned char const&) + 22629 6 MBv8_8.22r3588_avx2_x86_64-apple-darwin 0x000000010acb48aa std::__1::basic_ostream<char, std::__1::char_traits<char> >& std::__1::__put_character_sequence<char, std::__1::char_traits<char> >(std::__1::basic_ostream<char, std::__1::char_traits<char> >&, char const*, unsigned long) + 1050 7 MBv8_8.22r3588_avx2_x86_64-apple-darwin 0x000000010ac9b342 MBv8_8.22r3588_avx2_x86_64-apple-darwin + 1319746 8 libdyld.dylib 0x00007fff92671255 start + 1 9 ??? 0x0000000000000001 0x0 + 1 Thread 0 crashed with X86 Thread State (64-bit): rax: 0x0100001f rbx: 0x00000000 rcx: 0x7fff550a39d8 rdx: 0x00000028 rdi: 0x7fff550a3a40 rsi: 0x00000003 rbp: 0x7fff550a3a20 rsp: 0x7fff550a39d8 r8: 0x00000607 r9: 0x00000000 r10: 0x000003b0 r11: 0x00000206 r12: 0x000003b0 r13: 0x00000028 r14: 0x7fff550a3a40 r15: 0x00000607 rip: 0x7fff9279841a rfl: 0x00000206 Binary Images Description: 0x10ab59000 - 0x10aec4fff /Users/domisse/KWSN-OSX-bench-MB_v2.1.07/alpha/./MBv8_8.22r3588_avx2_x86_64-apple-darwin 0x7fff79f7a000 - 0x7fff79f7afff /System/Library/Frameworks/Accelerate.framework/Versions/A/Accelerate 0x7fff79f93000 - 0x7fff7a4acfff /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vImage.framework/Versions/A/vImage 0x7fff7a4ad000 - 0x7fff7a61dfff /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib 0x7fff7a61e000 - 0x7fff7a632fff /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBNNS.dylib 0x7fff7a633000 - 0x7fff7aa29fff /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libLAPACK.dylib 0x7fff7aa2a000 - 0x7fff7aa40fff /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libLinearAlgebra.dylib 0x7fff7aa41000 - 0x7fff7aa47fff /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libQuadrature.dylib 0x7fff7aa48000 - 0x7fff7aa5cfff /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libSparseBLAS.dylib 0x7fff7aa5d000 - 0x7fff7abe4fff /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libvDSP.dylib 0x7fff7abe5000 - 0x7fff7ac97fff /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libvMisc.dylib 0x7fff7ac98000 - 0x7fff7ac98fff /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/vecLib 0x7fff7bd2f000 - 0x7fff7bd2ffff /System/Library/Frameworks/ApplicationServices.framework/Versions/A/ApplicationServices 0x7fff7bd30000 - 0x7fff7bd9efff /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/ATS.framework/Versions/A/ATS 0x7fff7be38000 - 0x7fff7bf66fff /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/ATS.framework/Versions/A/Resources/libFontParser.dylib 0x7fff7bf67000 - 0x7fff7bfb1fff /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/ATS.framework/Versions/A/Resources/libFontRegistry.dylib 0x7fff7c0ac000 - 0x7fff7c156fff /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/ColorSync.framework/Versions/A/ColorSync 0x7fff7c157000 - 0x7fff7c1a7fff /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/HIServices.framework/Versions/A/HIServices 0x7fff7c1a8000 - 0x7fff7c1b7fff /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/LangAnalysis.framework/Versions/A/LangAnalysis 0x7fff7c1b8000 - 0x7fff7c205fff /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/PrintCore.framework/Versions/A/PrintCore 0x7fff7c206000 - 0x7fff7c241fff /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/QD.framework/Versions/A/QD 0x7fff7c242000 - 0x7fff7c24dfff /System/Library/Frameworks/ApplicationServices.framework/Versions/A/Frameworks/SpeechSynthesis.framework/Versions/A/SpeechSynthesis 0x7fff7c24e000 - 0x7fff7c45cfff /System/Library/Frameworks/AudioToolbox.framework/Versions/A/AudioToolbox 0x7fff7c45d000 - 0x7fff7c45dfff /System/Library/Frameworks/AudioUnit.framework/Versions/A/AudioUnit 0x7fff7c5c6000 - 0x7fff7c94afff /System/Library/Frameworks/CFNetwork.framework/Versions/A/CFNetwork 0x7fff7c964000 - 0x7fff7c964fff /System/Library/Frameworks/Carbon.framework/Versions/A/Carbon 0x7fff7c965000 - 0x7fff7c968fff /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/CommonPanels.framework/Versions/A/CommonPanels 0x7fff7c969000 - 0x7fff7cc71fff /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/HIToolbox.framework/Versions/A/HIToolbox 0x7fff7cc72000 - 0x7fff7cc75fff /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/Help.framework/Versions/A/Help 0x7fff7cc76000 - 0x7fff7cc7bfff /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/ImageCapture.framework/Versions/A/ImageCapture 0x7fff7cc7c000 - 0x7fff7cd13fff /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/Ink.framework/Versions/A/Ink 0x7fff7cd14000 - 0x7fff7cd2efff /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/OpenScripting.framework/Versions/A/OpenScripting 0x7fff7cd2f000 - 0x7fff7cd30fff /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/Print.framework/Versions/A/Print 0x7fff7cd31000 - 0x7fff7cd33fff /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/SecurityHI.framework/Versions/A/SecurityHI 0x7fff7cd34000 - 0x7fff7cd3afff /System/Library/Frameworks/Carbon.framework/Versions/A/Frameworks/SpeechRecognition.framework/Versions/A/SpeechRecognition 0x7fff7cf59000 - 0x7fff7cfe9fff /System/Library/Frameworks/CoreAudio.framework/Versions/A/CoreAudio 0x7fff7cfea000 - 0x7fff7cffdfff /System/Library/Frameworks/CoreBluetooth.framework/Versions/A/CoreBluetooth 0x7fff7cffe000 - 0x7fff7d2f4fff /System/Library/Frameworks/CoreData.framework/Versions/A/CoreData 0x7fff7d2f5000 - 0x7fff7d381fff /System/Library/Frameworks/CoreDisplay.framework/Versions/A/CoreDisplay 0x7fff7d382000 - 0x7fff7d81bfff /System/Library/Frameworks/CoreFoundation.framework/Versions/A/CoreFoundation 0x7fff7d81c000 - 0x7fff7de9afff /System/Library/Frameworks/CoreGraphics.framework/Versions/A/CoreGraphics 0x7fff7de9b000 - 0x7fff7e0dafff /System/Library/Frameworks/CoreImage.framework/Versions/A/CoreImage 0x7fff7e235000 - 0x7fff7e235fff /System/Library/Frameworks/CoreServices.framework/Versions/A/CoreServices 0x7fff7e236000 - 0x7fff7e287fff /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/AE.framework/Versions/A/AE 0x7fff7e288000 - 0x7fff7e563fff /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/CarbonCore.framework/Versions/A/CarbonCore 0x7fff7e564000 - 0x7fff7e597fff /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/DictionaryServices.framework/Versions/A/DictionaryServices 0x7fff7e598000 - 0x7fff7e5a0fff /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/FSEvents.framework/Versions/A/FSEvents 0x7fff7e5a1000 - 0x7fff7e70cfff /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/LaunchServices.framework/Versions/A/LaunchServices 0x7fff7e70d000 - 0x7fff7e7bdfff /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/Metadata.framework/Versions/A/Metadata 0x7fff7e7be000 - 0x7fff7e81dfff /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/OSServices.framework/Versions/A/OSServices 0x7fff7e81e000 - 0x7fff7e88efff /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/SearchKit.framework/Versions/A/SearchKit 0x7fff7e88f000 - 0x7fff7e8d5fff /System/Library/Frameworks/CoreServices.framework/Versions/A/Frameworks/SharedFileList.framework/Versions/A/SharedFileList 0x7fff7e962000 - 0x7fff7eaadfff /System/Library/Frameworks/CoreText.framework/Versions/A/CoreText 0x7fff7eaae000 - 0x7fff7eae3fff /System/Library/Frameworks/CoreVideo.framework/Versions/A/CoreVideo 0x7fff7eae4000 - 0x7fff7eb55fff /System/Library/Frameworks/CoreWLAN.framework/Versions/A/CoreWLAN 0x7fff7ec50000 - 0x7fff7ec55fff /System/Library/Frameworks/DiskArbitration.framework/Versions/A/DiskArbitration 0x7fff7ede4000 - 0x7fff7f18bfff /System/Library/Frameworks/Foundation.framework/Versions/C/Foundation 0x7fff7f1b7000 - 0x7fff7f1e8fff /System/Library/Frameworks/GSS.framework/Versions/A/GSS 0x7fff7f29c000 - 0x7fff7f33ffff /System/Library/Frameworks/IOBluetooth.framework/Versions/A/IOBluetooth 0x7fff7f340000 - 0x7fff7f3d5fff /System/Library/Frameworks/IOKit.framework/Versions/A/IOKit 0x7fff7f3d6000 - 0x7fff7f3dcfff /System/Library/Frameworks/IOSurface.framework/Versions/A/IOSurface 0x7fff7f430000 - 0x7fff7f586fff /System/Library/Frameworks/ImageIO.framework/Versions/A/ImageIO 0x7fff7f587000 - 0x7fff7f58bfff /System/Library/Frameworks/ImageIO.framework/Versions/A/Resources/libGIF.dylib 0x7fff7f58c000 - 0x7fff7f67cfff /System/Library/Frameworks/ImageIO.framework/Versions/A/Resources/libJP2.dylib 0x7fff7f67d000 - 0x7fff7f6a0fff /System/Library/Frameworks/ImageIO.framework/Versions/A/Resources/libJPEG.dylib 0x7fff7f6a1000 - 0x7fff7f6c8fff /System/Library/Frameworks/ImageIO.framework/Versions/A/Resources/libPng.dylib 0x7fff7f6c9000 - 0x7fff7f6cbfff /System/Library/Frameworks/ImageIO.framework/Versions/A/Resources/libRadiance.dylib 0x7fff7f6cc000 - 0x7fff7f725fff /System/Library/Frameworks/ImageIO.framework/Versions/A/Resources/libTIFF.dylib 0x7fff802e7000 - 0x7fff80300fff /System/Library/Frameworks/Kerberos.framework/Versions/A/Kerberos 0x7fff80af7000 - 0x7fff80b4efff /System/Library/Frameworks/Metal.framework/Versions/A/Metal 0x7fff81264000 - 0x7fff8126cfff /System/Library/Frameworks/NetFS.framework/Versions/A/NetFS 0x7fff81611000 - 0x7fff8165ffff /System/Library/Frameworks/OpenCL.framework/Versions/A/OpenCL 0x7fff81660000 - 0x7fff81679fff /System/Library/Frameworks/OpenDirectory.framework/Versions/A/Frameworks/CFOpenDirectory.framework/Versions/A/CFOpenDirectory 0x7fff8167a000 - 0x7fff81685fff /System/Library/Frameworks/OpenDirectory.framework/Versions/A/OpenDirectory 0x7fff81686000 - 0x7fff81688fff /System/Library/Frameworks/OpenGL.framework/Versions/A/Libraries/libCVMSPluginSupport.dylib 0x7fff81689000 - 0x7fff8168cfff /System/Library/Frameworks/OpenGL.framework/Versions/A/Libraries/libCoreFSCache.dylib 0x7fff8168d000 - 0x7fff81690fff /System/Library/Frameworks/OpenGL.framework/Versions/A/Libraries/libCoreVMClient.dylib 0x7fff81691000 - 0x7fff81699fff /System/Library/Frameworks/OpenGL.framework/Versions/A/Libraries/libGFXShared.dylib 0x7fff8169a000 - 0x7fff816a5fff /System/Library/Frameworks/OpenGL.framework/Versions/A/Libraries/libGL.dylib 0x7fff816a6000 - 0x7fff816e2fff /System/Library/Frameworks/OpenGL.framework/Versions/A/Libraries/libGLImage.dylib 0x7fff8185a000 - 0x7fff8189afff /System/Library/Frameworks/OpenGL.framework/Versions/A/Libraries/libGLU.dylib 0x7fff82202000 - 0x7fff82210fff /System/Library/Frameworks/OpenGL.framework/Versions/A/OpenGL 0x7fff82f22000 - 0x7fff8311cfff /System/Library/Frameworks/QuartzCore.framework/Versions/A/QuartzCore 0x7fff83682000 - 0x7fff8399efff /System/Library/Frameworks/Security.framework/Versions/A/Security 0x7fff8399f000 - 0x7fff83a15fff /System/Library/Frameworks/SecurityFoundation.framework/Versions/A/SecurityFoundation 0x7fff83a3f000 - 0x7fff83a42fff /System/Library/Frameworks/ServiceManagement.framework/Versions/A/ServiceManagement 0x7fff83dc7000 - 0x7fff83e38fff /System/Library/Frameworks/SystemConfiguration.framework/Versions/A/SystemConfiguration 0x7fff86490000 - 0x7fff864b2fff /System/Library/PrivateFrameworks/Apple80211.framework/Versions/A/Apple80211 0x7fff865ab000 - 0x7fff86636fff /System/Library/PrivateFrameworks/AppleJPEG.framework/Versions/A/AppleJPEG 0x7fff8809a000 - 0x7fff880a3fff /System/Library/PrivateFrameworks/CommonAuth.framework/Versions/A/CommonAuth 0x7fff887bf000 - 0x7fff887d0fff /System/Library/PrivateFrameworks/CoreEmoji.framework/Versions/A/CoreEmoji 0x7fff88df6000 - 0x7fff88f34fff /System/Library/PrivateFrameworks/CoreUI.framework/Versions/A/CoreUI 0x7fff88f35000 - 0x7fff88fd4fff /System/Library/PrivateFrameworks/CoreUtils.framework/Versions/A/CoreUtils 0x7fff89024000 - 0x7fff89089fff /System/Library/PrivateFrameworks/CoreWiFi.framework/Versions/A/CoreWiFi 0x7fff8908a000 - 0x7fff89097fff /System/Library/PrivateFrameworks/CrashReporterSupport.framework/Versions/A/CrashReporterSupport 0x7fff89238000 - 0x7fff89349fff /System/Library/PrivateFrameworks/DesktopServicesPriv.framework/Versions/A/DesktopServicesPriv 0x7fff8961b000 - 0x7fff89a4cfff /System/Library/PrivateFrameworks/FaceCore.framework/Versions/A/FaceCore 0x7fff8ad95000 - 0x7fff8ad95fff /System/Library/PrivateFrameworks/GPUCompiler.framework/libmetal_timestamp.dylib 0x7fff8b772000 - 0x7fff8b7e8fff /System/Library/PrivateFrameworks/Heimdal.framework/Versions/A/Heimdal 0x7fff8bdfb000 - 0x7fff8be02fff /System/Library/PrivateFrameworks/IOAccelerator.framework/Versions/A/IOAccelerator 0x7fff8be04000 - 0x7fff8be18fff /System/Library/PrivateFrameworks/IOPresentment.framework/Versions/A/IOPresentment 0x7fff8be19000 - 0x7fff8be3bfff /System/Library/PrivateFrameworks/IconServices.framework/Versions/A/IconServices 0x7fff8bf1d000 - 0x7fff8c0d1fff /System/Library/PrivateFrameworks/LanguageModeling.framework/Versions/A/LanguageModeling 0x7fff8c983000 - 0x7fff8c9fbfff /System/Library/PrivateFrameworks/MetalPerformanceShaders.framework/Versions/A/MetalPerformanceShaders 0x7fff8cb7d000 - 0x7fff8cba5fff /System/Library/PrivateFrameworks/MultitouchSupport.framework/Versions/A/MultitouchSupport 0x7fff8cc52000 - 0x7fff8cc5cfff /System/Library/PrivateFrameworks/NetAuth.framework/Versions/A/NetAuth 0x7fff8dba2000 - 0x7fff8dbbcfff /System/Library/PrivateFrameworks/ProtocolBuffer.framework/Versions/A/ProtocolBuffer 0x7fff8e8f1000 - 0x7fff8e96dfff /System/Library/PrivateFrameworks/Sharing.framework/Versions/A/Sharing 0x7fff8e98e000 - 0x7fff8ebe9fff /System/Library/PrivateFrameworks/SkyLight.framework/Versions/A/SkyLight 0x7fff8edc7000 - 0x7fff8edd3fff /System/Library/PrivateFrameworks/SpeechRecognitionCore.framework/Versions/A/SpeechRecognitionCore 0x7fff8f921000 - 0x7fff8f927fff /System/Library/PrivateFrameworks/TCC.framework/Versions/A/TCC 0x7fff8f9b3000 - 0x7fff8fb74fff /System/Library/PrivateFrameworks/TextureIO.framework/Versions/A/TextureIO 0x7fff90d8b000 - 0x7fff90d8dfff /System/Library/PrivateFrameworks/login.framework/Versions/A/Frameworks/loginsupport.framework/Versions/A/loginsupport 0x7fff90de2000 - 0x7fff90dfdfff /usr/lib/libCRFSuite.dylib 0x7fff90dfe000 - 0x7fff90e09fff /usr/lib/libChineseTokenizer.dylib 0x7fff90e9b000 - 0x7fff90e9cfff /usr/lib/libDiagnosticMessagesClient.dylib 0x7fff90e9d000 - 0x7fff910b0fff /usr/lib/libFosl_dynamic.dylib 0x7fff910d4000 - 0x7fff910d4fff /usr/lib/libOpenScriptingUtil.dylib 0x7fff910da000 - 0x7fff910dbfff /usr/lib/libSystem.B.dylib 0x7fff91147000 - 0x7fff91172fff /usr/lib/libarchive.2.dylib 0x7fff91173000 - 0x7fff911e7fff /usr/lib/libate.dylib 0x7fff911eb000 - 0x7fff911ebfff /usr/lib/libauto.dylib 0x7fff911ec000 - 0x7fff911fcfff /usr/lib/libbsm.0.dylib 0x7fff911fd000 - 0x7fff9120bfff /usr/lib/libbz2.1.0.dylib 0x7fff9120c000 - 0x7fff91262fff /usr/lib/libc++.1.dylib 0x7fff91263000 - 0x7fff9128dfff /usr/lib/libc++abi.dylib 0x7fff9128e000 - 0x7fff9129efff /usr/lib/libcmph.dylib 0x7fff9129f000 - 0x7fff912b4fff /usr/lib/libcompression.dylib 0x7fff912b5000 - 0x7fff912b5fff /usr/lib/libcoretls.dylib 0x7fff912b6000 - 0x7fff912b7fff /usr/lib/libcoretls_cfhelpers.dylib 0x7fff915f3000 - 0x7fff91646fff /usr/lib/libcups.2.dylib 0x7fff916c0000 - 0x7fff916c0fff /usr/lib/libenergytrace.dylib 0x7fff916d0000 - 0x7fff916d5fff /usr/lib/libheimdal-asn1.dylib 0x7fff916d6000 - 0x7fff917c8fff /usr/lib/libiconv.2.dylib 0x7fff917c9000 - 0x7fff919eefff /usr/lib/libicucore.A.dylib 0x7fff919f4000 - 0x7fff919f5fff /usr/lib/liblangid.dylib 0x7fff919f6000 - 0x7fff91a0ffff /usr/lib/liblzma.5.dylib 0x7fff91a10000 - 0x7fff91a26fff /usr/lib/libmarisa.dylib 0x7fff91a27000 - 0x7fff91cd1fff /usr/lib/libmecabra.dylib 0x7fff91d04000 - 0x7fff91d7dfff /usr/lib/libnetwork.dylib 0x7fff91d7e000 - 0x7fff9214efff /usr/lib/libobjc.A.dylib 0x7fff92151000 - 0x7fff92155fff /usr/lib/libpam.2.dylib 0x7fff92156000 - 0x7fff92186fff /usr/lib/libpcap.A.dylib 0x7fff921a4000 - 0x7fff921c0fff /usr/lib/libresolv.9.dylib 0x7fff92210000 - 0x7fff92358fff /usr/lib/libsqlite3.dylib 0x7fff9244a000 - 0x7fff92457fff /usr/lib/libxar.1.dylib 0x7fff9245b000 - 0x7fff9254afff /usr/lib/libxml2.2.dylib 0x7fff9254b000 - 0x7fff92574fff /usr/lib/libxslt.1.dylib 0x7fff92575000 - 0x7fff92586fff /usr/lib/libz.1.dylib 0x7fff92595000 - 0x7fff92599fff /usr/lib/system/libcache.dylib 0x7fff9259a000 - 0x7fff925a4fff /usr/lib/system/libcommonCrypto.dylib 0x7fff925a5000 - 0x7fff925acfff /usr/lib/system/libcompiler_rt.dylib 0x7fff925ad000 - 0x7fff925b5fff /usr/lib/system/libcopyfile.dylib 0x7fff925b6000 - 0x7fff92638fff /usr/lib/system/libcorecrypto.dylib 0x7fff92639000 - 0x7fff9266bfff /usr/lib/system/libdispatch.dylib 0x7fff9266c000 - 0x7fff92671fff /usr/lib/system/libdyld.dylib 0x7fff92672000 - 0x7fff92672fff /usr/lib/system/libkeymgr.dylib 0x7fff92673000 - 0x7fff9267ffff /usr/lib/system/libkxld.dylib 0x7fff92680000 - 0x7fff92680fff /usr/lib/system/liblaunch.dylib 0x7fff92681000 - 0x7fff92686fff /usr/lib/system/libmacho.dylib 0x7fff92687000 - 0x7fff92689fff /usr/lib/system/libquarantine.dylib 0x7fff9268a000 - 0x7fff9268bfff /usr/lib/system/libremovefile.dylib 0x7fff9268c000 - 0x7fff926a4fff /usr/lib/system/libsystem_asl.dylib 0x7fff926a5000 - 0x7fff926a5fff /usr/lib/system/libsystem_blocks.dylib 0x7fff926a6000 - 0x7fff92733fff /usr/lib/system/libsystem_c.dylib 0x7fff92734000 - 0x7fff92737fff /usr/lib/system/libsystem_configuration.dylib 0x7fff92738000 - 0x7fff9273bfff /usr/lib/system/libsystem_coreservices.dylib 0x7fff9273c000 - 0x7fff92754fff /usr/lib/system/libsystem_coretls.dylib 0x7fff92755000 - 0x7fff9275bfff /usr/lib/system/libsystem_dnssd.dylib 0x7fff9275c000 - 0x7fff92785fff /usr/lib/system/libsystem_info.dylib 0x7fff92786000 - 0x7fff927a8fff /usr/lib/system/libsystem_kernel.dylib 0x7fff927a9000 - 0x7fff927f0fff /usr/lib/system/libsystem_m.dylib 0x7fff927f1000 - 0x7fff9280ffff /usr/lib/system/libsystem_malloc.dylib 0x7fff92810000 - 0x7fff92867fff /usr/lib/system/libsystem_network.dylib 0x7fff92868000 - 0x7fff92871fff /usr/lib/system/libsystem_networkextension.dylib 0x7fff92872000 - 0x7fff9287bfff /usr/lib/system/libsystem_notify.dylib 0x7fff9287c000 - 0x7fff92884fff /usr/lib/system/libsystem_platform.dylib 0x7fff92885000 - 0x7fff9288ffff /usr/lib/system/libsystem_pthread.dylib 0x7fff92890000 - 0x7fff92893fff /usr/lib/system/libsystem_sandbox.dylib 0x7fff92894000 - 0x7fff92895fff /usr/lib/system/libsystem_secinit.dylib 0x7fff92896000 - 0x7fff9289dfff /usr/lib/system/libsystem_symptoms.dylib 0x7fff9289e000 - 0x7fff928befff /usr/lib/system/libsystem_trace.dylib 0x7fff928bf000 - 0x7fff928c4fff /usr/lib/system/libunwind.dylib 0x7fff928c5000 - 0x7fff928eefff /usr/lib/system/libxpc.dylib Exiting... ![]() |
Urs Echternacht ![]() Send message Joined: 15 May 99 Posts: 692 Credit: 135,197,781 RAC: 211 ![]() ![]() |
There is some sort of problem with FFTW when trying to compile with the --enable-avx2. I keep getting the same Error; Are you sure about "-target-cpu core2" ? _\|/_ U r s |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
Seems FFTW is inserting that all by itself. Strange it works with using --enable-avx but not --enable-avx2. Trying it with ./configure CC="/usr/bin/clang" --target=corei7 --enable-float --enable-avx --enable-avx2 doesn't work either. It fails with the same Error. In fact, nothing seems to work. Not even the simplest ./configure --enable-float --enable-avx2. I'm thinking it needs a newer xCode than 6.1.1, but, a newer xCode doesn't work with Petri's code. So, I'll stay with 6.1.1. |
W3Perl ![]() Send message Joined: 29 Apr 99 Posts: 251 Credit: 3,696,783,867 RAC: 12,606 ![]() ![]() |
Seems FFTW is inserting that all by itself. Strange it works with using --enable-avx but not --enable-avx2. MacOS Sierra use Xcode 8.2 ![]() |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
Are you sure about "-target-cpu core2" ? Using CFLAGS="-mtune=corei7" in the configure line gets rid of the above. But then you start getting "fatal error: error in backend: 32-bit absolute addressing is not supported in 64-bit mode" at the same location, /bin/sh ../../../libtool --tag=CC --mode=compile /usr/bin/gcc -DHAVE_CONFIG_H -I. -I../../.. -I../../../kernel -I../../../dft -I../../../dft/simd -I../../../simd-support -mavx2 -mfma -mtune=corei7 -MT t1fuv_2.lo -MD -MP -MF .deps/t1fuv_2.Tpo -c -o t1fuv_2.lo t1fuv_2.c Adding --host=x86_64-apple-darwin --target=x86_64-apple-darwin --build=x86_64-apple-darwin Doesn't help. So, have you tried compiling FFTW 3.3.5, or the new 3.3.6-pl1 using --enable-avx2? |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
fatal error: error in backend: 32-bit absolute addressing is not supported in 64-bit modeI think I've solved it. The Memory Error should have been the big clue. Seems FFTW is having the same problem as when compiling the SETI Apps, the compiler can't find <malloc.h>. After jumping through a number of hoops it appears it's that simple. So, I was able to compile the new FFTW 3.3.6 using; ./configure CC="/usr/bin/clang" --host=x86_64-apple-darwin --target=x86_64-apple-darwin --enable-float --enable-threads --with-combined-threads --enable-sse2 --enable-avx --enable-avx2 and make a couple new CPU Apps. I'm testing the sse41 version now. |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
Still not having any success trying to compile FFTW with just the avx2 SIMD, and using multiple selections doesn't work very well. I'll try one more attempt with FFTW using avx and the App using avx2. If this doesn't work we'll just have to settle for the avx App for now. AVX2_Test.zip |
![]() ![]() Send message Joined: 5 May 99 Posts: 39 Credit: 28,049,113 RAC: 87 ![]() ![]() |
Still not having any success trying to compile FFTW with just the avx2 SIMD, and using multiple selections doesn't work very well. @TBar I compiled with success fftw-3.3.6-bl1 using AVX2 SIMD and the MBv8 App using the same SIMD. After that I run the KWSN-OSX-bench comparing your App and mine and that's the result: Starting benchmark run... --------------------------------------------------- Listing wu-file(s) in /testWUs : test_work_unit.wu Listing executable(s) in /APPS : MBv8_8.17r3588_avx2_x86_64-apple-darwin Listing executable in /REF_APPs : MBv8_8.22r3603_avx2_x86_64-apple-darwin --------------------------------------------------- Current WU: test_work_unit.wu --------------------------------------------------- Running default app with command : MBv8_8.22r3603_avx2_x86_64-apple-darwin Elapsed Time: ………………………………… 1256 seconds --------------------------------------------------- Running app with command : MBv8_8.17r3588_avx2_x86_64-apple-darwin Elapsed Time : ……………………………… 1284 seconds Speed compared to default : 97 % ----------------- Comparing results Result : Strongly similar, Q= 99.82% --------------------------------------------------- Done with test_work_unit.wu. As you see on my i7 4770K Haswell the results are strongly similar and the same thing happens when I compare my App compiled with AVX SIMD and the one with AVX2 SIMD. Gianfranco I don't want to believe, I want to know! |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
I compiled with success fftw-3.3.6-bl1 using AVX2 SIMD and the MBv8 App using the same SIMD....After seeing that I downloaded xCode 7.3.1, installed it in OSX 10.11.6, and was finally able to compile 3.3.6 using AVX2. The AXV2 CPU App compiled without any trouble, now it needs to be tested. I decided to try compiling a sse41 CPU App using the same AVX2 FFTW library and it's giving the wrong results. I suppose you can't use the same AVX2 FFTW library with a SSE41 CPU although FFTW hints that you can; You need compiler that supports the given SIMD extensions, but FFTW will try to detect at runtime whether the CPU supports these extensions. That is, you can compile with--enable-avx and the code will still run on a CPU without AVX support.Hopefully the New AVX2 version will work better than the sse41 version did. AVX2_Test2.zip So, on your machine there isn't any difference with run-times between AVX & AVX2? |
©2025 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.