Mac Seti Enhanced Optimized |
![]() |
Message boards : Number crunching : Mac Seti Enhanced Optimized
| Author | Message |
|---|---|
|
I stated in amother thread that I was trying to build a faster Mac Enhanced client. | |
| ID: 317103 | | |
|
Thanks boog, it's great that you have worked on optimizing the Mac version! I'm trying it out now, I'll let you know how it runs. | |
| ID: 317376 | | |
Thanks boog, it's great that you have worked on optimizing the Mac version! I'm trying it out now, I'll let you know how it runs. You replied to it! lol http://setiathome.berkeley.edu/forum_thread.php?id=30811 I hope it runs as well on all hardware as it did on mine. I also wish that I could get the performance increase that crunch3r got out of his! ____________ | |
| ID: 317416 | | |
... If you can get your hand on the intel compiler and the ipp libraries for OSX on intel macs... That'll surely boost the performance on intel macs. Regarding the G4/G5 macs... well i know that alex kan is looking at it but i wouldn't expect much speed improvements. P.S. Did you compile fftw with fma enabled ? at least g5 are capable of fma and it should give a little performance increase. ____________ Have a bad Day! ![]() | |
| ID: 317840 | | |
... I compiled fftw with altivec and fma enabled for both the g4 and g5, along with other options (took lots of research and trial and error). And on the intel Macs, I wish I could afford to get one right now, might be a bit before I can get the $$ up to get one of the intel iMacs, or mini for that matter. ____________ | |
| ID: 317983 | | |
... IMHO selling the G4 and G5 mac as fast as you can is the best you can do atm to get the most money out of them. PPC apps are dead and won't be able to compete in any further applications. (remember the change from 68k to ppc ? But this time the transition will go even faster while apple allready developed darin x86 along to the ppc version) Just think of what will happen if adobe decides to compile photoshop with the intel compiler... Maybee only a g5 quad is capable of compete with a dual core intel. ____________ Have a bad Day! ![]() | |
| ID: 317987 | | |
IMHO Maybe I could name my computer "Charity" and take donations to get an intel mini and the intel compiler? lol ____________ | |
| ID: 318010 | | |
IMHO Well at least you can try to get yourself into intels beta test programm for the compiler and the ipp library on osx... ;) I'm pretty sure if you mention what you'll like to do with the compiler they'll grant you access. P.S. I would have distributed optimized mac biniaries myself but i realy dont have the time to look into it any further. ATM I'll have to maintain more windows & linux apps on different platforms than i could handle. Taking one more is not possible atm. Maybe later if the whole thing settled a bit. EDIT If you'd like me to add you app to my webpage, just contact me ;) You can find my mail adress on my page. /EDIT ____________ Have a bad Day! ![]() | |
| ID: 318024 | | |
Alex is working on it now! :) And his 1st release that is very unfinished is about fast as mine without heavy compiler flags! I'm actually excited to see what his final result will bring! ____________ | |
| ID: 319087 | | |
|
After some trials, alex over at Team MacNN has outdone himself! | |
| ID: 329190 | | |
|
Bump! | |
| ID: 331152 | | |
|
Thanks Alex! | |
| ID: 332351 | | |
|
Nice performance from my puny 1.42 GHz G4...one of the last results included a 3.73 GHz P4 EE.. | |
| ID: 332514 | | |
|
FYI. | |
| ID: 332760 | | |
|
Well, | |
| ID: 332766 | | |
|
Should be @ 5%-7% faster than the G5 compile on G4. | |
| ID: 332768 | | |
FYI. I just installed it on my [edit] imac [/edit] PPC G3 350 and it seems to be working fine so far. I'll keep you guys posted as to successful completion and times. Current estimated time for the work unit is 41 hours. messages when installed follow: Fri Jun 9 23:51:16 2006||Starting BOINC client version 5.4.9 for powerpc-apple-darwin Fri Jun 9 23:51:16 2006||libcurl/7.15.3 OpenSSL/0.9.7i zlib/1.2.3 Fri Jun 9 23:51:16 2006||Data directory: /Library/Application Support/BOINC Data Fri Jun 9 23:51:17 2006|SETI@home|Found app_info.xml; using anonymous platform Fri Jun 9 23:51:17 2006||Processor: 1 Power Macintosh PowerMac2,1 Fri Jun 9 23:51:17 2006||Memory: 192.00 MB physical, 0 bytes virtual Fri Jun 9 23:51:17 2006||Disk: 6.38 GB total, 3.43 GB free Fri Jun 9 23:51:17 2006|SETI@home|URL: http://setiathome.berkeley.edu/; Computer ID: 2440182; location: home; project prefs: home Fri Jun 9 23:51:17 2006|SETI@home Beta Test|URL: http://setiweb.ssl.berkeley.edu/beta/; Computer ID: 7249; location: home; project prefs: default Fri Jun 9 23:51:17 2006||General prefs: from SETI@home (last modified 2005-10-16 18:17:04) Fri Jun 9 23:51:17 2006||General prefs: no separate prefs for home; using your defaults Fri Jun 9 23:51:17 2006||Listening on port 31416 Fri Jun 9 23:51:17 2006|SETI@home Beta Test|Resuming task 01jn01aa.1212.13761.254826.3.235_1 using setiathome_enhanced version 513 Fri Jun 9 23:51:17 2006|SETI@home|Deferring task 21fe99ab.27562.17218.336072.3.71_1 Fri Jun 9 23:51:23 2006||Rescheduling CPU: result suspended, resumed or aborted by user Fri Jun 9 23:51:24 2006|SETI@home Beta Test|Pausing task 01jn01aa.1212.13761.254826.3.235_1 (removed from memory) Fri Jun 9 23:51:24 2006|SETI@home|Restarting task 21fe99ab.27562.17218.336072.3.71_1 using setiathome_enhanced version 513 Please advise if you need any additional info, and thanks for all your work with this! ____________ ![]() | |
| ID: 332783 | | |
I just installed it on my [edit] imac [/edit] PPC G3 350 and it seems to be working fine so far. I'll keep you guys posted as to successful completion and times. Current estimated time for the work unit is 41 hours. I haven't released any compiles for G3, so you should revert to the stock client. In fact, I can guarantee that you'll run into problems as soon as you hit the parts of the code that use Altivec, which are all over the place. Thank you all for your messages of support, although I recognize a bunch of you from the MacNN forums... :) | |
| ID: 332904 | | |
I just installed it on my [edit] imac [/edit] PPC G3 350 and it seems to be working fine so far. I'll keep you guys posted as to successful completion and times. Current estimated time for the work unit is 41 hours. I didn't have any errors, yet, but I followed your advise and removed the optimized client. Upon resuming the work, however, I immediately got an unrecoverable error. Oh, well. Can you recommend (does one exist?) an optimized client for G3s? Jack ____________ ![]() | |
| ID: 333123 | | |
|
So far, it seems that for results with the same credit claim, the G4 compile is really around 8-10% faster than the G5 one (on my G4 MacMini). Great work Alex! | |
| ID: 333420 | | |
|
Hi Alex, | |
| ID: 333557 | | |
Let me know when you've got your v5 code tarball on your web site. I'd like to see what you've done to see if any of it can be folded back into the main client... Check your email. :) Also, the source to v5 is up, so anyone else who's curious can now look and see what I've done. This also makes it a lot harder for me to decide that I don't want my client to be used or distributed anymore. ;) | |
| ID: 333578 | | |
|
Now that I see you are here Alex, I would like to say thank you for your efforts - they are appreciated very much (and Rick's work too)! (I'm not a member at the other forums so I can't post there) | |
| ID: 333750 | | |
|
OK, can anyone walk me through running fft_test3 (wisdom) in single user mode? I know how to do it in Terminal, I'm just looking for the most accurate wisdom file I can get. | |
| ID: 336829 | | |
OK, can anyone walk me through running fft_test3 (wisdom) in single user mode? I know how to do it in Terminal, I'm just looking for the most accurate wisdom file I can get. Here's what I do: 1) Place fft_test3 in Applications 2) Shutdown computer 3) Reboot and press Command + S together while computer restarts 4) You will notice black screen & white text and command prompt at Root directory 5) You will need to mount the drive to write the file. At the command prompt, type: /sbin/mount -uw / NOTE the space after w 6) At the command line, type: cd Applications 7) type: ls to list directory contents. You should see the the fft_test3 file listed 8) type: ./fft_test3, hit enter 9) program SHOULD run and write the Wisdom.sah file to the Applications directory. 10) Once complete, type: reboot and the GUI and OS X will load the desktop as normal. 11) Make sure BOINC is closed. 12) Go to Applications and move the Wisdom.sah file to your Library/Application Support/BOINC Data/projects/setiathome.berkeley.edu folder 13) Restart BOINC FYI: If you get a permission error after step 8, at the command prompt type "chmod 755" and enter. Retype ./fft_test3, hit enter.) You can also use the "sudo" command if your station is a stand-alone and security concerns are minimal. There are other ways, these are two I have used. Note: You may want to wait until Alex postes a revised fft_test3. When I compared this latest wisdom to the previous version, it appeared slower overall on my G4. Alex identified the problem and will intro an update shortly. Watch the Team MacNN forum thread for an update. Hope this helps. Good luck! ____________ | |
| ID: 337018 | | |
|
Thanks Gecko, just what I needed. I'm starting to get a tiny bit famaliar with the CLI, but I'm still a LONG way from knowing what the fsck I am doing. | |
| ID: 337404 | | |
Thanks Gecko, just what I needed. I'm starting to get a tiny bit famaliar with the CLI, but I'm still a LONG way from knowing what the fsck I am doing. You bet. I'm still an infant myself w/ command line and learn a little at a time. I don't see the update on the forum unless the original post from the 12th has the updated file, BUT the post doesn't show that it was edited, so I think this is still the original build he intends to revise. I encourage you to join the MacNN forum. Real good group of guys that you can count-on for help. VERY strong support and appreciation for Alex's (& Rick's & Mikkyo's) efforts. They've kept our PPC rigs competitive with the best out there and the Quads at the top of the computer list (except for that 32 CPU SMP rig currently on top). ____________ | |
| ID: 337436 | | |
|
Bump! | |
| ID: 342034 | | |
Needs to be on the front page, it's just too good to hide. Indeed. @Alex Kan: Could you post links to your current G4 & G5 versions in the New Optimized Apps thread? That way they’ll be easier for people to find. Or do you still consider them to be ‘in beta’? (FWIW they’re working just fine so far on three G4s here, all running OS 10.3.9.) ____________ ![]() | |
| ID: 342724 | | |
@Alex Kan: Could you post links to your current G4 & G5 versions in the New Optimized Apps thread? That way they’ll be easier for people to find. Or do you still consider them to be ‘in beta’? (FWIW they’re working just fine so far on three G4s here, all running OS 10.3.9.) Second that motion. I was wondering why you hadn't done that yet, Alex, since the other posts there were just OT anyway. Your client is the first new *released* optimized client, so it should really be there. Regards, Simon. ____________ Donate to SETI@Home via PayPal! Optimized SETI@Home apps + Information | |
| ID: 342777 | | |
I was wondering why you hadn't done that yet, Alex, since the other posts there were just OT anyway. Because I think I still have a v6 left in me. Your clients are turning over really impressive times on AMD chips, which leads me to suspect that your times on Intel chips will be even more impressive. Yes, you're reading that right--I feel threatened. :P That, and I realized that I can make (and even force) some assumptions that a compiler can't in a few places, so I might as well take care of those. I wish there was something like your MKL libraries for FFTs on PowerPC, though. FFTW is much faster than vDSP (Apple's performance library) at the FFT sizes that matter to us, so it's really my only option. Writing a faster FFT is well outside my area of expertise. :/ | |
| ID: 343441 | | |
|
"Alex Kan" wrote:
Have you considered getting in touch with the FFTW folks and asking them about further optimizations to FFTW? At least one of the original authors is now a Prof at MIT and the =all= are "performance hot-rodders". They want FFTW to be the fastest FFT available on any HW and under any OS. Given your facility with assembly hacking and your experience trying to make the s@hE app faster, talking to the FFTW folks could be very fruitful. ____________ | |
| ID: 343487 | | |
Perhaps this might interest Akos Fekete? He appears to thrive on interesting challenges, though he does have a full plate these days w/ Einstein's S5 rollover. Still, he's been very accessible on the Einstein forum. ____________ | |
| ID: 343499 | | |
Perhaps this might interest Akos Fekete? He appears to thrive on interesting challenges, though he does have a full plate these days w/ Einstein's S5 rollover. Still, he's been very accessible on the Einstein forum. He's stated that he only works on Windows code, not even Linux/x86, so I doubt he could help much with Mac/PPC. ____________ ![]() | |
| ID: 343521 | | |
Have you considered getting in touch with the FFTW folks and asking them about further optimizations to FFTW? Either that, or I could talk to Apple about how much vDSP on PowerPC can/will be improved. I might do it if I feel the FFT is really becoming a bottleneck. For now, optimizing the analysis functions and all the associated housekeeping still makes a noticeable difference. Check out the New Optimized Apps Links thread--I've posted links to v6 in there. | |
| ID: 344526 | | |
http://setiathome.berkeley.edu/forum_thread.php?id=31810#344522 Thanks for that Alex I'm sure a lot of people will be glad to get their hands on those. And thanks for using that thread. Hopefully others will follow you there when they have their apps ready adn it'll be much easyer to find the right app for right machine in the future. Wasn't my for the idea for the thread- got it from the einstien forum but it works so well thought should be tried here. | |
| ID: 344579 | | |
|
Just installed v6 and watching it crunch :o) Too soon for results, but we shall see. | |
| ID: 344610 | | |
|
If you don't use graphics (or can live without them), go to the optimized apps links thread and snag the newest version for yet another speedup! Thanks to Simon Zadra of KWSN for pointing out that SETI runs slower when compiled with graphics, regardless of whether or not they're actually used. | |
| ID: 349708 | | |
|
You sure are welcome, Alex. | |
| ID: 349710 | | |
If you don't use graphics (or can live without them), go to the optimized apps links thread and snag the newest version for yet another speedup! Thanks to Simon Zadra of KWSN for pointing out that SETI runs slower when compiled with graphics, regardless of whether or not they're actually used. Excellent—that was quick! I'll be installing them as soon as I get the chance. ____________ ![]() | |
| ID: 349819 | | |
If you don't use graphics (or can live without them), go to the optimized apps links thread and snag the newest version for yet another speedup! Thanks to Simon Zadra of KWSN for pointing out that SETI runs slower when compiled with graphics, regardless of whether or not they're actually used. Hi Alex, I tried your transposed PoT in cvs repository, but it generates a wrong result(I emailed Eric about this). Will you fix it? Eric wrote he didn't have time to try it. ____________ Luckiest in the world. WMD = Weapon of Mass Distraction. Click this table. ![]() | |
| ID: 349838 | | |
|
Hi Alex, | |
| ID: 350413 | | |
You've done amazing things :o) I believe the speedup vs. the standard Mac app is larger than with the PC ones. I have timing numbers from the DP G5 1.8 I've been testing on, but I figured I'd hold off on posting them until you update your Pentium D numbers to take into account the second stick of RAM, since I imagine that'll give you a sizable speed gain. Is there data how much RAC a G5 can pull per core/total? Look at the G5s in the Top 10--most of them are running some variety of v6, and all of them are running one of my clients, judging from the work unit times. I think halimedia's Quad is still leveling off in terms of RAC, but even it probably won't end up at more than 2300 or so. (That Woodcrest at the top is something else entirely.) | |
| ID: 350584 | | |
|
Niiiiice numbers. | |
| ID: 350592 | | |
|
I've posted the source code to v6. Sorry about the delay, if anyone was looking to get their hands on it. | |
| ID: 352630 | | |
|
Before I leave for a while and forget about them, here's those benchmark numbers on a Dual 1.8 GHz G5 from a while back. (I had to manually adjust the chirp limits to 1/2.5, since IIRC in the package I downloaded, they were set to 4/10.) | |
| ID: 352665 | | |
|
Reviving an old thread to post some news... | |
| ID: 387178 | | |
Message boards : Number crunching : Mac Seti Enhanced Optimized
| Copyright © 2009 University of California |