Message boards :
Number crunching :
ATTN Tetsuji Maverick- RE: SSE3 for AMD Dual Core
Message board moderation
Author | Message |
---|---|
Daniel Schaalma Send message Joined: 28 May 99 Posts: 297 Credit: 16,953,703 RAC: 0 |
Tetsuji, I will soon be putting a couple of new machines online, the AMD Athlon64 X2 Dual Core 4800+ units, and perhaps a Dual Core Opteron machine. All these CPU's have the SSE3 Instruction set, and I was wondering if you have any plans to compile a new version of the SETI science app to make use of SSE3 on these new AMD CPU's. The current SSE3 science app is evidently Intel specific. Thanks for your time, and BIG thanks for all the current optimized apps! Happy Crunching! Regards, Daniel. |
Tetsuji Maverick Rai Send message Joined: 25 Apr 99 Posts: 518 Credit: 90,863 RAC: 0 |
No I don't have any plan to build further applications. Sorry... And yes, SSE3 optimization code of ICC always checks the processor. So SSE2 version (W version) is the best way. There's little difference between SSE3 (P version) and SSE2 (W or N version) even on P4. Look http://www.marisan.nl/seti/reference.htm or other benchmark pages. SSE3 instruction set is just a small addition to SSE2. Maybe I can hack Intel's library technically and legally, but it won't benefit so much. Since I have no AMD chips, I will need your help for testing (just run small testing programs and give me the results.) It may be an interesting challenge. BTW marisan's English phpBB board looks hackable (I haven't tried.) Luckiest in the world. WMD = Weapon of Mass Distraction. Click this table. |
-= Vyper =- Send message Joined: 5 Sep 99 Posts: 1652 Credit: 1,065,191,981 RAC: 2,537 |
|
Ned Slider Send message Joined: 12 Oct 01 Posts: 668 Credit: 4,375,315 RAC: 0 |
There isn't really any real advantage to using SSE3. I built otherwise identical SSE2 and SSE3 clients for AMD Athlon64 on Linux and they benchmarked within a few seconds of each other (the SSE2 client was actually a few seconds faster!). This was on an Athlon64, not an X2, and on Linux, not Windows, but I see no reason for it to be any different. Ned *** My Guide to Compiling Optimised BOINC and SETI Clients *** *** Download Optimised BOINC and SETI Clients for Linux Here *** |
Tetsuji Maverick Rai Send message Joined: 25 Apr 99 Posts: 518 Credit: 90,863 RAC: 0 |
|
Hans Dorn Send message Joined: 3 Apr 99 Posts: 2262 Credit: 26,448,570 RAC: 0 |
BTW my RAC is zero :) Strange... Everyone except you is crunching away with your client :o) Regards Hans |
Tetsuji Maverick Rai Send message Joined: 25 Apr 99 Posts: 518 Credit: 90,863 RAC: 0 |
BTW my RAC is zero :) It's because now I'm up to the ears in beta test project. As you see in my computers, only two results are present in the database now. All my 3 machines are running setiathome_enhanced for beta test project. Sometimes (for ex. during the long outage, or while I'm compiling new version of crunchers) I attach to other projects. Beta is more efficient, but takes longer time. And now I hope major bugs are fixed. Now optimizatin is for testing/debugging in beta project to me. That's one of the reasons I'm reluctant on aggressive optimization now. Luckiest in the world. WMD = Weapon of Mass Distraction. Click this table. |
Tetsuji Maverick Rai Send message Joined: 25 Apr 99 Posts: 518 Credit: 90,863 RAC: 0 |
PS: And I am not concerned with credits so much (though if I told I'm not interested in credits at all, it would be a lie.....) So I am pouring full computer power to beta project whose credits don't appear in the stats. |
Hans Dorn Send message Joined: 3 Apr 99 Posts: 2262 Credit: 26,448,570 RAC: 0 |
PS: Fine. I'll be leeching .... erm, downloading your patches as soon as seti_enhanced gets out :o) Regards Hans |
Speedy67 & Friends Send message Joined: 14 Jul 99 Posts: 335 Credit: 1,178,138 RAC: 0 |
|
Tetsuji Maverick Rai Send message Joined: 25 Apr 99 Posts: 518 Credit: 90,863 RAC: 0 |
Hi TMR, To tell the truth, I tried just after I registered :) It's not hackable to me. I wrote "It looks hackable" because it looked older. In the bottom, it says "Powered by phpBB © 2001, 2002 phpBB Group ;" Now I think it means the skin (or template?) of the forums is old. As you may know, earlier versions of phpBB (2.0.12 or earlier) are very easy to hack. By changing cookie, any member can be the admin, and hijack the forums. For further details see this. BOINC synergy or another stats forum (I forgot) had this problem before, and I emailed to the admin. Much earlier versions (2.0.9? or earlier) had more serious problem. Anyway when I find a vulnerability I won't post in the public, instead email to the admin ASAP :) Luckiest in the world. WMD = Weapon of Mass Distraction. Click this table. |
Chilean Send message Joined: 6 Apr 03 Posts: 498 Credit: 3,200,504 RAC: 0 |
No I don't have any plan to build further applications. Sorry... And yes, SSE3 optimization code of ICC always checks the processor. So SSE2 version (W version) is the best way. There's little difference between SSE3 (P version) and SSE2 (W or N version) even on P4. Look http://www.marisan.nl/seti/reference.htm or other benchmark pages. Odd.... on the benchamarks... your version 7 is faster than your 8.1 |
Speedy67 & Friends Send message Joined: 14 Jul 99 Posts: 335 Credit: 1,178,138 RAC: 0 |
Hi TMR, I know, I try to keep things up-to-date. I was running 2.0.16, updated to 2.0.17 today just to be sure. Thanks for the concern anyway. :) Why it says © 2001, 2002 I don't know... I'm looking into it. Maybe something's not as is should be, and you turn out to be right after all.. :)
Thanks. :) Greetings, Sander |
Tetsuji Maverick Rai Send message Joined: 25 Apr 99 Posts: 518 Credit: 90,863 RAC: 0 |
yes, on my p4 boxes, 8.1 was always faster... Several people reported 7 was faster on this forum. I don't know why. So you'd better try both versions yourself on your boxes to your content. |
[BOINCstats] Willy Send message Joined: 4 Mar 01 Posts: 202 Credit: 152,243 RAC: 0 |
BOINC synergy or another stats forum (I forgot) had this problem before, and I emailed to the admin. I guess that was me. I don't trust the security of PHPBB anymore. I updated after the warning, and kept up-to-date as soon as an update was available. Still my server was hacked by using a PHPBB exploit even though I was running the lastest (4 days old) version op PHPBB. But this is all way off topic. Join team BOINCstats |
Tetsuji Maverick Rai Send message Joined: 25 Apr 99 Posts: 518 Credit: 90,863 RAC: 0 |
BOINC synergy or another stats forum (I forgot) had this problem before, and I emailed to the admin. sorry to mention it here :( But in reality there are still a lot of forums using older versions of phpBB. Luckiest in the world. WMD = Weapon of Mass Distraction. Click this table. |
Karsten Vinding Send message Joined: 18 May 99 Posts: 239 Credit: 25,201,931 RAC: 11 |
Regarding making a specialized version with SSE3 for AMD 64 with these instructions. I can't help you with the programming, but I found a handy little tool called iccpatch. It can be found on this page, together with some documentation: http://www.swallowtail.org/naughty-intel.html#patches This checks binaries compiled with ICC9 for specific processor checks, and if it finds any it replaces them with noop commands. The only thing it leaves behind is the check for the processors abilities, and if the processor reports ability to run SSE3, it will work on the processor, regardless if its an AMD or Intel processor. I have only checked the program on my AMD Sempron, with the SSE versions of TMR's clients, and it identifies and removes some commands. I don't see any difference in crunch time on my machine, and didn't expect to either. But I'm positive the program could be used to make the SSE3 compiles work on AMD64 processors with SSE3 as well. |
Tetsuji Maverick Rai Send message Joined: 25 Apr 99 Posts: 518 Credit: 90,863 RAC: 0 |
Tetsuji, I will soon be putting a couple of new machines online, the AMD Athlon64 X2 Dual Core 4800+ units, and perhaps a Dual Core Opteron machine. All these CPU's have the SSE3 Instruction set, and I was wondering if you have any plans to compile a new version of the SETI science app to make use of SSE3 on these new AMD CPU's. The current SSE3 science app is evidently Intel specific. Thanks for your time, and BIG thanks for all the current optimized apps! I found it how to cheat ICC (not IPP yet and forever), but I also found it illegal (see this post for details) and so I won't tell how to or build it. Sorry..... Luckiest in the world. WMD = Weapon of Mass Distraction. Click this table. |
Daniel Schaalma Send message Joined: 28 May 99 Posts: 297 Credit: 16,953,703 RAC: 0 |
No I don't have any plan to build further applications. Sorry... And yes, SSE3 optimization code of ICC always checks the processor. So SSE2 version (W version) is the best way. There's little difference between SSE3 (P version) and SSE2 (W or N version) even on P4. Look http://www.marisan.nl/seti/reference.htm or other benchmark pages. Tetsuji, thanks much for the info. I was not aware that there was so little difference between the SSE2 and SSE3 instruction set. You'll have to forgive my ignorance, as I have not done any programming since 1983, and even then, I was only working with BASIC, and it didn't require a compiler. I guess I should break down and buy a copy of Visual C++ and start learning it. I will let you know when I get the new dual core chip. I should be in a position to buy it in about two weeks. I would be happy to do some test crunching on it for you, if there is anything that you plan on coding for it. Sorry it took so long to reply, but I have been sick with pneumonia, and I have not spent much time here at the keyboard. Regards, Daniel. |
James Von Schmittou Send message Joined: 4 Jun 02 Posts: 5 Credit: 983,878 RAC: 0 |
i have a dual core on the way, and would be willing to test the SSE3 version "if you make it". |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.