| Author | Message |
|
|
|
The original thread can be found here -
http://setiathome.berkeley.edu/forum_thread.php?id=31655
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Any work performed or produced as a result of use of the Materials cannot be performed or produced for the benefit of other parties for a fee, compensation or any other reimbursement or remuneration.
This could possibly impinge on seti/BOINC funding to UCB.
I don't see how, as long as the person using the compiler isn’t being paid to do so by UCB (or anyone else). But, as I’ve said before, IANAL.
I agree with Odysseus here.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
I'm having trouble with CVS - I know someone posted how to get the enhanced CVS sources, the ones you get with the command from http://boinc.berkeley.edu/source_code.php are 4.xx. Can anyone paste the correct CVS checkout command again please? I'm too tired to wade through threads...
Regards,
Simon.
--edit
Never mind, found it by browsing the CVS web viewer.
cvs -d :pserver:anonymous:@alien.ssl.berkeley.edu:/home/cvs/cvsroot checkout -r setiathome_enhanced seti_boinc
- for anyone that's interested :o) That will grab the correct sources for you.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
FYI:
CVS sources compiled without real problems for me on Windows.
Someone forgot to remove a file from the project - amd64AnalyzeFuncs.cpp. It doesn't exist on the CVS anymore, but it's in the project file still.
Remove it, builds fine.
I will be testing this with my flags and ICC/IPP vs. my 5.15 builds and see if anything changes.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
and times, nothink
but times are ...
have you Comparisons with the original Client
Greetings from Germany NRW
Ulli 

|
|
|
|
|
|
As soon as I have results, I'll post them here :o)
Default 5.12 vs. Default 5.15 vs. Default 5.17
VS
Cruncher 5.12 My 5.15 My 5.17.
All on the same host, so the comparison should be halfway fair.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
As soon as I have results, I'll post them here :o)
Default 5.12 vs. Default 5.15 vs. Default 5.17
VS
Cruncher 5.12 My 5.15 My 5.17.
All on the same host, so the comparison should be halfway fair.
Regards,
Simon.
Simon,
Are you using the "Express" (free) version of Visual C?
-- Ned
____________
|
|
|
|
|
|
No, because I don't have a free 2003 version (and the 2005 express version croaks on some not-so-clean syntax somewhere, I forget).
Someone mentioned a free GUI-less 2003 version that's around - since the command-line options are in Visual Studio, I'm pretty sure that could be used to build.
For anyone interested in having Microsoft licenses available for personal use, you should really check out Microsoft certification...hint hint. Stop pirating, start getting free stuff from MS! Exams are much less expensive than licenses, but licenses are included when you take exams. Edumacate yourself ;o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
I'll ask that any links to stable clients be posted in the New Optimised Apps Links --READ ONLY THREAD thread.
If they are test clients, please keep them here. |
|
|
|
|
|
Ageless, that is exactly why I have not yet posted anything there.
I won't release things unless I'm sure about the license issues. Should I release anything, it will have been tested exhaustively on a number of different hosts beforehand.
I do want to make sure *everything* is as right as it could be before I take that step - can't speak for others trying their hand at optimizing.
Also, I have a couple of people as a test group. I won't risk a test build being less than perfect and getting distributed to lots of people, so I'm keeping it small for now.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
I won't risk a test build being less than perfect and getting distributed to lots of people, so I'm keeping it small for now.
Regards,
Simon.
probly wise move.. Can't wait to see all your efforts in action. My personal (and i'm sure from many others too) thanks for keeping to the struggle to build optimized apps and working so hard already.
Same goes to all others that have been doing their own work too. Its all much appreciated.
|
|
|
|
|
|
Yes it is by many of us, Very much appreciated! Excited also.:)
____________
|
|
|
|
|
|
Thanks again :o)
Well, after spending a few hours trying to get the CVS 5.17 sources to compile, I'm giving up on them for now.
Instead, I'll schedule a nice test run while I sleep - default vs. Crunch3r vs. mine on Windows, same on Linux, all for SSE2.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Okay, here is the first round of test results. Bear in mind a test run takes me around 1-2 hours all in all (with collating results and rebooting twice).
The chirp_limit has been adjusted (to 1/2.5) so the units take comparatively less time, or one run would take 12-20 hours.
testWU-1 (AR: 0.60488405578199)
Windows 32-Bit
Default 5.15 with graphics
23m 33s (1411 seconds)
Crunch3r 5.12 SSE2
11m 30s (690 seconds) - 51.1% quicker
My 5.15 SSE2
11m 25s (685 seconds) - 51.5% quicker
Those 5 seconds difference are well within standard result variance, so I'd calll it a draw at this AR on Windows.
Linux 32-Bit
Default 5.12 (no X-Win installed, so no graphics? not sure)
19m 30s (1150 seconds)
Crunch3r 5.12 SSE2
14m 16s (856 seconds) - 25.57% quicker
My 5.15 SSE2
12m 49s (769 seconds) - 33.13% quicker
--------------------------------
Linux vs. Windows
Default client
Linux is 18.5% quicker.
Crunch3rs 5.12
Windows is 19.4% quicker.
My 5.15 builds
Windows is 12.2% quicker.
So to sum things up, Windows does seem quicker at this AR with optimized clients, but slower with the default (due to the graphics? probably).
Next unit I'll test has an AR of ~0.44, so scaling should be different there.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Those look like fantastic results. Nice :) |
|
|
|
|
|
From what I have read about licensing issues, is not legal to distribuite any optimized app withouth the sources.
Here are some optimized apps, but no sources:
http://realtec.dyndns.org/web/vbforum/showthread.php?t=1245
____________
|
|
|
|
|
|
Simon, that is some very nice work there. Thanks for your efforts, and everyone else who is working on this as well!
|
|
|
|
|
|
baracutio has been notified that not offering source code for stuff you're distributing is against the GPL.
He used to offer Crunch3r's clients but took them down after being asked.
However, you're correct - that optimized BOINC app should also have a source package available to accompany it. It is not an optimized cruncher, though, but an optimized BOINC client.
Be aware that only made a difference when claimed credit depended on benchmarks. It doesn't anymore. So the real difference between using those and the default ones is zero.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
From what I have read about licensing issues, is not legal to distribuite any optimized app withouth the sources.
Here are some optimized apps, but no sources:
http://realtec.dyndns.org/web/vbforum/showthread.php?t=1245
For clarification purposes: The GPL does NOT require the source to be distributed with the executables. What it says is that the source must be made available if requested. This can be accomplished in many different ways and still be IAW the GPL. A text file with a link to the source can be included in the distribution or a simple statement telling people how to request it.
____________

|
|
|
|
|
|
Thanks for the clarification Sarge!
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Thanks for the clarification Sarge!
No problem. :)
____________

|
|
|
|
|
|
Stats for second WU:
testWU-2 (AR: 0.4437317022742)
Windows 32-Bit
Default 5.15 with graphics
27m 37s (1657 seconds)
Crunch3r 5.12 SSE2
13m 04s (784 seconds) - 52.7% quicker
My 5.15 SSE2
13m 10s (790 seconds) - 52.6% quicker
Those 6 seconds difference are again well within standard result variance, so I'd calll it a draw at this AR on Windows (between optimized clients), too.
Linux 32-Bit
Default 5.12 (no X-Win installed, so no graphics? not sure)
23m 04s (1384 seconds)
Crunch3r 5.12 SSE2
16m 34s (994 seconds) - 28.2% quicker
My 5.15 SSE2
14m 52s (892 seconds) - 35.6% quicker
--------------------------------
Linux vs. Windows
Default client
Linux is 16.5% quicker.
Crunch3rs 5.12
Windows is 21.1% quicker.
My 5.15 builds
Windows is 11.4% quicker.
So the results seem to be pretty repeatable for the first two.
Next one coming right up.
Okay, here is the first round of test results. Bear in mind a test run takes me around 1-2 hours all in all (with collating results and rebooting twice).
The chirp_limit has been adjusted (to 1/2.5) so the units take comparatively less time, or one run would take 12-20 hours.
testWU-1 (AR: 0.60488405578199)
Windows 32-Bit
Default 5.15 with graphics
23m 33s (1411 seconds)
Crunch3r 5.12 SSE2
11m 30s (690 seconds) - 51.1% quicker
My 5.15 SSE2
11m 25s (685 seconds) - 51.5% quicker
Those 5 seconds difference are well within standard result variance, so I'd calll it a draw at this AR on Windows.
Linux 32-Bit
Default 5.12 (no X-Win installed, so no graphics? not sure)
19m 30s (1150 seconds)
Crunch3r 5.12 SSE2
14m 16s (856 seconds) - 25.57% quicker
My 5.15 SSE2
12m 49s (769 seconds) - 33.13% quicker
--------------------------------
Linux vs. Windows
Default client
Linux is 18.5% quicker.
Crunch3rs 5.12
Windows is 19.4% quicker.
My 5.15 builds
Windows is 12.2% quicker.
So to sum things up, Windows does seem quicker at this AR with optimized clients, but slower with the default (due to the graphics? probably).
Next unit I'll test has an AR of ~0.44, so scaling should be different there.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Stats for third WU:
testWU-4 (AR: 1.2796485198966)
Windows 32-Bit
Default 5.15 with graphics
10m 46s (646 seconds)
Crunch3r 5.12 SSE2
4m 19s (259 seconds) - 59.9% quicker
My 5.15 SSE2
4m 17s (257 seconds) - 60.0% quicker
Those 2 seconds difference are again well within standard result variance, so I'd call it a draw at this AR on Windows (between optimized clients), too.
Linux 32-Bit
Default 5.12 (no X-Win installed, so no graphics? not sure)
8m 00s (480 seconds)
Crunch3r 5.12 SSE2
5m 02s (302 seconds) - 37.1% quicker
My 5.15 SSE2
4m 45s (285 seconds) - 40.6% quicker
--------------------------------
Linux vs. Windows
Default client
Linux is 25.7% quicker.
Crunch3rs 5.12
Windows is 14.2% quicker.
My 5.15 builds
Windows is 9.8% quicker.
The same sort of scaling seems to apply, roughly.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Simon, are you planning on making your benchmark system available to the public at some point? I'm sure other people would be more than willing to help contribute benchmarks, not to mention that I'm kind of curious as to how a G5 running the Mac client stacks up. :P
Also, are these tests still being run on the 3.06 GHz P4 laptop you mentioned before? |
|
|
|
|
|
Well, this all sounds very, very interesting to us all, but where are the clients to download and test this whole things for us all? ;)
____________
Aloha, Uli
 |
|
|
|
|
|
No,
these are all being run on a Pentium-D 805 right now (stock).
Also, because the computer store didn't have two Corsair modules, it's only running single channel until they can restock. Still, it makes for very comparable results.
I installed Windows XP 32 and 64 Bit, and Linux 32 and 64 Bit (so a total of 4 Operating Systems) on this computer. This should enable me to do repeatable results without too much variance, because this system is doing nothing else.
Of course you can have the WUs I'm using to crunch these tests as well as the shell scripts I made to compare clients.
The WUs can be found here along with a little benchmark shell script.
Uli, I'll put together an SSE2 Linux package after I had some more sleep. Check the announcements thread tomorrow.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
(...) Uli, I'll put together an SSE2 package after I had some more sleep. Check the announcements thread tomorrow. Thanks, i'll come after that on monday, cause the SSE3 machine is out of reach right now, and now i only have a very decent 3Dnow! and a retired SSE machine at hand ;)
____________
Aloha, Uli
 |
|
|
|
|
|
Bam.
'nuff said :o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
During testing, I've found that the Linux default client I've been using to test doesn't contain graphics and is between 10 and 30% quicker than Windows with graphics!
If it's not too much of a stretch code-wise, it would really be a cool feature to be able to turn off the graphics via the BOINC manager, for example. Right now, if they're compiled into the client, they are ALWAYS drawn, not only when you're actually looking at them (when screensaver runs or you select "view graphics" in the manager).
It really seems a waste of perfectly good cycles even if you don't use an optimized client.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
@ KWSN - Chicken of Angnor
Thanks for all yor work. Just noticed your linux app availible. I'm sure you've made a lot of happy people :)
Didn't know the graphics were always drawn. Hopefully in the future they'll be seperated from the main app then it would be simple to just delete them if you don't want them or, as you say, an option in boinc would be nice.
Thanks again. Looking forward to your windows version being ready for release. |
|
|
|
|
Bam.
'nuff said :o)
Regards,
Simon.
Fantastic job Simon!
Congratulations and THANKS!!!.....and not only for an excellent opt. ap, but MOST importantly, for leading the charge and pulling the great spirit in this forum back together from the chaos that was consuming it a few weeks back. It is great to see the collaboration and support from everyone, and especially the support of those who have led previous opt. efforts that today's knowledge is built upon.
Well done!
____________
|
|
|
|
|
|
First of all, congratulations on releasing a client! I'm sure you've made a lot of people happy, and there will be even more happy people in the coming days... :)
If it's not too much of a stretch code-wise, it would really be a cool feature to be able to turn off the graphics via the BOINC manager, for example. Right now, if they're compiled into the client, they are ALWAYS drawn, not only when you're actually looking at them (when screensaver runs or you select "view graphics" in the manager).
It really seems a waste of perfectly good cycles even if you don't use an optimized client.
Wow, I hadn't realized this--just looked through the initialization code to confirm. Looks like I'm going to have to reissue my v6 clients to take this into account. And I thought I was being nice in addressing that tiny minority that wanted their optimized clients to have graphics as well... |
|
|
|
|
|
Good Stuff, Simon.
Any idea as to when you will have WinXP SSE2 and SSE3 clients available?
____________
|
|
|
|
|
Wow, I hadn't realized this--just looked through the initialization code to confirm. Looks like I'm going to have to reissue my v6 clients to take this into account. And I thought I was being nice in addressing that tiny minority that wanted their optimized clients to have graphics as well...
Could you offer both versions, one fully optimized and one with graphics? The screensaver may be a ‘selling point’ for some, especially those who are nostalgic for Classic. ;)
What proportion of its CPU time does the app actually spend on graphics? I presume most of the ‘heavy lifting’ for rendering & animating is done by the video board.
____________
 |
|
|
|
|
Bam.
'nuff said :o)
Regards,
Simon.
Hello!
Sorry, but it failed here (just copied seti + app_info.xml)
Slackware 10.1, P4:
./setiathome-5.15-sse2
Segmentation fault
and with boinc:
2006-06-26 06:44:02 [SETI@home] Unrecoverable error for result 18mr99ab.11394.5617.329836.3.246_0 (process got signal 11)
justr tried on another box, supporting SSE3. (Slackware 10.0)
Works!
Maybe some SSE3 found its way to the code?
BR
Iztok
my CPU, no sse3, does not work:
processor : 0
vendor_id : GenuineIntel
cpu family : 15
model : 3
model name : Intel(R) Pentium(R) 4 CPU 3.00GHz
stepping : 4
cpu MHz : 3001.345
cache size : 1024 KB
physical id : 0
siblings : 2
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe pni monitor ds_cpl cid xtpr
bogomips : 5931.00
another PC, sse3, works!
processor : 0
vendor_id : GenuineIntel
cpu family : 15
model : 3
model name : Intel(R) Pentium(R) 4 CPU 3.40GHz
stepping : 4
cpu MHz : 3394.635
cache size : 1024 KB
physical id : 0
siblings : 2
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe pni monitor ds_cpl cid xtpr
bogomips : 6701.05
|
|
|
|
|
|
more info on different CPUs:
works on P4 with sse2 and ss.
Works on Opteron 285.
fails on Xeon (both old and new)
fails on Pentium-M (laptop)
They all run Slack, different -current versions.
Test: before updating boinc, I just run ./setiathome:
iztok@az:~/ulov/setiathome-5.15-sse2$ ./setiathome-5.15-sse2
Segmentation fault
iztok@az:~/ulov/setiathome-5.15-sse2$ cat /etc/slackware-version
Slackware 10.1.0
BR, thanks
Iztok
/proc/cpuinfo for boxes:
fails:
processor : 3
vendor_id : GenuineIntel
cpu family : 15
model : 2
model name : Intel(R) XEON(TM) CPU 2.40GHz
stepping : 4
cpu MHz : 2392.263
cache size : 512 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm
bogomips : 4771.02
processor : 3
vendor_id : GenuineIntel
cpu family : 15
model : 4
model name : Intel(R) Xeon(TM) CPU 3.20GHz
stepping : 3
cpu MHz : 3192.826
cache size : 2048 KB
physical id : 3
siblings : 2
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe lm pni monitor ds_cpl cid cx16 xtpr
bogomips : 6373.37
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 13
model name : Intel(R) Pentium(R) M processor 1.80GHz
stepping : 6
cpu MHz : 1796.547
cache size : 64 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr mce cx8 sep mtrr pge mca cmov pat clflush dts acpi mmx fxsr sse sse2 ss tm pbe tm2 est
bogomips : 3578.26
|
|
|
|
|
|
To anyone wanting the Visual C++ Toolkit 2003 (The free command line version from Microsoft), I didn't see it on their web site anymore. Instead they are pointing to their Visual Studio 2005 Express Edition (also free). That is located at http://msdn.microsoft.com/vstudio/express/]http://msdn.microsoft.com/vstudio/express/. I believe the Visual C++ Toolkit 2003 also had some sort of conflict also. If anyone still wants it however, I will try to locate it on my backup cds. I may have thrown it though. :( If I find it, I'll probably have to give it to someone elso to host as my i-net connection is at work. I doubt my boss would appreciate it. <BIG EVIL GRIN>
No, because I don't have a free 2003 version (and the 2005 express version croaks on some not-so-clean syntax somewhere, I forget).
Someone mentioned a free GUI-less 2003 version that's around - since the command-line options are in Visual Studio, I'm pretty sure that could be used to build.
For anyone interested in having Microsoft licenses available for personal use, you should really check out Microsoft certification...hint hint. Stop pirating, start getting free stuff from MS! Exams are much less expensive than licenses, but licenses are included when you take exams. Edumacate yourself ;o)
Regards,
Simon.
____________
William D. Gagliardi
 |
|
|
|
|
|
Just installed this client on My Dual Xeon 2.8 Ghz BOX. Running fine i'll give u some news about Crunch times when it finishes =)
____________
|
|
|
|
|
|
Simon, could you make an optimized client for FreeBSD?
I tried your optimized client with Linux emulator on FreeBSD, but it didn't work.
____________
|
|
|
|
|
|
Oops -
since some of you seem to be having trouble executing that binary on SSE2 systems, I must have mixed up some flags.
Since I recompiled what I had before (that worked on all SSE2 hosts) and thought I used the same script, it should have been okay.
But I think I switched -axWN to -xWN and that was what's wrong.
Silly mistake, nobody's perfect :o)
I'll update the binary in an hour or so when I get it recompiled.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Good Stuff, Simon.
Any idea as to when you will have WinXP SSE2 and SSE3 clients available?
As soon as Intel release a Win32 free for personal non-commercial use package...
Right now, according to their page, that version is ONLY available for Linux. Their FTP server says different, and the same licenses work. So while you can use them this way, the legality of doing that is very unclear.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Any idea as to when you will have WinXP SSE2 and SSE3 clients available?
As soon as Intel release a Win32 free for personal non-commercial use package...
Right now, according to their page, that version is ONLY available for Linux. Their FTP server says different, and the same licenses work. So while you can use them this way, the legality of doing that is very unclear.
Regards,
Simon.
...and what did Eben Moglen say to clarify the situation when you got in touch with him?
____________
|
|
|
|
|
|
I did not contact Eben yet.
Not because I don't think it's necessary but because I simply haven't found time yet.
However, at the Intel site, they mention SPECIFICALLY that the only operating system these free for personal non-com use packages are offered on is Linux. So that's a pretty clear statement (even though, like I said, you can manually download the necessary files and the license works).
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
The downloadable client package has been updated with an executable that should now really run on all SSE2-enabled systems.
The URL is still the same, but it now downloads a file called "setiathome-5.15-sse2-v2.tar.gz".
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
104 downloads and counting :o)
Wow. I didn't think it was gonna go that quickly after only being online for less than 16 hours...
So there were some more compatibility issues, namely: it wouldn't run on Linux 2.4 systems, only on 2.6.
That's now fixed, the links are the same, the package is updated. So have at it people :o) Bandwidth is not a problem.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
TobyVolunteer tester
 Send message Joined: Oct 26 00 Posts: 1001 Credit: 2,538,273 RAC: 1,572

|
|
Add one more download! Trying it on my workstation here at work. Dual core AMD64. It picked up 2 work units already in progress under the standard 5.12 client. this one and this one should be returned some time within the next few hours.
Ni!
____________
A member of The Knights Who Say NI!
For rankings, history graphs and more, check out:
My BOINC stats site |
|
|
|
|
|
I'll be watching :o)
Ni!
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
I did not contact Eben yet.
Not because I don't think it's necessary but because I simply haven't found time yet.
However, at the Intel site, they mention SPECIFICALLY that the only operating system these free for personal non-com use packages are offered on is Linux. So that's a pretty clear statement (even though, like I said, you can manually download the necessary files and the license works).
Regards,
Simon.
Obviously, Crunch3r found some legal way of creating and distributing his optimized WinXP apps.
Perhaps asking him about it would be useful?
____________
|
|
|
Eric KorpelaForum moderator Project administrator Project developer Project scientist
 Send message Joined: Apr 3 99 Posts: 562 Credit: 2,110,707 RAC: 330

|
|
Just a note to optimizers. 5.17 isn't ready for prime time (or even beta) yet. I've got some code issues to deal with in the short term. In particular the x86_64 chirp routine is untested.
On the bright side, that means people who want to send in optimized code for specific processors have time to do so.
Eric
____________
 |
|
|
|
|
|
EricVonDaniken,
to my knowledge he bought a license, and that's how :o)
Just my 0.02 bucks.
Thanks for the notice, Eric! (Korpela)
5.17 didn't seem to work too well no matter what I tried, so I gave up on it for now (CVS) until further notice. The apps are at 5.15, based on your source tarballs.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
EricVonDaniken,
to my knowledge he bought a license, and that's how :o)
Regards,
Simon.
A= can you verify that so we are sure of what thw truth is?
B= how much does a license cost?
____________
|
|
|
|
|
I did not contact Eben yet.
Not because I don't think it's necessary but because I simply haven't found time yet.
However, at the Intel site, they mention SPECIFICALLY that the only operating system these free for personal non-com use packages are offered on is Linux. So that's a pretty clear statement (even though, like I said, you can manually download the necessary files and the license works).
Regards,
Simon.
Obviously, Crunch3r found some legal way of creating and distributing his optimized WinXP apps.
Perhaps asking him about it would be useful?
From memory (and I could be wrong, it was a while ago) someone made a donation which allowed Crunch3r to either purchase it, or the donation was the license itself. I even believe it was specificly to allow the optimised apps to be made for SETI@home. This leads to the point that if someone donated to allow him to make the optimised SETI apps then isn't he showing a huge amount of bad form by removing them.
Live long and crunch.
____________
Paul
(S@H1 8888)
  And proud of it! |
|
|
Hans DornVolunteer developer Volunteer tester
 Send message Joined: Apr 3 99 Posts: 2236 Credit: 12,503,941 RAC: 2,003

|
|
A litlle OT, but Akos had to pull his latest patches because of validation issues.
A discussion about running optimized or not has started in this thread.
Some people are leaving Einstein already.
Regards Hans
____________
|
|
|
|
|
From memory (and I could be wrong, it was a while ago) someone made a donation which allowed Crunch3r to either purchase it, or the donation was the license itself. I even believe it was specificly to allow the optimised apps to be made for SETI@home. This leads to the point that if someone donated to allow him to make the optimised SETI apps then isn't he showing a huge amount of bad form by removing them.
Live long and crunch.
I remember the same thing happening. It was shortly after TMR released the first optimized app for windows. I don't know if Crunch3r was the recipient though...I seem to remember that whoever it was wished to remain annonomous.
____________
Boinc....Boinc....Boinc....Boinc |
|
|
|
|
|
Oi, 649 posts...
And about the donation and related questions...er...no idea, don't really want to comment, nothing good could come of it.
I had a different idea, which is to write to Intel directly (marketing or one the CEO's office) about supporting Seti@Home and giving a license to the project for the express purpose of offering optimized clients for everyone.
Not sure how that will go, I haven't written and sent that letter (yes, snail mail) yet, but I will.
If Intel doesn't want to do that, then I'd try a donation drive towards the same goal - not giving this licence to any one person but to the project.
I really don't believe, even though I spent lots of time doing what needed doing, that I am entitled to any recompensation. Sure, it would be nice, but noone asked me to, right?
I can only hope Intel notices my letter and is interested :o)
After all, there are a lot of us crunchers, and we all need hardware to crunch with...and the project could always use some more server hardware, too, but that's a different thing entirely (and I will not ask them for it in my letter...alluding to it maybe).
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
A litlle OT, but Akos had to pull his latest patches because of validation issues.
A discussion about running optimized or not has started in this thread.
Some people are leaving Einstein already.
Regards Hans
I think the gist of it is not so much the stable versions don't pass muster, but rather they aren't certified for use under the study protocol.
Still, it's causing a "mini flapdoodle" at the moment. ;-)
Alinator
|
|
|
Pappa Forum moderator Volunteer tester
 Send message Joined: Jan 9 00 Posts: 2115 Credit: 5,110,193 RAC: 9,101

|
|
The Gas Giant
Sorry but this is old news. Crunch3r does programing as a part of "business" he purchases Software to allow him to work... In this case Seti BOINC benifited... If he had a PayPal Link to help defer costs for time he donated here then "fine" No One, Paid for His Licenses! He paid for them... The sad part is that all the "credit mongers" should have been the First to Step Forward to Donate Money... But then the should have also dontated to Seti! They were getting Prime Credit for Cost/Hour....
So at this point, it is go figure! Someone wrung the neck of the goose that laid golden eggs... I hope they enjoyed the meal... For me the whole affair was was very unappitizing...
From memory (and I could be wrong, it was a while ago) someone made a donation which allowed Crunch3r to either purchase it, or the donation was the license itself. I even believe it was specificly to allow the optimised apps to be made for SETI@home. This leads to the point that if someone donated to allow him to make the optimised SETI apps then isn't he showing a huge amount of bad form by removing them.
Live long and crunch.
Pappa
____________
Please consider a Donation to the Seti Project.
 |
|
|
Hans DornVolunteer developer Volunteer tester
 Send message Joined: Apr 3 99 Posts: 2236 Credit: 12,503,941 RAC: 2,003

|
I had a different idea, which is to write to Intel directly (marketing or one the CEO's office) about supporting Seti@Home and giving a license to the project for the express purpose of offering optimized clients for everyone.
Regards,
Simon.
IMO the Seti project is pretty good PR for Intel's compiler, libraries and CPUs too.
Regards Hans
____________
|
|
|
|
|
|
Pappa: Like I said, I believe he paid for them :o) Enough drama.
and Hans, I believe that too.
Which is why I'm taking the time to try and do that right, or they'll just ignore me.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
A litlle OT, but Akos had to pull his latest patches because of validation issues.
Perhaps I just didn't sit down and really read what to do with the Einstein stuff, but it just seemed to be more complex than the "Average" user would be able to install. All the patching of .DAT files and whatnot... It shouldn't have been so public...and if patching of files was required then someone should've got an RTPatch license...IMHO.
I didn't bother with an optimized S5. Short run times for me were about 45 minutes with the long ones running at about 7.00 to 7.25 hours..on the stock app.
____________
 |
|
|
Pappa Forum moderator Volunteer tester
 Send message Joined: Jan 9 00 Posts: 2115 Credit: 5,110,193 RAC: 9,101

|
|
Simon
I wish you Luck! If you want a reference they you can give them my lowly al.setiboinc (at) Gmail.com addresss. I will be happy to add references and explain what is happening and how they "could" gain inroads into a "Sun Shop"...
From reading various BOINc Developers Froums, there are inroads into detecting the type of CPU to allow delivoring the correct Application according to the CPU type...
So Progress can be made, IF a Few people Try!
Oi, 649 posts...
And about the donation and related questions...er...no idea, don't really want to comment, nothing good could come of it.
I had a different idea, which is to write to Intel directly (marketing or one the CEO's office) about supporting Seti@Home and giving a license to the project for the express purpose of offering optimized clients for everyone.
Not sure how that will go, I haven't written and sent that letter (yes, snail mail) yet, but I will.
If Intel doesn't want to do that, then I'd try a donation drive towards the same goal - not giving this licence to any one person but to the project.
I really don't believe, even though I spent lots of time doing what needed doing, that I am entitled to any recompensation. Sure, it would be nice, but noone asked me to, right?
I can only hope Intel notices my letter and is interested :o)
After all, there are a lot of us crunchers, and we all need hardware to crunch with...and the project could always use some more server hardware, too, but that's a different thing entirely (and I will not ask them for it in my letter...alluding to it maybe).
Regards,
Simon.
____________
Please consider a Donation to the Seti Project.
 |
|
|
Hans DornVolunteer developer Volunteer tester
 Send message Joined: Apr 3 99 Posts: 2236 Credit: 12,503,941 RAC: 2,003

|
I think the gist of it is not so much the stable versions don't pass muster, but rather they aren't certified for use under the study protocol.
Alinator
I don't know if this argument holds water. IMO their validator should take care of this.
There are other sources for erroneous results:
- faulty hardware
- errors in the official apps
Regards Hans
____________
|
|
|
|
|
|
Well,
the whole problem for Akosf and with the Einstein apps is that they're closed source and he had only compiled code to optimize using manual assembly tuning.
The Seti@Home and BOINC clients are both GPL, the Einstein ones aren't. So I'm really not surprised (but sad for Akosf) that they decided to disallow (some of?) his apps.
And yeah, if you distribute a program, you should have a license for it...RTPatch isn't open source or free - but that's never stopped anyone from distributing it, heh...if you have a license, you're allowed to, as well - no idea whether he had one or not. There are free alternatives that are easy to use (even GUI-based).
So anyway, I've always just crunched for Seti, and that won't change anytime soon. No disrespect to other projects, but I'm staying right here :o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
Pappa Forum moderator Volunteer tester
 Send message Joined: Jan 9 00 Posts: 2115 Credit: 5,110,193 RAC: 9,101

|
|
Brian
A portion went into the S5 Application, now he is looking at what is happening and going to work... So If you look and read there are many people working on it... For Seti that work is happening in Seti Beta...
A litlle OT, but Akos had to pull his latest patches because of validation issues.
Perhaps I just didn't sit down and really read what to do with the Einstein stuff, but it just seemed to be more complex than the "Average" user would be able to install. All the patching of .DAT files and whatnot... It shouldn't have been so public...and if patching of files was required then someone should've got an RTPatch license...IMHO.
I didn't bother with an optimized S5. Short run times for me were about 45 minutes with the long ones running at about 7.00 to 7.25 hours..on the stock app.
____________
Please consider a Donation to the Seti Project.
 |
|
|
|
|
A litlle OT, but Akos had to pull his latest patches because of validation issues.
A discussion about running optimized or not has started in this thread.
Some people are leaving Einstein already.
Regards Hans
A combination of issues. In the testing, some patches indeed produced invalid results and this was "live testing" so those WUs were being uploaded and failing validation.
Biggest issue appears to be that Einstein proj. mgt. only recognizes "official aps" as sanctioned for doing actual science. Unlike S@H, Einstein tightly controls modifications to the applications. Akos's previous S4 opt. aps were released as Official "Betas" after approval of project Mgt. HIs last validating patch, S5T0712 was @ 35% faster than the std application. It would seem there wasn't unanimous approval of his efforts w/ the project and he was requested to tell everyone that "the party was over"....rather abruptly w/o any warning. Needless to say, folks while certainly appreciative and supportive of Akos, are pretty peeved at E@H project Mgt. They're still adjusting to very same things we all went through with migration to Enhanced in May ie. dramatically increased crunching time and credit methodology changes. Many of the comments and discussions on their board right now will look quite familiar to S@H forum participants.
____________
|
|
|
Pappa Forum moderator Volunteer tester
 Send message Joined: Jan 9 00 Posts: 2115 Credit: 5,110,193 RAC: 9,101

|
|
Hans
Sorry, You forgot one...
Thier Validator with the new Appilication could also be an issue... In Alpha/Beta it is a Bug! After Release it is an "un-documented feature."
I think the gist of it is not so much the stable versions don't pass muster, but rather they aren't certified for use under the study protocol.
Alinator
I don't know if this argument holds water. IMO their validator should take care of this.
There are other sources for erroneous results:
- faulty hardware
- errors in the official apps
Regards Hans
Regards
Pappa
____________
Please consider a Donation to the Seti Project.
 |
|
|
Hans DornVolunteer developer Volunteer tester
 Send message Joined: Apr 3 99 Posts: 2236 Credit: 12,503,941 RAC: 2,003

|
Hans, I believe that too.
Which is why I'm taking the time to try and do that right, or they'll just ignore me.
Regards,
Simon.
While you're at it, could you mention that I want a couple of woodcrests NOW :o)
Regards Hans
____________
|
|
|
|
|
Simon
I wish you Luck! If you want a reference they you can give them my lowly al.setiboinc (at) Gmail.com addresss. I will be happy to add references and explain what is happening and how they "could" gain inroads into a "Sun Shop"...
From reading various BOINc Developers Froums, there are inroads into detecting the type of CPU to allow delivoring the correct Application according to the CPU type...
So Progress can be made, IF a Few people Try!
Exactly! So that's why I'm trying to spearhead a friendly effort at getting us all to enjoy the potential benefits.
I would be glad if I can include you in my references, thanks for offering. I really don't know what to do first right now, good thing I still have time off from work (anyone looking for a good NT/Linux/Slowaris/BSD sysadmin / web developer / multi-os wizard? *grin*).
Seriously though, I'm really trying hard to make everything I do as transparent as possible to avoid any possible drama.
So far, it's going well :o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Hans, I believe that too.
Which is why I'm taking the time to try and do that right, or they'll just ignore me.
Regards,
Simon.
While you're at it, could you mention that I want a couple of woodcrests NOW :o)
Regards Hans
Lol.
Yes, yes. A couple of Woodcrests to do the heavy lifting, and a couple of Core 2 Duo X6800s to play with. Er, there goes my mind :o)
Wouldn't we just love that.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
HI simon
Hi Pappa
Hi Hans
Pappa: Like I said, I believe he paid for them :o) Enough drama.
and Hans, I believe that too.
Which is why I'm taking the time to try and do that right, or they'll just ignore me.
Regards,
Simon.
Some old history from ..... 30 Nov 2005 ... 13:37:48 UTC ...
To be honest i think the windows version will be faster.
But you're talking about your new client, which you won't release,right?
Actually I don't want to buy ipp for $199 and icc for $399.
Crunch3r,
I have a fully licensed copy of MS Visual C++ 2003, but my trial versions of IPP, ICC and the Intel Math Kernel Library just expired as well. I haven't purchased them because I am so new to it, that I couldn't get it to compile without a buttload of error messages and warnings in the compiler output. If you would be willing to release your source code to me, I would be willing to buy a fully licensed copy of ICC & IPP, compile it with your instructions, and send it back to you to release to the public. You and your other source code contributors can take FULL credit for the clients, and I would get to learn a few things in the process. If you would be interested in this arrangement, let me know. If all I would need is the ICC and IPP, I should be able to swing the cost in about 2 weeks. If I would also need the Math Kernel Library, then it will take about 3 or 4 weeks to get the funds.
Regards, Daniel.
That Sounds good to me.
the above was copied from here .............. Posted .. on ... 30 Nov 2005 13:37:48 UTC
http://setiathome.berkeley.edu/forum_thread.php?id=23410#199350
Regards
Byron
|
|
|
|
|
|
The first thread used by TMR when trying to optimize a windows version with the Eval version is called "Compiling faster Windows client with Intel C++ compiler (and fftw?). It's an interesting read. This development sounds much like that one. TMR discusses licenses and such, there are even links to intel compiler forums, and many things that KWSN - Chicken of Angnor might find helpful.
Also, in the follow up thread called 2nd: Compiling faster Windows client with Intel C++ compiler (and fftw?), TMR Identifies the person/s who set him up with the license in this post.
Hope this helps
tony |
|
|
|
|
|
Thanks a lot for the links!
Lots of useful info. Huge threads, too :o) It'll take me a while to skim over them ...
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
and the third thread is 3nd: Compiling faster Windows client with Intel C++ compiler (and fftw?) |
|
|
|
|
|
and shortly after the 3rd thread came, New Linux client compiled with Intel C++ compiler? |
|
|
|
|
|
then, compile a faster linux client using ipp instead of fftw
then, Will you make optimized Windows client if I give the source?,
then, "Legal' optimized client for Windows,
then, Oh heck, just search TMR's post from 383 days ago and sooner, I'll stop the list here, but it continues for a while.
tony
NOTE: these were posted in chronological order of posting |
|
|
Pappa Forum moderator Volunteer tester
 Send message Joined: Jan 9 00 Posts: 2115 Credit: 5,110,193 RAC: 9,101

|
|
Tony
Thank You!
then, compile a faster linux client using ipp instead of fftw
then, Will you make optimized Windows client if I give the source?,
then, "Legal' optimized client for Windows,
then, Oh heck, just search TMR's post from 383 days ago and sooner, I'll stop the list here, but it continues for a while.
tony
NOTE: these were posted in chronological order of posting
____________
Please consider a Donation to the Seti Project.
 |
|
|
Pappa Forum moderator Volunteer tester
 Send message Joined: Jan 9 00 Posts: 2115 Credit: 5,110,193 RAC: 9,101

|
|
Byron
Thank You!
HI simon
Hi Pappa
Hi Hans
Pappa: Like I said, I believe he paid for them :o) Enough drama.
and Hans, I believe that too.
Which is why I'm taking the time to try and do that right, or they'll just ignore me.
Regards,
Simon.
Some old history from ..... 30 Nov 2005 ... 13:37:48 UTC ...
To be honest i think the windows version will be faster.
But you're talking about your new client, which you won't release,right?
Actually I don't want to buy ipp for $199 and icc for $399.
Crunch3r,
I have a fully licensed copy of MS Visual C++ 2003, but my trial versions of IPP, ICC and the Intel Math Kernel Library just expired as well. I haven't purchased them because I am so new to it, that I couldn't get it to compile without a buttload of error messages and warnings in the compiler output. If you would be willing to release your source code to me, I would be willing to buy a fully licensed copy of ICC & IPP, compile it with your instructions, and send it back to you to release to the public. You and your other source code contributors can take FULL credit for the clients, and I would get to learn a few things in the process. If you would be interested in this arrangement, let me know. If all I would need is the ICC and IPP, I should be able to swing the cost in about 2 weeks. If I would also need the Math Kernel Library, then it will take about 3 or 4 weeks to get the funds.
Regards, Daniel.
That Sounds good to me.
the above was copied from here .............. Posted .. on ... 30 Nov 2005 13:37:48 UTC
http://setiathome.berkeley.edu/forum_thread.php?id=23410#199350
Regards
Byron
____________
Please consider a Donation to the Seti Project.
 |
|
|
|
|
|
OK one more, a shortcut to the posts of TMR are here. I started from the oldest but when I got to something like post 380 or 390 (383 days ago), I quit working forward |
|
|
|
|
I don't know if this argument holds water. IMO their validator should take care of this.
There are other sources for erroneous results:
- faulty hardware
- errors in the official apps
Regards Hans
Very true, but then there is the case where 2 "faulty" test versions get teamed up, or a single FTV gets selected as the canonical. Since they went to an IR of 2, there's a pretty good chance that could happen.
I think they just want to limit the number of variables present when they post process the MSD at the end of the S5 run.
Alinator
|
|
|
Hans DornVolunteer developer Volunteer tester
 Send message Joined: Apr 3 99 Posts: 2236 Credit: 12,503,941 RAC: 2,003

|
Very true, but then there is the case where 2 "faulty" test versions get teamed up, or a single FTV gets selected as the canonical. Since they went to an IR of 2, there's a pretty good chance that could happen.
I think they just want to limit the number of variables present when they post process the MSD at the end of the S5 run.
Alinator
Yep. With an IR of 2 the chances are pretty high that bad results get validated.
It's a pity that there's no offline validator for Einstein.
Regards Hans
____________
|
|
|
|
|
|
Yeah,
thank Tetsuji for rescmp! Made my life a lot easier in testing result validity. Thanks again for sending me a link Hans.
In any case, it really seems there's some momentum gaining again here on the forums (as well as good will and helpful, friendly posts). Let's keep it up people :o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Yeah,
thank Tetsuji for rescmp! Made my life a lot easier in testing result validity. Thanks again for sending me a link Hans.
In any case, it really seems there's some momentum gaining again here on the forums (as well as good will and helpful, friendly posts). Let's keep it up people :o)
Regards,
Simon.
LOL
The First Law of BOINC:
All forum flaps blow over eventually.
:-D
BTW keep plugging at it, my slugs desperately need help! ;-)
Alinator
|
|
|
|
|
|
I've said it before:
I'm not going away (read: you ain't gettin' rid of me THAT easily! :P) anytime soon - and I'll keep trying to get Windows as well as Linux apps released (and ultimately to get them to everyone like outlined before).
It may all take a bit of time, but rest assured it will get done. 16000 hits in the first 24h can't be that wrong :o) Good bit of motivation. Almost 1000 unique visitors, too.
Regards,
Simon.
P.S.: I'll huff and I'll puff and I'll blow your house down! *grin* "a storm in a glass of water", as a german proverb goes.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
BTW keep plugging at it, my slugs desperately need help! ;-)
Oh dear... You're running K6 processors? Are they at least K6-2?
____________
 |
|
|
|
|
BTW keep plugging at it, my slugs desperately need help! ;-)
Oh dear... You're running K6 processors? Are they at least K6-2?
K6-2/500's, but I have them running EAH and PAH, my Intels are over here right now, although one of the K6's picked up an SE when PAH went down with the power outage over the weekend.
Alinator
|
|
|
|
|
BTW keep plugging at it, my slugs desperately need help! ;-)
Oh dear... You're running K6 processors? Are they at least K6-2?
K6-2/500's, but I have them running EAH and my Intels over here right now.
Alinator
Ah... Well I have a K6-2/500 that's been sitting dormant for over 3 years now... The mb is really twitchy... I think it is an ASUS MVP3 chipset (or something like that). I picked it up because my DFI bit the dust, so I thought I'd try to get to AGP. The DFI was a workhorse (430TX chipset, overclocked to 6x83), while the ASUS board fussed about every timing...
____________
 |
|
|
|
|
Ah... Well I have a K6-2/500 that's been sitting dormant for over 3 years now... The mb is really twitchy... I think it is an ASUS MVP3 chipset (or something like that). I picked it up because my DFI bit the dust, so I thought I'd try to get to AGP. The DFI was a workhorse (430TX chipset, overclocked to 6x83), while the ASUS board fussed about every timing...
Yeah I read the MVP3 ASUS was fussy, and that was why I got the P5A which uses the ALi Aladdin V chipset and does AGP 2X as well with a BIOS and chipset driver upgrade. It's the best performing of the three.
The other two are a Compaq Presario 1200XL118 laptop and an HP Pavillion 6623. The integrated video and Phoenix BIOS hurts their performance somewhat although the Presario can come close to the P5A.
BTW, I do have a K6/300 on an ABIT TX5, and man it's virtually bulletproof (knock on wood). It been on virtually 24/7 for going on eight years without missing a beat.
Alinator
|
|
|
|
|
|
Oi,
I used to have a P5-AB (non-ATX version) which worked really nicely and overclocked my K6-2 333 to 400+ nicely and stably.
Damn, that was long ago...96?97? Big step up from the P133 I had before, that's for sure....mmh, memories of my first 3D card, a Voodoo 2, and playing the first games in real 3D...
The P133 ran at 166 and was my first ever overclock ;o) Necessary overclock, too, because the Microprose F1 sim wouldn't run at full detail without skipping frames at 133...before 3D accelerators, that was.
Cheers,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
LOL, the K6/300 was my step up from a 486DX-66.
It felt like going from a garbage scow to a warp driven starship! :-)
My own personal equipment that is. ;-)
Alinator |
|
|
|
|
|
When Crunch3r started he worked primarily with Linux/unix. Here's a quick link to his posts.
The first thread where I see made his windows version is Optmised Seti App, but there's not much here, then Optimized BOINC 5.2.7 for Windows and Linux from the 3rd week of November 2005, then !!! WANTED !!! OPTIMISED CLIENT -> WINDOWS x86 32bit or 64bit, then Optimized BOINC 5.2.13 for Windows, then BOINC 5.2.14 optimization on linux., then Part 2 - OPTIMISED CLIENT -> WINDOWS x86 32bit or 64bit
After this he stopped making optimized Boinc clients and went to SE to try and tweak the SE seti app. You might search those boards for more info.
tony
|
|
|
|
|
|
Thanks Tony!
I already read through most of the threads and found some helpful info there.
It also seems to jibe with what I read out of the Windows license stuff for the Intel packages - namely, that you can't really release stuff compiled with the trial version (and there is no non-com version on Windows according to the Intel page, only commercial trial versions which expire).
So that's why I decided to only release Linux for now, since my use and distribution classifies as personal and non-commercial according to the license distributed by Intel (and my understanding of it).
Barring further developments (I already posted about the as yet unsent snail mail letter), that won't change quickly. I have access to a commercial license but it ain't mine, so that's not an option. Having to ask people to download code, compile it for me then send it back each time just doesn't work, not to mention them maybe having issues and/or getting flak for their kind help.
So for now I'm mulling how exactly to word what I'm writing, who all to include, etc etc. as well as trying to figure out why my Linux MMX/SSE version doesn't want to run on AMD hosts (gives me "illegal instruction" when I try no matter what optimization level).
A Windows How-To is also in the works even if some of the tools necessary are not free (but they're cheap on eBay ;o) ).
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Oi,
I used to have a P5-AB (non-ATX version) which worked really nicely and overclocked my K6-2 333 to 400+ nicely and stably.
Damn, that was long ago...96?97? Big step up from the P133 I had before, that's for sure....mmh, memories of my first 3D card, a Voodoo 2, and playing the first games in real 3D...
The P133 ran at 166 and was my first ever overclock ;o) Necessary overclock, too, because the Microprose F1 sim wouldn't run at full detail without skipping frames at 133...before 3D accelerators, that was.
Cheers,
Simon.
Yeah It was around '96 or so, I had a Gigabyte GA5AX that had the same ALi Aladdin V chipset and I got a K6-3 to 550MHz, I never was able to get It to 600MHz, but It ran. Seems like a long time ago, My how time passes when You're havin fun.
____________
My Farm/PCs Specs
 |
|
|
|
|
Yeah It was around '96 or so, I had a Gigabyte GA5AX that had the same ALi Aladdin V chipset and I got a K6-3 to 550MHz, I never was able to get It to 600MHz, but It ran. Seems like a long time ago, My how time passes when You're havin fun.
I've got a never used -III/475 sitting around, I'll add the Gigabyte to my list of "keep my eye out for's". :-)
Sheesh, in looking through the bins, I don't even remember where some of this stuff came from anymore! LOL
Alinator
|
|
|
TobyVolunteer tester
 Send message Joined: Oct 26 00 Posts: 1001 Credit: 2,538,273 RAC: 1,572

|
|
Well the two results I linked to above have been reported. I was the first to report so I'll have to wait a while to see them validated.
And in the meantime 2 more have finished:
346850557 and 346850627.
Looks like they took 4.3 hours or so. The initial "to completion" estimate was 7.5 hours.
____________
A member of The Knights Who Say NI!
For rankings, history graphs and more, check out:
My BOINC stats site |
|
|
|
|
Yeah It was around '96 or so, I had a Gigabyte GA5AX that had the same ALi Aladdin V chipset and I got a K6-3 to 550MHz, I never was able to get It to 600MHz, but It ran. Seems like a long time ago, My how time passes when You're havin fun.
I've got a never used -III/475 sitting around, I'll add the Gigabyte to my list of "keep my eye out for's". :-)
Sheesh, in looking through the bins, I don't even remember where some of this stuff came from anymore! LOL
Alinator
Try ebay, There's some mention there.
ebay search ga-5ax motherboards, ram and cpus
____________
My Farm/PCs Specs
 |
|
|
|
|
Well the two results I linked to above have been reported. I was the first to report so I'll have to wait a while to see them validated.
And in the meantime 2 more have finished:
346850557 and 346850627.
Looks like they took 4.3 hours or so. The initial "to completion" estimate was 7.5 hours.
Very close to what my Athlon64 3500+ is doing using it (about half the time/2x speedup compared to default client) - it has only one core and only 512K cache, but runs at 2.2 GHz as well (dual channel, S939, 1GB PC3200 CL2 RAM).
I'm still doing runs Linux vs. Windows on the same Hardware, and I'm not convinced my benchmark results are correct (using shortened WUs), there Windows seems quicker.
On real WUs, the Linux client seems quicker, but I need to get less VLAR WUs to check that thoroughly ;o)
Ni!
I took the liberty of posting on the KWSN forums, as well. Someone mistook me for Alex Kan, so I linked his apps too ;o)
Regards,
Simon.
P.S. Joker could you please edit that URL so it only shows a name instead of the whole URL? Page width and formatting breaks again otherwise :o)
<url=http://a.b.c>Name for Link</url>
Replace <> with [], of course.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Well the two results I linked to above have been reported. I was the first to report so I'll have to wait a while to see them validated.
And in the meantime 2 more have finished:
346850557 and 346850627.
Looks like they took 4.3 hours or so. The initial "to completion" estimate was 7.5 hours.
Very close to what my Athlon64 3500+ is doing using it (about half the time/2x speedup compared to default client) - it has only one core and only 512K cache, but runs at 2.2 GHz as well (dual channel, S939, 1GB PC3200 CL2 RAM).
I'm still doing runs Linux vs. Windows on the same Hardware, and I'm not convinced my benchmark results are correct (using shortened WUs), there Windows seems quicker.
On real WUs, the Linux client seems quicker, but I need to get less VLAR WUs to check that thoroughly ;o)
Ni!
I took the liberty of posting on the KWSN forums, as well. Someone mistook me for Alex Kan, so I linked his apps too ;o)
Regards,
Simon.
P.S. Joker could you please edit that URL so it only shows a name instead of the whole URL? Page width and formatting breaks again otherwise :o)
<url=http://a.b.c>Name for Link</url>
Replace <> with [], of course.
I used bb code in the 1st place, I never use html here, even though that is a trivial thing to do.
____________
My Farm/PCs Specs
 |
|
|
|
|
|
Thanks!
Kind of sucks that you can only edit posts for one hour, sorry if I sounded patronizing :o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Thanks!
Kind of sucks that you can only edit posts for one hour, sorry if I sounded patronizing :o)
Regards,
Simon.
It's ok. Here's a Suzuki Katana that's been Customized somewhat and Yes I'm a fan of the man o steel.

____________
My Farm/PCs Specs
 |
|
|
|
|
Well the two results I linked to above have been reported. I was the first to report so I'll have to wait a while to see them validated.
And in the meantime 2 more have finished:
346850557 and 346850627.
Looks like they took 4.3 hours or so. The initial "to completion" estimate was 7.5 hours.
Take a look at this WU (or all that host's WUs).
Compare the first result (Athlon64 3800+ using Crunch3r's 5.12 on Windows) and the 3rd, (my Athlon64 3500+, using my Linux SSE2 build in a Linux VM on Windows).
This is a high-credit WU, of which there are really not a lot. One host took almost 180.000 seconds to return it.
There haven't been a lot of other optimized clients among the results I've looked at, and this seems a pretty close match (with 200 MHz clock speed advantage on the 3800+'s side).
Tests are underway to see which one is quicker on which hardware at which AR - AMD and Intel both.
I'm also going to investigate the unlikely case that Linux is actually quicker running in a VM on Windows than on its own...cause it sure seems like it so far, believe it or not.
Regards,
Simon.
P.S. on an unrelated note, I noticed with satisfaction that my pending credit is over 3000 again (hardly went above 1000 with enhanced so far), so it seems I'm returning more results than before.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
And thanks again Alex Kan!
He has released a version of his OS X G4/G5 optimized clients without graphics. This should make Macs even faster :o)
Kudos and hats off to him, as usual.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Hmm, that does look good. 16627.984375 (3800) vs. 12023.583137 (3500). Although the CPU time was less, the granted credit result with your version was very interesting. [edit] Compare that to one of my results with a Pentium 4 532 3.06 GHz, 1 MB L2.
Yee Haw!
____________
|
|
|
|
|
Well the two results I linked to above have been reported. I was the first to report so I'll have to wait a while to see them validated.
And in the meantime 2 more have finished:
346850557 and 346850627.
Looks like they took 4.3 hours or so. The initial "to completion" estimate was 7.5 hours.
Take a look at this WU (or all that host's WUs).
Compare the first result (Athlon64 3800+ using Crunch3r's 5.12 on Windows) and the 3rd, (my Athlon64 3500+, using my Linux SSE2 build in a Linux VM on Windows).
This is a high-credit WU, of which there are really not a lot. One host took almost 180.000 seconds to return it.
There haven't been a lot of other optimized clients among the results I've looked at, and this seems a pretty close match (with 200 MHz clock speed advantage on the 3800+'s side).
Tests are underway to see which one is quicker on which hardware at which AR - AMD and Intel both.
I'm also going to investigate the unlikely case that Linux is actually quicker running in a VM on Windows than on its own...cause it sure seems like it so far, believe it or not.
Regards,
Simon.
P.S. on an unrelated note, I noticed with satisfaction that my pending credit is over 3000 again (hardly went above 1000 with enhanced so far), so it seems I'm returning more results than before.
Mine was above 3000 too come to think about It, Seems strange, And now It's around 2950 or so, Of course the valiators are workin It looks like, There seems to be a mystery a foot Me thinks and No I'm not Sherlock Holmes. It just sounds cool to Me. ;) I'm getting near the 500,000 club too, Then all I have left is to slowly close on the Uncle B's, Hardly a fair fight, My 10GHz vs 144GHz, Even with all modern cpus at near 44GHz soon, I don't think I can match them, I feel like a Mouse attackin an Elephant. Where's an elephant swatter when I need one. Ho hum.
____________
My Farm/PCs Specs
 |
|
|
TobyVolunteer tester
 Send message Joined: Oct 26 00 Posts: 1001 Credit: 2,538,273 RAC: 1,572

|
|
Just finished another. Looking at my results I see it claimed exactly the same credit as a work unit I finished with the standard client right before I switched which I assume means they are the same AR?
stock client = 11,615 seconds and optimized = 7,660 seconds. In both cases, credit claimed was 31.13
____________
A member of The Knights Who Say NI!
For rankings, history graphs and more, check out:
My BOINC stats site |
|
|
|
|
|
The AR is exactly the same on those two results, yes.
You'll notice different sorts of speedups with different ARs. The ones that take longest seem to be the quickest compared to the default client.
I'll shortly be removing my optimized apps for a couple of WUs on my hosts so I can get an idea how it looks.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
It also seems to jibe with what I read out of the Windows license stuff for the Intel packages - namely, that you can't really release stuff compiled with the trial version (and there is no non-com version on Windows according to the Intel page, only commercial trial versions which expire).
So that's why I decided to only release Linux for now, since my use and distribution classifies as personal and non-commercial according to the license distributed by Intel (and my understanding of it).
Barring further developments (I already posted about the as yet unsent snail mail letter), that won't change quickly. I have access to a commercial license but it ain't mine, so that's not an option. Having to ask people to download code, compile it for me then send it back each time just doesn't work, not to mention them maybe having issues and/or getting flak for their kind help.
Regards,
Simon.
How much do all the relevant licenses cost?
____________
|
|
|
|
|
|
Around 900 dollars AFAIK, but I haven't checked closely. Sadly, the licenses are time-limited, I believe that's for one year and includes ICC, IPP and MKL (the latter two being library packages).
Intel pricing page
Also don't forget both Linux and Windows (and maybe even OS X, for X86-based Macs) are required, so that doubles/triples normal license costs.
--edit
Intel Compiler for Linux - $399
Intel Compiler for Windows - $399
Intel Compiler for OS X - $399
IPP - $199
MKL - $399 (maybe not necessary, but probably faster than fftw).
From the pricing page, it seems that all supported OS flavours are included when you buy one license for the library packages. The compiler has to be licensed once per OS.
There's a promotional MacOS package available that includes Compiler, IPP and MKL and costs $549. Since the libraries seem to license cross-OS, that might be the cheapest path overall.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Intel Compiler for Linux - $399
Intel Compiler for Windows - $399
Intel Compiler for OS X - $399
IPP - $199
MKL - $399 (maybe not necessary, but probably faster than fftw).
Yes, give me 2, please. :>
I've been buying intel chips since like 1989. Intel owes ME a lot, so I'll ask them to give me a license as a present. And then I'll give it to you.
I'll let you know how my petition goes. ;)
____________
/Mav

We have lingered long enough on the shores of the cosmic ocean.
We are ready at last to set sail for the stars.
(Carl Sagan) |
|
|
|
|
Around 900 dollars AFAIK, but I haven't checked closely. Sadly, the licenses are time-limited, I believe that's for one year and includes ICC, IPP and MKL (the latter two being library packages).
Simon.
Strange question.
Does all this legal rubish stop you from realeasing your 'un-compiled' work? before you've compiled it with the intel pakage its your intelectual property/ under gpl ect though obviously but they cant say you've broke the intel or any compilers license agreement can they?
What i mean is have the program you've made ready - compile it to test on your system - if it works then release the pre-compile source or tarball or what ever its called. Then we download it and compile it with the free :) intel (or whatever) compiler that anyone can download. :) With instructions would be helpfull :)
Sorry if i'm missing something obvious but i've only just started to try to learn to code myself and i wont even bother to try and understand the legal cr*p. |
|
|
|
|
|
No, the source code is your's to do with as you see fit.
The license only applies once you run it through the compiler and wish to release for public use.
I guess that was the point Maverick was making earlier, he can still work on optimizing with the Intel products, but he can't release them anymore because his license expired.
Alinator
<edit> On second thought, there might be a problem with providing the exact config files for the compiler, but I would think there's nothing wrong with including a text "crib" sheet to help folks along. |
|
|
|
|
No, the source code is your's to do with as you see fit.
The license only applies once you run it through the compiler and wish to release for public use.
I guess that was the point Maverick was making earlier, he can still work on optimizing with the Intel products, but he can't release them anymore because his license expired.
Alinator
So someone could release there optimized source material with some instructions on compiling and when to get the compilers(free?) and they wouldn't be breaking any rules. :)
Now just wish i know how to do all that. I doubt compileing is as easy as all that.
<edit> On second thought, there might be a problem with providing the exact config files for the compiler, but I would think there's nothing wrong with include a text "crib" sheet to help folks along.
Oh. darn it. Thanks for the info anyway. |
|
|
|
|
So someone could release there optimized source material with some instructions on compiling and when to get the compilers(free?) and they wouldn't be breaking any rules. :)
Now just wish i know how to do all that. I doubt compileing is as easy as all that.
Note see edit above.
Yep, that much is certain, it's a lot harder than it sounds!
Alinator
|
|
|
|
|
Around $900 AFAIK, but I haven't checked closely. Sadly, the licenses are time-limited, I believe that's for one year and includes ICC, IPP and MKL (the latter two being library packages).
Intel pricing page
Also don't forget both Linux and Windows (and maybe even OS X, for X86-based Macs) are required, so that doubles/triples normal license costs.
--edit
Intel Compiler for Linux - $399
Intel Compiler for Windows - $399
Intel Compiler for OS X - $399
IPP - $199
MKL - $399 (maybe not necessary, but probably faster than fftw).
From the pricing page, it seems that all supported OS flavours are included when you buy one license for the library packages. The compiler has to be licensed once per OS.
There's a promotional MacOS package available that includes Compiler, IPP and MKL and costs $549. Since the libraries seem to license cross-OS, that might be the cheapest path overall.
Regards,
Simon.
I find it hard to believe that Crunch3r was paying $900 US every year just so he could make BOINC and s@h optimizations!
Something does not make sense here. How is Berkeley making WinXP SW using the Intel stuff? How is MIT (FFTW)?
There has to be some form of academic/non-commercial/non-profit site licensing scheme available; and most likely one that is not time-bombed.
There are far more WinXP hosts than Linux hosts running BOINC, so it seems fairly important that we find a way to make those hosts run a effeciently and effectively as possible.
____________
|
|
|
|
|
EricVonDaniken,
to my knowledge he bought a license, and that's how :o)
Regards,
Simon.
A= can you verify that so we are sure of what thw truth is?
B= how much does a license cost?
A= if you search the forums, I think you'll find where he asked for donations.
B= I see someone has already given pricing.
____________
|
|
|
|
|
Around $900 AFAIK, but I haven't checked closely. Sadly, the licenses are time-limited, I believe that's for one year and includes ICC, IPP and MKL (the latter two being library packages).
Intel pricing page
Also don't forget both Linux and Windows (and maybe even OS X, for X86-based Macs) are required, so that doubles/triples normal license costs.
--edit
Intel Compiler for Linux - $399
Intel Compiler for Windows - $399
Intel Compiler for OS X - $399
IPP - $199
MKL - $399 (maybe not necessary, but probably faster than fftw).
From the pricing page, it seems that all supported OS flavours are included when you buy one license for the library packages. The compiler has to be licensed once per OS.
There's a promotional MacOS package available that includes Compiler, IPP and MKL and costs $549. Since the libraries seem to license cross-OS, that might be the cheapest path overall.
Regards,
Simon.
I find it hard to believe that Crunch3r was paying $900 US every year just so he could make BOINC and s@h optimizations!
All of the licenses I saw are perpetual, the support contracts do run out.
Are the support contracts needed? Probably not.
Something does not make sense here. How is Berkeley making WinXP SW using the Intel stuff? How is MIT (FFTW)?
There are several other compilers that are free (GCC) or less expensive.
It is important to remember that Crunch3r's big claim to fame was that he was not using the same compiler as Berkeley, and he licensed some libraries that are not free (IPP), and then told the compiler to produce code tailored to specific processors.
I think TMR did the work to figure out ICC/IPP (someone please correct me if I'm wrong) and ICC may or may not be the best optimizing compier out there. Moving existing source from one compiler to another is work, and it's not the fun kind of work, it's just a slow grind to find all the differences between compilers.
There has to be some form of academic/non-commercial/non-profit site licensing scheme available; and most likely one that is not time-bombed.
There are far more WinXP hosts than Linux hosts running BOINC, so it seems fairly important that we find a way to make those hosts run a effeciently and effectively as possible.
One way to do this is to simply publish an "optimization cookbook" with links to available tools, installation instructions and etc.
Then anyone can download an eval. license for the compiler and libraries, and build their own. They just can't distribute the compiled code without violating the Intel license.
Another way is to find someone (surely out of the 460,000-odd users out there someone must have it) who already has a license and have them do the final compiles for us. There are ways to do that without violating the license, and without making the compiler owner do lots of work.
____________
|
|
|
|
|
|
Craziness.
The best way to solve the problem is for Berekeley and/or BOINC to get the right licensing.
Frankly, I'd bet Berkeley does have a site Intel license for these tools and we just do not know it.
____________
|
|
|
|
|
|
And I believe you'd be wrong Eric -
where do you see any inkling that Berkeley have even used any of the Intel packages themselves? The official client is compiled using GCC(Linux) and Microsoft Visual C++ (Windows) and NOT using Intel's package.
No, I do not believe that Berkeley or Seti@Home has a license for them.
And yes, it seems that Crunch3rs license was paid for using donations via paypal for at least a part of the cost. I haven't asked for anything similar because I don't think it's the right way.
TMR said his license ran out (he also found a sponsor) - so if only the support packages run out, he should actually still be able to release things with it. That also would alleviate some of the licensing cost (the recurring part), obviously.
I agree that there is only a problem releasing Apps on Windows right now BUT - that only goes for me. The project itself cannot use the personal free for non-commercial tools, namely because it's not a person as such.
So to use them, they would definitely need a License for Linux, too.
There are of course academic license programs available (just check their site), for which the project may qualify. In any case, that is something that I can neither influence nor do for them - all I can do, as promised, is write to Intel and see whether they would like to help us out.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
On 6/16/06, Charles Elliott wrote:
> Why don't you use the Intel compiler and libraries? You
> would not have to do much, if any, re-writing as it can
> produce runtime versions that pick and choose what instructions
> to use based on the capabilities of the CPU.
I don't think the compiler actually does what you are suggesting here.
(I'm prepared to be convinced otherwise, though). The last version I
tried didn't (a couple years ago).
> Moreover, in the studies I have read the Intel tools are the
> best in the industry and produce the fastest executables.
Most of the time we have to develop to the most common tools rather
than the fastest. The cost in my time required to switch compilers is
far larger than the cost of the compiler. GCC, OTOH, works everywhere
on every platform.
Eric
the above was copied from here ............
http://www.ssl.berkeley.edu/pipermail/boinc_opt/2006-June/000764.html
|
|
|
|
|
|
Thanks Byron,
I'm subscribed to that list already :o)
Actually, you can now do exactly what Eric Korpela said in that message you couldn't, which is have one executable that combines optimization for a couple of platforms (didn't use to be possible but now works).
The relevant compiler switches on Windows are /QaX and /QxX where X is either K, W, N or B (K-> P3/SSE, W ->Generic SSE2, N -> P4 SSE2, B -> P4 SSE3). So you can also combine more than one to make it look like /QaKWNB, which would make a monolithic executable that's optimized from P3 upwards and executes everywhere.
It'll be larger than the original, of course, and it has about a 3% performance penalty vs. a "pure" optimized binary. Compared to the default, though, that's still massively quicker.
A point to remember, to be sure.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
The official client is compiled using GCC(Linux) and Microsoft Visual C++ (Windows) and NOT using Intel's package.
...
Simon.
Eric Korpela switched to DevC++/MinGW for the Windows builds in Beta starting with the 5.10 version, and has used that combination for all Windows releases here. He also keeps the VC++ project files updated, of course. Joe
|
|
|
|
|
|
...where do you see any inkling that Berkeley have even used any of the Intel packages themselves? The official client is compiled using GCC(Linux) and Microsoft Visual C++ (Windows) and NOT using Intel's package.
I'm not just talking about s@h or even BOINC here.
I'm talking about Berkeley and possibly even the U of C as whole.
There's a =ton= of academic research programming going on there, and most of it is on Wintel or Lintel boxes. It seems inconceivable that Berkeley as a whole, or possible even U of C, doesn't have some sort of "sweetheart deal" with Intel in regards to Intel's x86 libraries and build tools.
The "parallel path" is that if Intel's stuff really is making code that is 2-3x faster for SWAR than g++/gcc, that's a Problem the FSF compiler folks should care a great deal about.
However we do it, ~500-750K hosts burning 2-3x as much electricity, time, etc per result than they should be is Something That Needs to be Fixed.
...especially since this problem is going to get worse as more modern CPU's are introduced. The trend is towards specialized HW and instruction sets for things like FP and embarassingly parallel code.
The diffrence in performance of "ordinary" FP and SSE4 optimized FP on Core2 CPUs is something like between 1:4 to 1:8
____________
|
|
|
|
|
|
Well,
be that as it may - the Seti@Home project is a separate entity and I don't think its entitled to use Berkeley's licenses. They may of course have a deal with Intel in place, but that's really neither here nor there. Your friend may be married to a Supermodel, but does that mean she's going to sleep in your bed? :o) (no answers please :P)
I do agree that this problem is only going to worsen (like you said, optimized SSE code vs. traditional FPU code performance and similar things), and that the most favourable solution would be to offer a client that's as efficient as possible to EVERYONE.
Retaining users and making the most efficient use of the donated CPU time should also be on top of our list of things to do, so I'm with you there.
I'll repeat what I said before - as I have zero experience dealing with the "parallel path" (the FSF?), so feel free to jump in - I'm sure you can explain as coherently to them as to us what this is all about.
For my part, I'm about to finish a draft of my letter, which I'll be sending to Eric Korpela asking whether it's objectionable to the project in that form or not. I don't want to barge ahead without him/the project staff at least knowing what's happening.
I'll also post it (or a link to a copy of it) here.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
As to Windows binaries,
I just checked the license for Intel's evaluation versions, and it has this to say:
EVALUATION LICENSE: If you are using the Materials under the control of an Evaluation license, you as an individual may use the Materials only for internal evaluation purposes and only for the term of the evaluation time period, which is controlled by the license key code for the Materials. NOTWITHSTANDING ANYTHING TO THE CONTRARY ELSEWHERE IN THIS AGREEMENT, YOU MAY NOT DISTRIBUTE ANY PORTION OF THE MATERIALS, AND THE APPLICATION AND/OR PRODUCT DEVELOPED BY YOU MAY ONLY BE USED FOR EVALUATION PURPOSES AND ONLY FOR THE TERM OF THE EVALUATION PERIOD. You may install copies of the Materials on an unlimited number of computers provided that you are the only individual using the Materials and only one copy of the Materials is in use at any one time. A separate license is required for each additional use and/or individual user in all other cases. Intel will provide you with a license code key that enables the Materials for an Evaluation license. If you are an entity, Intel grants you the right to designate one individual within your organization to have the sole right to use the Materials in the manner provided above.
So yes, I could indeed use them to compile a client, which I could even distribute for TESTING PURPOSES ONLY and which could not legally be used after those 30 days (or distributed!).
I couldn't stop people "testing" it on BOINC for credits if I did that, either.
That's a no-go for me. I will not release stuff that I know to be taking advantage of a legal technicality (requiring people to confirm an EULA that makes them promise to use it only until trial expiration and then delete would put me in the clear, legally...but not ethically).
So unless a dark horse suddenly appears, no go on the Windows client from me for now.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Er,
I'll be the first to point out I might be wrong about the non-com versions for Windows.
The same license states, a paragraph before (I should have looked and recognized this earlier):
NONCOMMERCIAL-USE LICENSE: If you are using the Materials under the control of a Noncommercial-Use license, you as an individual may use the Materials only for non-business use where you receive no fee, salary or any other forms of compensation. The Materials may not be used for any other purpose, whether "for profit" or "not for profit." Any work performed or produced as a result of use of the Materials cannot be performed or produced for the benefit of other parties for a fee, compensation or any other reimbursement or remuneration. You may install copies of the Materials on an unlimited number of computers provided that you are the only individual using the Materials and only one copy of the Materials is in use at any one time. A separate license is required for each additional use and/or individual user in all other cases. Intel will provide you with a license code key that enables the Materials for a Noncommercial-Use license. If you obtained a time-limited Noncommercial-Use license, the duration (time period) of your license and your ability to use the Materials is limited to the time period of the obtained license, which is controlled by the license key code for the Materials. If you are an entity, Intel grants you the right to designate one individual within your organization to have the sole right to use the Materials in the manner provided above.
Now that is on a page that SPECIFICALLY states this is a Windows version! However, there are only Linux non-com downloads available (linked on the Intel site).
When you receive an email from them with your Linux non-com license, the link points at ftp://download.intel.com, which is an anonymous FTP. Installers and packages for all their products and all operating systems are available there.
Also, like noted before, the library package licenses are for all supported operating systems, so Linux licenses will work on Windows, too. The Linux non-com ICC license also seemed to work fine...
Now, this dilemma is something that's worthy of putting to the FSF, to me. Had they not specifically included the non-commercial version paragraph into the license for the WINDOWS version, I would not have thought so.
Whatcha think?
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
I will not release stuff that I know to be taking advantage of a legal technicality (requiring people to confirm an EULA that makes them promise to use it only until trial expiration and then delete would put me in the clear, legally...but not ethically).
Sounds like that might also conflict with the GPL for the original sources, which forbids the imposition of further conditions on licensees.
<edit>Let me add that although I’m a Mac-oriented person and therefore not particularly interested in Windows or Linux optimizations, I congratulate you on the circumspect and methodical manner in which you’ve been proceeding (according to what I’ve seen ‘from the sidelines’) with respect to both testing for validity (protecting the potential scientific results) and investigating legal issues (protecting the project & its volunteers).</edit>
____________
 |
|
|
|
|
|
Thats not a license condition, but a distribution condition (technicality, but still).
And it's not where I want to go anyway :o)
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
EricVonDaniken,
to my knowledge he bought a license, and that's how :o)
Regards,
Simon.
A= can you verify that so we are sure of what thw truth is?
B= how much does a license cost?
License is found at Intel web site - I just purchased it, cost of about $600.00 USD for the compiler, distribution, and Primatives.
____________
|
|
|
|
|
|
If you want to get in touch with me (for example to get sources and instructions how to compile them), contact me at simon <insert funny sign here> zadra.org.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
Hans DornVolunteer developer Volunteer tester
 Send message Joined: Apr 3 99 Posts: 2236 Credit: 12,503,941 RAC: 2,003

|
License is found at Intel web site - I just purchased it, cost of about $600.00 USD for the compiler, distribution, and Primatives.
Wow!
Have one on me:

Cheers :o)
Regards Hans
____________
|
|
|
|
|
|
Second that :o)
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
I will not release stuff that I know to be taking advantage of a legal technicality (requiring people to confirm an EULA that makes them promise to use it only until trial expiration and then delete would put me in the clear, legally...but not ethically).
Sounds like that might also conflict with the GPL for the original sources, which forbids the imposition of further conditions on licensees.
<edit>Let me add that although I’m a Mac-oriented person and therefore not particularly interested in Windows or Linux optimizations, I congratulate you on the circumspect and methodical manner in which you’ve been proceeding (according to what I’ve seen ‘from the sidelines’) with respect to both testing for validity (protecting the potential scientific results) and investigating legal issues (protecting the project & its volunteers).</edit>
Thanks.
You might say recent events have somewhat influenced that, but a methodical and public approach seems best in any case. So far, there has been an amazing lack of drama, and I'd like to continue that.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Craziness.
The best way to solve the problem is for Berekeley and/or BOINC to get the right licensing.
Frankly, I'd bet Berkeley does have a site Intel license for these tools and we just do not know it.
At this point, this is really just for us fanatics: people who are willing to do the work to load an app. that is exactly suited to the CPU.
The current alpha test version of BOINC detects the various capabilities of the processor.
Once it knows what the specific CPU can do, it can tell that to the project servers. Those servers need to know how to interpret that.
Then the project(s) can deliver applications more closely matched to the CPU.
Then it makes sense for the project to officially produce clients that use something better than a 486.
Until then, the default client has to run on almost everything.
____________
|
|
|
|
|
Craziness.
The best way to solve the problem is for Berekeley and/or BOINC to get the right licensing.
Frankly, I'd bet Berkeley does have a site Intel license for these tools and we just do not know it.
At this point, this is really just for us fanatics: people who are willing to do the work to load an app. that is exactly suited to the CPU.
The current alpha test version of BOINC detects the various capabilities of the processor.
Once it knows what the specific CPU can do, it can tell that to the project servers. Those servers need to know how to interpret that.
Then the project(s) can deliver applications more closely matched to the CPU.
Then it makes sense for the project to officially produce clients that use something better than a 486.
Until then, the default client has to run on almost everything.
I agree, Besides a 486 is old, Maybe not as old as a 6502 cpu, But It's old and I'd like to see someone try and run Linux on a 6502 cpu with 48k ram. I don't think crunching or what ever is even possible on something that old. :D ;)
____________
My Farm/PCs Specs
 |
|
|
Hans DornVolunteer developer Volunteer tester
 Send message Joined: Apr 3 99 Posts: 2236 Credit: 12,503,941 RAC: 2,003

|
I'd like to see someone try and run Linux on a 6502 cpu with 48k ram. :D ;)
Easy.
Write an emulator for a more recent CPU (ARM, etc..) and use your disk drive as main memory :o)
Just install a tiny linux distro when you're done.
Regards Hans
____________
|
|
|
|
|
I'd like to see someone try and run Linux on a 6502 cpu with 48k ram. :D ;)
Easy.
Write an emulator for a more recent CPU (ARM, etc..) and use your disk drive as main memory :o)
Just install a tiny linux distro when you're done.
Regards Hans
Easy? It would have to be all in Machine language, As the 6502 can only address 48k of ram and was a 1.79MHz cpu.
____________
My Farm/PCs Specs
 |
|
|
|
|
I'd like to see someone try and run Linux on a 6502 cpu with 48k ram. :D ;)
Easy.
Write an emulator for a more recent CPU (ARM, etc..) and use your disk drive as main memory :o)
Just install a tiny linux distro when you're done.
Regards Hans
Easy? It would have to be all in Machine language, As the 6502 can only address 48k of ram and was a 1.79MHz cpu.
Showing my age here... It could actually address 64K RAM native (though early machines came with a whopping 4K RAM, like the original Apple ][, not the ][+ or even the //e... LOL). The 6502 was later tricked into addressing 128K of RAM by "bank switching". It was originally a 1.00 MHz CPU, though later iterations got up to a whopping 3.6 MHz (TransWarp Accelerator).
The Apple //gs used to run a C-Shell and programming environment based on unix with a 2.8MHz 65816 (16-bit offspring of the 6502). It was called GNO, for Gno's Not Orca. It could address 8 MB RAM I believe. There was even one accessible via telnet at one point. It actually compiled and ran Mines of Moria (old text-based mapping adventure game) quite well. I solved it and killed the Balrog and thus survived to rule the Mines. But yes, number crunching of the type we do with BOINC would have been years-long ventures for a single WU, if limitations of cache and memory addressing were worked around.
____________

Kung Fu and Powerlifting |
|
|
|
|
I'd like to see someone try and run Linux on a 6502 cpu with 48k ram. :D ;)
Easy.
Write an emulator for a more recent CPU (ARM, etc..) and use your disk drive as main memory :o)
Just install a tiny linux distro when you're done.
Regards Hans
Easy? It would have to be all in Machine language, As the 6502 can only address 48k of ram and was a 1.79MHz cpu.
Showing my age here... It could actually address 64K RAM native (though early machines came with a whopping 4K RAM, like the original Apple ][, not the ][+ or even the //e... LOL). The 6502 was later tricked into addressing 128K of RAM by "bank switching". It was originally a 1.00 MHz CPU, though later iterations got up to a whopping 3.6 MHz (TransWarp Accelerator).
The Apple //gs used to run a C-Shell and programming environment based on unix with a 2.8MHz 65816 (16-bit offspring of the 6502). It was called GNO, for Gno's Not Orca. It could address 8 MB RAM I believe. There was even one accessible via telnet at one point. It actually compiled and ran Mines of Moria (old text-based mapping adventure game) quite well. I solved it and killed the Balrog and thus survived to rule the Mines. But yes, number crunching of the type we do with BOINC would have been years-long ventures for a single WU, if limitations of cache and memory addressing were worked around.
Of course the 6502 can go to 64k, Without an OS loaded that is and the Atari 400/800 OS was about 12k, The last Atari XE could go to 128k of course, But Apple was the only company to use the 65816 cpu, Atari never thought It was worth bothering with.
____________
My Farm/PCs Specs
 |
|
|
|
|
Showing my age here... It could actually address 64K RAM native (though early machines came with a whopping 4K RAM, like the original Apple ][, not the ][+ or even the //e... LOL). The 6502 was later tricked into addressing 128K of RAM by "bank switching". It was originally a 1.00 MHz CPU, though later iterations got up to a whopping 3.6 MHz (TransWarp Accelerator).
The Apple //gs used to run a C-Shell and programming environment based on unix with a 2.8MHz 65816 (16-bit offspring of the 6502). It was called GNO, for Gno's Not Orca. It could address 8 MB RAM I believe. There was even one accessible via telnet at one point. It actually compiled and ran Mines of Moria (old text-based mapping adventure game) quite well. I solved it and killed the Balrog and thus survived to rule the Mines. But yes, number crunching of the type we do with BOINC would have been years-long ventures for a single WU, if limitations of cache and memory addressing were worked around.
Of course the 6502 can go to 64k, Without an OS loaded that is and the Atari 400/800 OS was about 12k, The last Atari XE could go to 128k of course, But Apple was the only company to use the 65816 cpu, Atari never thought It was worth bothering with.
;-) Just responding to the stated clock-speed and address-space. Different OSes (and langs) installed of course affected the _available_ address-space.
Th 65816 was actually arguably useful in its incarnation in the //gs - the team working on GS/OS was the first to develop a color Finder (the desktop) and countless niceties that later made their way back up the chain to be used in MacOS. What's more, since we're interested in better-optimized code, the engineering team had to do it with such a slow processor that they actually coded in assembly to hand-tweak the workings for maximum speed, since the compilers were nowhere near up to that kind of optimization. In fact they were so good at it and UI ideas, many of them were 'stolen' away from the //gs development team and made to work on Mac dev. Andy McFadden is one notable example.
Ah, the good ol' days. LOL |
|
|
|
|
The "parallel path" is that if Intel's stuff really is making code that is 2-3x faster for SWAR than g++/gcc, that's a Problem the FSF compiler folks should care a great deal about.
I don't think the difference is as large as that. Simon posted these:
testWU-2 (AR: 0.4437317022742)
Windows 32-Bit
Default 5.15 with graphics
27m 37s (1657 seconds)
Crunch3r 5.12 SSE2
13m 04s (784 seconds) - 52.7% quicker
My 5.15 SSE2
13m 10s (790 seconds) - 52.6% quicker
However, note carefully that 5.15 was running with graphics. Using a -nographics command line argument would have reduced its time. Even so, since it is built with i386 code there would be a significant lag compared to builds using SSE2 optimizations. When someone makes a DevC++/MinGW SSE2 build without graphics there will be a way to compare. I'm making slow progress toward that, but it will take a couple more days to see if I reach the goal. Joe |
|
|
|
|
|
Simon (KWSN), I've been following your work with great interest, but I only have Athlon XPs (so, no SSE2) or higher... Any thoughts on doing SSE or 3DNow! optimized apps sometime? Or are those somehow not applicable? Thanks
____________

Kung Fu and Powerlifting |
|
|
|
|
|
Josef,
it should be noted that the tests on Linux were with a client that had no graphics and DID go faster. Hence the smaller speedup of standard vs. optimized on Linux - see my benchmark posts.
I had no graphics-less Windows version to test with, so that's why the tests used this version - which I got via BOINC, so it's the one everyone is using.
Also be aware that these times do not scale to all ARs. On some the speedup vs. unoptimized is close to 3x, so that's not entirely untrue. On average, I'd say it's about 30-50% faster on most WUs.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Showing my age here... It could actually address 64K RAM native (though early machines came with a whopping 4K RAM, like the original Apple ][, not the ][+ or even the //e... LOL). The 6502 was later tricked into addressing 128K of RAM by "bank switching". It was originally a 1.00 MHz CPU, though later iterations got up to a whopping 3.6 MHz (TransWarp Accelerator).
The Apple //gs used to run a C-Shell and programming environment based on unix with a 2.8MHz 65816 (16-bit offspring of the 6502). It was called GNO, for Gno's Not Orca. It could address 8 MB RAM I believe. There was even one accessible via telnet at one point. It actually compiled and ran Mines of Moria (old text-based mapping adventure game) quite well. I solved it and killed the Balrog and thus survived to rule the Mines. But yes, number crunching of the type we do with BOINC would have been years-long ventures for a single WU, if limitations of cache and memory addressing were worked around.
Of course the 6502 can go to 64k, Without an OS loaded that is and the Atari 400/800 OS was about 12k, The last Atari XE could go to 128k of course, But Apple was the only company to use the 65816 cpu, Atari never thought It was worth bothering with.
;-) Just responding to the stated clock-speed and address-space. Different OSes (and langs) installed of course affected the _available_ address-space.
Th 65816 was actually arguably useful in its incarnation in the //gs - the team working on GS/OS was the first to develop a color Finder (the desktop) and countless niceties that later made their way back up the chain to be used in MacOS. What's more, since we're interested in better-optimized code, the engineering team had to do it with such a slow processor that they actually coded in assembly to hand-tweak the workings for maximum speed, since the compilers were nowhere near up to that kind of optimization. In fact they were so good at it and UI ideas, many of them were 'stolen' away from the //gs development team and made to work on Mac dev. Andy McFadden is one notable example.
Ah, the good ol' days. LOL
Yep, I watched and I wished Atari had made use of those cpus, The Tramiels wouldn't hear of It, They wanted the 68000 cpus instead and then made sure that If You wanted more ram, You had to buy a new Computer instead and then even though the cpu could address 4Mb ram, The custom MMU could only see 2Mb which was a design flaw and so only 2Mb could be addressed, No slots, Just a cheap as hell computer, All It needed was an internal battery, hdd, ethernet and/or dialup modem and It would have been a portable(Later named the Stacy and It had no ethernet), Stupid, stupid decisons. They were cheap guys.
Yep the good ol' days,
____________
My Farm/PCs Specs
 |
|
|
|
|
|
Josef (and others):
Benchmark posts
That one and the next one have some results you can use to compare. They're a bit tough to find in that long thread.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
I agree, Besides a 486 is old, Maybe not as old as a 6502 cpu, But It's old and I'd like to see someone try and run Linux on a 6502 cpu with 48k ram. I don't think crunching or what ever is even possible on something that old. :D ;)
There aren't that many differences (besides clock speed) from a 486 to an early Pentium, IIRC.
The cool stuff starts with the later K6's and PII's.
____________
|
|
|
|
|
I agree, Besides a 486 is old, Maybe not as old as a 6502 cpu, But It's old and I'd like to see someone try and run Linux on a 6502 cpu with 48k ram. I don't think crunching or what ever is even possible on something that old. :D ;)
There aren't that many differences (besides clock speed) from a 486 to an early Pentium, IIRC.
The cool stuff starts with the later K6's and PII's.
True, And I've seen them all. Soon I'll be 46 on July 6th, Party on the 1st though as members of My extended family can only be together all at once then. But I crunch on and on. The Uncle B's are in front of Me and I must pass them somehow.
____________
My Farm/PCs Specs
 |
|
|
|
|
EricVonDaniken,
to my knowledge he bought a license, and that's how :o)
Regards,
Simon.
A= can you verify that so we are sure of what thw truth is?
B= how much does a license cost?
License is found at Intel web site - I just purchased it, cost of about $600.00 USD for the compiler, distribution, and Primatives.
This thread may interest you. It contains a link to detailed instructions for Windows (complete with Screenshots) and a source code package you can use to get started.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
EricVonDaniken,
to my knowledge he bought a license, and that's how :o)
Regards,
Simon.
A= can you verify that so we are sure of what thw truth is?
B= how much does a license cost?
License is found at Intel web site - I just purchased it, cost of about $600.00 USD for the compiler, distribution, and Primatives.
This thread may interest you. It contains a link to detailed instructions for Windows (complete with Screenshots) and a source code package you can use to get started.
Regards,
Simon.
merci for this very interesting link
seti_britta
|
|
|
|
|
|
De rien!
So far, the source package hasn't been downloaded, but I'm reasonably sure that will change. The Linux source package was downloaded 29 times already, the SSE2-optimized Linux client 205 times.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Hi, KWSN-CHicken of Angnor!
I followed your link to the instruction for windows to make my own compiled client. So far no problem, but when I unpacked your file, containing boinc and seti_boinc a password was needed for some files ???
So, are these protected files necessary or what is about this password?
Edit: It's ok now, it was just a lag of diskspace...
____________
|
|
|
|
|
|
Lol :o)
I was wondering...no, the file is not passworded, of course.
Good that you got it to unpack! I hope you can get it to work for you, the sources should compile if you install all the stuff.
I might have forgotten something with the MKL library - check the How-To again in a bit to see what I mean (it's the MKL FFTW wrapper library).
The same instructions as in the Linux version work, almost. It's just that you have to install or download parts of an SDK from Microsoft that has a program called "nmake" (free) and put them in your path somewhere, then go to "C:\\Program Files\\Intel\\MKL\\8.0.2\\examples\\fftw2mkl" and execute "nmake lib32" in a command window there. Then copy "_results\\ffwt2mkl.lib" to "C:\\Program Files\\Intel\\MKL\\8.0.2\\ia32\\lib\\".
Otherwise, it will probably give you a linker error (I'm reasonably sure I put that .lib file in the libraries it should include in the project file).
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
What is the current test state of the Windows optimized?
____________

Computers:
AMD 64 3500+ w/Win2k <dead>
AMD 1.4ghz w/Win98SE <dead>
AMD 1ghz Duron w/Win98SE <dead>
AMD 1ghz w/Win98SE <dead>
AMD 64 X2 4000+ w/Vista Home Premium |
|
|
|
|
|
Test state?
In my tests, it's been running mighty fine. The only problem (and what's keeping me from releasing) is the license.
Results are valid, some of my boxes are using my Windows build already. I just can't legally distribute things I compile until Intel allow people to use their compiler and library packages for free and non-commercial tools for Windows just like for Linux.
However, someone just bought a license, and I'm sincerely hoping he reads and follows my instructions so everyone can get an optimized Windows version.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Mmkay,
I made a rather startling discovery, and it's starting to firm up.
I've alluded to this before - my first test system was a Virtual Machine (VMWare hosted on Windows) that ran Linux. I did this so I wouldn't have to dedicate a full HD or partition to it and it wouldn't affect my system otherwise.
Well, during testing, the times I got were out of this world (very, VERY quick). In later testing (all done outside a VM), I never got very close to them anymore. This, while annoying, made me think hard about what could be different - since stuff in a VM is using the same hardware as when you install an operating system normally, it should generally be around the same speed (except for graphics, which I disabled anyway).
It didn't seem to be that case, though.
So I bought a P-D 805 system last week which I'm using to benchmark on now because it has 4 operating systems (Windows/Linux 32/64 Bit) installed on it. The times it was pulling with my optimized builds were quite okay even though it was running single channel until today (finally got my second DDR2 module), but they were really nowhere close to what I expected given my first test runs on the VM.
So I thought, let's install a VM on this computer, as well. After that was done, I started crunching some WUs on it, and guess what: it's a lightning bolt.
You can compare the following 4 hosts - they're all one and the same computer with different operating systems:
Linux in a VM on Windows 32 Bit
Windows 32 Bit
Windows 64 Bit
Linux 32 Bit
There aren't a whole big lot of results to compare yet, but the 58.70/58.69 units are all VLARs which take the longest time of all.
Lin32 in a VM average for WLAR WU: ~10600s
Win32 average for VLAR WU: ~16600s
Win64 average for VLAR WU: ~16200s
Lin32 average for VLAR WU: ~17500s (no VLAR yet, _estimated_!)
So that's a HUGE difference there...I still have to benchmark Win32/64 with dual channel again as it should be significantly quicker as well, but not THAT much.
So compare my other host, which has been running dual channel all the time (Athlon64 3500+).
Lin32 in a VM - 8000-10000s/VLAR WU
Windows 32 Bit - ~13400s/VLAR WU
That is more than a little surprising, I must say.
Currently, it looks as if crunching in a VM gives extra speed. It's not logical, but that's how it is.
Average speedup of VM vs. native/same hardware:
!!!!!FIFTY PERCENT!!!!!!
Now THAT's something to write home about.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Josef,
it should be noted that the tests on Linux were with a client that had no graphics and DID go faster. Hence the smaller speedup of standard vs. optimized on Linux - see my benchmark posts.
I had no graphics-less Windows version to test with, so that's why the tests used this version - which I got via BOINC, so it's the one everyone is using.
Certainly 5.15 needs to be the reference, I was only noting that the appearance of Intel builds being 2x to 3x faster than gcc builds is not a fair comparison.
Windows 5.15 can be run with a -nographics argument in standalone, and that gives another data point to consider. Unfortunately the -nographics argument causes a non-graphic build to quit with an error, which makes automating tests more complex.
One thing I discovered in the last two days is that running standalone with graphics but minimizing the graphics window as soon as it appears also reduces the run time to very near the -nographics case. What I don't know yet is whether running with BOINC but not showing graphics is similar to the minimized window situation.
Also be aware that these times do not scale to all ARs. On some the speedup vs. unoptimized is close to 3x, so that's not entirely untrue. On average, I'd say it's about 30-50% faster on most WUs.
Regards,
Simon.
I think that's a fair estimate. There may be lessons to be learned from the effects at different ARs, an analysis might give clues to where some algorithms can be improved. Joe |
|
|
ML1Volunteer tester Send message Joined: Nov 25 01 Posts: 4238 Credit: 1,387,611 RAC: 1,980

|
...Currently, it looks as if crunching in a VM gives extra speed. It's not logical, but that's how it is.
Average speedup of VM vs. native/same hardware:
!!!!!FIFTY PERCENT!!!!!!
Now THAT's something to write home about.
Very interesting indeed!
1: Is the speedup real (wall-clock time) or some fiction of time reporting?
2: Why? What's different in the VM world compared to just the native hardware??
Keep with the kool developments there! ;-)
Regards,
Martin
____________
Mandriva Linux A user friendly OS!
Optimised clients
Boinc HELP |
|
|
|
|
|
Ahh, -nographics :o)
I'll be using that and doing a new run of the same WUs with the default cruncher.
We will see what happens.
Thanks for the info, as always!
Simon
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
...Currently, it looks as if crunching in a VM gives extra speed. It's not logical, but that's how it is.
Average speedup of VM vs. native/same hardware:
!!!!!FIFTY PERCENT!!!!!!
Now THAT's something to write home about.
Very interesting indeed!
1: Is the speedup real (wall-clock time) or some fiction of time reporting?
2: Why? What's different in the VM world compared to just the native hardware??
Keep with the kool developments there! ;-)
Regards,
Martin
No, this is in *real* time. The clock in the VM is exactly the same as on the Host OS it's running on. Check the times recorded in the results :o)
As to the difference - THERE AIN'T ANY! Rather, it should be slower due to added overhead, if anything. That's why I'm so amazed. I just can't believe that Linux would work more quickly hosted on Windows. Results show that this is the case, however, on more than one host (though I'm still not positive it's correct and not due to some strange warp in space-time).
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
First KWSN, thanks for all your work and experiments :)
Lin32 in a VM average for WLAR WU: ~10600s
Win32 average for VLAR WU: ~16600s
Win64 average for VLAR WU: ~16200s
Lin32 average for VLAR WU: ~17500s (no VLAR yet, _estimated_!)
Average speedup of VM vs. native/same hardware:
!!!!!FIFTY PERCENT!!!!!!
Probably silly idea (I don't use/know VMWare), but if
win32 is faster than lin32
win64 is faster than win32
lin32 in a VM is faster than win64...
what about win64 in a win64 hosted VM ? *%)
____________
|
|
|
|
|
|
@KWSN - Chicken of Angnor:
I compiled the application, but I got a lot of warnings. The most are:
"../../db/schema_master.h(31): warning #187: use of "=" where "==" may have been intended
return !(db_is_open=0);"
But I got 1 catastrophic error (in project boinc_seti):
"analyzeFuncs.cpp
..\\analyzeFuncs.cpp(70): catastrophic error: could not open source file "ipp_w7.h"
#include <ipp_w7.h>"
I did all the changes exactly like in the instruction.
Is it possible, to discuss this via ICQ (My Nr.: 310857779)?
____________
|
|
|
|
|
|
That file is in c:\\program files\\intel\\ipp\\8.0.2\\ia32\\tools\\staticlib.
This path should be in the Include paths (check there and whether it's correct).
You can ignore the warnings, I get them too. They don't matter, really.
Good luck :o)
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
KWSN: Can you hit me again in ICQ, something got wrong. I saw the window, but I could not write to you...
____________
|
|
|
|
|
First KWSN, thanks for all your work and experiments :)
Lin32 in a VM average for WLAR WU: ~10600s
Win32 average for VLAR WU: ~16600s
Win64 average for VLAR WU: ~16200s
Lin32 average for VLAR WU: ~17500s (no VLAR yet, _estimated_!)
Average speedup of VM vs. native/same hardware:
!!!!!FIFTY PERCENT!!!!!!
Probably silly idea (I don't use/know VMWare), but if
win32 is faster than lin32
win64 is faster than win32
lin32 in a VM is faster than win64...
what about win64 in a win64 hosted VM ? *%)
Ah, it was too nice...couldn't believe it was true anyway, and it isn't.
I've since used a real stopwatch to see whether an hour in real time equals an hour in processing time. Well, it doesn't. Seems Virtual Machines live in some kind of time-dilation field. The system clock keeps going correctly, but the elapsed time goes like molasses - between 20 and 40 percent more slowly than normal. So I'd say that the apparent gains from running in a VM are purely in the numbers, which can lie too as can be seen.
Must be going near light speed to make time go more slowly...*grin*
In any case, guess I can not recommend crunching in a VM after all, as it seems to skew the results.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
TobyVolunteer tester
 Send message Joined: Oct 26 00 Posts: 1001 Credit: 2,538,273 RAC: 1,572

|
|
Do you have the vmware tools installed in the guest OS (linux)? I saw some truly strange timing issues in my vm hosts before I got the guest tools installed. Even after installing it, there can be some weird quirks but it does help out a lot.
I am also happy to report that my dual core is happily crunching along and its results are finally being validated. No errors so far. Looks like your careful testing has been successful.
____________
A member of The Knights Who Say NI!
For rankings, history graphs and more, check out:
My BOINC stats site |
|
|
|
|
|
Thanks Toby!
And yes, I guess the tools should help with timing issues there.
I still don't get why the system clock works correctly but the program counts time more slowly though, but I'll put that down to VM peculiarities.
I'll test more after installing the VMWare tools.
Ni!
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
TobyVolunteer tester
 Send message Joined: Oct 26 00 Posts: 1001 Credit: 2,538,273 RAC: 1,572

|
|
FYI the two dual core machines on my account are indeed different machines and are virtually identical except that one runs amd64 linux and one runs 32 bit windows. The linux one also has 2 GB of memory to the windows' 1 but other than that they both have the same motherboard and similar hardware all around. This might be useful for some comparisons as well... All the recent work units on the linux one are of course processed with your optimized app but if you go back a few pages you can compare how the two did with the stock app.
linux host (at work)
windows host (at home)
____________
A member of The Knights Who Say NI!
For rankings, history graphs and more, check out:
My BOINC stats site |
|
|
|
|
There aren't that many differences (besides clock speed) from a 486 to an early Pentium, IIRC.
The cool stuff starts with the later K6's and PII's.
True, And I've seen them all. Soon I'll be 46 on July 6th, Party on the 1st though as members of My extended family can only be together all at once then. But I crunch on and on. The Uncle B's are in front of Me and I must pass them somehow.
LOL, I hear ya - just turned 41 myself a few weeks ago. Fun to watch the evolution from no personal computers to hobby kits and on and on until here we are today, giving away our spare CPU cycles for projects like this.
____________

Kung Fu and Powerlifting |
|
|
|
|
|
KWSN- Chicken of Angnor wrote:
So I bought a P-D 805 system last week which I'm using to benchmark on now because it has 4 operating systems (Windows/Linux 32/64 Bit) installed on it. The times it was pulling with my optimized builds were quite okay even though it was running single channel until today (finally got my second DDR2 module), but they were really nowhere close to what I expected given my first test runs on the VM.
???
If you can afford to buy all this HW on a whim, why not just pay $600 for an Intel license?
____________
|
|
|
|
|
|
Who said it was all on a whim? Also, how is my budget anyone else's but my business? Why do you think I bought a system for bottom-dollar?
Eric, not to belittle you, but you keep asking me why not this why not that. So let me ask you: why not you? No technical ability required here. Just money.
I haven't asked for anyone to donate anything to me (and will not). Why not spend a further 600 dollars? Rent, for one. I'd like to be keep living where I am, thank you. So er...kindly refrain from comments like that, it's a bit much to expect me to cough it up just because everyone would feel so much happier if I did. Really.
Add to that the time I spent putting together easy-to-follow instructions for people, testing, benchmarking...
You couldn't pay me for it, most likely, if I started calculating how much someone would have to pay me to do what I did in this short time. I really cannot afford it, else I would have already bought it. Clear enough?
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Who said it was all on a whim? Also, how is my budget anyone else's but my business? Why do you think I bought a system for bottom-dollar?
Eric, not to belittle you, but you keep asking me why not this why not that. So let me ask you: why not you? No technical ability required here. Just money.
I haven't asked for anyone to donate anything to me (and will not). Why not spend a further 600 dollars? Rent, for one. I'd like to be keep living where I am, thank you. So er...kindly refrain from comments like that, it's a bit much to expect me to cough it up just because everyone would feel so much happier if I did. Really.
Add to that the time I spent putting together easy-to-follow instructions for people, testing, benchmarking...
You couldn't pay me for it, most likely, if I started calculating how much someone would have to pay me to do what I did in this short time. I really cannot afford it, else I would have already bought it. Clear enough?
Regards,
Simon.
Very well put Simon. Thank you once again for all your hard work.
____________
Account frozen... |
|
|
|
|
|
Hi again. I've temporarily got a hold of VS .net 2003 :-)
I've come across the same problem as Bluesilvegreen. The ipp_7t.h file doesn't exist an my computer :(. i think it maybe because the only version i could find on intels website is ipp 5.1 whilst your seems to be 8.0.2 first time and versin 5.0 next. oh and i dont have a staticlib folder in tools. i have a stubliib in the main directory thought?
Will this cause me roblems. Not worried by all the warings as you say
Whilst mooching around i noticed the 'optimizaion' tab in c/c++ folder icon when you've rightclicked into properties. I sthis redundent since you've used the command line route.. I ask because the commands are slightly didderent forprocessor specific. QaxP for pentium 4 SSE3 maybe that might get me better performance in my settup?
Also is the MKL math stuff automatically used when you've included them as per your instructions. I ask because in preproccesor commands you added use_IPP. does it need use_MKL?
[EDIT]
Went through more thorought and it seems its all to do with trying to impliments SSE ect. You prbly knew that already. looking thruoght that part the code i can see i dont have any of those #include files except ipp.h:
// In order to use IPP, set -DUSE_IPP and one of -DUSE_SSE3, -DUSE_SSE2,
// -DUSE_SSE or nothing(generic), IPP precedes FFTW, ooura // TMR
#if defined(USE_IPP)
#pragma message ("-----IPP-----")
#if defined(USE_SSE3)
#define T7 1
#pragma message ("-----sse3-----")
#include <ipp_t7.h>
#elif defined(USE_SSE2)
#define W7 1
#pragma message ("-----sse2-----")
#include <ipp_w7.h>
#elif defined(USE_SSE)
#define A6 1
#pragma message ("-----sse-----")
#include <ipp_a6.h>
#else
#pragma message ("-----mmx-----")
#include <ipp_px.h>
#endif // T7
#include <ipp.h>
#elif defined(USE_FFTWF)
#pragma message ("----FFTW----")
#include "fftw3.h"
#else
#pragma message ("----ooura----")
#include "fft8g.h"
#endif // USE_IPP
these are the files i do have. Any chance one of them is the right one and i can edit the code to use it instead?:
ipp.h - ippac.h - ippalign.h - ippcc.h - ippch.h - ippcore.h - ippcv.h - ippdefs.h - ippi.h - ippj.h - ippm.h - ipps.h - ippsc.h - ippsr.h - ippvc.h - ippvm.h
Thanks very much again. Made this all extremely interesting. learnt more going thought your tutorial than did in the last week with a couple of old c++ and VB books |
|
|
|
|
|
Following up my observations:
Windows 5.15 can be run with a -nographics argument in standalone, and that gives another data point to consider. Unfortunately the -nographics argument causes a non-graphic build to quit with an error, which makes automating tests more complex.
One thing I discovered in the last two days is that running standalone with graphics but minimizing the graphics window as soon as it appears also reduces the run time to very near the -nographics case. What I don't know yet is whether running with BOINC but not showing graphics is similar to the minimized window situation.
Running the stock 5.15 with BOINC, I checked the amount of CPU time each of the setiathome threads was using. With graphics off, after an hour of run time the main worker thread had almost all the time, another thread about 2 seconds. Turning graphics on for the WU started accumulating time in another thread at about 5 to 10 percent CPU usage, turning them off again that thread stopped using any appreciable time.
My take is that a graphics build adds considerably less than 1% to crunch time if graphics are not turned on. Using the -nographics argument when doing standalone testing to compare against optimized builds seems justified. Joe |
|
|
|
|
|
Quick question Joe,
So starting the CC from the command line like:
path\\boinc.exe -allow_remote_gui_rpc -nographics
will kill the graphics loop for win boxes?
Alinator
PS: Talking about the stock build here. |
|
|
|
|
KWSN- Chicken of Angnor wrote:
So I bought a P-D 805 system last week which I'm using to benchmark on now because it has 4 operating systems (Windows/Linux 32/64 Bit) installed on it. The times it was pulling with my optimized builds were quite okay even though it was running single channel until today (finally got my second DDR2 module), but they were really nowhere close to what I expected given my first test runs on the VM.
???
If you can afford to buy all this HW on a whim, why not just pay $600 for an Intel license?
Erik,
Why should this matter - if you want it - you go buy it. It is his money, and he can do with it what ever he likes, this includes flushing it down the toilet, or spending it on ice cream and cake. He is in no way obligated to pay for anything for this project.
____________
|
|
|
|
|
Around $900 AFAIK, but I haven't checked closely. Sadly, the licenses are time-limited, I believe that's for one year and includes ICC, IPP and MKL (the latter two being library packages).
Intel pricing page
Also don't forget both Linux and Windows (and maybe even OS X, for X86-based Macs) are required, so that doubles/triples normal license costs.
--edit
Intel Compiler for Linux - $399
Intel Compiler for Windows - $399
Intel Compiler for OS X - $399
IPP - $199
MKL - $399 (maybe not necessary, but probably faster than fftw).
From the pricing page, it seems that all supported OS flavours are included when you buy one license for the library packages. The compiler has to be licensed once per OS.
There's a promotional MacOS package available that includes Compiler, IPP and MKL and costs $549. Since the libraries seem to license cross-OS, that might be the cheapest path overall.
Regards,
Simon.
I find it hard to believe that Crunch3r was paying $900 US every year just so he could make BOINC and s@h optimizations!
Erik,
Why do you find it so difficult to believe that Crunch3r was not paying this, it is not so hard to understand. Why do you think he got so upset and left the project ?
____________
|
|
|
|
|
|
Kevin,
I have some sort-of indecent questions - since you bought the licences, will you be releasing any clients to the public?
Or will you keep it Team-only (or just to yourself)? Please, I'm not trying to offend you, just want to know.
You buying the license was the reason I put in some extra hours for the Windows How-To - not that this obligates you in any way, obviously :o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Hi again. I've temporarily got a hold of VS .net 2003 :-)
I've come across the same problem as Bluesilvegreen. The ipp_7t.h file doesn't exist an my computer :(. i think it maybe because the only version i could find on intels website is ipp 5.1 whilst your seems to be 8.0.2 first time and versin 5.0 next. oh and i dont have a staticlib folder in tools. i have a stubliib in the main directory thought?
Will this cause me roblems. Not worried by all the warings as you say
Whilst mooching around i noticed the 'optimizaion' tab in c/c++ folder icon when you've rightclicked into properties. I sthis redundent since you've used the command line route.. I ask because the commands are slightly didderent forprocessor specific. QaxP for pentium 4 SSE3 maybe that might get me better performance in my settup?
Also is the MKL math stuff automatically used when you've included them as per your instructions. I ask because in preproccesor commands you added use_IPP. does it need use_MKL?
[EDIT]
Went through more thorought and it seems its all to do with trying to impliments SSE ect. You prbly knew that already. looking thruoght that part the code i can see i dont have any of those #include files except ipp.h:
// In order to use IPP, set -DUSE_IPP and one of -DUSE_SSE3, -DUSE_SSE2,
// -DUSE_SSE or nothing(generic), IPP precedes FFTW, ooura // TMR
#if defined(USE_IPP)
#pragma message ("-----IPP-----")
#if defined(USE_SSE3)
#define T7 1
#pragma message ("-----sse3-----")
#include <ipp_t7.h>
#elif defined(USE_SSE2)
#define W7 1
#pragma message ("-----sse2-----")
#include <ipp_w7.h>
#elif defined(USE_SSE)
#define A6 1
#pragma message ("-----sse-----")
#include <ipp_a6.h>
#else
#pragma message ("-----mmx-----")
#include <ipp_px.h>
#endif // T7
#include <ipp.h>
#elif defined(USE_FFTWF)
#pragma message ("----FFTW----")
#include "fftw3.h"
#else
#pragma message ("----ooura----")
#include "fft8g.h"
#endif // USE_IPP
these are the files i do have. Any chance one of them is the right one and i can edit the code to use it instead?:
ipp.h - ippac.h - ippalign.h - ippcc.h - ippch.h - ippcore.h - ippcv.h - ippdefs.h - ippi.h - ippj.h - ippm.h - ipps.h - ippsc.h - ippsr.h - ippvc.h - ippvm.h
Thanks very much again. Made this all extremely interesting. learnt more going thought your tutorial than did in the last week with a couple of old c++ and VB books
Pepperammi, check the "How to make your own optimized Windows Seti@Home" thread for an answer, please :o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
H.Volunteer tester Send message Joined: Jun 26 06 Posts: 63 Credit: 1,192 RAC: 0
|
|
I have put both of Simons HOW-TO's up for sticky. They will stay here for at least a week. Or until the next moderator passes through and finds it too cluttering.
H. |
|
|
|
|
|
Thank you H.!
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Like Josef stated, testing an optimized build without graphics vs. the standard with graphics is not quite fair. So - here are some revised numbers:
Default 5.15 with -nographics
9m 36s (576 seconds)
So that compares to 646 seconds before, and is a sizable difference of 70 seconds or 10.8% vs. with graphics.
So the revised speedup for my clients (on this WU, but it should hold true elsewhere) -
Crunch3r 5.12 SSE2
4m 19s (259 seconds) - 55.0% quicker
My 5.15 SSE2
4m 17s (257 seconds) - 55.38% quicker
I'd expect 5-10% less speedup in the scores posted before.
Regards,
Simon.
Stats for third WU:
testWU-4 (AR: 1.2796485198966)
Windows 32-Bit
Default 5.15 with graphics
10m 46s (646 seconds)
Crunch3r 5.12 SSE2
4m 19s (259 seconds) - 59.9% quicker
My 5.15 SSE2
4m 17s (257 seconds) - 60.0% quicker
Those 2 seconds difference are again well within standard result variance, so I'd call it a draw at this AR on Windows (between optimized clients), too.
Linux 32-Bit
Default 5.12 (no X-Win installed, so no graphics? not sure)
8m 00s (480 seconds)
Crunch3r 5.12 SSE2
5m 02s (302 seconds) - 37.1% quicker
My 5.15 SSE2
4m 45s (285 seconds) - 40.6% quicker
--------------------------------
Linux vs. Windows
Default client
Linux is 25.7% quicker.
Crunch3rs 5.12
Windows is 14.2% quicker.
My 5.15 builds
Windows is 9.8% quicker.
The same sort of scaling seems to apply, roughly.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Great stuff KWSN! just finished first test unit and its strongly similar (great) and I tested it against crunch3r's 5.12 too;
Pentium D 830 3.21Ghz, Dual channel memory, Windows XP
Default Standard; 20:40 (1240seconds)
My SSE3 from your instructions; 9:23 (563secs) -54.6%
Crunch3r's 5.12 SSE2; 9:17 (557secs) -55%
Going to test a SSE2 compiled version too as sometimes I find them faster on this machine. Also going to try the other test units too for the different AR's.
Eventually i'll get round to having a go on my HT machine ;)
Also i'm going to turn off my blank screensaver for the next test because it maybe gave crunch3r's a very slight advantage when it blanked screen halfway through default and all of crunch3r's
[EDIT]
Sorry my numbers were with the default with graphics. |
|
|
|
|
|
Great results :)
I've heard from a couple of people who have successfully made their own Windows crunchers now. Their benchmark times - or rather, the relative speedup - are very close to what you're getting (and me as well).
So it seems as if it's reproducible across different hardware (and by different people ;o) ) - I had tested it on several installations here, but it's great to see the instructions work for you guys, too.
Thanks for all the supportive posts, a definite ego-booster there!
I'm still waiting to hear back from Kevin - since he bought the licenses, right now he seems one of the few people around here who could actually release a binary to the public.
Oh, and Pepperammi, when you repeat benchmark runs you will notice there is some slight variance in times, usually. So I would think that your client and Crunch3rs is very much on equal terms. Time will tell :o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Great results :)
I've heard from a couple of people who have successfully made their own Windows crunchers now. Their benchmark times - or rather, the relative speedup - are very close to what you're getting (and me as well).
So it seems as if it's reproducible across different hardware (and by different people ;o) ) - I had tested it on several installations here, but it's great to see the instructions work for you guys, too.
Thanks for all the supportive posts, a definite ego-booster there!
I'm still waiting to hear back from Kevin - since he bought the licenses, right now he seems one of the few people around here who could actually release a binary to the public.
Oh, and Pepperammi, when you repeat benchmark runs you will notice there is some slight variance in times, usually. So I would think that your client and Crunch3rs is very much on equal terms. Time will tell :o)
Regards,
Simon.
Pretty soon It's going to be. Crunch3r? Who's that? ;) Your How to threads have been sticked. Cool. :D
____________
My Farm/PCs Specs
 |
|
|
|
|
Quick question Joe,
So starting the CC from the command line like:
path\\boinc.exe -allow_remote_gui_rpc -nographics
will kill the graphics loop for win boxes?
Alinator
PS: Talking about the stock build here.
Sorry I wasn't clear, the -nographics argument is only available when testing the setiathome app standalone (without BOINC):
path\\setiathome_5.15_windows_intelx86.exe -nographics
Joe |
|
|
|
|
Sorry I wasn't clear, the -nographics argument is only available when testing the setiathome app standalone (without BOINC):
path\\setiathome_5.15_windows_intelx86.exe -nographics[/pre]
To me that sounds like a usefull boinc feature: to be able to spawn the app with the -nographics option (or any other, maybe project-specific, options). Would such a feature give everybody who uses it (I personally never use graphics) a 5-10% speedup?
____________
|
|
|
|
|
Sorry I wasn't clear, the -nographics argument is only available when testing the setiathome app standalone (without BOINC):
path\\setiathome_5.15_windows_intelx86.exe -nographics[/pre]
To me that sounds like a usefull boinc feature: to be able to spawn the app with the -nographics option (or any other, maybe project-specific, options). Would such a feature give everybody who uses it (I personally never use graphics) a 5-10% speedup?
maybe its possible to code this into boinc or the app. Maybe not the option to choose but that when the graphics window is closed then graphics are dissabled an initialised when you open the graphics.
Not very good way of doing it but the simplest i can think of is reprogram boinc so it automatically loads tha seti app with '-nographics' and when yo press that show graphic button it just quickly stops the app and loads again without the '-nographics' and continues where it left off. And agin when you close the window it stops the app and restarts with '-nographics'. It may have problems with preemtied ect. I'l have a look. not saying i'll come up with something-i don't know enought but i find it interesting.
you'd have to consider the screensver part too though. Ensure it does the same there.
@ KWSN
Is there anyway to change how '1D FFTs' are handled? Is it possible (without getting extremely complicated)? Anyway to single/sort them out to work them differently? or unload them to a different part the app to be done elsewhere then when computing then bring the results back to continue as they would normally. Might come into trouble doing that because its using IPP now?
Hope i made sence there. Doesn't to me :)
I'll have some more complete benchmark numbers after a few more runs. I started again because remembered i was tinckering around when couldn't get it to work because of that critical error(thanks again) so i recompiled to see if i maybe missed a change i made. To sum up- doing it again with your exact instruction for SSE3 nocked about five more seconds off. Thanks for the info on repeat benchmarks. i spotted that. luckily it only fluctuates by about 2seconds when i does |
|
|
|
|
|
There any kind of software (free) to monitor work load on the gpu? I cant find one. sounds nuts but i've had a really basic go at including gpu use in the app using http://gamma.cs.unc.edu/GPUFFTW/documentation.html (got got bored of benchmarks ;-) ). Their insutruntions are so overly simple that i thought what the hell even if i dont believe it.
Anyway tried those instructions and compiled with USE_FFTWF so it'll probly be slower but i was just interested if it'd work. I can't tell... all i got at the mo is temp levels to go by and they do pop up 2-4c now and then...?
times i got where on test WU 3
12min01secs with this 'bodgit' app
11min28secs with SSE3 compiled
Not as huge difference as i thought would make as theres no SSE or IPP. Or i got it wrong |
|
|
Hans DornVolunteer developer Volunteer tester
 Send message Joined: Apr 3 99 Posts: 2236 Credit: 12,503,941 RAC: 2,003

|
There any kind of software (free) to monitor work load on the gpu? I cant find one. sounds nuts but i've had a really basic go at including gpu use in the app using http://gamma.cs.unc.edu/GPUFFTW/documentation.html (got got bored of benchmarks ;-) ). Their insutruntions are so overly simple that i thought what the hell even if i dont believe it.
Anyway tried those instructions and compiled with USE_FFTWF so it'll probly be slower but i was just interested if it'd work. I can't tell... all i got at the mo is temp levels to go by and they do pop up 2-4c now and then...?
times i got where on test WU 3
12min01secs with this 'bodgit' app
11min28secs with SSE3 compiled
Not as huge difference as i thought would make as theres no SSE or IPP. Or i got it wrong
Hi Pepperami,
GPUFFTW does not have the same API as fftw3f, so it won't work as a drop-in replacement.
Have you tried compiling the GPUFFTW example?
Regards Hans
____________
|
|
|
|
|
Hi Pepperami,
GPUFFTW does not have the same API as fftw3f, so it won't work as a drop-in replacement.
Have you tried compiling the GPUFFTW example?
Regards Hans
Hi Hans,
I guessed not to replace the fftw so i just added the gpufftw include as well. Like said very basic attempt (not very good) :). was wondering if the compiler would be able to make a little sence out of it.
surprised all that messing hasn't seriously affected the speed and its still valid.
I did have a quick look throught the example and it cetainly shows a lot more setup needed. It interesting to go through anyway. I need to learn a lot more.
Thank you.
|
|
|
|
|
|
Guys can someone link me to a seti_enchanced shortened reference unit?
The one you are using maybe? ;)
Thanks and regards.
____________
/Mav

We have lingered long enough on the shores of the cosmic ocean.
We are ready at last to set sail for the stars.
(Carl Sagan) |
|
|
|
|
Guys can someone link me to a seti_enchanced shortened reference unit?
The one you are using maybe? ;)
Thanks and regards.
theres five of them in KWSN's source download from the Windows How-To. Plus an invaluable benchmark script to compare the speed and another invauable tool to test your results againts the defaut reference unit.
About to finish fifth wu and ill have average times to put up.
|
|
|
|
|
|
Also, I put it them for download before for Josef Segur.
So here's the URL to just the WUs:
http://www.zadra.org/seti_enhanced/testWUs.zip
Cheers,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Good news :o)
I'm in final testing of an SSE-optimized Linux cruncher. No numbers yet, but it seems to run fine across platforms. As soon as I can get some independent test and verification results, I'll put it up for download.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Sorry I wasn't clear, the -nographics argument is only available when testing the setiathome app standalone (without BOINC):
path\\setiathome_5.15_windows_intelx86.exe -nographics[/pre]
To me that sounds like a usefull boinc feature: to be able to spawn the app with the -nographics option (or any other, maybe project-specific, options). Would such a feature give everybody who uses it (I personally never use graphics) a 5-10% speedup?
My testing indicates no change is needed. If you don't turn on the graphics in BOINC the graphics thread doesn't use enough CPU time to matter, probably much less than 1%.
For this project, there's also control of the graphics from the project preference settings. Included in that is a setting for the maximum percent of CPU time the graphics can use. The default is 50%, but it could be set to zero.
Joe |
|
|
|
|
|
Ok. So for someone who knows absolutely nothing about programming, is this something that I could do? Where would I find everything I need to download to begin doing this? I'm very interested in tryin this out.
Also, I know that you have test WU's, do you run your cache dry of SETI WU's, plug these in, disable network access? I'm pretty illiterate when it comes down to this stuff. But, I'm interested in learning. Thanks for all the work you've done Simon. :-)
Jeremy
____________
Account frozen... |
|
|
|
|
|
Hi KB7RZF,
I made two How-To's, one for Windows, one for Linux, that have both been stickied. They have links to the necessary stuff and information how to install and use it. Those instructions are geared towards people with little to no experience as much as I could make it so.
Linux How-To
Windows How-To
Hope that helped,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Hi KB7RZF,
I made two How-To's, one for Windows, one for Linux, that have both been stickied. They have links to the necessary stuff and information how to install and use it. Those instructions are geared towards people with little to no experience as much as I could make it so.
Linux How-To
Windows How-To
Hope that helped,
Simon.
Thanks for those, forgot about um. :-) I have no clue where to find C++ 2003 at all. Where would I come across this?
____________
Account frozen... |
|
|
|
|
|
It's a commercial product. eBay or the like should have used version for sale pretty cheaply.
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
|
Oh yeah,
just saw this among the other hosts on one of my WUs -
Windows optimized S@H Enhanced application by Daniel Schaalma -SETI.USA
Developed for Team SETI.USA(www.setiusa.net) JOIN Team SETI.USA!!!
Special Thanks to Crunch3r, Hans Dorn, TMR, Harold Naparst, and Financier!
Version info: Windows Intel P4 SSE2 V5.15c by Daniel Schaalma -SETI.USA
Work Unit Info:
...............
WU true angle range is : 0.5692
Flopcounter: 12903131849229.531000
So I guess team Seti.Usa has already gotten their own optimized windows client. kevint, you haven't answered whether you want to release anything to the public. Judging by what I'm seeing, it doesn't look likely.
Also, if someone's already testing this, why do it as "anonymous"? Seems a bit fishy to me - also, why test it online? Grab a few reference WUs and do it offline, where problems don't affect the project as a whole.
All in all, I'm moderately surprised you a) didn't announce anything b) are doing it in this sort-of furtive way (forgive my choice of words, but that's what it seems like to me). Oh, and thanks for deciding for me whether I wanted to include a team ad into my clients...you managed to put it once on each line on average.
Anyway, props to Daniel for getting his build to work :o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
TobyVolunteer tester
 Send message Joined: Oct 26 00 Posts: 1001 Credit: 2,538,273 RAC: 1,572

|
|
6,000 seconds slower on a CPU that is only 200 MHz slower? At least it looks like he hasn't managed to beat your times yet :)
____________
A member of The Knights Who Say NI!
For rankings, history graphs and more, check out:
My BOINC stats site |
|
|
|
|
|
@ KWSN
Sorry. Another question. If you remeber i asked about threading. Anyway i had another thought on that. (you know all this already so might as well scip to the second paragraph) On a HT machine you can run two units -will take longer but because doing two they're both done quicker. I remember on the old SAH app when running optimised some people got more efficient crunching with HT off.
My question is would you get better results (time wise) from running a optimized app two units with HT on - or limit SAH to 1cpu and running a threaded optimised app? i know the compiler can only do so much to thread the app because it wasn't writen threaded.
Still currently testing the SSE3 app on my HT machine and i got my first result in (hybrid) and is valid. few other shorts. All valid and granted.
http://setiathome.berkeley.edu/result.php?resultid=345128248
http://setiathome.berkeley.edu/result.php?resultid=348661796
http://setiathome.berkeley.edu/result.php?resultid=348661792
http://setiathome.berkeley.edu/result.php?resultid=348661807
Need to rmember to add a personal bit to the version report.
|
|
|
|
|
|
Oi,
Pepperammi, you seem to be picking up a lot in a short time :o) In my experience, HT has been quicker than disabling it. Yes, you can crunch a WU in a shorter amount of time if you switch it off, say by 33% (estimate). So then you switch on HT, and you have two virtual CPUs crunching 2 results at the same time, each 33% slower than before.
So say the time taken without HT is 1 time unit. The time taken with HT is 1.33. Let's do 10 WUs. This will take 10 time units without HT, and 5x1.33 with HT, which is 6.65. So you see, even with HT taking longer per WU, you will crank out about 50% more results (33% less time taken).
Hope that helped :)
Oh, and right now (as this was set before) all apps are set to use multi-threaded libs in Windows - I'm doing some testing whether anything changes if I set it to use single-threaded stuff instead.
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Also, if someone's already testing this, why do it as "anonymous"? Seems a bit fishy to me - also, why test it online? Grab a few reference WUs and do it offline, where problems don't affect the project as a whole.
All in all, I'm moderately surprised you a) didn't announce anything b) are doing it in this sort-of furtive way (forgive my choice of words, but that's what it seems like to me). Oh, and thanks for deciding for me whether I wanted to include a team ad into my clients...you managed to put it once on each line on average.
Anyway, props to Daniel for getting his build to work :o)
Regards,
Simon.
You are making some wayward assumptions there Simon!! First, you assume that it was a test when actually all of the testing took place using reference WUs and on the beta site as is proper. Your assumption that something was fishy and or furtive is also way off base since the WU you saw was from one of our members who has always elected to be anonymous. But by far the worse assumption you made was that it was "your client". The client was made from scratch using the nightly tarballs and Daniel's own programming knowledge with input from the people listed. As a matter of fact it was being developed at about the same time you were doing yours and your assumption that it was your work that was changed is false. The Team voted to keep this within the Team since it was funded by one of our members. So please in the future have your facts straight before making cloaked accusations.
____________

|
|
|
|
|
|
Sarge,
nowhere did I assume it was my client - you misunderstood me there.
And no, I don't assume it was my work.
When I make accusations, I don't cloak them. When I ask about things, I do so in the form of a question. If you understood my questions as an accusation, you inferred that meaning.
Also, I honestly salute Daniel for getting his build to work - there was no sarcasm included (nor is there now).
So no, that's not what I'm on about, and please don't try to infer I am :o)
Instead, seeing that message, I was already thinking about whether to include one line with a link where to get updates for the client and including my team name. That's what I'm talking about.
And again - when I assume something, I say "it seems", not "these are the facts" or something. So thanks for explaining - but please don't put words in my mouth.
Also, I had asked kevint to post whether he planned to release anything to the public or not. As it seems, that question has been answered (by a vote, as you say), but not by anyone from Seti.USA until right now. Again, doing things behind closed doors is something I cannot understand. Even if you wanted to keep it to yourself, why not even tell anyone? Seti.USA has always been most vociferous in other regards.
What I find worthy of accusation is this: Seti.USA caring only for Seti.USA, not for SETI. Of course, the vote was a team decision, but it just goes to strengthen that point. I'm willing to be persuaded my opinion here is wrong (in a reasonable discussion), but so far, I have been less than impressed.
I recognize it was funded by one of your members (seems it was kevint) and that you are in no way required to share. That's why you're Seti.USA and I'm a KWSN ;o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
nowhere did I assume it was my client - you misunderstood me there.
And no, I don't assume it was my work.
My apologies for apparently doing what I accused you of. Yes I did misunderstand that part of your statement.
However, comments like:
Seems a bit fishy to me
and
are doing it in this sort-of furtive way
Would have been better made to us in private rather than in this public forum. By making them publicly they became inflammatory and will be seen as accusatory by many people. So I elected to answer them as accusations.
____________

|
|
|
|
|
Oh, and right now (as this was set before) all apps are set to use multi-threaded libs in Windows - I'm doing some testing whether anything changes if I set it to use single-threaded stuff instead.
Regards,
Simon.
Thanks. I think you answered it more in your last paragraph. :)
I meant if HT-enabled and limit SAH to 1cpu it useto on the old app (for me) only use the 1 cpu and the other logical was free and report only using 50%cpu. Not sure if still the case, Havn't messed with it for a long time.
so you could have;
2WUs running two optimized apps together -one each physical/logical cpu and as you say they'l be done faster overall than with HT off running 1WU.
Or (?) 1WU HT-Enabled and SAH set to 1cpu but the app optimized-with-Threading and this i think should utilise both physical and logical cpu? I just wasn't sure if this setup might utilize the cpu and its threading abilities more intelligently/efficiently than two opti-apps fighting over it. I suppose this would only come into play (if at all) if boinc set affinity but windows probly does all this already.
I think you've already answered all this though because as you say its compiled with mutithreading already. :) sorry if i've wasted your time.
P.S. learning most of this from the forums and your how-to pages :) |
|
|
|
|
nowhere did I assume it was my client - you misunderstood me there.
And no, I don't assume it was my work.
My apologies for apparently doing what I accused you of. Yes I did misunderstand that part of your statement.
However, comments like:
Seems a bit fishy to me
and
are doing it in this sort-of furtive way
Would have been better made to us in private rather than in this public forum. By making them publicly they became inflammatory and will be seen as accusatory by many people. So I elected to answer them as accusations.
That's okay, maybe I used the wrong words. If so, I apologize.
Private forums are exactly what I'm trying to avoid here, like I said, I aim for maximum transparency.
Thanks for answering!
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Oh, and right now (as this was set before) all apps are set to use multi-threaded libs in Windows - I'm doing some testing whether anything changes if I set it to use single-threaded stuff instead.
Regards,
Simon.
Thanks. I think you answered it more in your last paragraph. :)
I meant if HT-enabled and limit SAH to 1cpu it useto on the old app (for me) only use the 1 cpu and the other logical was free and report only using 50%cpu. Not sure if still the case, Havn't messed with it for a long time.
so you could have;
...
Pepperammi,
be aware that the applications using multi-threaded libs does not mean they are specially optimized to take advantage of multithreading, as well! That's a separate setting you can find in "C/C++ - Optimization - Parallelization" (scroll down all the way, it's the last). I haven't played with it much yet, so I'm very interested in your results.
As to it using 50% of available CPU - no matter whether it's a HT system or a real SMP system, on a 2-CPU machine 100% usage on one CPU shows as 50% overall. I'm not sure whether setting CPU affinity will change anything there. Optimizing for Parallelization may cause both logical/physical CPUs to share the load and overall system usage still be 50%, maybe. Check your CPU graphs in task manager when you do benchmarking :o)
Regards,
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|
Pepperammi,
be aware that the applications using multi-threaded libs does not mean they are specially optimized to take advantage of multithreading, as well! That's a separate setting you can find in "C/C++ - Optimization - Parallelization" (scroll down all the way, it's the last). I haven't played with it much yet, so I'm very interested in your results.
As to it using 50% of available CPU - no matter whether it's a HT system or a real SMP system, on a 2-CPU machine 100% usage on one CPU shows as 50% overall. I'm not sure whether setting CPU affinity will change anything there. Optimizing for Parallelization may cause both logical/physical CPUs to share the load and overall system usage still be 50%, maybe. Check your CPU graphs in task manager when you do benchmarking :o)
Regards,
Simon.
Ah. Thanks. I'm still just checking that the app is making valid results on the HT machine but yea I will try the different single-threaded and Parallelization options you mention. Will be interesting to see if any difference. I'll try the parallelization first if your already trying single-threaded. |
|
|
|
|
|
Hi Simon,
First, let me start out by offering my congratulations on releasing your Linux apps. I have not had the time to spend here reading these boards for a while. I first started working on an optimized app just after Crunch3r resigned from the Seti Community. My team has supplied me with a fully licensed version of ICC and IPP, to produce optimized apps for our team. It was decided that after the turmoil that led to Crunch3r's loss to our community, that we would build apps for team members only, to avoid a repeat of such events. I would like to reassure you that my apps were very thoroughly tested in "standalone" mode, and all results checked with TMR's "rescmp", and ALL completed results were graded "Strongly Similar". After the "standalone" testing phase, these apps were then thoroughly tested on Seti Beta. In that phase, there were NO INVALID results, except through installation errors by one or two testers. It was only after very thorough testing that the apps were released here for production. I would NEVER release an app for production without thorough testing.
I wish you the best of luck in your app development, Simon, and I just hope that by your releasing your apps to everyone, that you don't suffer the same misfortunes as Crunch3r. Take care, and again, great job on getting your builds released!
[edit]We also decided to remain silent about our apps, to attempt to avoid the flaming that will unfortunately, enevitably appear, for as long as physically possible. I am sorry if you took this to mean that we had some sort of nefarious plans, but that is not the case. We have been keeping our comments mostly among team members, in order that people on all sides can cool off for a while, and we can all get back to crunching.[/edit]
Regards, Daniel.
____________
   |
|
|
|
|
Around $900 AFAIK, but I haven't checked closely. Sadly, the licenses are time-limited, I believe that's for one year and includes ICC, IPP and MKL (the latter two being library packages).
Intel pricing page
Also don't forget both Linux and Windows (and maybe even OS X, for X86-based Macs) are required, so that doubles/triples normal license costs.
--edit
Intel Compiler for Linux - $399
Intel Compiler for Windows - $399
Intel Compiler for OS X - $399
IPP - $199
MKL - $399 (maybe not necessary, but probably faster than fftw).
From the pricing page, it seems that all supported OS flavours are included when you buy one license for the library packages. The compiler has to be licensed once per OS.
There's a promotional MacOS package available that includes Compiler, IPP and MKL and costs $549. Since the libraries seem to license cross-OS, that might be the cheapest path overall.
Regards,
Simon.
I find it hard to believe that Crunch3r was paying $900 US every year just so he could make BOINC and s@h optimizations!
Erik,
Why do you find it so difficult to believe that Crunch3r was not paying this, it is not so hard to understand. Why do you think he got so upset and left the project ?
ACTUALLY, I believe Crunch3r's copy was donated by another Seti user.
It has been awhile, but I do remember this from A LONG TIME AGO now, a user had a copy and was not using it.
____________
 |
|
|
|
|
|
Thanks for your answer, Daniel!
I was aware that you were already working on an optimized build when I started doing my stuff, so that's why I was sure you had gotten there on your own.
Thanks for explaining the testing regimen you put your clients through, I commend you for taking extra care.
I do have to disagree on your decision to cut down on the flaming by keeping things private. I do believe that (even had I not noticed it) this will cause more turmoil in the long run exactly because you didn't tell anyone beforehand.
Also best of luck in your further app development :o) I'm pretty sure I won't let things like that happen - hence my policy of total transparency. That's really the best way to stop rumours - never give anyone anything to wonder about.
Just my 2 cents...
Simon.
____________
Donate to SETI@Home via PayPal!
Optimized SETI@Home apps + Information |
|
|
|
|