64-bit App Build Windows XP x64

Message boards : Number crunching : 64-bit App Build Windows XP x64
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
kevint
Volunteer tester

Send message
Joined: 17 May 99
Posts: 414
Credit: 11,680,240
RAC: 0
United States
Message 420009 - Posted: 13 Sep 2006, 19:04:00 UTC - in response to Message 419904.  

Here's the first unit crunched, there is one other unit reporting, But It was an invalid WU made by a Pentium M(1.86GHz). It still has not been validated yet, But at least It's not errored out.
http://setiathome.berkeley.edu/result.php?resultid=378760482

<stderr_txt>
Optimized Windows SETI@Home Enhanced application
Version info: Windows EMT64 64-bit V5.15 Test 1
Modified & Compiled by Bob Delkhoon (DeNitro)
Source downloaded from Simon Zadra (KWSN - Chicken of Angnor)

CPU real speed: 5920 MHz

Work Unit Info
True angle range: 0.426464

Flopcounter: 16093243232437.539000

Spike count: 1
Pulse count: 0
Triplet count: 4
Gaussian count: 2
</stderr_txt>



This does not look like it is being tested on the beta site - is this a production release already ? Or should it be tested on Beta fully before allowing it to be used in production ????


ID: 420009 · Report as offensive
Profile KWSN - Chicken of Angnor
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 9 Jul 99
Posts: 1199
Credit: 6,615,780
RAC: 0
Austria
Message 420063 - Posted: 13 Sep 2006, 22:14:18 UTC - in response to Message 419661.  
Last modified: 13 Sep 2006, 22:39:00 UTC

Great to hear it's working, very sorry about that mix up. Again, I'm not sure what the speed difference will be. On my machine, I don't see a significant difference over Simon's (awesome) optimized app that this was based off but hopefully it's a start.

Simon, Sure I'd like to work together. There's some more I want to change in the code base to make it totally cross compatabile with 32-bit and 64-bit builds. There's also still a bit of warnings I want to check into. This was just a rought edit to get valid results and check initial performacne since I'm kinda time limited with school atm. As long as it's not any slower than the 32-bit apps, I hope it's an ok start.

Again PLEASE remember, this is just a test. Make sure you do a test run first on your platform with test work units before you try it on the actuall BOINC app!!!

Thanks,
--Bob Delkhoon (DeNitro)

Great :o)

I've given you access to the pre-release and development boards on lunatics.at, check your personal messages there for more information.

Running a benchmark (offline) of your version vs. mine currently, will let you know the results. The second build you posted now runs fine on my system.

Regards,
Simon.

<edit>
Run results indicate that your app is quicker on some WUs and the same speed on others as my 32-bit ones. Great work! All my test WUs had valid results.
</edit>
Donate to SETI@Home via PayPal!

Optimized SETI@Home apps + Information
ID: 420063 · Report as offensive
Profile Benher
Volunteer developer
Volunteer tester

Send message
Joined: 25 Jul 99
Posts: 517
Credit: 465,152
RAC: 0
United States
Message 420234 - Posted: 14 Sep 2006, 7:00:10 UTC - in response to Message 419914.  
Last modified: 14 Sep 2006, 7:00:39 UTC

Benher mentioned that the code is not taking good advantage of the all the registers available in the IA32 or x86-64 architecture. Fixing this might require non-trivial understanding of the source.

I think when Ben originally said this, he wasn't implying that it was a problem to be fixed. :) A lot of SETI code is simple loopy code, and there are arguments against doing a ton of loop unrolling on Core2, like a 64-byte loop buffer and the increase in code size that you've taken by moving to x86-64.


Hey Alex,

Can you think of a function that might benefit from 16 x 4 simd regs (in seti)? Would it keep the register pressure down somewhere? Your most complex simd seti use seems to be your chirp sse3, but compiler seemed to use only registers and not store values in temp memory when I looked at the assembly output.

=Ben
ID: 420234 · Report as offensive
Profile Sutehk
Volunteer tester
Avatar

Send message
Joined: 11 Jun 99
Posts: 42
Credit: 1,443,674
RAC: 0
United States
Message 420309 - Posted: 14 Sep 2006, 13:19:10 UTC

Thought you guys might like to know, not sure if its my comp or the app, but I have gotten a few "client errors".

stderr out is as follows:

<core_client_version>5.4.11</core_client_version>
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>

Check it out here: http://setiathome.berkeley.edu/result.php?resultid=379101082
ID: 420309 · Report as offensive
Pepperammi

Send message
Joined: 3 Apr 99
Posts: 200
Credit: 737,775
RAC: 0
United Kingdom
Message 420411 - Posted: 14 Sep 2006, 16:18:47 UTC - in response to Message 420309.  

Hi. Haven't been back to the SAH for a bit and know I find this :).

I've grabbed the app and have started full testing for validity and run time with Kna's nice benchmark tool. Ive used the Imagecfg tool to limit it to one core as have been doing for the other testing.

I'l just let it run and let you know how it goes.

Nice Work :D
ID: 420411 · Report as offensive
Pepperammi

Send message
Joined: 3 Apr 99
Posts: 200
Credit: 737,775
RAC: 0
United Kingdom
Message 420579 - Posted: 14 Sep 2006, 20:45:35 UTC

Heres a full copy of the full test run results. Included in the run is KWSN's sse3-1.3-p4 app and Bens test sse3-xP-ipo-ben-B as they're currently the fastest on this machine. This give somthing to compaire times to as well as the Default 5.15 app.

All results where Strongly similar.

Thank you Bob (DeNitro)

============================================================================
============
default-515.exe -nographics / testWU-1.wu :
Started at : 17:14:21
Ended at : 17:26:33
Elapsed time: 732 seconds
CPU time : 695 secs - Flops: 0 (rev: 0.00)

Stderr.txt :
Can't set up shared mem: -1
ar=0.604884 NumCfft=3005 NumGauss= 16222892 NumPulse= 3112845952 NumTriplet= 405094842368
------------
5.15-DeNitro-emt64.exe / testWU-1.wu :
Started at : 17:26:33
Ended at : 17:34:26
Elapsed time: 473 seconds
CPU time : 401 secs - Flops: 282842 (rev: 0.00)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows EMT64 64-bit V5.15 Test 1
Modified & Compiled by Bob Delkhoon (DeNitro)
Source downloaded from Simon Zadra (KWSN - Chicken of Angnor)

CPU real speed: 3209 MHz

Work Unit Info
True angle range: 0.604884

Flopcounter: 636752282842.789430

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
5.15-kwsn-sse3-p4.exe / testWU-1.wu :
Started at : 17:34:26
Ended at : 17:41:39
Elapsed time: 432 seconds
CPU time : 414 secs - Flops: 331767 (rev: 1.30)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows P4 SSE3 32-bit V5.15 'Chicken Good!' (R-1.3|+freq|xP+)
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPU real speed: 3210 MHz

Work Unit Info
True angle range: 0.604884

Flopcounter: 636752331767.789060

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
sse3-xP-ipo-ben-B.exe / testWU-1.wu :
Started at : 17:41:39
Ended at : 17:49:35
Elapsed time: 476 seconds
CPU time : 415 secs - Flops: 276161 (rev: 1.50)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows SSE3 32-bit V5.15 'Chicken Good!' (R-1.5|+freq|xP [Ben Opt])
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPUID: 'Intel PD Pentium D (Smithfield)'
cpus: 1 cores: 2 threads: 2 cache: L1=16K L2=1024K L3=0K
features: mmx sse sse2 sse3
speed: 3209 MHz

Work Unit Info
True angle range: 0.604884

Flopcounter: 636727276161

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
default-515.exe -nographics / testWU-2.wu :
Started at : 17:49:35
Ended at : 18:03:49
Elapsed time: 854 seconds
CPU time : 835 secs - Flops: 0 (rev: 0.00)

Stderr.txt :
Can't set up shared mem: -1
ar=0.443732 NumCfft=3539 NumGauss= 22080918 NumPulse= 4202890368 NumTriplet= 457641934848
------------
5.15-DeNitro-emt64.exe / testWU-2.wu :
Started at : 18:03:49
Ended at : 18:12:14
Elapsed time: 505 seconds
CPU time : 473 secs - Flops: 721325 (rev: 0.00)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows EMT64 64-bit V5.15 Test 1
Modified & Compiled by Bob Delkhoon (DeNitro)
Source downloaded from Simon Zadra (KWSN - Chicken of Angnor)

CPU real speed: 3209 MHz

Work Unit Info
True angle range: 0.443732

Flopcounter: 785466721325.214600

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
5.15-kwsn-sse3-p4.exe / testWU-2.wu :
Started at : 18:12:14
Ended at : 18:20:30
Elapsed time: 496 seconds
CPU time : 472 secs - Flops: 635430 (rev: 1.30)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows P4 SSE3 32-bit V5.15 'Chicken Good!' (R-1.3|+freq|xP+)
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPU real speed: 3210 MHz

Work Unit Info
True angle range: 0.443732

Flopcounter: 785466635430.214600

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
sse3-xP-ipo-ben-B.exe / testWU-2.wu :
Started at : 18:20:31
Ended at : 18:29:06
Elapsed time: 515 seconds
CPU time : 470 secs - Flops: 321748 (rev: 1.50)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows SSE3 32-bit V5.15 'Chicken Good!' (R-1.5|+freq|xP [Ben Opt])
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPUID: 'Intel PD Pentium D (Smithfield)'
cpus: 1 cores: 2 threads: 2 cache: L1=16K L2=1024K L3=0K
features: mmx sse sse2 sse3
speed: 3209 MHz

Work Unit Info
True angle range: 0.443732

Flopcounter: 785445321748

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
default-515.exe -nographics / testWU-3.wu :
Started at : 18:29:07
Ended at : 18:45:06
Elapsed time: 958 seconds
CPU time : 871 secs - Flops: 0 (rev: 0.00)

Stderr.txt :
Can't set up shared mem: -1
ar=0.425877 NumCfft=3631 NumGauss= 23035130 NumPulse= 4453686656 NumTriplet= 526987509760
------------
5.15-DeNitro-emt64.exe / testWU-3.wu :
Started at : 18:45:06
Ended at : 18:54:11
Elapsed time: 545 seconds
CPU time : 528 secs - Flops: 224326 (rev: 0.00)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows EMT64 64-bit V5.15 Test 1
Modified & Compiled by Bob Delkhoon (DeNitro)
Source downloaded from Simon Zadra (KWSN - Chicken of Angnor)

CPU real speed: 3209 MHz

Work Unit Info
True angle range: 0.425877

Flopcounter: 817199224326.900760

Spike count: 0
Pulse count: 0
Triplet count: 1
Gaussian count: 0
------------
5.15-kwsn-sse3-p4.exe / testWU-3.wu :
Started at : 18:54:12
Ended at : 19:03:01
Elapsed time: 529 seconds
CPU time : 473 secs - Flops: 347850 (rev: 1.30)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows P4 SSE3 32-bit V5.15 'Chicken Good!' (R-1.3|+freq|xP+)
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPU real speed: 3210 MHz

Work Unit Info
True angle range: 0.425877

Flopcounter: 817199347850.900760

Spike count: 0
Pulse count: 0
Triplet count: 1
Gaussian count: 0
------------
sse3-xP-ipo-ben-B.exe / testWU-3.wu :
Started at : 19:03:01
Ended at : 19:12:08
Elapsed time: 546 seconds
CPU time : 534 secs - Flops: 11798 (rev: 1.50)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows SSE3 32-bit V5.15 'Chicken Good!' (R-1.5|+freq|xP [Ben Opt])
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPUID: 'Intel PD Pentium D (Smithfield)'
cpus: 1 cores: 2 threads: 2 cache: L1=16K L2=1024K L3=0K
features: mmx sse sse2 sse3
speed: 3209 MHz

Work Unit Info
True angle range: 0.425877

Flopcounter: 817183011798

Spike count: 0
Pulse count: 0
Triplet count: 1
Gaussian count: 0
------------
default-515.exe -nographics / testWU-4.wu :
Started at : 19:12:08
Ended at : 19:15:41
Elapsed time: 213 seconds
CPU time : 172 secs - Flops: 0 (rev: 0.00)

Stderr.txt :
Can't set up shared mem: -1
ar=1.279649 NumCfft=1663 NumGauss= 0 NumPulse= 843636224 NumTriplet= 301563183104
------------
5.15-DeNitro-emt64.exe / testWU-4.wu :
Started at : 19:15:41
Ended at : 19:17:38
Elapsed time: 117 seconds
CPU time : 59 secs - Flops: 680427 (rev: 0.00)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows EMT64 64-bit V5.15 Test 1
Modified & Compiled by Bob Delkhoon (DeNitro)
Source downloaded from Simon Zadra (KWSN - Chicken of Angnor)

CPU real speed: 3209 MHz

Work Unit Info
True angle range: 1.279649

Flopcounter: 243253680427.522030

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
5.15-kwsn-sse3-p4.exe / testWU-4.wu :
Started at : 19:17:38
Ended at : 19:19:37
Elapsed time: 119 seconds
CPU time : 59 secs - Flops: 680437 (rev: 1.30)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows P4 SSE3 32-bit V5.15 'Chicken Good!' (R-1.3|+freq|xP+)
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPU real speed: 3210 MHz

Work Unit Info
True angle range: 1.279649

Flopcounter: 243253680437.522030

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
sse3-xP-ipo-ben-B.exe / testWU-4.wu :
Started at : 19:19:38
Ended at : 19:22:22
Elapsed time: 164 seconds
CPU time : 114 secs - Flops: 770583 (rev: 1.50)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows SSE3 32-bit V5.15 'Chicken Good!' (R-1.5|+freq|xP [Ben Opt])
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPUID: 'Intel PD Pentium D (Smithfield)'
cpus: 1 cores: 2 threads: 2 cache: L1=16K L2=1024K L3=0K
features: mmx sse sse2 sse3
speed: 3209 MHz

Work Unit Info
True angle range: 1.279649

Flopcounter: 243250770583

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
default-515.exe -nographics / testWU-5.wu :
Started at : 19:22:22
Ended at : 19:37:05
Elapsed time: 883 seconds
CPU time : 816 secs - Flops: 0 (rev: 0.00)

Stderr.txt :
Can't set up shared mem: -1
ar=0.439957 NumCfft=3557 NumGauss= 22302084 NumPulse= 4217568640 NumTriplet= 457910337536
------------
5.15-DeNitro-emt64.exe / testWU-5.wu :
Started at : 19:37:06
Ended at : 19:45:35
Elapsed time: 509 seconds
CPU time : 474 secs - Flops: 794676 (rev: 0.00)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows EMT64 64-bit V5.15 Test 1
Modified & Compiled by Bob Delkhoon (DeNitro)
Source downloaded from Simon Zadra (KWSN - Chicken of Angnor)

CPU real speed: 3209 MHz

Work Unit Info
True angle range: 0.439957

Flopcounter: 790412794676.046260

Spike count: 0
Pulse count: 0
Triplet count: 1
Gaussian count: 0
------------
5.15-kwsn-sse3-p4.exe / testWU-5.wu :
Started at : 19:45:35
Ended at : 19:53:52
Elapsed time: 497 seconds
CPU time : 475 secs - Flops: 810695 (rev: 1.30)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows P4 SSE3 32-bit V5.15 'Chicken Good!' (R-1.3|+freq|xP+)
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPU real speed: 3209 MHz

Work Unit Info
True angle range: 0.439957

Flopcounter: 790412810695.046260

Spike count: 0
Pulse count: 0
Triplet count: 1
Gaussian count: 0
------------
sse3-xP-ipo-ben-B.exe / testWU-5.wu :
Started at : 19:53:52
Ended at : 20:02:29
Elapsed time: 517 seconds
CPU time : 472 secs - Flops: 291752 (rev: 1.50)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows SSE3 32-bit V5.15 'Chicken Good!' (R-1.5|+freq|xP [Ben Opt])
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPUID: 'Intel PD Pentium D (Smithfield)'
cpus: 1 cores: 2 threads: 2 cache: L1=16K L2=1024K L3=0K
features: mmx sse sse2 sse3
speed: 3209 MHz

Work Unit Info
True angle range: 0.439957

Flopcounter: 790391291752

Spike count: 0
Pulse count: 0
Triplet count: 1
Gaussian count: 0
------------
default-515.exe -nographics / testWU-6.wu :
Started at : 20:02:30
Ended at : 20:25:59
Elapsed time: 1409 seconds
CPU time : 1350 secs - Flops: 0 (rev: 0.00)

Stderr.txt :
Can't set up shared mem: -1
ar=0.033858 NumCfft=2557 NumGauss= 0 NumPulse= 8272970368 NumTriplet= 542472847360
------------
5.15-DeNitro-emt64.exe / testWU-6.wu :
Started at : 20:25:59
Ended at : 20:35:12
Elapsed time: 553 seconds
CPU time : 531 secs - Flops: 295541 (rev: 0.00)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows EMT64 64-bit V5.15 Test 1
Modified & Compiled by Bob Delkhoon (DeNitro)
Source downloaded from Simon Zadra (KWSN - Chicken of Angnor)

CPU real speed: 3209 MHz

Work Unit Info
True angle range: 0.033858

Flopcounter: 745955295541.180420

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
5.15-kwsn-sse3-p4.exe / testWU-6.wu :
Started at : 20:35:13
Ended at : 20:44:33
Elapsed time: 560 seconds
CPU time : 535 secs - Flops: 295541 (rev: 1.30)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows P4 SSE3 32-bit V5.15 'Chicken Good!' (R-1.3|+freq|xP+)
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPU real speed: 3209 MHz

Work Unit Info
True angle range: 0.033858

Flopcounter: 745955295541.180420

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
sse3-xP-ipo-ben-B.exe / testWU-6.wu :
Started at : 20:44:33
Ended at : 20:53:05
Elapsed time: 512 seconds
CPU time : 476 secs - Flops: 397150 (rev: 1.50)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows SSE3 32-bit V5.15 'Chicken Good!' (R-1.5|+freq|xP [Ben Opt])
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPUID: 'Intel PD Pentium D (Smithfield)'
cpus: 1 cores: 2 threads: 2 cache: L1=16K L2=1024K L3=0K
features: mmx sse sse2 sse3
speed: 3209 MHz

Work Unit Info
True angle range: 0.033858

Flopcounter: 745950397150

Spike count: 0
Pulse count: 0
Triplet count: 0
Gaussian count: 0
------------
default-515.exe -nographics / testWU-7.wu :
Started at : 20:53:05
Ended at : 21:02:55
Elapsed time: 590 seconds
CPU time : 533 secs - Flops: 0 (rev: 0.00)

Stderr.txt :
Can't set up shared mem: -1
ar=0.775000 NumCfft=1169 NumGauss= 6087368 NumPulse= 1224262784 NumTriplet= 304384294912
------------
5.15-DeNitro-emt64.exe / testWU-7.wu :
Started at : 21:02:55
Ended at : 21:08:25
Elapsed time: 330 seconds
CPU time : 295 secs - Flops: 68628 (rev: 0.00)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows EMT64 64-bit V5.15 Test 1
Modified & Compiled by Bob Delkhoon (DeNitro)
Source downloaded from Simon Zadra (KWSN - Chicken of Angnor)

CPU real speed: 3209 MHz

Work Unit Info
True angle range: 0.775000

Flopcounter: 266377068628.250240

Spike count: 7
Pulse count: 5
Triplet count: 5
Gaussian count: 6
------------
5.15-kwsn-sse3-p4.exe / testWU-7.wu :
Started at : 21:08:25
Ended at : 21:13:21
Elapsed time: 296 seconds
CPU time : 238 secs - Flops: 114356 (rev: 1.30)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows P4 SSE3 32-bit V5.15 'Chicken Good!' (R-1.3|+freq|xP+)
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPU real speed: 3209 MHz

Work Unit Info
True angle range: 0.775000

Flopcounter: 266377114356.250240

Spike count: 7
Pulse count: 5
Triplet count: 5
Gaussian count: 6
------------
sse3-xP-ipo-ben-B.exe / testWU-7.wu :
Started at : 21:13:21
Ended at : 21:18:34
Elapsed time: 313 seconds
CPU time : 297 secs - Flops: 562621 (rev: 1.50)

Result : strongly similar.
Stderr.txt :
Can't set up shared mem: -1
Optimized Windows SETI@Home Enhanced application
Version info: Windows SSE3 32-bit V5.15 'Chicken Good!' (R-1.5|+freq|xP [Ben Opt])
Compiled by Simon Zadra (KWSN - Chicken of Angnor) - Member of the Knights who say Ni! (http://www.kwsn.net)
Download Updates at: http://www.zadra.org/seti_enhanced/

CPUID: 'Intel PD Pentium D (Smithfield)'
cpus: 1 cores: 2 threads: 2 cache: L1=16K L2=1024K L3=0K
features: mmx sse sse2 sse3
speed: 3209 MHz

Work Unit Info
True angle range: 0.775000

Flopcounter: 266372562621

Spike count: 7
Pulse count: 5
Triplet count: 5
Gaussian count: 6
------------
CPU:
Number of CPUs 2
Specification Intel(R) Pentium(R) D CPU 3.00GHz
Code name SmithField
Instructions Sets MMX, SSE, SSE2, SSE3, EM64T
Clock Speed 2995.9 MHz
Stock frequency 3000 MHz
Clock multiplier x14.0
Front Side Bus Frequency 214.0 MHz
Bus Speed 856.0 MHz
L1 Data Cache 16 KBytes, 8-way set associative, 64 Bytes line size
L2 Cache 1024 KBytes, 8-way set associative, 64 Bytes line size
------------
Chipset:
Northbridge Intel i945P rev. A1
Southbridge Intel 82801GB (ICH7) rev. 01
------------
RAM:
Memory Type DDR2
Memory Size 3072 MBytes
Memory Frequency 285.3 MHz (3:4)
Max bandwidth PC4300 (266 MHz)
CAS# 4.0
RAS# to CAS# 4
RAS# Precharge 4
Cycle Time (tRAS) 12
------------
OS:
Windows Version Microsoft Windows XP x64 Professional Service Pack 2 (Build 3790)
============
ID: 420579 · Report as offensive
Bob Delkhoon

Send message
Joined: 15 May 99
Posts: 11
Credit: 201,827
RAC: 0
United States
Message 420867 - Posted: 15 Sep 2006, 2:18:55 UTC

Thank you guys for the feedback. I believe I've been able to pull a little more speed out of the 64-bit app. It's hard to tell from the quicker test units how much faster it is but the results were valid. I'm going to run the client overnight to see if there's a noticable speed difference and make sure it's all ok. If it checks out I'll probably submit the code over on Simon's developer board tomorrow.

Side note: It is supposed to be "EM64T", sorry about that. Got it mixed with AMD64 =)

Thanks,
Bob Delkhoon (DeNitro)
ID: 420867 · Report as offensive
Profile BORG
Volunteer tester
Avatar

Send message
Joined: 3 Aug 99
Posts: 305
Credit: 6,157,052
RAC: 0
Canada
Message 420872 - Posted: 15 Sep 2006, 2:27:36 UTC - in response to Message 420867.  

Thank you guys for the feedback. I believe I've been able to pull a little more speed out of the 64-bit app. It's hard to tell from the quicker test units how much faster it is but the results were valid. I'm going to run the client overnight to see if there's a noticable speed difference and make sure it's all ok. If it checks out I'll probably submit the code over on Simon's developer board tomorrow.

Side note: It is supposed to be "EM64T", sorry about that. Got it mixed with AMD64 =)

Thanks,
Bob Delkhoon (DeNitro)


It runs faster on the Intel machine then on the AMD. If anything it slowed down the AMD so I removed it.

Running great on the intel machine.

Borg :-)
ID: 420872 · Report as offensive
Alex Kan
Volunteer developer

Send message
Joined: 4 Dec 03
Posts: 127
Credit: 29,269
RAC: 0
United States
Message 420931 - Posted: 15 Sep 2006, 5:05:55 UTC - in response to Message 420234.  

Can you think of a function that might benefit from 16 x 4 simd regs (in seti)? Would it keep the register pressure down somewhere? Your most complex simd seti use seems to be your chirp sse3, but compiler seemed to use only registers and not store values in temp memory when I looked at the assembly output.

Had you not brought up my SSE3 chirp, I would have said the same thing--the main loop's data dependencies aren't particularly serial. Given the length of the loop, I can't guarantee how well x86 processors will schedule around instruction latencies, so if I had extra registers, I'd try unrolling it wider to see what effect it had. It's still less certain than on PowerPC, where I'm dealing with an in-order vector unit and can tell what's a win just from examining the generated assembly.

As for the rest of the functions, I doubt too many of them would benefit much, if at all.
ID: 420931 · Report as offensive
Profile KWSN - Chicken of Angnor
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 9 Jul 99
Posts: 1199
Credit: 6,615,780
RAC: 0
Austria
Message 421350 - Posted: 15 Sep 2006, 19:41:43 UTC - in response to Message 419914.  
Last modified: 15 Sep 2006, 19:42:04 UTC


[...]
So, EricVonDaniken, have you signed up on Simon's board yet?

Yup, he just did.

Eric, you've got access to the pre-release forum now. Check back on lunatics.at, you will see an extra forum category (as well as an extra Downloads section).

Regards,
Simon.
Donate to SETI@Home via PayPal!

Optimized SETI@Home apps + Information
ID: 421350 · Report as offensive
Profile KWSN - Chicken of Angnor
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 9 Jul 99
Posts: 1199
Credit: 6,615,780
RAC: 0
Austria
Message 421509 - Posted: 16 Sep 2006, 2:46:25 UTC
Last modified: 16 Sep 2006, 2:47:40 UTC

Hi Joker,

interesting that it's returning invalid results - haven't had many reports of such things happening (other than from massive overclocks that went just that little bit too far). Are you sure it's the 1.3 app? The 1.2 and 1.3 only differ in that 1.3 adds CPU MHz detection, not in any other significant way. The function used is part of IPP, and seems to work reliably enough.

I'll check out your registration troubles and manually unlock your account. Sorry it's got to be a bit complicated to sign up - there have already been a few unsavory spam messages posted before, so now there are a couple of measures against that in place, of which the registration confirmation is only one.

Should still work, but sometimes it just doesn't :o)

Regards,
Simon.

<edit>Account is activated. Enjoy!</edit>
Donate to SETI@Home via PayPal!

Optimized SETI@Home apps + Information
ID: 421509 · Report as offensive
KB7RZF
Volunteer tester
Avatar

Send message
Joined: 15 Aug 99
Posts: 9549
Credit: 3,308,926
RAC: 2
United States
Message 421520 - Posted: 16 Sep 2006, 3:00:48 UTC

Hmm, ever since I installed the 1.3 version of the sse3 app on my computer, not one invalid result. This is on a Dell computer, p4 2.8ghz 512mb RAM with Winxp Home SP2. Crunching like a mad man. Question tho, on CPU-Z, under the instruction set, it lists that EM64T. Do I need to be on a 64 bit operating system to take advantage of that function, or can I do anything with it being on 32 bit?
ID: 421520 · Report as offensive
KB7RZF
Volunteer tester
Avatar

Send message
Joined: 15 Aug 99
Posts: 9549
Credit: 3,308,926
RAC: 2
United States
Message 421527 - Posted: 16 Sep 2006, 3:05:48 UTC - in response to Message 421526.  

Hmm, ever since I installed the 1.3 version of the sse3 app on my computer, not one invalid result. This is on a Dell computer, p4 2.8ghz 512mb RAM with Winxp Home SP2. Crunching like a mad man. Question tho, on CPU-Z, under the instruction set, it lists that EM64T. Do I need to be on a 64 bit operating system to take advantage of that function, or can I do anything with it being on 32 bit?

You need XP x64 at the very least, No 32bit XP OS will run a 64bit app or program.

Ahh, ok. Thanks. I just wasn't too sure. Don't know enough about all that stuff. Hehe. Thanks again.
ID: 421527 · Report as offensive
Bob Delkhoon

Send message
Joined: 15 May 99
Posts: 11
Credit: 201,827
RAC: 0
United States
Message 421632 - Posted: 16 Sep 2006, 6:43:07 UTC - in response to Message 421369.  

I compiled a second test. It will probably be the last test I put out. I'm going to post the code over on Simon's page and most likely just submit changes to him there to keep things organized. The results seem pretty good so far.

This build is about 80% smaller and 5%-6% faster on the tests I did.
http://tiger.towson.edu/~bdelkh1/setiathome-5.15-DeNitro-em64t_test2.rar

Results may very, I only have 1 x64 machine to test code on so it's hard to tell how it will run on other CPU types.

PLEASE make sure to try this on a test work unit and validate the result before putting this into BOINC!

Hey Simon, Your 1.3 SSE2 Generic has handed out Invalid Results, I went back to 1.2 on one PC and may on the other one that is using the 64 bit app made by Bob If It makes any more Invalid Results and Yours did 4 in a row.

I also tried registering for an account on Your forum and I tried both My email addresses and somehow both don't like Your confirmation emails and so I can read, But not login or post.

This most likely doesn't apply here, but a few months back I had one of my machines that only crunches start randomly giving out errors. I ran Memtest86 on it to test the RAM and sure enough, after running for about 30 minutes I saw errors start appearing. There was only 2 sticks of ram in the system so with trial and error I figured out which was bad and left only the good one in. It's probably a slim chance the RAM is the problem, but it's pretty easy to check with that software.

Thanks,
Bob Delkhoon
ID: 421632 · Report as offensive
Profile KWSN - Chicken of Angnor
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 9 Jul 99
Posts: 1199
Credit: 6,615,780
RAC: 0
Austria
Message 421873 - Posted: 16 Sep 2006, 17:29:46 UTC

Joker,

again, the only code change from 1.2 to 1.3 was the insertion of CPU MHz detection.

There is no "1.2 Code base" as such :o)

HTH,
Simon.
Donate to SETI@Home via PayPal!

Optimized SETI@Home apps + Information
ID: 421873 · Report as offensive
Profile KWSN - Chicken of Angnor
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 9 Jul 99
Posts: 1199
Credit: 6,615,780
RAC: 0
Austria
Message 422163 - Posted: 16 Sep 2006, 23:43:59 UTC

In a word, no.

I'm sorry you're having trouble with it, but - 10005 Windows app downloads say differently.

I won't remove that code, as I find it pretty useful. In fact, there will be a little more and more specific output in a future release version about your CPU and its capabilities.

Should you need any specific 1.2 release version, I still have the .exe files.

Regards,
Simon.
Donate to SETI@Home via PayPal!

Optimized SETI@Home apps + Information
ID: 422163 · Report as offensive
Profile KWSN - Chicken of Angnor
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 9 Jul 99
Posts: 1199
Credit: 6,615,780
RAC: 0
Austria
Message 422347 - Posted: 17 Sep 2006, 15:20:09 UTC

Okay,

I'll say it a third time: the only difference between 1.2 and 1.3 is the addition of the CPU MHz output.

There is no 1.2 source archive online anymore.

In analyzeFuncs.cpp, search for "KWSN".

In the code block following the first search hit, you will find something like

#ifdef USE_IPP
[3 lines of code]
#endif

Remove them including the #ifdef and #endif lines, you're now at 1.2.

Regards,
Simon.
Donate to SETI@Home via PayPal!

Optimized SETI@Home apps + Information
ID: 422347 · Report as offensive
Bob Delkhoon

Send message
Joined: 15 May 99
Posts: 11
Credit: 201,827
RAC: 0
United States
Message 444737 - Posted: 27 Oct 2006, 21:01:24 UTC

I've been working with Simon and a few guys on his developer's board. They've been working on some great optimizations to the seti code. I've been incorporating the changes into 64-bit builds and doing testing on them. I don't have any kind of timeframe on new clients but the progress is looking very good.

-Babak Delkhoon
ID: 444737 · Report as offensive
Profile jeffusa
Avatar

Send message
Joined: 21 Aug 02
Posts: 224
Credit: 1,809,275
RAC: 0
United States
Message 444861 - Posted: 28 Oct 2006, 0:56:15 UTC - in response to Message 444804.  
Last modified: 28 Oct 2006, 0:58:12 UTC

Good to know, Until then It's full speed ahead on the 64bit road for one of My crunchers(I have 4, 2 completed and running, 1 needing only a pair of 275 cpus and hsf's, and the last only needing a case for the P5W+C2Q cpu). I also have two others(2-Abit AW9D-MAX motherboards), But sadly no cpus yet as I'm trying to get a 14'x60' singlewide mobilehome I bought preped for moving and moved from Las Vegas NV to Yermo CA, Then I'll have to find a place that can do setup on It(maybe the park can for a small fee).

This will be one of My newest crunchers, Sadly I'm waiting on an OS and a case. And yep It's a C2Q cpu.


Wow! You got a C2Q? Where did you score that? It would be cool to see running if you still lived in Las Vegas. Why the move to Yermo? Where is that by the way?!?!

ID: 444861 · Report as offensive
Profile jeffusa
Avatar

Send message
Joined: 21 Aug 02
Posts: 224
Credit: 1,809,275
RAC: 0
United States
Message 444863 - Posted: 28 Oct 2006, 0:59:35 UTC

In regards to all the talk about SSE stuff, I ran the auto-optimized seti client and it said the regular sse optimized version was the best for my e6700 core 2 duo machine. Both sse2 and sse3 tested worse.
ID: 444863 · Report as offensive
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : 64-bit App Build Windows XP x64


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.