Windows port of Alex v8 code

Message boards : Number crunching : Windows port of Alex v8 code
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · 15 · 16 . . . 50 · Next

AuthorMessage
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65749
Credit: 55,293,173
RAC: 49
United States
Message 732889 - Posted: 31 Mar 2008, 15:42:46 UTC

Yeah thanks to all the coders here who are actively porting Alex's V8 code to Windows, Possibly parity at last with the Macs, At least for those that aren't or can't be overclocked, for those that can overclock well I don't need to say that production shouldn't increase any, It should increase a lot, Exactly how much I don't know, But It looks like It will go up none the less. :D :)
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 732889 · Report as offensive
Profile John Clark
Volunteer tester
Avatar

Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 732928 - Posted: 31 Mar 2008, 18:13:16 UTC - in response to Message 732889.  

Yeah thanks to all the coders here who are actively porting Alex's V8 code to Windows, Possibly parity at last with the Macs, At least for those that aren't or can't be overclocked, for those that can overclock well I don't need to say that production shouldn't increase any, It should increase a lot, Exactly how much I don't know, But It looks like It will go up none the less. :D :)


You are right there Joker. Production will go up when the port is released to the wild.
It's good to be back amongst friends and colleagues



ID: 732928 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65749
Credit: 55,293,173
RAC: 49
United States
Message 732942 - Posted: 31 Mar 2008, 19:23:21 UTC - in response to Message 732928.  
Last modified: 31 Mar 2008, 19:25:25 UTC

Yeah thanks to all the coders here who are actively porting Alex's V8 code to Windows, Possibly parity at last with the Macs, At least for those that aren't or can't be overclocked, for those that can overclock well I don't need to say that production shouldn't increase any, It should increase a lot, Exactly how much I don't know, But It looks like It will go up none the less. :D :)


You are right there Joker. Production will go up when the port is released to the wild.

Thanks, Extra text moved to It's own thread by Your's truly, The Joker.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 732942 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 732944 - Posted: 31 Mar 2008, 19:28:44 UTC - in response to Message 732928.  
Last modified: 31 Mar 2008, 19:40:21 UTC

Yeah thanks to all the coders here who are actively porting Alex's V8 code to Windows, Possibly parity at last with the Macs, At least for those that aren't or can't be overclocked, for those that can overclock well I don't need to say that production shouldn't increase any, It should increase a lot, Exactly how much I don't know, But It looks like It will go up none the less. :D :)


You are right there Joker. Production will go up when the port is released to the wild.



A very good job, J.D.Whale, this surely will work fine on Quad's, f.i. Q6600 .

One off my 2 Q6600's has a clock off 3325MHz, FSB 1486.The other 'stock', with Crunch3r's SSSE3, but the OC'ed beats the stock, running optimized app.

Your result's are looking promising keep up the good works.
ID: 732944 · Report as offensive
Profile David
Volunteer tester
Avatar

Send message
Joined: 19 May 99
Posts: 411
Credit: 1,426,457
RAC: 0
Australia
Message 732949 - Posted: 31 Mar 2008, 19:54:10 UTC - in response to Message 732944.  

One off my 2 Q6600's has a clock off 3325MHz, FSB 1486.The other 'stock', with Crunch3r's SSSE3, but the OC'ed beats the stock, running optimized app.


My 2 Q6600's are now running at 3105 (1380 FSB) and 3150 (1400 FSB). It's amasing how well the Q6600's will overclock without really trying.

ID: 732949 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 732999 - Posted: 31 Mar 2008, 22:47:33 UTC - in response to Message 732859.  



Thanks to all for the work!!

I don't have so much time to read every day this long thread..

I hope the final release of the new apps will be publish in the New Optimized Apps Links -- READ ONLY THREAD that all can profit from it..


When the new app is ready for everybody's use, I am sure there will be an announcement and links posted there.....

"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 732999 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65749
Credit: 55,293,173
RAC: 49
United States
Message 733000 - Posted: 31 Mar 2008, 22:53:06 UTC - in response to Message 732999.  



Thanks to all for the work!!

I don't have so much time to read every day this long thread..

I hope the final release of the new apps will be publish in the New Optimized Apps Links -- READ ONLY THREAD that all can profit from it..


When the new app is ready for everybody's use, I am sure there will be an announcement and links posted there.....

Sounds good to Me.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 733000 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 733041 - Posted: 1 Apr 2008, 1:03:41 UTC

JDWale: I'm sure Ozzy would give some devil horns for your effort! Rock on! :-)
ID: 733041 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 733067 - Posted: 1 Apr 2008, 3:11:46 UTC - in response to Message 732731.  
Last modified: 1 Apr 2008, 3:20:25 UTC

Host = Thurston: Q6600 01 Apr 02:22 UDT

Update on Thurstons RAC 6500 Challenge shows that RAC climbed ~230 yesterday and it seems the slope trended higher. I haven't even performed any creative scheduling yet, well not very creative anyway ;-) He was even crunching on 3 cores for part of the day while running some benchmarks. So I wonder what's up...?

Any guesses on what day he'll reach 6500 ?
Will he even reach 6500 ?
Any predictions on where he'll top out relative to Marks "Frozen Penny" ?
Maybe BoincStats can help with the predictions, I think I'll go have a look.

[edit] Boincstats says he did 6,432 on 31 Mar... two weeks like that and 6500 is in reach. [/edit]

Cheers,
JDWhale

@OzzFan - Back @ you! Seen Iron Man yet?
ID: 733067 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 733141 - Posted: 1 Apr 2008, 9:24:18 UTC - in response to Message 733067.  

Host = Thurston: Q6600 01 Apr 02:22 UDT

Update on Thurstons RAC 6500 Challenge shows that RAC climbed ~230 yesterday and it seems the slope trended higher. I haven't even performed any creative scheduling yet, well not very creative anyway ;-) He was even crunching on 3 cores for part of the day while running some benchmarks. So I wonder what's up...?

Any guesses on what day he'll reach 6500 ?
Will he even reach 6500 ?
Any predictions on where he'll top out relative to Marks "Frozen Penny" ?
Maybe BoincStats can help with the predictions, I think I'll go have a look.

[edit] Boincstats says he did 6,432 on 31 Mar... two weeks like that and 6500 is in reach. [/edit]

Cheers,
JDWhale

@OzzFan - Back @ you! Seen Iron Man yet?


Good luck with Thurston......
Just keep in mind that I drop my Frozen Penny back to 2.4v code in between the AK ported test runs I have been doing.....I will not be running the ported code full time until official release.....but the testing sure is fun....LOL.

"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 733141 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 733151 - Posted: 1 Apr 2008, 10:27:41 UTC - in response to Message 733141.  
Last modified: 1 Apr 2008, 10:28:36 UTC


Good luck with Thurston......
Just keep in mind that I drop my Frozen Penny back to 2.4v code in between the AK ported test runs I have been doing.....I will not be running the ported code full time until official release.....but the testing sure is fun....LOL.


I'm counting on you running 2.4V as the only way Thurston could even come close to catching Penny. Remember that Thurston is only a 3.2GHz Q6600 chasing your 4.58GHz X9650 Extreme... You've got almost 1.5 times the clock rate and 1.5 times the L2 cache. I estimate that my WhalePort of AK code is ~1.6 times the speed of the 2.4V code... This might prove an interresting challenge ;-)

I estimate that if Penny were running the ported code full time that she just might reach RAC of 10,000. I wouldn't stand a chance in ehll of once the "Chicken Plus" port comes public... So a little friendly "exhibition match" to demonstrate ported potential for us "little Q6600 guys" in the mean time.... I might be substituting in other "minor tweaks" to help gain advantage if I can find some, so watch your back, Penny ;-)

Cheers,
JDWhale
ID: 733151 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 733153 - Posted: 1 Apr 2008, 10:49:12 UTC - in response to Message 733151.  

I'm counting on you running 2.4V as the only way Thurston could even come close to catching Penny. Remember that Thurston is only a 3.2GHz Q6600 chasing your 4.58GHz X9650 Extreme... You've got almost 1.5 times the clock rate and 1.5 times the L2 cache. I estimate that my WhalePort of AK code is ~1.6 times the speed of the 2.4V code... This might prove an interresting challenge ;-)

I estimate that if Penny were running the ported code full time that she just might reach RAC of 10,000. I wouldn't stand a chance in ehll of once the "Chicken Plus" port comes public... So a little friendly "exhibition match" to demonstrate ported potential for us "little Q6600 guys" in the mean time.... I might be substituting in other "minor tweaks" to help gain advantage if I can find some, so watch your back, Penny ;-)

Cheers,
JDWhale


Looks like you might have a chance as long as you don't snag too many VLAR WU's (not much chance!).

According to my records, Thurston is 38% slower than the Frozen Penny at VLAR but 1% FASTER at 0.22 - 0.4 AR and 9% FASTER at 0.4 - 0.5 AR (mean values).

Woo-hoo... Friendly competition.

F.
ID: 733153 · Report as offensive
Profile John Clark
Volunteer tester
Avatar

Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 733158 - Posted: 1 Apr 2008, 11:49:35 UTC - in response to Message 733153.  
Last modified: 1 Apr 2008, 11:50:50 UTC

I estimate that if Penny were running the ported code full time that she just might reach RAC of 10,000. I wouldn't stand a chance in ehll of once the "Chicken Plus" port comes public... So a little friendly "exhibition match" to demonstrate ported potential for us "little Q6600 guys" in the mean time.... I might be substituting in other "minor tweaks" to help gain advantage if I can find some, so watch your back, Penny ;-)

Cheers,
JDWhale


Looks like you might have a chance as long as you don't snag too many VLAR WU's (not much chance!).

According to my records, Thurston is 38% slower than the Frozen Penny at VLAR but 1% FASTER at 0.22 - 0.4 AR and 9% FASTER at 0.4 - 0.5 AR (mean values).

Woo-hoo... Friendly competition.

F.


Fred W

I am running a non-frozen Penny on air @ 4.0GHz. This rig uses Win XP Pro and the Crunch3r V2.4 optimised client.

How does a slightly slower Penny, on air, compare to Thurston?

Salivating to the public release of the JD/Alex, Chicken +, code port!
It's good to be back amongst friends and colleagues



ID: 733158 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 733167 - Posted: 1 Apr 2008, 12:25:42 UTC - in response to Message 733158.  
Last modified: 1 Apr 2008, 12:27:49 UTC


Fred W

I am running a non-frozen Penny on air @ 4.0GHz. This rig uses Win XP Pro and the Crunch3r V2.4 optimised client.

How does a slightly slower Penny, on air, compare to Thurston?

Salivating to the public release of the JD/Alex, Chicken +, code port!


Though Fred can provide more widespread analysis and total assessment... A quick peak at the 0.39 AR 72.23 credit WUs that we're both choking on right now shows me that Thurston ~16% faster than your Penny at this AR. Of course you're running at 4000Mhz and Thurston is only at 3200Mhz, so the comparison is not at all fair ;-)

Whereas Marks "Frozen" Penny running at 4580MHz with 2.4V SSE4 shows virtually the same CPU times as Thurston running AK WhalePort v0.2 at the same AR.

BOINC On..On...
JDWhale
ID: 733167 · Report as offensive
Profile John Clark
Volunteer tester
Avatar

Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 733173 - Posted: 1 Apr 2008, 12:52:00 UTC - in response to Message 733167.  
Last modified: 1 Apr 2008, 13:06:37 UTC

Though Fred can provide more widespread analysis and total assessment... A quick peak at the 0.39 AR 72.23 credit WUs that we're both choking on right now shows me that Thurston ~16% faster than your Penny at this AR. Of course you're running at 4000Mhz and Thurston is only at 3200Mhz, so the comparison is not at all fair ;-)

Whereas Marks "Frozen" Penny running at 4580MHz with 2.4V SSE4 shows virtually the same CPU times as Thurston running AK WhalePort v0.2 at the same AR.

BOINC On..On...
JDWhale


Taking a fairly surface view of your statement ... this suggests, at an AR of ~0.39, that the software of the Chicken+ code is worth at least ~600MHz of CPU clock and the difference in die fabrication (45nM compared to 65nM)? All this on already optimised code!!
It's good to be back amongst friends and colleagues



ID: 733173 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 733183 - Posted: 1 Apr 2008, 13:59:47 UTC - in response to Message 733158.  

Fred W

I am running a non-frozen Penny on air @ 4.0GHz. This rig uses Win XP Pro and the Crunch3r V2.4 optimised client.

How does a slightly slower Penny, on air, compare to Thurston?

Salivating to the public release of the JD/Alex, Chicken +, code port!

Demonstrating the danger of small sample sizes, your baby looks slightly faster than Thurston at VLAR... I think the other AR's are more representative:


Direct Link

F.
ID: 733183 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 733186 - Posted: 1 Apr 2008, 14:12:00 UTC - in response to Message 733173.  

Though Fred can provide more widespread analysis and total assessment... A quick peak at the 0.39 AR 72.23 credit WUs that we're both choking on right now shows me that Thurston ~16% faster than your Penny at this AR. Of course you're running at 4000Mhz and Thurston is only at 3200Mhz, so the comparison is not at all fair ;-)

Whereas Marks "Frozen" Penny running at 4580MHz with 2.4V SSE4 shows virtually the same CPU times as Thurston running AK WhalePort v0.2 at the same AR.

BOINC On..On...
JDWhale


Taking a fairly surface view of your statement ... this suggests, at an AR of ~0.39, that the software of the Chicken+ code is worth at least ~600MHz of CPU clock and the difference in die fabrication (45nM compared to 65nM)? All this on already optimised code!!


Mind you that neither Mark nor I are running Chicken+ (sorry if this is not the preferred moniker). Mark might run it for short bursts for benchmarking. I'm running WhalePort v0.2 and am holding nothing back and disclosing everything I've learned and done with the crew working on the Chicken+ port. As said before, I'm restricted by Intel from sharing WhalePort, but I do not want my short-term effort to be lost. I hate to reitterate that point, please understand my position.

By "already optimized", I assume you're referring to the earlier Lunatics KWSN, Cruncher built 2.4V version. While it's true, that code was/is "optimized" and demonstrates a remarkable performance over the "stock" app. The source code in "Alex Kan src-v8.tar" is a total rewrite of several functions that highly leverage the SIMD instruction sets on more than one level (SSE2 SSE3 SSSE3), addin differenly coded algorithms based on processor characteristics and you can see that the code is complex and lends itself to many possible permutations. Give those folks doing all the work at Chicken+ the credit they deserve, That is where your "ported" app will likely come from. I'm just a big mouthed fish that got lucky with some "trial development tools".

IMO...At first & second look, some functions bear absolutely no resembelance to each other. The methods of optimization are beyond my comprehension, I have no working knowledge of SIMD instructions and often find myself laboring over just a few lines of code trying to understand the method. This isn't the stuff you're likely to learn at school without at least a half dozen prerequisites, more likely only after already in the workplace a while and paying Intel dearly for a customized course/workshop. Just my opinion, YMMV.

I don't pretend to be a coding expert and fall far short on Windows code (my working knowledge is in low level 2D Graphics, 3D-Graphics(OpenGL) and performance computing on the SGI Shared Memory MultiProcessor super computers of yesteryear. Like I've stated before, "Kudos to the folks that understand & develop this stuff". I'm just a porter that prefers hitting golf balls.

I think I'll head to the practice range now.

Cheers,
John
ID: 733186 · Report as offensive
Profile [AF>france>pas-de-calais]symaski62
Volunteer tester

Send message
Joined: 12 Aug 05
Posts: 258
Credit: 100,548
RAC: 0
France
Message 733188 - Posted: 1 Apr 2008, 14:14:55 UTC

WU : 26ja07ab.16641.18068.9.7.202
WU true angle range is : 1.242116
CPU time : 3210.625 sec
CPU : type GenuineIntel Intel(R) Pentium(R) Dual CPU E2160 @ 1.80GHz [x86 Family 6 Model 15 Stepping 13]
Number of CPUs : 2
Operating System : Microsoft Windows Vista Home Edition, Service Pack 1,(06.00.6001.00)

----------------------------------------

no comment ? mini 1.27 Angle Range ?

SETI@Home Informational message -9 result_overflow
with a general handicap of 80% and it makes much d' efforts for the community and s' expimer, thank you d' to be understanding.
ID: 733188 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 733197 - Posted: 1 Apr 2008, 14:27:15 UTC - in response to Message 733188.  
Last modified: 1 Apr 2008, 14:29:20 UTC

WU : 26ja07ab.16641.18068.9.7.202
WU true angle range is : 1.242116
CPU time : 3210.625 sec
CPU : type GenuineIntel Intel(R) Pentium(R) Dual CPU E2160 @ 1.80GHz [x86 Family 6 Model 15 Stepping 13]
Number of CPUs : 2
Operating System : Microsoft Windows Vista Home Edition, Service Pack 1,(06.00.6001.00)

----------------------------------------

no comment ? mini 1.27 Angle Range ?


WU : 28dc06ag.14133.23794.16.7.218
WU true angle range is : 2.387831
CPU time : 725.108248 sec
CPU type : GenuineIntel Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz [x86 Family 6 Model 15 Stepping 11]
Number of CPUs : 4
Operating System : Microsoft Windows Vista Professional Edition, (06.00.6000.00)

[edit] wuid=244705636 [/edit]
ID: 733197 · Report as offensive
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 733201 - Posted: 1 Apr 2008, 14:35:52 UTC - in response to Message 733186.  

Mind you that neither Mark nor I are running Chicken+ (sorry if this is not the preferred moniker). Mark might run it for short bursts for benchmarking. I'm running WhalePort v0.2 and am holding nothing back and disclosing everything I've learned and done with the crew working on the Chicken+ port. As said before, I'm restricted by Intel from sharing WhalePort, but I do not want my short-term effort to be lost. I hate to reitterate that point, please understand my position.

By "already optimized", I assume you're referring to the earlier Lunatics KWSN, Cruncher built 2.4V version. While it's true, that code was/is "optimized" and demonstrates a remarkable performance over the "stock" app. The source code in "Alex Kan src-v8.tar" is a total rewrite of several functions that highly leverage the SIMD instruction sets on more than one level (SSE2 SSE3 SSSE3), addin differenly coded algorithms based on processor characteristics and you can see that the code is complex and lends itself to many possible permutations. Give those folks doing all the work at Chicken+ the credit they deserve, That is where your "ported" app will likely come from. I'm just a big mouthed fish that got lucky with some "trial development tools".

IMO...At first & second look, some functions bear absolutely no resembelance to each other. The methods of optimization are beyond my comprehension, I have no working knowledge of SIMD instructions and often find myself laboring over just a few lines of code trying to understand the method. This isn't the stuff you're likely to learn at school without at least a half dozen prerequisites, more likely only after already in the workplace a while and paying Intel dearly for a customized course/workshop. Just my opinion, YMMV.

I don't pretend to be a coding expert and fall far short on Windows code (my working knowledge is in low level 2D Graphics, 3D-Graphics(OpenGL) and performance computing on the SGI Shared Memory MultiProcessor super computers of yesteryear. Like I've stated before, "Kudos to the folks that understand & develop this stuff". I'm just a porter that prefers hitting golf balls.

I think I'll head to the practice range now.

Cheers,
John


John, your being too modest.

The AK source has been around for a year or more just waiting for someone to pick up the challenge and port it into a windows application. If it were easy it would have been done long ago by someone else. Your being lucky has nothing to do with it. You accepted the challenge and brought new insight to the problem many had been looking at. It brings truth to the statement that “sometimes a new brain will see something that others have overlooked”.

John, accept your well earned accolades. You have earned your place here.

ID: 733201 · Report as offensive
Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · 15 · 16 . . . 50 · Next

Message boards : Number crunching : Windows port of Alex v8 code


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.