Windows port of Alex v8 code

Message boards : Number crunching : Windows port of Alex v8 code
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 37 · 38 · 39 · 40 · 41 · 42 · 43 . . . 50 · Next

AuthorMessage
Profile David
Volunteer tester
Avatar

Send message
Joined: 19 May 99
Posts: 411
Credit: 1,426,457
RAC: 0
Australia
Message 743403 - Posted: 24 Apr 2008, 4:53:38 UTC - in response to Message 743343.  

I will pull the plug on Ginger when her cache empties in a couple days.


Jeez, sounds rather dramatic. What did Ginger do to you to deserve such a nasty fate... :)

My focus is now pointed at "burning in" the release candidates so there are fewer (hopefully none), surprises upon release.


Only real way is to widen the test group. Look what happens with each Microsoft release as its tested with a small and knowledgeable userbase, but give it to the idiots out in public and they break it quicker than a McDonalds Toy.

and have refrained from cancelling WUs where my wingam was running BOINC 4.45.


I dont look that hard, but have been caught by a few running the old version. Mt RAC is jumping by a few hundred now & then, but I blame the WU's rather than anything else. Plus I had a heap on the David_Quad_core that failed for some reason, but took a few thousand seconds to do it, and gave me 0 credits - argh


ID: 743403 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 743515 - Posted: 24 Apr 2008, 12:30:13 UTC - in response to Message 743403.  

I will pull the plug on Ginger when her cache empties in a couple days.


Jeez, sounds rather dramatic. What did Ginger do to you to deserve such a nasty fate... :)


Summer is approaching quickly here in SE Texas. Room temps are at 27C and Ginger stays in heat. Coupled with her energy draw puts me over my project budget of 1kW since Wrongway came online. She may be going down, but she is not going out. Is destined to my 21YO daughter come June.

My focus is now pointed at "burning in" the release candidates so there are fewer (hopefully none), surprises upon release.


Only real way is to widen the test group. Look what happens with each Microsoft release as its tested with a small and knowledgeable userbase, but give it to the idiots out in public and they break it quicker than a McDonalds Toy.


I'm trying everything I know to break RC1... Used it as a coffee holder one morning... Had it fetch the post another...Manning the BBQ last night... Solid as a rock! We're talking "Top Shelf" stuff here.

and have refrained from cancelling WUs where my wingam was running BOINC 4.45.


I dont look that hard, but have been caught by a few running the old version. Mt RAC is jumping by a few hundred now & then, but I blame the WU's rather than anything else. Plus I had a heap on the David_Quad_core that failed for some reason, but took a few thousand seconds to do it, and gave me 0 credits - argh

I hate those... and sometimes you get a gaggle of WUs with the same wingman, who happens to go missing leaving you hold the pending bag for a couple months.

Yes, modus operandi(MO) for Thurston avoid most "pending" pitfalls as credit is usually granted when the result is returned ("checked but no concensus" is the exception). As such I would expect RAC to stabilize 2 weeks after change is applied as long as he doesn't run out of work. That is one of the pitfalls of this MO. Without many pending, his RAC will drop like a rock when the work runs out... Thus a large cache is essential. Without another interruption, I expect Thurston to reach stable RAC in about 72-96 hours, target is still RAC 7090.

G'Day,
JDWhale
ID: 743515 · Report as offensive
Profile mr.kjellen
Volunteer tester
Avatar

Send message
Joined: 4 Jan 01
Posts: 195
Credit: 71,324,196
RAC: 0
Sweden
Message 743557 - Posted: 24 Apr 2008, 14:09:52 UTC

I see a Whale roaming in the waters of the top 20...Congrats JD.

/Anton
ID: 743557 · Report as offensive
Profile Francois Piednoel
Avatar

Send message
Joined: 14 Jun 00
Posts: 898
Credit: 5,969,361
RAC: 0
United States
Message 743558 - Posted: 24 Apr 2008, 14:11:50 UTC
Last modified: 24 Apr 2008, 14:12:41 UTC

Starting this morning, one of my Skulltrail will be running John (JDWhale) version of the Alex V8 code, based on his request. I will use the other machine to try to catch up with my own code :) , that is a big task, because Alex/John code is amazingly fast.

I am so busy preparing the next step of the hardware ... We will be able to see how my actual design can do on the best code ;)

Francois
ID: 743558 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 743577 - Posted: 24 Apr 2008, 15:10:53 UTC - in response to Message 743558.  

Starting this morning, one of my Skulltrail will be running John (JDWhale) version of the Alex V8 code, based on his request. I will use the other machine to try to catch up with my own code :) , that is a big task, because Alex/John code is amazingly fast.

I am so busy preparing the next step of the hardware ... We will be able to see how my actual design can do on the best code ;)

Francois



Thank you for allowing AK_WhalePort to run on your Skulltrail. The version I provided is V0.2X built using SSE4.1 IPP 32-bit. Since I do not have any hardware to test this version (45nm chip) I can only hope that this client version will perform well. I will be watching and waiting to see some results post.

Note: AK_WhalePort is a "verbatim" windows port of Alex v-8 code... I only did what was necessary to get the source code to build & link substituting Intel IPP calls for Apple proprietary DSP functions.

Again... Thank you Francois..

Best regards,
JDWhale


@mr.kjellen - Thanks... except for untold misfortune by the leaders I can not expect Thurston to rise any more... Since starting this quest/race the RAC to reach top 20 has risen by maybe 300-500.
ID: 743577 · Report as offensive
Profile Francois Piednoel
Avatar

Send message
Joined: 14 Jun 00
Posts: 898
Credit: 5,969,361
RAC: 0
United States
Message 743594 - Posted: 24 Apr 2008, 16:14:29 UTC - in response to Message 743577.  

Starting this morning, one of my Skulltrail will be running John (JDWhale) version of the Alex V8 code, based on his request. I will use the other machine to try to catch up with my own code :) , that is a big task, because Alex/John code is amazingly fast.

I am so busy preparing the next step of the hardware ... We will be able to see how my actual design can do on the best code ;)

Francois



Thank you for allowing AK_WhalePort to run on your Skulltrail. The version I provided is V0.2X built using SSE4.1 IPP 32-bit. Since I do not have any hardware to test this version (45nm chip) I can only hope that this client version will perform well. I will be watching and waiting to see some results post.

Note: AK_WhalePort is a "verbatim" windows port of Alex v-8 code... I only did what was necessary to get the source code to build & link substituting Intel IPP calls for Apple proprietary DSP functions.

Again... Thank you Francois..

Best regards,
JDWhale


@mr.kjellen - Thanks... except for untold misfortune by the leaders I can not expect Thurston to rise any more... Since starting this quest/race the RAC to reach top 20 has risen by maybe 300-500.



It is now running, not generating errors, the L2 cache miss are much lower than before, nice work from Alex and team. this is the machine you want to look at: hostid=4081480


I am jalous :)

V / Who? / Francois
ID: 743594 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19078
Credit: 40,757,560
RAC: 67
United Kingdom
Message 743595 - Posted: 24 Apr 2008, 16:25:40 UTC

Right, I'm setup and waiting, patiently, for these new apps. Just connected new host Q9450 at 15:42 UTC first two results returned. Still at stock speed etc.
Actually my sons, but haven't told him components and new 24" widescreen monitor are here yet, but he has paid me, so maybe I better. It will be connected via his account when he takes it to his place.
ID: 743595 · Report as offensive
Profile John Clark
Volunteer tester
Avatar

Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 743597 - Posted: 24 Apr 2008, 16:27:05 UTC - in response to Message 743577.  
Last modified: 24 Apr 2008, 16:28:20 UTC

Starting this morning, one of my Skulltrail will be running John (JDWhale) version of the Alex V8 code, based on his request. I will use the other machine to try to catch up with my own code :) , that is a big task, because Alex/John code is amazingly fast.

I am so busy preparing the next step of the hardware ... We will be able to see how my actual design can do on the best code ;)Francois


Thank you for allowing AK_WhalePort to run on your Skulltrail. The version I provided is V0.2X built using SSE4.1 IPP 32-bit. Since I do not have any hardware to test this version (45nm chip) I can only hope that this client version will perform well. I will be watching and waiting to see some results post.

Note: AK_WhalePort is a "verbatim" windows port of Alex v-8 code... I only did what was necessary to get the source code to build & link substituting Intel IPP calls for Apple proprietary DSP functions.

Again... Thank you Francois..

Best regards,
JDWhale


Excellent addition to see how the Alex code port, care of JD, will perform.

@mr.kjellen - Thanks... except for untold misfortune by the leaders I can not expect Thurston to rise any more... Since starting this quest/race the RAC to reach top 20 has risen by maybe 300-500.


It is interesting to see how the top 20 entry requirement has changed in the last 12 months.

My first Quad was a QX6,700 clocked to 3.1GHz from stock. This stabilised at it's current RAC while reaching a maximum host position of 19 in the top 20. So, it looks like the entry RAC is heading to double that required 12 - 14 months ago.
It's good to be back amongst friends and colleagues



ID: 743597 · Report as offensive
Profile ML1
Volunteer moderator
Volunteer tester

Send message
Joined: 25 Nov 01
Posts: 20331
Credit: 7,508,002
RAC: 20
United Kingdom
Message 743610 - Posted: 24 Apr 2008, 17:11:12 UTC - in response to Message 743558.  

Starting this morning, one of my Skulltrail will be running John (JDWhale) version of the Alex V8 code, based on his request. I will use the other machine to try to catch up with my own code :) ...

A most excellent test... But why only 32-bit?

Good luck and may the best code win!

(Please, just the results ;-) )

Happy crunchin,
Martin

See new freedom: Mageia Linux
Take a look for yourself: Linux Format
The Future is what We all make IT (GPLv3)
ID: 743610 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 743620 - Posted: 24 Apr 2008, 17:34:36 UTC - in response to Message 743610.  

Starting this morning, one of my Skulltrail will be running John (JDWhale) version of the Alex V8 code, based on his request. I will use the other machine to try to catch up with my own code :) ...

A most excellent test... But why only 32-bit?

Good luck and may the best code win!

(Please, just the results ;-) )

Happy crunchin,
Martin



First results have posted hostid=4081480&offset=1620

These early results are hybrids, partially crunched with another client. "Purebreed" should be a bit faster ;-)

@Martin - I only DL and install Intel ICC & IPP 32-bit evaluation copies.
So the "real" clients will be even faster. In addition, 32-bit runs on both 32 & 64 bit OS.
ID: 743620 · Report as offensive
Profile Geek@Play
Volunteer tester
Avatar

Send message
Joined: 31 Jul 01
Posts: 2467
Credit: 86,146,931
RAC: 0
United States
Message 743627 - Posted: 24 Apr 2008, 17:59:50 UTC

With the release of the new app's about 1 week away, I have one question.

Can someone explain in layman's terms........Why is Alex Kan's V8 source code so much faster than what we had before?
ID: 743627 · Report as offensive
Profile Francois Piednoel
Avatar

Send message
Joined: 14 Jun 00
Posts: 898
Credit: 5,969,361
RAC: 0
United States
Message 743629 - Posted: 24 Apr 2008, 18:05:54 UTC
Last modified: 24 Apr 2008, 18:19:01 UTC

here it is!

wow!

Francois
ID: 743629 · Report as offensive
Profile Francois Piednoel
Avatar

Send message
Joined: 14 Jun 00
Posts: 898
Credit: 5,969,361
RAC: 0
United States
Message 743630 - Posted: 24 Apr 2008, 18:11:34 UTC - in response to Message 743627.  

With the release of the new app's about 1 week away, I have one question.

Can someone explain in layman's terms........Why is Alex Kan's V8 source code so much faster than what we had before?

From what I can tell with vtune, Alex code did improve dramatically the data locality, The L2 cache miss ratio decrease by a factor of 10, and L1 cache is much more efficent.Alex can probably confirm it, I don t have the "accelerator.h", so, I can t confirm from a code point of view.

Who?
ID: 743630 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 743641 - Posted: 24 Apr 2008, 18:36:23 UTC - in response to Message 743627.  
Last modified: 24 Apr 2008, 18:50:42 UTC

With the release of the new app's about 1 week away, I have one question.

Can someone explain in layman's terms........Why is Alex Kan's V8 source code so much faster than what we had before?


I'll try one example...

When wanting to find the MEAN of a vector(1D array), typically a programmer would write ~5 lines of code to loop over the all the elements to gather a sum, then divide the sum by the vector length. Simple enough and easy to read/understand, even for most non-programmers.

Alex code might implement the same operation using maybe 100 lines of code, split the array into 4 or eight pipes, feed the "vector" part of the processor allowing 4 or 8 sums to take place in the same amount of CPU clicks as a single sum operation. Accumulate the sum from the separate pipes, then do the division.

Not very intuitive, but making much more efficient use of the CPU and making the code much more complex to read and to write and to comprehend.
Also requires coding some of the optimizations multiple times, once for each CPUs capabilities to gain maximum performance on each CPU type.

This process uses what are called "intrinsic functions" that allow "low level" access to the CPU capabilities. This set of intrinsics change from CPU to CPU and are sometimes refered to as SSE, SSE2, SSE3, SSSE3, SSE4.1, MMX, ...etc.
Explaining the need for more than one client, depending on which "instructions" your host(CPU) can understand.


This was just the most simple of examples of how Alex speed up the SETI client.

Regards,
JDWhale



[edit]
The Skulltrail has changed hostid and "detached"... hostid=4262769&offset=1800
[/edit]
ID: 743641 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 743649 - Posted: 24 Apr 2008, 19:03:46 UTC - in response to Message 743629.  
Last modified: 24 Apr 2008, 19:25:16 UTC

[edit]message deleted[/edit]

The Skulltrail has changed hostid and "detached"... New host is

hostid=4262769
ID: 743649 · Report as offensive
Profile Sir Ulli
Volunteer tester
Avatar

Send message
Joined: 21 Oct 99
Posts: 2246
Credit: 6,136,250
RAC: 0
Germany
Message 743655 - Posted: 24 Apr 2008, 19:33:37 UTC

one Question, when is the code be official..

for all...

Greetings from Germany NRW
Ulli




ID: 743655 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 743657 - Posted: 24 Apr 2008, 19:35:26 UTC - in response to Message 743655.  

one Question, when is the code be official..

for all...

Greetings from Germany NRW
Ulli





To the best of my knowledge, current target is May 1st. Unless anyone in the know wishes to revise that estimate.

F.
ID: 743657 · Report as offensive
Profile Francois Piednoel
Avatar

Send message
Joined: 14 Jun 00
Posts: 898
Credit: 5,969,361
RAC: 0
United States
Message 743668 - Posted: 24 Apr 2008, 20:23:05 UTC - in response to Message 743649.  

[edit]message deleted[/edit]

The Skulltrail has changed hostid and "detached"... New host is

hostid=4262769


that was weird ... Can't explain what happenned...

Francois
ID: 743668 · Report as offensive
Profile JDWhale
Volunteer tester
Avatar

Send message
Joined: 6 Apr 99
Posts: 921
Credit: 21,935,817
RAC: 3
United States
Message 743677 - Posted: 24 Apr 2008, 20:55:59 UTC - in response to Message 743668.  

[edit]message deleted[/edit]

The Skulltrail has changed hostid and "detached"... New host is

hostid=4262769


that was weird ... Can't explain what happenned...

Francois



If you crashed... Mark had mentioned he needed to raise vCore a bit to account for the extra stress being put on his OC penryn with the new client.

You probably have more information/resources on that requirement than I could help with. Check your BOINC log. About 4 days ago Thurston suffered from a "random project reset" where all WUs were deleted and detatched. More details on this were posted to this thread... like I said about 4 days ago.

Since then I pointed a small fan directly on my memory and adjusted vDimm18 a bit. I don't have a lot of confidence in my BALLISTIX DDR2-1066 dimms.

Looking good,
John
ID: 743677 · Report as offensive
Profile David
Volunteer tester
Avatar

Send message
Joined: 19 May 99
Posts: 411
Credit: 1,426,457
RAC: 0
Australia
Message 743702 - Posted: 24 Apr 2008, 21:53:55 UTC - in response to Message 743677.  

Since then I pointed a small fan directly on my memory and adjusted vDimm18 a bit. I don't have a lot of confidence in my BALLISTIX DDR2-1066 dimms.

I recently bought 2 1GB sticks as well and it took less than 2 days before I removed them and went back to the Geil 800 sticks - even running under 1066 the Ballistix showed errors and poor memory speed with various memory tests and benchmark programs.

Hard to imagine why 1066 ram is slower than 800 (at 840) ram. Ah well, such is life.
ID: 743702 · Report as offensive
Previous · 1 . . . 37 · 38 · 39 · 40 · 41 · 42 · 43 . . . 50 · Next

Message boards : Number crunching : Windows port of Alex v8 code


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.