Conroe in SETI? :)

Message boards : Number crunching : Conroe in SETI? :)
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Saimek

Send message
Joined: 25 Jan 00
Posts: 121
Credit: 454,423
RAC: 0
Poland
Message 341251 - Posted: 18 Jun 2006, 14:14:03 UTC
Last modified: 18 Jun 2006, 14:14:53 UTC

This may be the 1st conroe in seti.... 2,66 Ghz.... 4 logic units = 2core*2(HT) = 4? :>

nice times... 65 credit units in ~6000sec...

HERE

http://setiathome.berkeley.edu/show_host_detail.php?hostid=2458557


ID: 341251 · Report as offensive
Hans Dorn
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 2262
Credit: 26,448,570
RAC: 0
Germany
Message 341254 - Posted: 18 Jun 2006, 14:16:57 UTC

Pretty impressive.

I would guess this is a dual woodcrest xeon machine.

Regards Hans.

P.S: Still no sign of them in the online shops, grrmph.

ID: 341254 · Report as offensive
EricVonDaniken

Send message
Joined: 17 Apr 04
Posts: 177
Credit: 67,881
RAC: 0
United States
Message 341302 - Posted: 18 Jun 2006, 15:49:21 UTC

That may be the fastest s@h client host currently crunching...

...and it is not even going at full speed yet since it does not look like it is running SSE optimized code.
ID: 341302 · Report as offensive
Urs Echternacht
Volunteer tester
Avatar

Send message
Joined: 15 May 99
Posts: 692
Credit: 135,197,781
RAC: 211
Germany
Message 341306 - Posted: 18 Jun 2006, 15:53:51 UTC - in response to Message 341302.  

That may be the fastest s@h client host currently crunching...

...and it is not even going at full speed yet since it does not look like it is running SSE optimized code.


If you look at the results, it is actually running at full speed with an optimized SSE3 setiapp.
_\|/_
U r s
ID: 341306 · Report as offensive
Ingleside
Volunteer developer

Send message
Joined: 4 Feb 03
Posts: 1546
Credit: 15,832,022
RAC: 13
Norway
Message 341330 - Posted: 18 Jun 2006, 16:43:23 UTC

Hmm, on average 34.3 CS/h per "core". :drool:

This should be 3292 CS/day for this 4-way system.

Also, for "normal" angle-range, averages 1.6h/wu, meaning 15 wu/day per core...
At these speeds, Seti_Enhanced-wu doesn't look long afterall...
ID: 341330 · Report as offensive
EricVonDaniken

Send message
Joined: 17 Apr 04
Posts: 177
Credit: 67,881
RAC: 0
United States
Message 341338 - Posted: 18 Jun 2006, 16:56:19 UTC - in response to Message 341306.  

That may be the fastest s@h client host currently crunching...

...and it is not even going at full speed yet since it does not look like it is running SSE optimized code.


If you look at the results, it is actually running at full speed with an optimized SSE3 setiapp.

*smacks self in forehead for being unobservant*
I stand corrected. Thanks.

In that case, I suspect this is =not= a 4 core Woodcrest system since a 2.66GHz 4C Woodcrest system rates to be even faster than this... (!)

ID: 341338 · Report as offensive
Profile Clyde C. Phillips, III

Send message
Joined: 2 Aug 00
Posts: 1851
Credit: 5,955,047
RAC: 0
United States
Message 341480 - Posted: 18 Jun 2006, 19:06:39 UTC

With his Conroe he's doing just about as well (per thread) as my Pentium D950s with half-hour units (25 cobblestones per hour). However he's doing much better with the longer units, about 40 cobblestones per thread per hour conpared to my 14 per hour. And- he's got four threads to my two.
ID: 341480 · Report as offensive
Profile KWSN - Chicken of Angnor
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 9 Jul 99
Posts: 1199
Credit: 6,615,780
RAC: 0
Austria
Message 341489 - Posted: 18 Jun 2006, 19:15:04 UTC

Conroes are FPU monsters compared to any previous CPUs - even Athlon64s, which for the past few years have been leading in this regard.

Also, their shared-cache approach instead of exclusive is getting them extra gains vs. other dual core architectures, even if the integrated memory controller isn't ready yet (next version, then...CSI).

All in all, on the Desktop front, it'll be the best for a while. It's a pity Intel won't be able to back it up with numbers (Core 2 will make up for less than 20% of their volume far into 2007).

But still - "precioussssssssssss", as Gollum would say. Damn, but I want one :o)

Regards
Donate to SETI@Home via PayPal!

Optimized SETI@Home apps + Information
ID: 341489 · Report as offensive
W-K 666 Project Donor
Volunteer tester

Send message
Joined: 18 May 99
Posts: 19716
Credit: 40,757,560
RAC: 67
United Kingdom
Message 341936 - Posted: 19 Jun 2006, 4:15:29 UTC - in response to Message 341927.  

This may be the 1st conroe in seti.... 2,66 Ghz.... 4 logic units = 2core*2(HT) = 4? :>

nice times... 65 credit units in ~6000sec...

HERE

http://setiathome.berkeley.edu/show_host_detail.php?hostid=2458557



Conroe has 4MB of cache......2MB for each core. This computer says it only has 1MB of cache.

Thats a problem with BOINC all cpu's report 1 MB of Cache.
ID: 341936 · Report as offensive
Profile Clyde C. Phillips, III

Send message
Joined: 2 Aug 00
Posts: 1851
Credit: 5,955,047
RAC: 0
United States
Message 342484 - Posted: 19 Jun 2006, 19:01:04 UTC

One reason why Gurra's computer is doing very well is that he's using Crunch3r's cruncher. I see, now, that his machine is número uno on the list of top computers with an RAC of over 2,000. But that doesn't explain nearly fully why BOTH of my P950s together are only doing about 1,100, even without a single erroneous unit in a week. They should be doing almost as well as his single Conroe, I should think. His machine eats up 60 to 64-cobblestone units whereas mine balk at them (4-1/2 hours), and especially the 56-cobblestone ones which take eight or nine hours. It could be that Corsair Value Select (667) memory which has a reported CAS of a huge 5. Or it might be the default cruncher.
ID: 342484 · Report as offensive
Hans Dorn
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 2262
Credit: 26,448,570
RAC: 0
Germany
Message 342492 - Posted: 19 Jun 2006, 19:10:38 UTC - in response to Message 342484.  
Last modified: 19 Jun 2006, 19:12:02 UTC

One reason why Gurra's computer is doing very well is that he's using Crunch3r's cruncher. I see, now, that his machine is número uno on the list of top computers with an RAC of over 2,000. But that doesn't explain nearly fully why BOTH of my P950s together are only doing about 1,100, even without a single erroneous unit in a week. They should be doing almost as well as his single Conroe, I should think. His machine eats up 60 to 64-cobblestone units whereas mine balk at them (4-1/2 hours), and especially the 56-cobblestone ones which take eight or nine hours. It could be that Corsair Value Select (667) memory which has a reported CAS of a huge 5. Or it might be the default cruncher.


This machine has 2 "woodcrest" xeons, sporting a 1333 Mhz FSB :o)
(I'm guessing here, but that's the only plausible candidate...)

Intels pentium-M architecture always was pretty fast, but they improved it significantly with their new "core2" (conroe, woodcrest et al.)

Regards Hans

P.S: The optimized crunchers are significantly faster than the stock one.
ID: 342492 · Report as offensive
Kim Vater
Volunteer tester

Send message
Joined: 27 May 99
Posts: 227
Credit: 22,743,307
RAC: 0
Norway
Message 342582 - Posted: 19 Jun 2006, 20:55:21 UTC - in response to Message 342492.  
Last modified: 19 Jun 2006, 20:57:58 UTC



Regards Hans

P.S: The optimized crunchers are significantly faster than the stock one.


I fully agree on that ;-)
(have a look at my 3 hosts - P4 Northwood & Prescott s.478)

Kiva
Greetings from Norway

Crunch3er & AK-V8 Inside
ID: 342582 · Report as offensive
EricVonDaniken

Send message
Joined: 17 Apr 04
Posts: 177
Credit: 67,881
RAC: 0
United States
Message 342587 - Posted: 19 Jun 2006, 20:57:58 UTC

Gurra's host does not seem fast enough to be 2 2C 2.66GHz Woodcrests.
Especially since it is running SSE optimized apps.

If it is a 2S Woodcrest system, Gurra is not getting expected performance out of it.
ID: 342587 · Report as offensive
Hans Dorn
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 2262
Credit: 26,448,570
RAC: 0
Germany
Message 342750 - Posted: 19 Jun 2006, 23:35:06 UTC

On a related note: Dell is giving away xeons for free ATM.
(Offer ends Jun-27).

Methinks they're making room for the 51xx xeons :o)

Regards Hans
ID: 342750 · Report as offensive
Profile Gecko
Volunteer tester
Avatar

Send message
Joined: 17 Nov 99
Posts: 454
Credit: 6,946,910
RAC: 47
United States
Message 342865 - Posted: 20 Jun 2006, 2:03:17 UTC - in response to Message 342587.  
Last modified: 20 Jun 2006, 2:30:51 UTC

Gurra's host does not seem fast enough to be 2 2C 2.66GHz Woodcrests.
Especially since it is running SSE optimized apps.

If it is a 2S Woodcrest system, Gurra is not getting expected performance out of it.


I indeed think it could be a DP , 4 core Xeon system.
Crunch3r's ap is @ 25-40% faster than standard x86 S@H ap depending on the comparison.
Alex's G5 optimized ap is @ 80%-90% faster than standard S@H PPC ap.

If one were comparing apples to apples applications w/ similar efficiency improvements, Gurra's host would be blowing the G5 Quads into the weeds, RAC @ 5500-6000+. Plus, his RAC probably isn't done climbing yet as it is.

If a host running Crunch3r's 5.12 runs at 1/2 the RAC vs. Crunch3r's best non-enhanced ap, that would equate to each Woodcrest core at @ 1144 RAC under the old system. Don't recall seeing too many single CPU, single core, non HT hosts at stock clock speed w/ that high of RAC.

Can only imagine what that rig could do w/ an ap actually optimized for the new Core 2 architecture.
It would be awe-inspiring.
ID: 342865 · Report as offensive
Profile KWSN - Chicken of Angnor
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 9 Jul 99
Posts: 1199
Credit: 6,615,780
RAC: 0
Austria
Message 342868 - Posted: 20 Jun 2006, 2:08:25 UTC

ICC 9.1 supports Core 2/Duo natively and will accelerate things even faster.

I'm still salivating at the prospect of getting one - I've been meaning to upgrade my box but have kept delaying first for X2s and then for Conroe.

They also overclock like mad on default voltage and air cooling.

BTW, Crunch3rs optimized app isn't the quickest Linux optimized code anymore ;)

My code runs between 2x to 4x faster on the same host than the default client does. Same as Alex', by the way. It's incredible what he did, my MacMini is a credit monster now.
Donate to SETI@Home via PayPal!

Optimized SETI@Home apps + Information
ID: 342868 · Report as offensive
Profile Gecko
Volunteer tester
Avatar

Send message
Joined: 17 Nov 99
Posts: 454
Credit: 6,946,910
RAC: 47
United States
Message 342871 - Posted: 20 Jun 2006, 2:44:07 UTC - in response to Message 342868.  

ICC 9.1 supports Core 2/Duo natively and will accelerate things even faster.

I'm still salivating at the prospect of getting one - I've been meaning to upgrade my box but have kept delaying first for X2s and then for Conroe.

They also overclock like mad on default voltage and air cooling.

BTW, Crunch3rs optimized app isn't the quickest Linux optimized code anymore ;)

My code runs between 2x to 4x faster on the same host than the default client does. Same as Alex', by the way. It's incredible what he did, my MacMini is a credit monster now.



LOT's of Mac-fanatic forums have been speculating the last few weeks that the new PowerMac may be an Apple-exclusive 3GHZ Woodcrest. It would make perfect sense (which is exactly why the assumptions may be wrong in typical Apple counter-logic), since it would be the fullfillment of the 3GHZ promise, ala Intel instead of IBM, and a 1333 FSB would trump current 1.25GHZ. The marketing campaign would compare this event to the "2nd coming". A Quad model of this for $3500, and.....drool city!

Try to build a 4 core Woodcrest on a server board w/ similar features and other desktop multi-media hardware for same. New PowerMac may be well worth the wait : )
ID: 342871 · Report as offensive
EricVonDaniken

Send message
Joined: 17 Apr 04
Posts: 177
Credit: 67,881
RAC: 0
United States
Message 343278 - Posted: 20 Jun 2006, 12:34:36 UTC
Last modified: 20 Jun 2006, 12:44:59 UTC

To put things into proper perspective, a 4C 2.66 Intel 5150 box running SSE 2/3 optimized apps (SSE4 optimizations would require the Core2 specific compiler support and libraries Simon mentioned) should be

2-4x faster per core than standard because it is running the optimized app

4x faster per core than a Core Duo due to having 2x as many FP/SSE units, each 128b wide rather than 64b wide (1cycle FP per FPU rather than 2cycles)

2.66/2.16= 16/13= ~1.23x faster per core than a T2600 due to clock rate.

2x faster aggregate speed than a Core Duo since it it has 4C rather than 2C

=> 2*(16/13)*4= 9.85x faster aggregate than the current fastest shipping Yonah.
...running a =non= optimized app.

=> 19.69x - 39.38x faster than above running optimized apps.
Memory bandwidth limitations are probably going to hold the performance increase down to closer to 19x rather than 39x under most circumstances.

A 2.16GHz T2600 Yonah has a stable 24x7x365 s@h RAC in excess of 900...
ID: 343278 · Report as offensive
Hans Dorn
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 2262
Credit: 26,448,570
RAC: 0
Germany
Message 343507 - Posted: 20 Jun 2006, 17:04:48 UTC - in response to Message 343475.  


Yeah Intel is still using a fsb, Instead of something like Hyper Transport. Just think without the bottleneck how fast It would be. Oh well, what a shame.


Intel's new server chipset (i5000p et al) has 2 separate FSB interfaces, one per CPU, running at 1333Mhz.

Memory is FB-DIMM with 4 individual 666MHz channels.

This is a very clean setup with no artificial limitations coming from the chipset.
The memory bandwith is higher than for a similar opteron system.


Regards Hans
ID: 343507 · Report as offensive
EricVonDaniken

Send message
Joined: 17 Apr 04
Posts: 177
Credit: 67,881
RAC: 0
United States
Message 343523 - Posted: 20 Jun 2006, 19:39:20 UTC - in response to Message 343507.  
Last modified: 20 Jun 2006, 19:42:26 UTC


Yeah Intel is still using a fsb, Instead of something like Hyper Transport. Just think without the bottleneck how fast It would be. Oh well, what a shame.


Intel's new server chipset (i5000p et al) has 2 separate FSB interfaces, one per CPU, running at 1333Mhz.

Memory is FB-DIMM with 4 individual 666MHz channels.

This is a very clean setup with no artificial limitations coming from the chipset.
The memory bandwith is higher than for a similar opteron system.

Regards Hans

Only In Theory. In Practice, even the the mighty 5160 can only do ~5GBps memory bandwidth while the Opteron's weigh in at ~6GBps.

What Woodcrest =does= seem to have is enormous small footprint bandwidth.
L1 cache bandwidth is ~300-375GBps (!) {in contrast, a T2600 has ~45GBps of L1 cache bandwidth} and Woodcrest systems seem to have more RAM bandwidth than current similar Opteron systems up to ~48MB footprints.

All this may change once the AM2 based and especially the K8L based Opteron systems are available, but traditionally Intel does very well in the smaller footprint arena.

ID: 343523 · Report as offensive
1 · 2 · Next

Message boards : Number crunching : Conroe in SETI? :)


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.