Lunatics Windows Installer v0.39 release notes

Message boards : Number crunching : Lunatics Windows Installer v0.39 release notes
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 12 · Next

AuthorMessage
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14655
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1176307 - Posted: 7 Dec 2011, 16:55:46 UTC - in response to Message 1176306.  

What one possibly can say here it seems, is that the SSSE3x build may be faster on true multicore CPU's, like the Core2 range, while the SSE3 is faster on HT enabled CPU's like my i3, and the ATOM.

Well, yes and no.

That's exactly the effect I saw on my i5 - somewhat to everyone's surprise - so it's good to have it confirmed.

But I'm pretty sure it isn't anything directly related to the number of cores, or hyperthreading as such. I'd bet (subject to checking by the real hardware gurus) that it's more to do with the internal micro-architecture of the various generations of Intel CPUs.
ID: 1176307 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1176323 - Posted: 7 Dec 2011, 18:36:52 UTC - in response to Message 1176311.  
Last modified: 7 Dec 2011, 18:39:54 UTC

What one possibly can say here it seems, is that the SSSE3x build may be faster on true multicore CPU's, like the Core2 range, while the SSE3 is faster on HT enabled CPU's like my i3, and the ATOM.

Well, yes and no.

That's exactly the effect I saw on my i5 - somewhat to everyone's surprise - so it's good to have it confirmed.

But I'm pretty sure it isn't anything directly related to the number of cores, or hyperthreading as such. I'd bet (subject to checking by the real hardware gurus) that it's more to do with the internal micro-architecture of the various generations of Intel CPUs.


Well, I was just speculating. It would be interesting to know the real reason why some CPU's although they are capable of the SSSE3x, runs MB faster with SSE3. Maybe in time some hardware gury will answer that.


The reasoning put forward by Joe Segur involved that much of Alex's Core2 (SSSE3 & up in our builds) optimisation was there related to handling misaligned memory accesses. A big problem traditionally when dealing with large amounts of data at strange strides. Since the later architectures have more sheer transistors dedicated to handling those weird alignments of data, the extra code really becomes extra unneeded overhead. I agree with that so far.

I further proposed that having L3 cache also closer resembles later Pentium 4 (SSE3) architectures, so the compiler optimisations for those may be dealing with memory more appropriately as well.

As far as I'm aware, neither of us have done sufficient testing to verify those completely, but this test helps push further along those directions. Understanding what the architecture does well comes in handy, and more efficient memory handling is going to be a very useful thing to have confirmed.

Jason
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1176323 · Report as offensive
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1176326 - Posted: 7 Dec 2011, 19:39:01 UTC

I have just installed lunatics v0.39 on my four PCs http://setiathome.berkeley.edu/hosts_user.php however I am not sure that I have used the optimal apps. In the past I simply looked for the most advanced feature that the CPU would support (SSE4 or SSSE3) and used that. However, following the advice that the SSE4 app only works best for fast processors and that SSE3 is better than SSSE3 for Intel i3-7, this time I used the SSE3 app for my Intel(R) Xeon(R) CPU X3430 @ 2.40GHz and SSSE3 for the other three machines (Intel(R) Core(TM)2 Duo CPU E4500 @ 2.20GHz; Intel(R) Core(TM)2 Duo CPU E7500 @ 2.93GHz and Pentium(R) Dual-Core CPU T4200 @ 2.00GHz). None of my processors support hyperthreading. Is this correct?
ID: 1176326 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14655
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1176338 - Posted: 7 Dec 2011, 20:43:37 UTC - in response to Message 1176326.  
Last modified: 7 Dec 2011, 20:47:36 UTC

I have just installed lunatics v0.39 on my four PCs http://setiathome.berkeley.edu/hosts_user.php however I am not sure that I have used the optimal apps. In the past I simply looked for the most advanced feature that the CPU would support (SSE4 or SSSE3) and used that. However, following the advice that the SSE4 app only works best for fast processors and that SSE3 is better than SSSE3 for Intel i3-7, this time I used the SSE3 app for my Intel(R) Xeon(R) CPU X3430 @ 2.40GHz and SSSE3 for the other three machines (Intel(R) Core(TM)2 Duo CPU E4500 @ 2.20GHz; Intel(R) Core(TM)2 Duo CPU E7500 @ 2.93GHz and Pentium(R) Dual-Core CPU T4200 @ 2.00GHz). None of my processors support hyperthreading. Is this correct?

It's a long time since I looked at the List of Intel Xeon microprocessors - boy, are there a lot of them!

Your X3430 looks like a 45nm Lynnfield with SSE4.2 and TurboBoost - so yes, that sounds like the same class of beastie as an i3. I'd give it a try with the SSE3, but as you may have gathered, this isn't an exact science yet. If you feel like it, give it a comparison run with SSSE3x - that's likely to be the only other good one, SSE4 certainly wasn't any benefit on my E5320 Clovertowns.

Edit - if anybody's keeping notes, the processor I noticed the SSE3 app running fast on is a 32nm Arrandale i5 - that's a mobile version of the first-generation i5 series, before they added AVX.
ID: 1176338 · Report as offensive
Dave

Send message
Joined: 29 Mar 02
Posts: 778
Credit: 25,001,396
RAC: 0
United Kingdom
Message 1176355 - Posted: 7 Dec 2011, 21:56:03 UTC

The 486 was simpler...
ID: 1176355 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14655
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1176357 - Posted: 7 Dec 2011, 22:00:41 UTC - in response to Message 1176355.  

The 486 was simpler...

I rather liked the Z80 myself ;-)
ID: 1176357 · Report as offensive
Grutte Pier [Wa Oars]~MAB The Frisian
Avatar

Send message
Joined: 14 Nov 10
Posts: 18
Credit: 2,020,802
RAC: 0
Netherlands
Message 1176359 - Posted: 7 Dec 2011, 22:13:44 UTC

My first computer had a 6502A
ID: 1176359 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1176364 - Posted: 7 Dec 2011, 22:45:04 UTC

Fingers, with a later upgrade to toes as an
auxiliary processor.
ID: 1176364 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65841
Credit: 55,293,173
RAC: 49
United States
Message 1176395 - Posted: 8 Dec 2011, 4:04:36 UTC - in response to Message 1176359.  

ID: 1176395 · Report as offensive
EdwardPF
Volunteer tester

Send message
Joined: 26 Jul 99
Posts: 389
Credit: 236,772,605
RAC: 374
United States
Message 1176408 - Posted: 8 Dec 2011, 5:17:24 UTC
Last modified: 8 Dec 2011, 5:18:18 UTC

This may be off thread (and may make no sense) but ...

With all the talk of sse3, ssse3, and sse4.1 (not to mention MMX etc) ...

Proposition:

On an Intel hyperthreading processor, wouldn't it make most sense to run the most architecturally divergent processes on the same CPU? I.E. something like - sse3 assigned affinity to even "cores" and sse4.1 to odd "cores" so that the chip can use idle registers (or whatever) on both halves of the chip??

I would have no idea what this would mean in real life but it seems like a good idea ...

Hyperthreading the same code on both sides of a chip should only provide an advantage based on timing of instruction arrivals but no advantage in utilizing idle hardware .

or am I all wet ??

Ed F
ID: 1176408 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1177990 - Posted: 14 Dec 2011, 9:03:05 UTC

Thank you Sten-Arne. Your exhaustive test is really appreaciated.
ID: 1177990 · Report as offensive
aad

Send message
Joined: 3 Apr 99
Posts: 101
Credit: 204,131,099
RAC: 26
Netherlands
Message 1179150 - Posted: 18 Dec 2011, 23:13:09 UTC

Anyone tried the 11.12 drivers with 0.39 and ATI 6xxx card?

Or the 12.1 preview drivers?
http://support.amd.com/us/kbarticles/Pages/Catalyst121Previewdriver.aspx
ID: 1179150 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34272
Credit: 79,922,639
RAC: 80
Germany
Message 1179206 - Posted: 19 Dec 2011, 8:25:56 UTC
Last modified: 19 Dec 2011, 8:26:38 UTC

Yes, both drivers are safe to use with both 5xxx and 6xxx cards.

The 100% CPU bug is now fixed.


With each crime and every kindness we birth our future.
ID: 1179206 · Report as offensive
aad

Send message
Joined: 3 Apr 99
Posts: 101
Credit: 204,131,099
RAC: 26
Netherlands
Message 1179583 - Posted: 20 Dec 2011, 17:26:01 UTC - in response to Message 1179206.  

Yes, both drivers are safe to use with both 5xxx and 6xxx cards.

The 100% CPU bug is now fixed.


Thanks Mike!
Guess i'll try the 12.1 preview with my HD6970.

Can anybody drop me a link for the prog that cleans any driver-left-overs?
Had a fresh install and can't remember the name/url.
ID: 1179583 · Report as offensive
Profile skildude
Avatar

Send message
Joined: 4 Oct 00
Posts: 9541
Credit: 50,759,529
RAC: 60
Yemen
Message 1179585 - Posted: 20 Dec 2011, 17:27:43 UTC - in response to Message 1179583.  

http://www.guru3d.com/category/driversweeper/
there you go. I'll give the 11.12 a try as well


In a rich man's house there is no place to spit but his face.
Diogenes Of Sinope
ID: 1179585 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1179610 - Posted: 20 Dec 2011, 23:25:46 UTC - in response to Message 1176359.  

My first computer had a 6502A

National Semiconductor SC/MP (we eventually used the SC/MP-2 NMOS version) -- helped build a 4-processor multiprocessor system with shared-memory communication to run a computer-controlled Fabry-Perot etalon and its DAS, in 1978, and installed it in Antarctica in 1980. In the meantime I built a Philips 2650 kit published in Electronics Australia -- I never did build their PDP-8 clone based on TTL SSL chips despite working with a PDP-8/e for my PhD.
ID: 1179610 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1179927 - Posted: 22 Dec 2011, 7:02:50 UTC - in response to Message 1179585.  
Last modified: 22 Dec 2011, 7:11:03 UTC

http://www.guru3d.com/category/driversweeper/
there you go. I'll give the 11.12 a try as well

Guru3D no longer supplies Driver Sweeper

"
Publisher: Guru3D.com / Phyxion (FnF)

Attention - Guru3D Driver Sweeper has been discontinued as such you will not find any download links here any longer.
"
http://downloads.guru3d.com/Guru3D---Driver-Sweeper-(Setup)_d1655.html


But the new version of Driver Sweeper 3.2.0 is here:
http://phyxion.net/item/driver-sweeper.html


I found the link here (Nvidia Forceware Tweak Guide):
http://www.tweakguides.com/NVFORCE_3.html

Similar Guide for ATI cards (somehow old - 2009) (ATI Catalyst Tweak Guide):
http://www.tweakguides.com/ATICAT_1.html


 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1179927 · Report as offensive
Profile RottenMutt
Avatar

Send message
Joined: 15 Mar 01
Posts: 1011
Credit: 230,314,058
RAC: 0
United States
Message 1180515 - Posted: 24 Dec 2011, 21:28:44 UTC

with V0.39
is it faster to run two work units on a gtx 580?
what about on a gtx 580 two or three???
ID: 1180515 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6653
Credit: 121,090,076
RAC: 0
United States
Message 1180518 - Posted: 24 Dec 2011, 21:33:46 UTC - in response to Message 1180515.  
Last modified: 24 Dec 2011, 22:12:07 UTC

with V0.39
is it faster to run two work units on a gtx 580?
what about on a gtx 580 two or three???

The basic rule, is to increase the number of wu until you get to 97% or higher GPU usage. This is assuming your GPU's have the memory to run multiple tasks. Every rig is a bit different, but try adding one at a time until you get to 97% or better usage. Beyond that you run into the law of diminishing returns.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1180518 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34272
Credit: 79,922,639
RAC: 80
Germany
Message 1180523 - Posted: 24 Dec 2011, 21:46:29 UTC

You can easily run 3 with a 580.
Maybe 4.

Check what Steve said of course.



With each crime and every kindness we birth our future.
ID: 1180523 · Report as offensive
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 12 · Next

Message boards : Number crunching : Lunatics Windows Installer v0.39 release notes


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.