Astropulse FAQ

Message boards : Number crunching : Astropulse FAQ
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 14 · Next

AuthorMessage
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 865040 - Posted: 13 Feb 2009, 14:41:25 UTC - in response to Message 865016.  


You can leave r103 active with the existing app_info.xml file if you want, and you will get an occasional resend from the unfinished v5.00 work: or you can switch to the stock v5.03 application.


Please forgive a simple question, from a simple person.

Can I leave my existing app_info.xml file, to continue using optimized MB, and somehow switch on stock V5.03 AP? Or do I need a new app_info? I need real easy step by step answers here ;).


ID: 865040 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 865044 - Posted: 13 Feb 2009, 14:49:05 UTC - in response to Message 865040.  

Strangely enough i've just doctored my app_info to see if i can get stock astropulse v5 work, as well as optimised AP 5.00 work and MB 6.03 work,
and i've just PM'd it to Richard, but i haven't managed to get any work yet,
further changes might be needed.

Claggy
ID: 865044 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14679
Credit: 200,643,578
RAC: 874
United Kingdom
Message 865046 - Posted: 13 Feb 2009, 14:51:27 UTC - in response to Message 865040.  


You can leave r103 active with the existing app_info.xml file if you want, and you will get an occasional resend from the unfinished v5.00 work: or you can switch to the stock v5.03 application.

Please forgive a simple question, from a simple person.

Can I leave my existing app_info.xml file, to continue using optimized MB, and somehow switch on stock V5.03 AP? Or do I need a new app_info? I need real easy step by step answers here ;).

The simple answer is 'No'.

You can crunch optimised MB and stock AP, but to do so you have to modify app_info.xml and download the application files yourself. It's what we did back in August 2008, when Astropulse was first released and there were no optimised versions available.
ID: 865046 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 865048 - Posted: 13 Feb 2009, 14:56:27 UTC - in response to Message 865046.  

Thanks Richard. I think I'll leave things as they are, until the new optimized AP is available, along with a new app_info.

ID: 865048 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14679
Credit: 200,643,578
RAC: 874
United Kingdom
Message 865050 - Posted: 13 Feb 2009, 15:03:22 UTC - in response to Message 865044.  

Strangely enough i've just doctored my app_info to see if i can get stock astropulse v5 work, as well as optimised AP 5.00 work and MB 6.03 work,
and i've just PM'd it to Richard, but i haven't managed to get any work yet,
further changes might be needed.

Claggy

You have a PM. A couple of changes needed, one asap, please.
ID: 865050 · Report as offensive
Profile Debs
Volunteer tester
Avatar

Send message
Joined: 27 Feb 08
Posts: 20
Credit: 265,401
RAC: 0
Message 865341 - Posted: 14 Feb 2009, 11:58:20 UTC - in response to Message 863118.  

Incorrect, It is a Microsoft Visual Studio build, and No, Intel compiler doesn't do that unless you use certain dynamic dispatch features, which we don't.


Then the executable contains a lot of redundant data. I opened it as a plain text file and did a search on "Intel", and got a large number of strings such as:

GenuineIntel (ie the CPUID string that specifically identifies Intel CPUs)
Intel(R) Core(TM) Duo processors and compatible Intel processors with supplemental Streaming SIMD Extensions 3 (SSSE3) instruction support
Intel(R) Pentium(R) 4 and compatible Intel processors with Streaming SIMD Extensions 3 (SSE3) instruction support

and many more. There are no strings which mention AMD or any other CPU manufacturer. It seems strange that it would have strings which identify specific Intel CPUs if it is not specifically looking for them in preference to other CPU manufacturers.

Also, the SSE3 app on a C2D is finishing work in approx 8 hours or less, but the SSE app is taking in the region of 45-50 hours on an early Athlon 64. I'm going to run the SSE app on the C2D when there is enough work so I can compare it one two systems at similar speeds and see if I am right. I do not believe the difference between SSE and SSE3 would explain that high a difference between the running times (ie 15 credits per hours on an Athlon 64 at 2.4GHz and 100 credits/hour on a C2D at 2.66GHz).
ID: 865341 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 865345 - Posted: 14 Feb 2009, 12:18:41 UTC - in response to Message 865341.  
Last modified: 14 Feb 2009, 12:32:44 UTC

Incorrect, It is a Microsoft Visual Studio build, and No, Intel compiler doesn't do that unless you use certain dynamic dispatch features, which we don't.


Then the executable contains a lot of redundant data. I opened it as a plain text file and did a search on "Intel", and got a large number of strings such as:

GenuineIntel (ie the CPUID string that specifically identifies Intel CPUs)
Intel(R) Core(TM) Duo processors and compatible Intel processors with supplemental Streaming SIMD Extensions 3 (SSSE3) instruction support
Intel(R) Pentium(R) 4 and compatible Intel processors with Streaming SIMD Extensions 3 (SSE3) instruction support

and many more. There are no strings which mention AMD or any other CPU manufacturer. It seems strange that it would have strings which identify specific Intel CPUs if it is not specifically looking for them in preference to other CPU manufacturers.

Also, the SSE3 app on a C2D is finishing work in approx 8 hours or less, but the SSE app is taking in the region of 45-50 hours on an early Athlon 64. I'm going to run the SSE app on the C2D when there is enough work so I can compare it one two systems at similar speeds and see if I am right. I do not believe the difference between SSE and SSE3 would explain that high a difference between the running times (ie 15 credits per hours on an Athlon 64 at 2.4GHz and 100 credits/hour on a C2D at 2.66GHz).


You're comparing an Athlon64 to a core 2 duo? Please GET REAL. Try Phenom 2 and you'll have a more realistic comparison.

[Edit: Just a reminder to anyone who wants optimised code for any particular platform, the (original stock) source code is freely available, or our site has Windows ports oriented toward Intel and Generic Compatible CPUs, as always, as a starting point. It is not in our skill or resources base to optimise targets for every possible CPU out there.]
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 865345 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 865364 - Posted: 14 Feb 2009, 14:32:34 UTC - in response to Message 865341.  

Debs wrote:
Incorrect, It is a Microsoft Visual Studio build, and No, Intel compiler doesn't do that unless you use certain dynamic dispatch features, which we don't.


Then the executable contains a lot of redundant data. I opened it as a plain text file and did a search on "Intel", and got a large number of strings such as:

GenuineIntel (ie the CPUID string that specifically identifies Intel CPUs)
Intel(R) Core(TM) Duo processors and compatible Intel processors with supplemental Streaming SIMD Extensions 3 (SSSE3) instruction support
Intel(R) Pentium(R) 4 and compatible Intel processors with Streaming SIMD Extensions 3 (SSE3) instruction support

and many more. There are no strings which mention AMD or any other CPU manufacturer. It seems strange that it would have strings which identify specific Intel CPUs if it is not specifically looking for them in preference to other CPU manufacturers.
...

The FFTW static library was built with ICC, because the FFTW authors note that builds made with the Microsoft compiler are incorrect. Comparing the speed of using that static library with otherwise identical builds using the FFTW DLL (DevCPP/MinGW build) supplied by the project, the static ICC version outperformed on both Intel and AMD test systems. The application itself is built with Microsoft tools, the Intel strings must come from that library.
                                                              Joe
ID: 865364 · Report as offensive
Profile Debs
Volunteer tester
Avatar

Send message
Joined: 27 Feb 08
Posts: 20
Credit: 265,401
RAC: 0
Message 865367 - Posted: 14 Feb 2009, 14:36:32 UTC

Actually, my point is not whether a C2D is faster than an Athlon 64, that goes without saying (hence why I am going to run the SSE opt app on the C2D for one wu when they are available, in order to compare). My point is that the SSE app contains code which looks very much like it is looking for specific Intel CPUs but not for other specific manufacturer CPUs. As I have seen reports that indicate the Intel compiler includes such tests and uses generic code for non-Intel CPUs, I have suggested that maybe that is happening.

I did not aim to start an argument over whether it is right or wrong. I merely asked in my original post if there is a generic SSE2 app so my slower 64-bit systems can use it, as the SSE opti app seems to be running at about the same speed as the non-optimised app (if I ever manqage to download another ap wu on either of the relevant systems, I will confirm this, as have now removed the opt apps from those systems).
ID: 865367 · Report as offensive
Profile Debs
Volunteer tester
Avatar

Send message
Joined: 27 Feb 08
Posts: 20
Credit: 265,401
RAC: 0
Message 865372 - Posted: 14 Feb 2009, 14:45:03 UTC - in response to Message 865364.  

The FFTW static library was built with ICC, because the FFTW authors note that builds made with the Microsoft compiler are incorrect. Comparing the speed of using that static library with otherwise identical builds using the FFTW DLL (DevCPP/MinGW build) supplied by the project, the static ICC version outperformed on both Intel and AMD test systems. The application itself is built with Microsoft tools, the Intel strings must come from that library.
                                                              Joe


Thanks Joe. That was the most helpful answer I have read.

I will still run tests when ap is back online (probably wait for the new version though) so I can be sure what is ha[ppening between systems and versions :)

Are there any plans for a generic x64 version (I use 64-bit Windows), or is that too much extra work?
ID: 865372 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 865374 - Posted: 14 Feb 2009, 14:45:52 UTC
Last modified: 14 Feb 2009, 14:48:56 UTC

Ah, thanks for clarifying your position. I can state categorically that the AstroPulse optimised binaries do not use the dynamic CPU dispatch mechanisms offered by Intel compiler.

If you have stock astropulse times, against optimised times that are even roughly equal, I would be very interested to see the stderr output, as nearly all optimisations to date have been generally applicable and boosted performance several fold over stock.

Please note that if you mean AKv8 multibeam binaries then the story is completely different, and we can make more suitable recommendations if you need SSE AMD targetted builds for that.

Jason

[Edit: No immediate plans for x64 specific builds, though they will come when more generally applicable optimisations have been made.]
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 865374 · Report as offensive
Profile Debs
Volunteer tester
Avatar

Send message
Joined: 27 Feb 08
Posts: 20
Credit: 265,401
RAC: 0
Message 865379 - Posted: 14 Feb 2009, 15:01:57 UTC - in response to Message 865374.  

Ah, thanks for clarifying your position. I can state categorically that the AstroPulse optimised binaries do not use the dynamic CPU dispatch mechanisms offered by Intel compiler.

If you have stock astropulse times, against optimised times that are even roughly equal, I would be very interested to see the stderr output, as nearly all optimisations to date have been generally applicable and boosted performance several fold.

Please note that if you mean AKv8 multibeam binaries then the story is completely different, and we can make more suitable recommendations if you need SSE AMD targetted builds for that.

Jason


I have used both KWSN_2.4V_SSE2_MB.exe and ap_5.00r103_SSE.exe on my Athlon 64 3200+ (socket 754, Newcastle core, SSE2 only) running XP64. I only ran ap_5.00r103_SSE.exe on the clawhammer socket 754 3600+. In all cases, I can only get about 15 credits/hour, compared to the stock app at cosmology@home which gives around 20/hour. I expected the optimised apps here to give more than stock apps elsewhere (although I am also aware that all projects have different standards). I have checked on boincstats.com, and see that typically cosmology gives approx 1.33 * the credits of seti, but I am sure that is compared to all seti users, not just the optimised apps? On the C2D seti optimised pays about 2.5 times what cosmology pays.

Would I be better using different build on the two AMD64 systems?
ID: 865379 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 865383 - Posted: 14 Feb 2009, 15:21:15 UTC - in response to Message 865379.  
Last modified: 14 Feb 2009, 15:31:57 UTC

Would I be better using different build on the two AMD64 systems?
Ahh perhaps. For Multibeam you should try (if you haven't) using the 2.4V FFTW build of Crunch3r's, though I don't seem to have a link handy, and am not sure if it is listed on Crunch3r's page anywhere. (Hopefully someone can provide one). I am not certain if he has made a 64 bit variant or not, I think there was some issues with it, but wasn't paying attention as I was busy elsewhere. [EDIT: Sudden thought, Did you try our 32 bit SSE2 AKv8 Build? *slaps own head*]

For AstroPulse, the new release stock 5.03 [BTW: not compatible with our r103] takes even longer, and the performance difference to test optimised builds is even greater (Currently subject to favourable validation test outcomes), and the credit rate is raised to more appropriate levels ~1290 credits / WU . The SSE optimised build should perform roughly 2-3+x stock on minimally blanked tasks.

I did some cosmology work a long time ago, and the credit rate was significantly higher than on S@H at the time (Much more than 33% more). I don't know the status quo now, as I dropped off when they started getting many server problems, But my guess is that since no optimised application is available, the credit rate would correspondingly be high enough to better reward older machines, as here the credit multiplier is scaling against the 'Median Machine' (Whatever that is ;D)

Jason
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 865383 · Report as offensive
Profile Debs
Volunteer tester
Avatar

Send message
Joined: 27 Feb 08
Posts: 20
Credit: 265,401
RAC: 0
Message 865399 - Posted: 14 Feb 2009, 16:23:59 UTC - in response to Message 865383.  

[EDIT: Sudden thought, Did you try our 32 bit SSE2 AKv8 Build? *slaps own head*]


I hadn't thought about using the 32-bit SSE2, just assumed the older 64-bit SSE2 would be faster. I will try that at some point.

For AstroPulse, the new release stock 5.03 [BTW: not compatible with our r103] takes even longer, and the performance difference to test optimised builds is even greater (Currently subject to favourable validation test outcomes), and the credit rate is raised to more appropriate levels ~1290 credits / WU . The SSE optimised build should perform roughly 2-3+x stock on minimally blanked tasks.


I've been waiting to see if one of the AMD64 machines will download a stock astropulse app (any of them!) but no joy for now. Hopefully once the optimised app is ready there will also be work for ap :)
ID: 865399 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 865400 - Posted: 14 Feb 2009, 16:39:43 UTC - in response to Message 865399.  
Last modified: 14 Feb 2009, 16:41:12 UTC

I've been waiting to see if one of the AMD64 machines will download a stock astropulse app (any of them!) but no joy for now. Hopefully once the optimised app is ready there will also be work for ap :)


Yep, in testing we're in the same boat. Though we have completed a few tasks, and are reasonably happy with the indicated performance (at least for initial release), we have no indication of a 5.03 validation yet against stock wingmen or direct time comparisons. Not surprising I suppose given the long processing time and slow rate of issue (my p4 hasn't managed to pick up one at all yet).

In the meantime, It would be helpful if anyone who does manage to get, and process, a stock 5.03 AstroPulse task, could post their machine/task link to indicate some kind of baseline, as we're seeing some WU processing time variability that caught us by surprise [Though we're fairly certain it's the amount of client side blanking that causes it].

Jason
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 865400 · Report as offensive
Profile Death2Gnomes
Volunteer tester
Avatar

Send message
Joined: 30 Nov 00
Posts: 48
Credit: 246,481
RAC: 0
United States
Message 865475 - Posted: 14 Feb 2009, 20:57:25 UTC

ok where do I find the stock AP apps to manual install?
"Red Warrior Needs Food Badly"
ID: 865475 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 865478 - Posted: 14 Feb 2009, 21:05:41 UTC - in response to Message 865475.  

ok where do I find the stock AP apps to manual install?

If you have Astropulse selected in your web preferences, then the app will be downloaded automatically when you are assigned one of the new WU's. It's all auto-magic.

F.
ID: 865478 · Report as offensive
Profile Death2Gnomes
Volunteer tester
Avatar

Send message
Joined: 30 Nov 00
Posts: 48
Credit: 246,481
RAC: 0
United States
Message 865479 - Posted: 14 Feb 2009, 21:09:01 UTC
Last modified: 14 Feb 2009, 21:10:59 UTC

Uhhh, ya. I am going from optimized r103 back to stock ...
"Red Warrior Needs Food Badly"
ID: 865479 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 865482 - Posted: 14 Feb 2009, 21:18:52 UTC - in response to Message 865478.  

ok where do I find the stock AP apps to manual install?

If you have Astropulse selected in your web preferences, then the app will be downloaded automatically when you are assigned one of the new WU's. It's all auto-magic.

F.

Since he's using optimised apps with an app_info it won't,
he needs to download the apps manually and add them to his app_info.

When i get home in about 45mins, I'll post the links & my app_info unless someone beats me.

Claggy
ID: 865482 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 865485 - Posted: 14 Feb 2009, 21:24:10 UTC - in response to Message 865482.  

ok where do I find the stock AP apps to manual install?

If you have Astropulse selected in your web preferences, then the app will be downloaded automatically when you are assigned one of the new WU's. It's all auto-magic.

F.

Since he's using optimised apps with an app_info it won't,
he needs to download the apps manually and add them to his app_info.

When i get home in about 45mins, I'll post the links & my app_info unless someone beats me.

Claggy

Oops. Sorry, insufficient investigation. I'm sure Claggy will get back with a location.

F.
ID: 865485 · Report as offensive
Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 14 · Next

Message boards : Number crunching : Astropulse FAQ


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.