Why need 5 different stock AMD OpenCL GPU applications?

Message boards : Number crunching : Why need 5 different stock AMD OpenCL GPU applications?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1791277 - Posted: 28 May 2016, 14:32:14 UTC - in response to Message 1790683.  

Richard Haselgrove wrote:
Double-check whether all of the applications you are referring to by plan_class names are actually different binary executables. I think they probably are, this time round, but it's as well to be sure. In earlier incarnations, sometimes the same app appeared in different clothes - e.g. so that an OpenCL app could present and be run as a straight ATI/CAL app for the benefit of a pre-OpenCL client.

Let's see...
SAH is not used on APUs.
CAT132 is not used on APUs.
NOCAL is not used on APUs (although this is strange, it includes _ATI_, but as far as I know there are no pre-ATI5 GPUs that don't have CAL support.)
SOG is not used on _ATI_

According to the Applications page we have the choice of: opencl_ati5_cat132, opencl_ati5_nocal, opencl_ati5_sah, opencl_ati5_SoG, opencl_ati5_SoG_cat132, opencl_ati5_SoG_nocal, opencl_atiapu_sah, opencl_atiapu_SoG, opencl_ati_cat132, opencl_ati_nocal, opencl_ati_sah
Thus far I have seen:
setiathome_8.12_windows_intelx86__opencl_ati_sah.exe
setiathome_8.12_windows_intelx86__opencl_ati5_sah.exe
setiathome_8.12_windows_intelx86__opencl_ati5_SoG.exe
setiathome_8.12_windows_intelx86__opencl_atiapu_sah.exe
setiathome_8.12_windows_intelx86__opencl_atiapu_SoG.exe

What I see is that opencl_ati5_cat132 is run with setiathome_8.12_windows_intelx86__opencl_ati5_sah.exe and opencl_ati5_SoG_cat132 with setiathome_8.12_windows_intelx86__opencl_ati5_SoG.exe

Richard Haselgrove wrote:
I think that the best current answer is more along the lines of "no-one can predict which application will perform best on a given host, given the variety of hardware platforms (GPU + host CPU/motherboard) run by volunteers here - which greatly exceeds the number of platforms run by testers". So Eric lets BOINC work it out by itself.

Then how does a project like Primegrid manage to run 24 different kinds of prime calculations with all just one application per sort of hardware? They must be aces in being able to predict how their applications work on the variety of different sorts of hardware. Or maybe they're just not thinking too difficult about it, and just have the one application fits all methodology. :)

In that sense it really doesn't matter what the science is that's being done, whether it's to find the first tooth paste commercial of the aliens nearest to us, or to find a prime with more than 1,024,921 digits.
ID: 1791277 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1791278 - Posted: 28 May 2016, 14:40:14 UTC - in response to Message 1791272.  

msattler wrote:
Maybe the diversity of hardware requires more apps than some would realize.

Hardware is hardware is hardware. There's a big difference in hardware between a Pentium 4 CPU, an early AMD Athlon and the newest Intels and AMDs. Yet they all run one and the same v8 application. There isn't even a 64bit version of the application, just one plain 32bit version.

As long as it all works, this is a rather stupid question in the first place.

"There are naive questions, tedious questions, ill-phrased questions, questions put after inadequate self-criticism. But every question is a cry to understand the world. There is no such thing as a dumb question". ~ Carl Sagan, The Demon-Haunted World: Science as a Candle in the Dark.
ID: 1791278 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34255
Credit: 79,922,639
RAC: 80
Germany
Message 1791280 - Posted: 28 May 2016, 14:44:07 UTC
Last modified: 28 May 2016, 14:44:58 UTC

Then how does a project like Primegrid manage to run 24 different kinds of prime calculations with all just one application per sort of hardware? They must be aces in being able to predict how their applications work on the variety of different sorts of hardware. Or maybe they're just not thinking too difficult about it, and just have the one application fits all methodology. :)

In that sense it really doesn't matter what the science is that's being done, whether it's to find the first tooth paste commercial of the aliens nearest to us, or to find a prime with more than 1,024,921 digits.
____________


That`s not so easy to explain.

It would run on one application but seti members want more.
Especially speed.
So you can blame Raistmer for trying to make most volunteers happy.
Maybe i am a bad tester, who knows.

But to make it clear only Raistmer or myself can answer this.


With each crime and every kindness we birth our future.
ID: 1791280 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1791281 - Posted: 28 May 2016, 14:46:09 UTC - in response to Message 1791278.  


"There are naive questions, tedious questions, ill-phrased questions, questions put after inadequate self-criticism. But every question is a cry to understand the world. There is no such thing as a dumb question". ~ Carl Sagan, The Demon-Haunted World: Science as a Candle in the Dark.

Given....the only stupid question is the one not asked.
I stand down.

Meow.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1791281 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1791296 - Posted: 28 May 2016, 15:26:09 UTC - in response to Message 1791261.  


Mere users don't really need to dive into the highly technical BOINC documentation to get an answer to their question.

Depends on questions and details they would like to have.Of course if answer is "because" no any details needed :)
ID: 1791296 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 1791318 - Posted: 28 May 2016, 16:29:09 UTC - in response to Message 1791280.  

It would run on one application but seti members want more.
Especially speed.


I think Jord's point is that the stock app should be universal without speed optimizations with the focus on simply just working stably. Then the Lunatics pack should contain all the speed improvements and various applications required to achieve the most speed possible.

I'm not so sure that all SETI members want more, though I'm sure many do. There are some people out there that simply want to contribute and are happier with stability and simplicity over speed. Those that want speed can follow the same advise we've given for the past 10 years: install optimized applications.
ID: 1791318 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34255
Credit: 79,922,639
RAC: 80
Germany
Message 1791324 - Posted: 28 May 2016, 16:47:25 UTC - in response to Message 1791318.  
Last modified: 28 May 2016, 16:47:57 UTC

It would run on one application but seti members want more.
Especially speed.


I think Jord's point is that the stock app should be universal without speed optimizations with the focus on simply just working stably. Then the Lunatics pack should contain all the speed improvements and various applications required to achieve the most speed possible.

I'm not so sure that all SETI members want more, though I'm sure many do. There are some people out there that simply want to contribute and are happier with stability and simplicity over speed. Those that want speed can follow the same advise we've given for the past 10 years: install optimized applications.


I totally understand this point of view but that`s Raistmer`s and Eric`s decission and i support it.

We have a similar situation for cuda builds BTW.
cuda 23
cuda 32
cuda 42
cuda 50 as stock apps.

So far i know cuda 23 would also run on all devices but the servers decide which app is best for which host.


With each crime and every kindness we birth our future.
ID: 1791324 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1791327 - Posted: 28 May 2016, 16:57:27 UTC - in response to Message 1791324.  


I totally understand this point of view but that`s Raistmer`s and Eric`s decission and i support it.

We have a similar situation for cuda builds BTW.
cuda 23
cuda 32
cuda 42
cuda 50 as stock apps.

So far i know cuda 23 would also run on all devices but the servers decide which app is best for which host.


Think I covered the CudaMB for comparison in a previous post in this thread. There are *some* complications, particularly with Cuda23, because back then Cuda binaries did not embed forward compatible PTX. The project not wanting to cut off working crunchers is another aspect, then throw in that Cuda3.2 breaks on Maxwell...

You get the gist, similar situations (just a different angle)

Eventually the Cuda applications (IIRC started before OpenCL was a thing) will end up fully heterogeneous, though priorities for now are certainly on minimising breakage, accumulating fast code and new techniques.

It's an easy thing to say 'needs to be simpler', and IMO quite right, though at the same time crafting such 'simple' applications has a lot of devils in the details. Something I'm sure Raistmer and Primegrid Developers know too well.
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1791327 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 1791328 - Posted: 28 May 2016, 17:05:28 UTC - in response to Message 1791324.  

I totally understand this point of view but that`s Raistmer`s and Eric`s decission and i support it.


Sure, but minds can change, can't they? It usually starts with open and honest discussion.

We have a similar situation for cuda builds BTW.
cuda 23
cuda 32
cuda 42
cuda 50 as stock apps.

So far i know cuda 23 would also run on all devices but the servers decide which app is best for which host.


I'm not convinced that a valid argument is essentially "well the Green Team's applications are mess too!" It would almost seem that those offering such an argument are taking it personally and so are pointing fingers elsewhere.


I don't think the intent is to make this personal for the ATi/AMD developers and testers, but rather to make development, testing, and supporting easier. Jord mentioned that the OpenCL standard was supposed to simplify our lives with a single unified application, yet it seems everyone here at SETI (Green Team and Red Team) have chosen the opposite path - all with the mindset of getting the most optimized speed available.

I think the question really comes down to: Can we reconsider our path and can we simply this process?
ID: 1791328 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34255
Credit: 79,922,639
RAC: 80
Germany
Message 1791329 - Posted: 28 May 2016, 17:05:57 UTC - in response to Message 1791327.  


I totally understand this point of view but that`s Raistmer`s and Eric`s decission and i support it.

We have a similar situation for cuda builds BTW.
cuda 23
cuda 32
cuda 42
cuda 50 as stock apps.

So far i know cuda 23 would also run on all devices but the servers decide which app is best for which host.


Think I covered the CudaMB for comparison in a previous post in this thread. There are *some* complications, particularly with Cuda23, because back then Cuda binaries did not embed forward compatible PTX. The project not wanting to cut off working crunchers is another aspect, then throw in that Cuda3.2 breaks on Maxwell...

You get the gist, similar situations (just a different angle)

Eventually the Cuda applications (IIRC started before OpenCL was a thing) will end up fully heterogeneous, though priorities for now are certainly on minimising breakage, accumulating fast code and new techniques.

It's an easy thing to say 'needs to be simpler', and IMO quite right, though at the same time crafting such 'simple' applications has a lot of devils in the details. Something I'm sure Raistmer and Primegrid Developers know too well.


I know Jason.

I just wanted to keep it as simple as possible.
Its more complicated with OpenCL apps too.


With each crime and every kindness we birth our future.
ID: 1791329 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1791331 - Posted: 28 May 2016, 17:11:24 UTC - in response to Message 1791318.  

It would run on one application but seti members want more.
Especially speed.


I think Jord's point is that the stock app should be universal without speed optimizations with the focus on simply just working stably. Then the Lunatics pack should contain all the speed improvements and various applications required to achieve the most speed possible.

I'm not so sure that all SETI members want more, though I'm sure many do. There are some people out there that simply want to contribute and are happier with stability and simplicity over speed. Those that want speed can follow the same advise we've given for the past 10 years: install optimized applications.

Users who want simplicity just attach to project and go to own tasks and deeds. No matters 1 app, 10 app or few GB ow initial download as some projects have.
It's too curious ones who wonder. And then, when they got full descriptive answer they wonder why so complex??? Because!
ID: 1791331 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34255
Credit: 79,922,639
RAC: 80
Germany
Message 1791332 - Posted: 28 May 2016, 17:11:51 UTC - in response to Message 1791328.  

I totally understand this point of view but that`s Raistmer`s and Eric`s decission and i support it.


Sure, but minds can change, can't they? It usually starts with open and honest discussion.

We have a similar situation for cuda builds BTW.
cuda 23
cuda 32
cuda 42
cuda 50 as stock apps.

So far i know cuda 23 would also run on all devices but the servers decide which app is best for which host.


I'm not convinced that a valid argument is essentially "well the Green Team's applications are mess too!" It would almost seem that those offering such an argument are taking it personally and so are pointing fingers elsewhere.


I don't think the intent is to make this personal for the ATi/AMD developers and testers, but rather to make development, testing, and supporting easier. Jord mentioned that the OpenCL standard was supposed to simplify our lives with a single unified application, yet it seems everyone here at SETI (Green Team and Red Team) have chosen the opposite path - all with the mindset of getting the most optimized speed available.

I think the question really comes down to: Can we reconsider our path and can we simply this process?


Under current cicumstances the answer is no so far i can tell.

We try to keep it as simple as possible.
OTOH the users don`t have to do anything except letting the apps run.
So what`s the real problem in this case ?


With each crime and every kindness we birth our future.
ID: 1791332 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1791333 - Posted: 28 May 2016, 17:15:08 UTC - in response to Message 1791332.  

So what`s the real problem in this case ?


Quoting myself, this:
...None of the applications barring stock CPU have very sophisticated dispatch mechanisms as yet. That makes for some pretty costly maintenance of number of builds on each platform.

In the Cuda case, which for now still suffers this build creep, I *try* to limit to builds that make sense, knowing that number of builds costs focus, project resources, confusion, and introduces more points of failure, While in the Cuda case ongoing decreasing performance with newer Cuda versions and baseline code.

"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1791333 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 1791334 - Posted: 28 May 2016, 17:15:29 UTC - in response to Message 1791327.  

It's an easy thing to say 'needs to be simpler', and IMO quite right, though at the same time crafting such 'simple' applications has a lot of devils in the details. Something I'm sure Raistmer and Primegrid Developers know too well.


There's no doubt that it is easier said than done, and there's no doubt that it requires a lot of work and development. Hopefully you know that Jord and I and others are intelligent enough to understand this.

Understanding that there's been a large push to get applications and support for hardware out the door, but I suppose a re-phrasing of the question comes down to: Why wasn't there more focus on simplification and unification in the first place?

In the beginning, for CUDA applications it was as simple as selecting an app that corresponded with your CUDA version supported in hardware to get the most speed. Now on the ATi/AMD side of things, it's application for hardware support, app_info.xml files, command line .txt files.... you get the gist, right? It seems things are constantly going toward speed optimization and complicated setups that stability is gone.

Again, I understand that development of these applications isn't easy, but there's definitely a pattern of a problematic approach.
ID: 1791334 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1791335 - Posted: 28 May 2016, 17:16:13 UTC - in response to Message 1791328.  



I don't think the intent is to make this personal for the ATi/AMD developers and testers, but rather to make development, testing, and supporting easier. Jord mentioned that the OpenCL standard was supposed to simplify our lives with a single unified application, yet it seems everyone here at SETI (Green Team and Red Team) have chosen the opposite path - all with the mindset of getting the most optimized speed available.

I think the question really comes down to: Can we reconsider our path and can we simply this process?

No. If one wants to simplify testing and supporting one can take part in both activities.
ID: 1791335 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 1791336 - Posted: 28 May 2016, 17:20:51 UTC - in response to Message 1791332.  

Under current cicumstances the answer is no so far i can tell.


That's an incredibly discouraging and off-putting response. I suppose if that's the way it is, then it is what it is. Though I must say that doesn't instill me with much confidence in the development teams.

We try to keep it as simple as possible.
OTOH the users don`t have to do anything except letting the apps run.
So what`s the real problem in this case ?


How many times have I seen you assisting ATi/AMD users with a specific problem to modify the command line .txt file or to use a specific driver version or both? Still think users don't have to do anything but let apps run?
ID: 1791336 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1791337 - Posted: 28 May 2016, 17:22:55 UTC - in response to Message 1791334.  

It's an easy thing to say 'needs to be simpler', and IMO quite right, though at the same time crafting such 'simple' applications has a lot of devils in the details. Something I'm sure Raistmer and Primegrid Developers know too well.


There's no doubt that it is easier said than done, and there's no doubt that it requires a lot of work and development. Hopefully you know that Jord and I and others are intelligent enough to understand this.

Understanding that there's been a large push to get applications and support for hardware out the door, but I suppose a re-phrasing of the question comes down to: Why wasn't there more focus on simplification and unification in the first place?

In the beginning, for CUDA applications it was as simple as selecting an app that corresponded with your CUDA version supported in hardware to get the most speed. Now on the ATi/AMD side of things, it's application for hardware support, app_info.xml files, command line .txt files.... you get the gist, right? It seems things are constantly going toward speed optimization and complicated setups that stability is gone.

Again, I understand that development of these applications isn't easy, but there's definitely a pattern of a problematic approach.


Oh *I'm* quite clear that yourself and Jord have an excellent understanding of the issues in hand. Just have to present that I'm fully aware of the limitations with respect to my own work, such that it can't be used as excuses :)
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1791337 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1791339 - Posted: 28 May 2016, 17:23:51 UTC - in response to Message 1791332.  

So what`s the real problem in this case ?

As I already said, the real problem that some want to transfer tech-oriented forum in cafe filial.
Self-imposed issue that actually only in mind of asking.
If one cares about how it will scale to iPhone - iPhone will get just baseline app (though I doubt it ever happened). ATi users will get apps more suitable for particular hardware. Also this will allow to select what path most suitable and allow further improvements.
ID: 1791339 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34255
Credit: 79,922,639
RAC: 80
Germany
Message 1791340 - Posted: 28 May 2016, 17:24:19 UTC - in response to Message 1791336.  

Under current cicumstances the answer is no so far i can tell.


That's an incredibly discouraging and off-putting response. I suppose if that's the way it is, then it is what it is. Though I must say that doesn't instill me with much confidence in the development teams.

We try to keep it as simple as possible.
OTOH the users don`t have to do anything except letting the apps run.
So what`s the real problem in this case ?


How many times have I seen you assisting ATi/AMD users with a specific problem to modify the command line .txt file or to use a specific driver version or both? Still think users don't have to do anything but let apps run?


That`s true but merly speed related.
Would you prefer i dont help ?


In the beginning, for CUDA applications it was as simple as selecting an app that corresponded with your CUDA version supported in hardware to get the most speed. Now on the ATi/AMD side of things, it's application for hardware support, app_info.xml files, command line .txt files.... you get the gist, right? It seems things are constantly going toward speed optimization and complicated setups that stability is gone.


Neither me nor Raistmer are happy with this situation but we are not responsible for driver issues etc.
We just try to make the best out of this.


With each crime and every kindness we birth our future.
ID: 1791340 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 1791342 - Posted: 28 May 2016, 17:25:15 UTC - in response to Message 1791335.  



I don't think the intent is to make this personal for the ATi/AMD developers and testers, but rather to make development, testing, and supporting easier. Jord mentioned that the OpenCL standard was supposed to simplify our lives with a single unified application, yet it seems everyone here at SETI (Green Team and Red Team) have chosen the opposite path - all with the mindset of getting the most optimized speed available.

I think the question really comes down to: Can we reconsider our path and can we simply this process?

No. If one wants to simplify testing and supporting one can take part in both activities.


Excellent. Since you're already part of development and testing, I take this as tacit acceptance of the challenge laid out before you.
ID: 1791342 · Report as offensive
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Number crunching : Why need 5 different stock AMD OpenCL GPU applications?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.