OpenCL vs. CUDA, histoical and current situation?

Message boards : Number crunching : OpenCL vs. CUDA, histoical and current situation?
Message board moderation

To post messages, you must log in.

AuthorMessage
Al Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 3 Apr 99
Posts: 1682
Credit: 477,343,364
RAC: 482
United States
Message 1776855 - Posted: 7 Apr 2016, 14:45:28 UTC

If someone could fill me and everyone else (who hasn't paid close attention to it over the years) in regarding NV products and CUDA, and then/now OpenCL, I would appreciate it. I know that back in the day, Nvidia was a partner (with us directly?) in getting CUDA out as an optimized platform, but if I remember correctly, they pretty much pulled their active support, and this is what drove development of the OpenCL?

Am I correct in thinking that all things considered, CUDA is better than OpenCL when it comes to crunching on Nvidia cards? As I haven't been active for a few years, this may all be old news, but I'd like to know the history of it, and also if there is anything new to know about the situation. Thanks guys!

ID: 1776855 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1776859 - Posted: 7 Apr 2016, 15:10:08 UTC - in response to Message 1776855.  
Last modified: 7 Apr 2016, 15:25:51 UTC

If someone could fill me and everyone else (who hasn't paid close attention to it over the years) in regarding NV products and CUDA, and then/now OpenCL, I would appreciate it. I know that back in the day, Nvidia was a partner (with us directly?) in getting CUDA out as an optimized platform, but if I remember correctly, they pretty much pulled their active support, and this is what drove development of the OpenCL?

Am I correct in thinking that all things considered, CUDA is better than OpenCL when it comes to crunching on Nvidia cards? As I haven't been active for a few years, this may all be old news, but I'd like to know the history of it, and also if there is anything new to know about the situation. Thanks guys!

I think the super quick version of the story is a long the lines of... NVIDIA basically wrote, or did a lot of optimization for, the initial SETI@home CUDA application.

I think NVIDIA may have also donated some GPUs to the project so development could continue. Continued development/optimization of the CUDA applications has largely been, 100%?, from volunteer efforts of the community. I'm not sure if jason_gee was the initial person to jump on it, but they are defiantly one of the CUDA gurus.

ATI/AMD sent SETI@home some GPUs and the SDK & said good luck. So no Brook/Stream apps were developed by the project as they didn't have the time/resources.
So volunteer efforts, by Raistmer, developed the initial Brook/Stream on ATI GPUs for Astropulse. Then later the move to OpenCL as AMD dropped support for ATI's native Brook/Stream.
In time SETI@Home(MB) OpenCL apps were developed for ATI/AMD GPUs. Then came the OpenCL Astropulse app for Nvidia. I think last was the OpenCL MB app for Nvidia.
I'm not sure on the current status of the OpenCL MB Nvidia apps. As I primary use ATI GPUs. However I believe that TBar's MAC Nvidia app is the stock GPU app for MAC. Also petri has or is still developing a CUDA app that is more efficient.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1776859 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1776911 - Posted: 7 Apr 2016, 18:52:28 UTC - in response to Message 1776855.  

Have some roadmap adjustments to make for inclusion of Cuda 8 (for which I've received a request for preview), but then can fill in some of the blanks. There is a fair amount of flux coming, both from upcoming technology change, plus pre-existing plans. Taking the time to review the history of the Cuda apps (pre X-Branch and X-branch) will probably be a valuable thing to do going forwards, since the strengths versus the shortcomings come out of the history to a pretty large degree.
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1776911 · Report as offensive
Profile mimo
Volunteer tester
Avatar

Send message
Joined: 7 Feb 03
Posts: 92
Credit: 14,957,404
RAC: 0
Slovakia
Message 1776927 - Posted: 7 Apr 2016, 19:39:11 UTC - in response to Message 1776911.  

Have some roadmap adjustments to make for inclusion of Cuda 8 (for which I've received a request for preview), but then can fill in some of the blanks. There is a fair amount of flux coming, both from upcoming technology change, plus pre-existing plans. Taking the time to review the history of the Cuda apps (pre X-Branch and X-branch) will probably be a valuable thing to do going forwards, since the strengths versus the shortcomings come out of the history to a pretty large degree.


maybe it is time to drop some old cards (1.x) ...

ID: 1776927 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1776939 - Posted: 7 Apr 2016, 20:17:27 UTC - in response to Message 1776927.  
Last modified: 7 Apr 2016, 20:51:09 UTC

1.Ñ… is where CUDA23/cuFFT outperforms OpenCL/oclFFT with good margin...

And I'm very doubt that any hardware was donated by AMD or NV.
ID: 1776939 · Report as offensive
Profile cov_route
Avatar

Send message
Joined: 13 Sep 12
Posts: 342
Credit: 10,270,618
RAC: 0
Canada
Message 1776999 - Posted: 8 Apr 2016, 1:09:55 UTC - in response to Message 1776939.  

1.Ñ… is where CUDA23/cuFFT outperforms OpenCL/oclFFT with good margin...

And I'm very doubt that any hardware was donated by AMD or NV.

Do you mean OCL 1.x?
ID: 1776999 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1777008 - Posted: 8 Apr 2016, 1:37:27 UTC - in response to Message 1776927.  

Have some roadmap adjustments to make for inclusion of Cuda 8 (for which I've received a request for preview), but then can fill in some of the blanks. There is a fair amount of flux coming, both from upcoming technology change, plus pre-existing plans. Taking the time to review the history of the Cuda apps (pre X-Branch and X-branch) will probably be a valuable thing to do going forwards, since the strengths versus the shortcomings come out of the history to a pretty large degree.


maybe it is time to drop some old cards (1.x) ...


A lot of people are bringing that up lately, but at least for me it's more work to remove functional builds than it is to leave them working (at least for stock). I'll be hesitant while the technologies are changing, just because reference is needed to prove new stuff against (even if no-one uses it in production anymore) :) Naturally stock and 3rd party have different things to think about, and it all could change.
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1777008 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1777009 - Posted: 8 Apr 2016, 1:44:51 UTC - in response to Message 1776999.  
Last modified: 8 Apr 2016, 1:49:08 UTC

1.Ñ… is where CUDA23/cuFFT outperforms OpenCL/oclFFT with good margin...

And I'm very doubt that any hardware was donated by AMD or NV.

Do you mean OCL 1.x?


I believe he means Pre-Fermi (Compute capability 1.0 to 1.3). After Cuda 2.3 there were significant driver changes to support newer architectures and complex OS Driver models. On these (~DirextX9) GPUs with limited dedicated hardware, the more complex features become emulated from later XP drivers + Cuda 3 onwards, slowing them down (and using more VRAM)

[Edit:] Lol, only thing ever offered to me by nVidia, was an nVidia T-Shirt, as a thankyou for beta testing Cuda 6 and finding several issues. I never received it, lol.
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1777009 · Report as offensive
Profile mimo
Volunteer tester
Avatar

Send message
Joined: 7 Feb 03
Posts: 92
Credit: 14,957,404
RAC: 0
Slovakia
Message 1777160 - Posted: 8 Apr 2016, 14:03:10 UTC - in response to Message 1777009.  

1.Ñ… is where CUDA23/cuFFT outperforms OpenCL/oclFFT with good margin...

And I'm very doubt that any hardware was donated by AMD or NV.

Do you mean OCL 1.x?


I believe he means Pre-Fermi (Compute capability 1.0 to 1.3). After Cuda 2.3 there were significant driver changes to support newer architectures and complex OS Driver models. On these (~DirextX9) GPUs with limited dedicated hardware, the more complex features become emulated from later XP drivers + Cuda 3 onwards, slowing them down (and using more VRAM)

[Edit:] Lol, only thing ever offered to me by nVidia, was an nVidia T-Shirt, as a thankyou for beta testing Cuda 6 and finding several issues. I never received it, lol.


ya i mean prefermi.

not only diffs in the driver but even coding style changed (best practise, memory organisation - coalescing in prefermi archs ... )

ID: 1777160 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1777319 - Posted: 8 Apr 2016, 23:05:41 UTC - in response to Message 1776939.  

And I'm very doubt that any hardware was donated by AMD or NV.

I asked, and it appears that Nvidia did give the development crew one (or more) GPU(s) to work with while converting the Seti source code to CUDA. ATI expressed an interest in GPGPU computing but in the end didn't give any help at all.

I think Einstein@Home managed to get an ATI GPU donated from ATI, but they also had to convert the source code to run with Brook+ - CAL themselves.
ID: 1777319 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1777335 - Posted: 8 Apr 2016, 23:36:34 UTC - in response to Message 1777319.  

And I'm very doubt that any hardware was donated by AMD or NV.

I asked, and it appears that Nvidia did give the development crew one (or more) GPU(s) to work with while converting the Seti source code to CUDA. ATI expressed an interest in GPGPU computing but in the end didn't give any help at all.

I think Einstein@Home managed to get an ATI GPU donated from ATI, but they also had to convert the source code to run with Brook+ - CAL themselves.

I thought Eric or Matt has mentioned ATI sent a dev kit with something like a HD4850 & said "Good luck". Perhaps it was someone mentioning Einstein that my brain has twisted around.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1777335 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1777341 - Posted: 8 Apr 2016, 23:44:35 UTC - in response to Message 1777319.  

So far as I know, the actual CUDA development for SETI@Home was done by:

CUDA development by:
Przemyslaw Zych (NVIDIA)
Scott Le Grand (NVIDIA)
Bob Johnston (NVIDIA)

- those are the authors we credit in the Lunatics AUTHORS file, at any rate: I don't have access to the stock equivalent right now. If anybody has access to a sourced alternative list, please let me know.

I suspect any hardware donated by NVidia will have been used in-lab primarily for testing, rather than original development.

The original Release Announcement has a still-active link to the NVidia Press Release.
ID: 1777341 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1777496 - Posted: 9 Apr 2016, 11:53:50 UTC - in response to Message 1777335.  

I thought Eric or Matt has mentioned ATI sent a dev kit with something like a HD4850 & said "Good luck". Perhaps it was someone mentioning Einstein that my brain has twisted around.

I thought the same thing and spent over an hour searching the forums on it, then just went and asked Matt and Eric in email. Eric answered they never got an ATI GPU but did get an Nvidia and that they wrote the CUDA application.

With development crew I meant the people developing the CUDA application, not BOINC, although there wasn't much difference between them as the BOINC developers heavily developed the CUDA application. I got to be middle man between the forums and some of the Nvidia CUDA developers, still got all those emails. Fun thing for me was that I didn't even have -and still haven't- an Nvidia GPU. ;-)

So back to ATI, and whatever they may have sent in. I also seem to remember it was a HD4850 or HD4870, and that it was a huge card, one they couldn't fit in any of their present computers. But memory wanes as to who it was specifically, Seti or Einstein. And searching the forums for any string with "ATI" in it will return any word with 'ati' in it someplace, like retaliation, location, limitations, occupation, participation, nation, etc. Really frustrating. ;-)
ID: 1777496 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34255
Credit: 79,922,639
RAC: 80
Germany
Message 1777501 - Posted: 9 Apr 2016, 12:13:09 UTC
Last modified: 9 Apr 2016, 12:23:55 UTC

One of the reasons i joined Lunatics was the lack of man power for ATI development.
Heck its been a long time now.
I remember when i started we were at r177.
We were just happy to have something to use for.

I`m very happy what has been accomplished in the last years.


With each crime and every kindness we birth our future.
ID: 1777501 · Report as offensive

Message boards : Number crunching : OpenCL vs. CUDA, histoical and current situation?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.