Experience with opt CUDA versus stock CUDA on my ION

Message boards : Number crunching : Experience with opt CUDA versus stock CUDA on my ION
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6652
Credit: 121,090,076
RAC: 0
United States
Message 995828 - Posted: 12 May 2010, 16:39:31 UTC - in response to Message 995825.  

Except for the hard drive issues, that is exactly what happened to me with my GTX 295. The screen would blank and recover. Hardly any GPU WU's made it without generating an error. I am draining my cache now, and haven't had a GPU WU in about 6 days, and my RAC is just about where it was when I started draining the cache. I figured that my total RAC, which is totally CPU, would be about the same by using standard apps and my GPU. I will have the twin GTX 480's installed this weekend.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 995828 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6652
Credit: 121,090,076
RAC: 0
United States
Message 995830 - Posted: 12 May 2010, 16:49:22 UTC - in response to Message 995829.  

Except for the hard drive issues, that is exactly what happened to me with my GTX 295. The screen would blank and recover. Hardly any GPU WU's made it without generating an error. I am draining my cache now, and haven't had a GPU WU in about 6 days, and my RAC is just about where it was when I started draining the cache. I figured that my total RAC, which is totally CPU, would be about the same by using standard apps and my GPU. I will have the twin GTX 480's installed this weekend.

Steve


Yeah, but you can stay opt on MB and AP, and go stock only on CUDA. Opt MB and AP seems OK, and much faster than stock, but opt CUDA is in my humble opinion, not worth using, since it isn't any faster than stock, and seems to be not compatible with the screen driver.

Sten-Arne



I would have tried it out, as the instructions were clear, but with the 480's pending, I just gave up and put all my hopes in the 480's. I think that this has been a very useful bit of knowedge learned, on how to run stock CUDA, and optimized MB and AP. My thanks to Sutaru as well. I think many will benefit from that information. As you stated, it worked perfectly with stock apps.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 995830 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 995851 - Posted: 12 May 2010, 18:31:58 UTC - in response to Message 995828.  
Last modified: 12 May 2010, 18:37:18 UTC

.

... I will have the twin GTX 480's installed this weekend.

Steve


AFAIK GTX 480 can't be used for CUDA on SETI now - they need to change the app!

http://setiathome.berkeley.edu/forum_thread.php?id=59894

.
 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 995851 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 995858 - Posted: 12 May 2010, 19:21:12 UTC


Sten-Arne, SciManStev, thanks for the flowers! :-)

BilBg, I guess SciManStev know this. ;-)

ID: 995858 · Report as offensive
Profile Jack Shaftoe
Avatar

Send message
Joined: 19 Aug 04
Posts: 44
Credit: 2,343,242
RAC: 0
United States
Message 995984 - Posted: 13 May 2010, 15:52:59 UTC - in response to Message 995962.  
Last modified: 13 May 2010, 15:57:58 UTC

It probably depends on the card, driver and BOINC version you are using. I'm using a GTX 260, and optimized CUDA works quite well for me. The only crashing tasks for me are VLAR's.

However, I have experienced a few "your display driver stopped working" black screens here and there. That said - it doesn't seem to directly correlate with a crashed task - sometimes when I check BOINC after this happens I have no failed CUDA wu's in the list.

Using 197.45 driver. 6.10.43 BOINC
ID: 995984 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6652
Credit: 121,090,076
RAC: 0
United States
Message 995991 - Posted: 13 May 2010, 16:36:58 UTC

My GTX 295 would crash and send back errors regardless of which driver I used, or which version of BOINC. I tried evey combination I could, multiple times, and nothing worked as long as I used the optimized apps.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 995991 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 996000 - Posted: 13 May 2010, 17:06:14 UTC - in response to Message 995991.  

[OFF TOPIC]
Just started trying the ATI cards (HD5770/HD4850) at MW and desperatly trying one the ATI_AP app.'s at Bêta.
[ON TOPIC]

It ain't easy, programming a GPU, specially when too little info regarding these card's, is provided by the manufacturer.
NVidia has been helpfull, so I heard, but ATI, is a different matter, so it seems.
One should think, ATI has something to gain here also.

Use BOINC 6.10.43 on 3 QUAD's and 1 LT, used the Unified Installer v.035 (X64).
On the Q6600's I run optimized MB (CUDA too). Never changed the driver version, (only back)190.38 NVidida. CAL 1.41
Since the heat problems were solved, they run without error's, if I don't forget to check the REScheduler (1.9) on my XP64(X9650+2xGTS250;4GiG), that is :-)



ID: 996000 · Report as offensive
Profile Jack Shaftoe
Avatar

Send message
Joined: 19 Aug 04
Posts: 44
Credit: 2,343,242
RAC: 0
United States
Message 996007 - Posted: 13 May 2010, 17:21:17 UTC - in response to Message 995987.  

The following are only two examples of many of not VLAR killed task from your CUDA. Typical "opt" crashes:


I don't blame the app for those, I was playing with EVGA Precision several times yesterday when overclocking my card.

Last week Mon > Thu would be a good time to analyze - was travelling for work and it ran the whole time.

Right now, I am looking at my BOINC Manager, 7 Cuda tasks finished. 6 successful, one killed after 4 seconds - most likely VLAR, and one running - 68% complete after 28min.
ID: 996007 · Report as offensive
Profile Jack Shaftoe
Avatar

Send message
Joined: 19 Aug 04
Posts: 44
Credit: 2,343,242
RAC: 0
United States
Message 996013 - Posted: 13 May 2010, 17:51:48 UTC - in response to Message 996007.  

I just set my Network Activity to suspended so I could look at my results tomorrow. My card completes 1 MB every 25-40 minutes so I should have about 40 or 50 complete by midday tomorrow and can update you on number of failures.
ID: 996013 · Report as offensive
Profile Jack Shaftoe
Avatar

Send message
Joined: 19 Aug 04
Posts: 44
Credit: 2,343,242
RAC: 0
United States
Message 996166 - Posted: 14 May 2010, 14:22:20 UTC - in response to Message 996013.  

ID: 996166 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 996171 - Posted: 14 May 2010, 14:53:50 UTC - in response to Message 996166.  

Morning Jack,
I like the to completion times on those you still have waiting to run. :-) Tasks that are taking you a half hour to 45 minutes to run are showing a guesstimate of three hours. Have you tried putting a flops count in your app_info? It should bring your DCF way down and you will get a lot more work once the boys from Berkeley get this upload problem sorted out.


PROUD MEMBER OF Team Starfire World BOINC
ID: 996171 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 996202 - Posted: 14 May 2010, 16:43:05 UTC - in response to Message 996193.  

However, to each his own. I'm not trying to make everyone revert to stock. For me it worked perfect and I reduced my errors to zero %.
Sten-Arne


V12 is possible to summon driver restarts on lowest-end GPUs more often than stock, cause it has one of thresholds changed that governs choice between 2 different kernels (big and faster one and small and slower one) to do pulse search. small kernel was introduced (though it's slower) to fight with too long kernel execution that causes driver restarts.

So definitely your experience can be quite different than mid-range GPU owner.
ID: 996202 · Report as offensive
Profile Jack Shaftoe
Avatar

Send message
Joined: 19 Aug 04
Posts: 44
Credit: 2,343,242
RAC: 0
United States
Message 996234 - Posted: 14 May 2010, 19:31:25 UTC - in response to Message 996202.  
Last modified: 14 May 2010, 19:32:07 UTC

Raistmer, can you confirm - stock app is just as fast as optimized app for cuda?
ID: 996234 · Report as offensive
Profile Jack Shaftoe
Avatar

Send message
Joined: 19 Aug 04
Posts: 44
Credit: 2,343,242
RAC: 0
United States
Message 996237 - Posted: 14 May 2010, 19:35:34 UTC - in response to Message 996171.  

Morning Jack,
Have you tried putting a flops count in your app_info? It should bring your DCF way down and you will get a lot more work once the boys from Berkeley get this upload problem sorted out.


No, not yet. Can you share details on how to do this PJ?
ID: 996237 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 996239 - Posted: 14 May 2010, 19:46:01 UTC - in response to Message 996234.  

Raistmer, can you confirm - stock app is just as fast as optimized app for cuda?


It depends from your hardware & software perhaps.
On my own hosts V12 runs faster.
Also, it has few changes to save some time here and there that missing in stock app. So I don't see why it should be slower. But for some configs it's possible.
As one could know binaries of even slightly different rebuilds can different in speed. Different memory buffers placing and you will get slightly different speed.
Newly rebooted system will give different (sometimes pretty noticeable one, not diminishing results) performance when one that runs for few days already with many processes started stopped due run. All these factors adds to error in any benchmarking.

So it's question of user preference, if he perceives/measures that one binary faster for his PC, why not use what is faster.

Another thing is error rate. As I said already, for very low-end GPUs and ION is such example driver restart rate can be higher than with stock app.
When discussed mod was in beta-testing there was no ION card involved.
So better participation in testing is required, btw...
ID: 996239 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 996255 - Posted: 14 May 2010, 21:37:56 UTC - in response to Message 996237.  

Jack,
The instructions are in this thread http://setiathome.berkeley.edu/forum_thread.php?id=53593 once you get the math taken care of, post back if you have trouble figuring out where it goes.


PROUD MEMBER OF Team Starfire World BOINC
ID: 996255 · Report as offensive

Message boards : Number crunching : Experience with opt CUDA versus stock CUDA on my ION


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.