Message boards :
Number crunching :
Nvidia 397.31 driver available.
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
All it takes is for the ComputeCache to get corrupted and then compute is done for and your cache will just empty out on instant errors. Only solution is to stop BOINC, delete the ComputeCache and reboot. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Bernie Vine Send message Joined: 26 May 99 Posts: 9954 Credit: 103,452,613 RAC: 328 |
Last night i noticed that my machine with the 397.31 driver had 6 GPU tasks "waiting to run" which means they had started and been "postponed" This is a fairly new machine and has never had that before. So I rolled back to 391.35 and the problem seems to have gone. Also I use this machine for gaming and yesterday evening I was totally unable to get Fallout 4 to start, now however having the old driver back it started with no problems. I think perhaps with 397.31 your mileage may vary. |
JohnDK Send message Joined: 28 May 00 Posts: 1222 Credit: 451,243,443 RAC: 1,127 |
I installed 397.31 yesterday and so far no problems here. |
Kevin Olley Send message Joined: 3 Aug 99 Posts: 906 Credit: 261,085,289 RAC: 572 |
Apparently there are problems with this driver. https://hothardware.com/news/nvidia-fix-recent-buggy-drivers-geforce-gtx-1060 Kevin |
tullio Send message Joined: 9 Apr 04 Posts: 8797 Credit: 2,930,782 RAC: 1 |
I had to reinstall it using GeForce on my Windows 10 PC. After the first installation all Einstein@home GPU tasks crashed after a few seconds. Tullio |
Siran d'Vel'nahr Send message Joined: 23 May 99 Posts: 7379 Credit: 44,181,323 RAC: 238 |
Greetings, The fixed driver can be downloaded here: GeForce Hotfix Driver Version 397.55. Siran CAPT Siran d'Vel'nahr - L L & P _\\// Winders 11 OS? "What a piece of junk!" - L. Skywalker "Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath |
Dave Lewis Send message Joined: 12 Apr 99 Posts: 34 Credit: 53,432,603 RAC: 108 |
Mike, thanks for your response. I've had no issues since reverting to an older Nvidia driver either (approximately 3 days ago). Yes, time will tell. Both of the episodes of errors occurred within 36 hours of installing the ver. 391.35 drivers. I see a driver hotfix is out but I'm going to stick with the current version that I'm using until the official new version is released to test. Best of luck to you! |
Dave Lewis Send message Joined: 12 Apr 99 Posts: 34 Credit: 53,432,603 RAC: 108 |
All it takes is for the ComputeCache to get corrupted and then compute is done for and your cache will just empty out on instant errors. Only solution is to stop BOINC, delete the ComputeCache and reboot. Thanks for your assistance Keith. I want to verify which ComputeCache folder to delete in case there could be more than one that I didn't locate. It would be the one in the path "...Users\username\AppData\Roaming\NVIDIA\ComputeCache" correct? |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Yes, that is correct. Just delete the folder and all the subfolders when BOINC is not running and you are not doing anything else compute wise with the cards. The Nvidia driver will recreate the folder upon reboot. And the first compute work creates the necessary compute primitives for each task all automatically. The ComputeCache contains all the the CUDA and OpenCL primitives. I have had corruption on both CUDA special app work and SoG OpenCL work. As long as the necessary drivers and capabilities are called out at the beginning of the BOINC event log and the cards are seen, any stderr.txt error message on a failed task alluding to missing capability or failure to initialize CUDA or OpenCL almost always points to ComputeCache corruption or system pointers that have gone missing. Easiest solution is to delete the ComputeCache and reboot. You have to be fast to catch the problem because on fast systems with lots of cards, it only takes a couple of minutes to completely error out your cache as each task start errors out in seconds and then grabs another task to repeat the failure. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Dave Lewis Send message Joined: 12 Apr 99 Posts: 34 Credit: 53,432,603 RAC: 108 |
|
petri33 Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156 |
latest driver and just in time compilation feature can give you a boost. NVIDIA apps in general. OpenCL too. New driver can have a new compiler and new optimisations. OpenCL (SoG) is compiled whenever a driver changes. CUDA 9.1 version has pre-compiled code for existing hardware and new hardware is supported through JIT compilation. Petri To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Thanks for the lucid explanation Petri. I got it all in one take. Really appreciate your efforts to maximize our hardware's potentials. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.