Message boards :
Number crunching :
Linux CUDA 'Special' App finally available, featuring Low CPU use
Message board moderation
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 83 · Next
Author | Message |
---|---|
![]() ![]() Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0 ![]() |
I have a list of issues, but none of the others prevent wider testing so much as present much rarer annoyances. I'll probably list those once I (or anyone else) resolves the pulsefinding issue, since that's priority. Later tonight, am going to attempt to bring the Mac Pro up on win10 (usb stick is prepared). That currently has el capitan, sierra, and Ubuntu 16.04 LTS. once same machine/device cross platform comparison is doable things get a bit easier. Fingers crossed for at least a sierra 1050ti driver sometime soon. "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
Well, I just read the last few weeks at Mac Rumors and it doesn't look good for any Pascal drivers. The final word appears to be here, ...Actually, we did hear from the Nvidia CEO himself and it's not promising. The people in that thread would have it working if it were possible. So, you might be better off to put your 980 in the Mac and forget about the Mac running Pascal for now. The 980 works fine in a Mac, I've run across a few while looking over the Mac nVidia OpenCL App. |
![]() ![]() Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0 ![]() |
Ugh, frustrating. Well we'll see. Probably if nothing happens between now and getting Windows working with the Radeon and 1050ti, then I'll rather umbilical in the 780 again for regression testing purposes (relatively easy, but ugly). If the 980 can be flashed to enable the boot screen I'd prefer that, since it'd drop 1 GPU I'm not really using for crunching, and avoid snaking cables in through the back. Failing that, the 680 can be more readily flashed apparently, and is currently sitting idle. Unfortunately alpha code would need some jiggering to work on 680, but then again that's going to have to happen eventually anyway. "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. |
![]() Send message Joined: 9 Apr 04 Posts: 8797 Credit: 2,930,782 RAC: 1 ![]() |
I have installed it on my SuSE Leap 42.2 and it works both as CPU and GPU. Tullio |
![]() ![]() ![]() ![]() Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 ![]() ![]() |
I just looked at Petri's times for his stack of 1080's ... WOW 45s for Alerico tasks !! It looks like he is running a 2.75hour GPU cache, that's speedy. |
Stephen "Heretic" ![]() ![]() ![]() ![]() Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 ![]() ![]() |
I just looked at Petri's times for his stack of 1080's ... WOW 45s for Alerico tasks !! . . Absolutely, but a pain during the weekly outages. Stephen :) |
rob smith ![]() ![]() ![]() Send message Joined: 7 Mar 03 Posts: 22606 Credit: 416,307,556 RAC: 380 ![]() ![]() |
...more like about an hour - remember he only has 100 tasks for each of his GPUs on that rig. (My own "big" cruncher barely manages three hours on the 300 tasks available for the three GPUs, and that one is nowhere as near as fast as Petri's monster) Bob Smith Member of Seti PIPPS (Pluto is a Planet Protest Society) Somewhere in the (un)known Universe? |
Sidewinder ![]() ![]() ![]() ![]() Send message Joined: 15 Nov 09 Posts: 100 Credit: 79,432,465 RAC: 0 ![]() |
My GTX 1050 Ti's arrived today and switched over to Tbar's app. It appears to be running well on the 1050 Ti's and Arch. https://setiathome.berkeley.edu/show_host_detail.php?hostid=8173821 Name : nvidia Version : 375.26-1 Description : NVIDIA drivers for linux Architecture : x86_64 Name : cuda Version : 8.0.44-2 Description : NVIDIA's GPU programming toolkit Architecture : x86_64 GPU stats: +-----------------------------------------------------------------------------+ | NVIDIA-SMI 375.26 Driver Version: 375.26 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | |===============================+======================+======================| | 0 GeForce GTX 105... Off | 0000:02:00.0 Off | N/A | | 39% 62C P0 59W / 75W | 1189MiB / 4036MiB | 87% Default | +-------------------------------+----------------------+----------------------+ | 1 GeForce GTX 105... Off | 0000:06:00.0 Off | N/A | | 35% 45C P0 60W / 75W | 1189MiB / 4038MiB | 92% Default | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: GPU Memory | | GPU PID Type Process name Usage | |=============================================================================| | 0 3651 C ...thome_x41p_zi+_x86_64-pc-linux-gnu_cuda60 1187MiB | | 1 3526 C ...thome_x41p_zi+_x86_64-pc-linux-gnu_cuda60 1187MiB | +-----------------------------------------------------------------------------+ ![]() |
![]() ![]() ![]() ![]() Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 ![]() ![]() |
Hey sidewinder, How do you get those outputs from Ubuntu? |
Sidewinder ![]() ![]() ![]() ![]() Send message Joined: 15 Nov 09 Posts: 100 Credit: 79,432,465 RAC: 0 ![]() |
Hey sidewinder, How do you get those outputs from Ubuntu? I'm on an Arch-based distro so it may be different for debian/ubuntu. The first is just a pacman (arch's package manager) query and the second is nvidia-smi which comes with the nvidia driver package. ![]() |
![]() Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156 ![]() ![]() |
Hey sidewinder, How do you get those outputs from Ubuntu? Hi, I run nvidia-smi -l on a separate console window all the time.. |===============================+======================+======================| | 0 GeForce GTX 1080 On | 0000:05:00.0 On | N/A | | 96% 66C P2 173W / 215W | 4258MiB / 8112MiB | 94% Default | +-------------------------------+----------------------+----------------------+ | 1 GeForce GTX 1080 On | 0000:06:00.0 Off | N/A | |100% 70C P2 148W / 215W | 3896MiB / 8113MiB | 94% Default | +-------------------------------+----------------------+----------------------+ | 2 GeForce GTX 1080 On | 0000:09:00.0 Off | N/A | | 96% 60C P2 148W / 215W | 3896MiB / 8113MiB | 90% Default | +-------------------------------+----------------------+----------------------+ | 3 GeForce GTX 1080 On | 0000:0A:00.0 Off | N/A | | 96% 59C P2 140W / 215W | 3896MiB / 8113MiB | 94% Default | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: GPU Memory | | GPU PID Type Process name Usage | |=============================================================================| | 0 898 G /usr/bin/X 214MiB | | 0 1537 G compiz 146MiB | | 0 11565 C ...thome_x41zc_x86_64-pc-linux-gnu_cuda65_v8 3893MiB | | 1 11574 C ...thome_x41zc_x86_64-pc-linux-gnu_cuda65_v8 3893MiB | | 2 12029 C ...thome_x41zc_x86_64-pc-linux-gnu_cuda65_v8 3893MiB | | 3 11947 C ...thome_x41zc_x86_64-pc-linux-gnu_cuda65_v8 3893MiB | +-----------------------------------------------------------------------------+ To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones |
![]() ![]() ![]() ![]() Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 ![]() ![]() |
Thanks guys, I'm a little surprised to see my 980 using less power than a 1080 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 367.57 Driver Version: 367.57 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | |===============================+======================+======================| | 0 GeForce GTX 980 Off | 0000:01:00.0 Off | N/A | | 26% 37C P2 94W / 180W | 2007MiB / 4037MiB | 48% Default | +-------------------------------+----------------------+----------------------+ | 1 GeForce GTX 1070 Off | 0000:03:00.0 Off | N/A | | 51% 72C P2 72W / 151W | 1810MiB / 8113MiB | 39% Default | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: GPU Memory | | GPU PID Type Process name Usage | |=============================================================================| | 0 1358 G /usr/lib/xorg/Xorg 181MiB | | 0 2465 G compiz 38MiB | | 0 2901 G /usr/lib/firefox/plugin-container 4MiB | | 0 5762 C ...thome_x41p_zi+_x86_64-pc-linux-gnu_cuda60 1779MiB | | 1 5798 C ...thome_x41p_zi+_x86_64-pc-linux-gnu_cuda60 1807MiB | +-----------------------------------------------------------------------------+ |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13871 Credit: 208,696,464 RAC: 304 ![]() ![]() |
Thanks guys, You need to compare your GPU Utilisation with Petrie's GPU-Util You 48% 39% petrie33 94% 94% 90% 94% And the APR 643 v 1734 GFLOPS. The more work it does, the more power it needs. Still, his cards use less than double your power, but pump out (almost) 3 times as much work. Pascal really is impressive for it's work/watt hour. Grant Darwin NT |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
Thanks guys, That's low for the GPU Utilization. If it's constantly that low you might try raising the Unroll number. Petri is using -unroll 40 on his 1080s, which is twice as high as the compute units. However, it appears to be working well at 40. Most people have a higher GPU-Util number, | NVIDIA-SMI 367.57 Driver Version: 367.57 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | |===============================+======================+======================| | 0 GeForce GTX 750 Ti Off | 0000:01:00.0 On | N/A | | 70% 61C P0 24W / 38W | 1476MiB / 1999MiB | 92% Default | +-------------------------------+----------------------+----------------------+ | 1 GeForce GTX 750 Ti Off | 0000:02:00.0 Off | N/A | | 53% 63C P0 28W / 38W | 1284MiB / 2000MiB | 93% Default | +-------------------------------+----------------------+----------------------+ | 2 GeForce GTX 750 Ti Off | 0000:03:00.0 Off | N/A | | 46% 54C P0 29W / 38W | 1284MiB / 2000MiB | 95% Default | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: GPU Memory | | GPU PID Type Process name Usage | |=============================================================================| | 0 1242 G /usr/lib/xorg/Xorg 137MiB | | 0 1989 G compiz 54MiB | | 0 19463 C ...home_x41p_zi3k_x86_64-pc-linux-gnu_cuda75 1280MiB | | 1 19714 C ...home_x41p_zi3k_x86_64-pc-linux-gnu_cuda75 1280MiB | | 2 19237 C ...home_x41p_zi3k_x86_64-pc-linux-gnu_cuda75 1280MiB | +-----------------------------------------------------------------------------+ Look at that, under 30 watts on all three. |
![]() ![]() ![]() ![]() Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 ![]() ![]() |
Yes I noticed Petri was set at 40 with 20 CU, and also some other comand line options that I'm unsure of (and not mention anywhere that I have found) Using pfb = 8 from command line args Edit: I have 15CU(1070) and 16CU(980) so that adds another twist. |
Sidewinder ![]() ![]() ![]() ![]() Send message Joined: 15 Nov 09 Posts: 100 Credit: 79,432,465 RAC: 0 ![]() |
Yes I noticed Petri was set at 40 with 20 CU, and also some other comand line options that I'm unsure of (and not mention anywhere that I have found) For reference, the 1050 Ti's have 6 CUs and my utilization numbers above are with unroll set to 6. Tbar, do you think setting it any higher will keep the utilization higher on the 1050 Ti's? They typically stay at low 90's to mid-80's. ![]() |
![]() ![]() ![]() ![]() Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 ![]() ![]() |
Well this is certainly interesting, I was running 3 CPU tasks with 2 GPU tasks (total) at ~85% CPU usage, and figured I would run tasks out before restarting. When one CPU task finished GPU temp/Usage went UP When two CPU task finished GPU temp/Usage went UP When three CPU task finished GPU temp/Usage went UP (slightly) I guess it's because all tasks are at VeryLowPriority ... Any one know how to raise that in Linux for GPU? I did try -hp before and it didn't change anything. |
Sidewinder ![]() ![]() ![]() ![]() Send message Joined: 15 Nov 09 Posts: 100 Credit: 79,432,465 RAC: 0 ![]() |
Well this is certainly interesting, I was running 3 CPU tasks with 2 GPU tasks (total) at ~85% CPU usage, and figured I would run tasks out before restarting. It looks like you control that with "<process_priority_special>N</process_priority_special>" in the cc_config.xml file. See: https://boinc.berkeley.edu/wiki/Client_configuration. Looks like it requires BOINC v7.6.14 or higher. ![]() |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 ![]() ![]() |
Tbar, do you think setting it any higher will keep the utilization higher on the 1050 Ti's? They typically stay at low 90's to mid-80's. You could try it and see. In my experience raising the unroll on the lower end cards will slow down the VLAR tasks. Of course we don't have any BLC tasks at present, but, the unroll is mainly for the BLC tasks and a setting for Arecibo tasks may not be best for VLARs. It might be better to add the <no_priority_change>1</no_priority_change> line to your cc_config.xml file. That line will set All BOINC tasks to nice 0, but usually will increase GPU usage. I have it on my machines and the machine with three 750Ti is also running 2 CPU tasks on a old Quad core CPU yet the GPU usage is around the low to mid 90s. The pfb & pfp settings are the same from the Windows CUDA Apps and just as with the Windows CUDA Apps they produce little to No advantage. They Can cause increased Inconclusive results on some cards though. On my Mac it Slows down the tasks unless the settings are maxed out and then you get Many Inconclusive results. The last thing this App needs is More Inconclusive results, so, I don't recommend using those settings. |
![]() ![]() ![]() ![]() Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 ![]() ![]() |
It's certainly warming up in here ... |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | |===============================+======================+======================| | 0 GeForce GTX 980 Off | 0000:01:00.0 Off | N/A | | 26% 40C P2 133W / 180W | 3300MiB / 4037MiB | 92% Default | +-------------------------------+----------------------+----------------------+ | 1 GeForce GTX 1070 Off | 0000:03:00.0 Off | N/A | | 60% 80C P2 123W / 151W | 3106MiB / 8113MiB | 90% Default | +-------------------------------+----------------------+----------------------+ Down about 25% in times to around 3:30 ... more playing required :) |
©2025 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.