Message boards :
Number crunching :
Linux CUDA 'Special' App finally available, featuring Low CPU use
Message board moderation
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 83 · Next
Author | Message |
---|---|
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
Where did you place the BOINC folder? If you would have placed it in your Home folder, as per instructions, you wouldn't need any special permissions. https://boinc.berkeley.edu/wiki/Installing_BOINC#The_Berkeley_Installer Here is an example. The archive is downloaded to the desktop. It is then moved to the home directory (~). Finally BOINC is unpacked and installed. All of this can be done within a regular user account; root privileges are not needed. Actually, those instructions are a little off. You can simply drag the download to your Home folder, and after the dependency is installed just double click on 'The BOINC Manager' to launch it. With the BOINC folder in your Home folder, Ubuntu is about the easiest system to configure BOINC on. No Mac permissions, and No Windows hidden folders. |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
BOINC didn't ask were to go (I don't think) It's living in var/lib/boinc-client at the moment, with 'BOINC core client' as the owner. I will run out my tasks and try a reinstall and see it I can 'bring it home' :) |
qbit Send message Joined: 19 Sep 04 Posts: 630 Credit: 6,868,528 RAC: 0 |
Brent, you don't have to install it. Just download from the boinc site, unpack it and move the whole folder to your home directory (or move first and unpack then). Then run the manager from terminal. At least that's how I did it and it runs fine. |
Rockhount Send message Joined: 29 May 00 Posts: 34 Credit: 31,935,954 RAC: 29 |
Hi Folks, update from my machine with linux special cuda 6 app. RAC is up to 8900 and still rainsing. Everything is running fine with this app. My RAC is still raising too. Regards from nothern Germany Roman SETI@home classic workunits 207,059 SETI@home classic CPU time 1,251,095 hours |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
Good to hear Rock, I have Ubuntu 14.04 running now and just burning off these stock tasks before attempting the special app. Let's make the 1070 and 980 smoke :) Well the 980 is a hybrid, it will be fine :) |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
Is the Cuda info not showing a driver version a concern before switching to the new apps? I have been running Cuda60 and SoG on stock apps with 1 new Invalid against SoG Invalid Wed 28 Dec 2016 06:12:52 AM CST | | CUDA: NVIDIA GPU 0: GeForce GTX 1070 (driver version unknown, CUDA version 8.0, compute capability 6.1, 4096MB, 3992MB available, 10204 GFLOPS peak) Heat is certainly a problem in this mid tower case :( |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
None of the Linux BOINC Apps posted by Berkeley show the CUDA driver version. It hasn't been a problem yet, so, I don't expect it ever will. The CUDA driver version is shown in the later versions of BOINC, only available from the Linux repositories, it's the same as the OpenCL version anyway and that is shown in the earlier BOINC versions. The current Windows OpenCL App on Main has a known problem with Overflows, the Baseline CUDA App doesn't have that problem, so, I'd say you were probably Robbed on that Invalid Overflow. Too bad it's too late to grab the task and test it on a CPU. It's an Overflow though, and will Never be used by SETI to look for ET or anything else, so, it doesn't really matter. The Newer Windows OpenCL App handles the Overflows better, but as with the New Mac nVidia OpenCL App, it's still stuck at Beta. Maybe soon the Newer Apps will make it to Main. |
qbit Send message Joined: 19 Sep 04 Posts: 630 Credit: 6,868,528 RAC: 0 |
Does anybody know why Boinc reports my VRAM with 1024MB on Windows but only 977MB on Linux? |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
When you look at NVIDIA Settings in Windows what does it show for Total Memory and Total Dedicated Memory? I'm guessing your Linux NVIDIA Settings is showing 1024 for Total and around 977 for Total Dedicated Memory, that's the way it is on my nVidia cards anyway. BOINC is not showing the Total Memory. |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
Where can I find the AP application files that the sample app_info in the CUDA60 download refer to? Looking for: astropulse_7.08_x86_64-pc-linux-gnu__opencl_nvidia_100 AstroPulse_Kernels_r2751.cl I found the CPU version, but not for GPU EDIT: Found them, I should have searched the forum, Duhh |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
What was I thinking? why didn't I put ny motherboard in the new case which has much much better cooling, and would have been easier to do - tomorrows project. Got the code purring away GPU temps up 5C, 980@43C, 1070@83C Runtimes are nice ~ 6:20 on 980, 7:20 on 1070. Everything really went well other than all the problems with my i7, the i5 works great. And new motherboard ordered, likely Tuesday at the earliest for that. Now for more reading to try and find some fine tuning. |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
Nicer on Arecibo tasks 4:20 on 980, 4:40 on 1070. And haven't see the CPU over 7% for 2 tasks. |
qbit Send message Joined: 19 Sep 04 Posts: 630 Credit: 6,868,528 RAC: 0 |
When you look at NVIDIA Settings in Windows what does it show for Total Memory and Total Dedicated Memory? Hm, I would have to stop crunching on Linux to boot into Windows. Don't wanna do that right now but from your answer it looks like it's not uncommon and therefore not a problem. Small Update after a few days with the new app: Everything's fine, have crunched 800+ tasks sucessfully, no Error, no Invalid. RAC was ~5800 in the last days running openCL, currently it's ~7700 and still rising. Nice work, TBar & Petri! |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
My Inconclusives seem just a tad high :( State: All (504) · In progress (271) · Validation pending (59) · Validation inconclusive (14) · Valid (159) · Invalid (0) · Error (1) |
qbit Send message Joined: 19 Sep 04 Posts: 630 Credit: 6,868,528 RAC: 0 |
That's ok, I had 22 for some time, all valid until now. [/url] |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
Yea it's OK in the end, but if 10-20% of task need to be rechecked that does seem a little counter productive project wise. |
qbit Send message Joined: 19 Sep 04 Posts: 630 Credit: 6,868,528 RAC: 0 |
Sure, but since our tasks are valid it's not a problem with this app here. I think a problem exists with the apple darwin apps, I see those a lot in my inconclusives. |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
Sure, but since our tasks are valid it's not a problem with this app here. Don't forget that credit granting and ultimate validation is quite "forgiving". Task will be counted as valid even if some signal it reports isn't. Provided there are many other valid signals + invalid one in "best" area. So, ultimate validation doesn't mean that all is OK. Actually, we not in the stage to decide if app is OK or not. This stage already passed. Jason confirmed missing parts in Pulse signal reduction stages. So we know app is broken in that part. Low invalids ratio just ensures that bug manifests itself in quite low number of cases. Actually it will show up only if there are few reportable Pulses in same PoT. Cause even one such pulse is rare event having 2 in same PoT and in such order that incorrect one will be reported - even more rare event. SETI apps news We're not gonna fight them. We're gonna transcend them. |
jason_gee Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0 |
Sure, but since our tasks are valid it's not a problem with this app here. A bit deeper digging verified a race on the power of 2 folds, which are running in parallel onto a single result set. So when a reportable pulse is detected in one fold, and higher score in the other, without synchronisation it's a coin toss as to which pulse is recorded (as observed). The traditional (serial) algorithm treats the higher score as a refinement of the prior detection. Needs a bit more walkthrough, but probably i'll just separate the pulse results, and see if adding the reduction CPU side is too costly or not. My feeling is that they're rare enough to not have much noticeable performance impact, though if the increased bus consumption becomes a problem, then adding a GPU side reduction step shouldn't be a huge step. Either way I'll probably end up undoing the (long time ago) nvidia coded triplets-first rearrangement along the way, restoring to CPU serial final order. The spikes, gaussians, and auto-correlations are somewhat paralleled already anyway, so the synch tap points for these needed reductions are already there. Just probably need a bit of juggling and an intermediate store. "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
Since a new revision is needed, Downloads have been suspended until the next revision is ready. I think 35 is enough testers for now. If you find something other than the Pulsefind Race problem please report it here. |
©2025 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.