Linux CUDA 'Special' App finally available, featuring Low CPU use

Message boards : Number crunching : Linux CUDA 'Special' App finally available, featuring Low CPU use
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 83 · Next

AuthorMessage
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1837608 - Posted: 23 Dec 2016, 23:23:17 UTC - in response to Message 1837601.  
Last modified: 23 Dec 2016, 23:26:02 UTC

Where did you place the BOINC folder? If you would have placed it in your Home folder, as per instructions, you wouldn't need any special permissions.
https://boinc.berkeley.edu/wiki/Installing_BOINC#The_Berkeley_Installer
Here is an example. The archive is downloaded to the desktop. It is then moved to the home directory (~). Finally BOINC is unpacked and installed. All of this can be done within a regular user account; root privileges are not needed.
This creates a directory called BOINC/ under the home directory containing the following files...

Actually, those instructions are a little off. You can simply drag the download to your Home folder, and after the dependency is installed just double click on 'The BOINC Manager' to launch it.
With the BOINC folder in your Home folder, Ubuntu is about the easiest system to configure BOINC on. No Mac permissions, and No Windows hidden folders.
ID: 1837608 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1837618 - Posted: 23 Dec 2016, 23:58:36 UTC - in response to Message 1837608.  

BOINC didn't ask were to go (I don't think) It's living in var/lib/boinc-client at the moment, with 'BOINC core client' as the owner.

I will run out my tasks and try a reinstall and see it I can 'bring it home' :)
ID: 1837618 · Report as offensive
qbit
Volunteer tester
Avatar

Send message
Joined: 19 Sep 04
Posts: 630
Credit: 6,868,528
RAC: 0
Austria
Message 1837628 - Posted: 24 Dec 2016, 1:00:55 UTC
Last modified: 24 Dec 2016, 1:07:24 UTC

Brent, you don't have to install it. Just download from the boinc site, unpack it and move the whole folder to your home directory (or move first and unpack then). Then run the manager from terminal. At least that's how I did it and it runs fine.
ID: 1837628 · Report as offensive
Rockhount
Avatar

Send message
Joined: 29 May 00
Posts: 34
Credit: 31,935,954
RAC: 29
Germany
Message 1838107 - Posted: 27 Dec 2016, 11:22:39 UTC - in response to Message 1837165.  

Hi Folks,
update from my machine with linux special cuda 6 app.
RAC is up to 8900 and still rainsing.
Everything is running fine with this app.

My RAC is still raising too.
Started on 14th dec at 3100cr/day with my i3-2100T now with the 750ti is runs 6800cr/day (reduced to 18h daily)

https://setiathome.berkeley.edu/show_host_detail.php?hostid=1931980

Regards from nothern Germany
Roman

SETI@home classic workunits 207,059
SETI@home classic CPU time 1,251,095 hours

ID: 1838107 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1838112 - Posted: 27 Dec 2016, 12:17:05 UTC - in response to Message 1838107.  

Good to hear Rock, I have Ubuntu 14.04 running now and just burning off these stock tasks before attempting the special app.

Let's make the 1070 and 980 smoke :)

Well the 980 is a hybrid, it will be fine :)
ID: 1838112 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1838265 - Posted: 28 Dec 2016, 15:07:27 UTC

Is the Cuda info not showing a driver version a concern before switching to the new apps?
I have been running Cuda60 and SoG on stock apps with 1 new Invalid against SoG

Invalid

Wed 28 Dec 2016 06:12:52 AM CST | | CUDA: NVIDIA GPU 0: GeForce GTX 1070 (driver version unknown, CUDA version 8.0, compute capability 6.1, 4096MB, 3992MB available, 10204 GFLOPS peak)
Wed 28 Dec 2016 06:12:52 AM CST | | CUDA: NVIDIA GPU 1: GeForce GTX 980 (driver version unknown, CUDA version 8.0, compute capability 5.2, 4038MB, 3898MB available, 8245 GFLOPS peak)
Wed 28 Dec 2016 06:12:52 AM CST | | OpenCL: NVIDIA GPU 0: GeForce GTX 1070 (driver version 367.57, device version OpenCL 1.2 CUDA, 8113MB, 3992MB available, 10204 GFLOPS peak)
Wed 28 Dec 2016 06:12:52 AM CST | | OpenCL: NVIDIA GPU 1: GeForce GTX 980 (driver version 367.57, device version OpenCL 1.2 CUDA, 4038MB, 3898MB available, 8245 GFLOPS peak)


Heat is certainly a problem in this mid tower case :(
ID: 1838265 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1838284 - Posted: 28 Dec 2016, 16:25:58 UTC - in response to Message 1838265.  
Last modified: 28 Dec 2016, 16:32:01 UTC

None of the Linux BOINC Apps posted by Berkeley show the CUDA driver version. It hasn't been a problem yet, so, I don't expect it ever will. The CUDA driver version is shown in the later versions of BOINC, only available from the Linux repositories, it's the same as the OpenCL version anyway and that is shown in the earlier BOINC versions.
The current Windows OpenCL App on Main has a known problem with Overflows, the Baseline CUDA App doesn't have that problem, so, I'd say you were probably Robbed on that Invalid Overflow. Too bad it's too late to grab the task and test it on a CPU. It's an Overflow though, and will Never be used by SETI to look for ET or anything else, so, it doesn't really matter. The Newer Windows OpenCL App handles the Overflows better, but as with the New Mac nVidia OpenCL App, it's still stuck at Beta. Maybe soon the Newer Apps will make it to Main.
ID: 1838284 · Report as offensive
qbit
Volunteer tester
Avatar

Send message
Joined: 19 Sep 04
Posts: 630
Credit: 6,868,528
RAC: 0
Austria
Message 1838297 - Posted: 28 Dec 2016, 17:26:10 UTC

Does anybody know why Boinc reports my VRAM with 1024MB on Windows but only 977MB on Linux?
ID: 1838297 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1838332 - Posted: 28 Dec 2016, 20:39:56 UTC - in response to Message 1838297.  

When you look at NVIDIA Settings in Windows what does it show for Total Memory and Total Dedicated Memory?
I'm guessing your Linux NVIDIA Settings is showing 1024 for Total and around 977 for Total Dedicated Memory, that's the way it is on my nVidia cards anyway. BOINC is not showing the Total Memory.
ID: 1838332 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1838348 - Posted: 28 Dec 2016, 21:46:50 UTC
Last modified: 28 Dec 2016, 22:02:06 UTC

Where can I find the AP application files that the sample app_info in the CUDA60 download refer to?

Looking for:
astropulse_7.08_x86_64-pc-linux-gnu__opencl_nvidia_100
AstroPulse_Kernels_r2751.cl

I found the CPU version, but not for GPU

EDIT: Found them, I should have searched the forum, Duhh
ID: 1838348 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1838434 - Posted: 29 Dec 2016, 5:07:40 UTC

What was I thinking? why didn't I put ny motherboard in the new case which has much much better cooling, and would have been easier to do - tomorrows project.

Got the code purring away GPU temps up 5C, 980@43C, 1070@83C

Runtimes are nice ~ 6:20 on 980, 7:20 on 1070.

Everything really went well other than all the problems with my i7, the i5 works great. And new motherboard ordered, likely Tuesday at the earliest for that.

Now for more reading to try and find some fine tuning.
ID: 1838434 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1838453 - Posted: 29 Dec 2016, 6:41:11 UTC - in response to Message 1838434.  
Last modified: 29 Dec 2016, 6:42:17 UTC

Nicer on Arecibo tasks 4:20 on 980, 4:40 on 1070.

And haven't see the CPU over 7% for 2 tasks.
ID: 1838453 · Report as offensive
qbit
Volunteer tester
Avatar

Send message
Joined: 19 Sep 04
Posts: 630
Credit: 6,868,528
RAC: 0
Austria
Message 1838511 - Posted: 29 Dec 2016, 13:31:16 UTC - in response to Message 1838332.  
Last modified: 29 Dec 2016, 13:31:54 UTC

When you look at NVIDIA Settings in Windows what does it show for Total Memory and Total Dedicated Memory?
I'm guessing your Linux NVIDIA Settings is showing 1024 for Total and around 977 for Total Dedicated Memory, that's the way it is on my nVidia cards anyway. BOINC is not showing the Total Memory.

Hm, I would have to stop crunching on Linux to boot into Windows. Don't wanna do that right now but from your answer it looks like it's not uncommon and therefore not a problem.

Small Update after a few days with the new app: Everything's fine, have crunched 800+ tasks sucessfully, no Error, no Invalid. RAC was ~5800 in the last days running openCL, currently it's ~7700 and still rising. Nice work, TBar & Petri!
ID: 1838511 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1838668 - Posted: 29 Dec 2016, 23:20:37 UTC

My Inconclusives seem just a tad high :(
State: All (504) · In progress (271) · Validation pending (59) · Validation inconclusive (14) · Valid (159) · Invalid (0) · Error (1)
ID: 1838668 · Report as offensive
qbit
Volunteer tester
Avatar

Send message
Joined: 19 Sep 04
Posts: 630
Credit: 6,868,528
RAC: 0
Austria
Message 1838669 - Posted: 29 Dec 2016, 23:30:53 UTC

That's ok, I had 22 for some time, all valid until now. [/url]
ID: 1838669 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1838676 - Posted: 29 Dec 2016, 23:53:02 UTC - in response to Message 1838669.  

Yea it's OK in the end, but if 10-20% of task need to be rechecked that does seem a little counter productive project wise.
ID: 1838676 · Report as offensive
qbit
Volunteer tester
Avatar

Send message
Joined: 19 Sep 04
Posts: 630
Credit: 6,868,528
RAC: 0
Austria
Message 1838685 - Posted: 30 Dec 2016, 0:11:35 UTC
Last modified: 30 Dec 2016, 0:12:28 UTC

Sure, but since our tasks are valid it's not a problem with this app here.

I think a problem exists with the apple darwin apps, I see those a lot in my inconclusives.
ID: 1838685 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1838792 - Posted: 30 Dec 2016, 12:00:47 UTC - in response to Message 1838685.  

Sure, but since our tasks are valid it's not a problem with this app here.

I think a problem exists with the apple darwin apps, I see those a lot in my inconclusives.

Don't forget that credit granting and ultimate validation is quite "forgiving". Task will be counted as valid even if some signal it reports isn't. Provided there are many other valid signals + invalid one in "best" area.
So, ultimate validation doesn't mean that all is OK.

Actually, we not in the stage to decide if app is OK or not. This stage already passed. Jason confirmed missing parts in Pulse signal reduction stages. So we know app is broken in that part.
Low invalids ratio just ensures that bug manifests itself in quite low number of cases. Actually it will show up only if there are few reportable Pulses in same PoT. Cause even one such pulse is rare event having 2 in same PoT and in such order that incorrect one will be reported - even more rare event.
SETI apps news
We're not gonna fight them. We're gonna transcend them.
ID: 1838792 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1838843 - Posted: 30 Dec 2016, 14:44:39 UTC - in response to Message 1838792.  
Last modified: 30 Dec 2016, 14:47:12 UTC

Sure, but since our tasks are valid it's not a problem with this app here.

I think a problem exists with the apple darwin apps, I see those a lot in my inconclusives.

Don't forget that credit granting and ultimate validation is quite "forgiving". Task will be counted as valid even if some signal it reports isn't. Provided there are many other valid signals + invalid one in "best" area.
So, ultimate validation doesn't mean that all is OK.

Actually, we not in the stage to decide if app is OK or not. This stage already passed. Jason confirmed missing parts in Pulse signal reduction stages. So we know app is broken in that part.
Low invalids ratio just ensures that bug manifests itself in quite low number of cases. Actually it will show up only if there are few reportable Pulses in same PoT. Cause even one such pulse is rare event having 2 in same PoT and in such order that incorrect one will be reported - even more rare event.


A bit deeper digging verified a race on the power of 2 folds, which are running in parallel onto a single result set. So when a reportable pulse is detected in one fold, and higher score in the other, without synchronisation it's a coin toss as to which pulse is recorded (as observed).

The traditional (serial) algorithm treats the higher score as a refinement of the prior detection. Needs a bit more walkthrough, but probably i'll just separate the pulse results, and see if adding the reduction CPU side is too costly or not. My feeling is that they're rare enough to not have much noticeable performance impact, though if the increased bus consumption becomes a problem, then adding a GPU side reduction step shouldn't be a huge step.

Either way I'll probably end up undoing the (long time ago) nvidia coded triplets-first rearrangement along the way, restoring to CPU serial final order. The spikes, gaussians, and auto-correlations are somewhat paralleled already anyway, so the synch tap points for
these needed reductions are already there. Just probably need a bit of juggling and an intermediate store.
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1838843 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1839500 - Posted: 1 Jan 2017, 23:43:34 UTC

Since a new revision is needed, Downloads have been suspended until the next revision is ready.
I think 35 is enough testers for now. If you find something other than the Pulsefind Race problem please report it here.
ID: 1839500 · Report as offensive
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 83 · Next

Message boards : Number crunching : Linux CUDA 'Special' App finally available, featuring Low CPU use


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.