Posts by David Anderson (not *that* DA)


log in
1) Message boards : Number crunching : Ubuntu Linux GPU woes nearing end (Message 1665441)
Posted 20 days ago by Profile David Anderson (not *that* DA)Project donor
Starting in late 2014 Ubuntu users were encountering difficulties getting a working nvidia GPU driver. This was crucial for GPU crunching.

Thousands of machines were listed on Ubuntu bug number 1268257.

The problem was finally understood -- as reported in 1431753.

Most were able to work around this, but...it took a
few commands --- after every new kernel update and reboot.

It's fixed in the most recent Ubuntu and hopefully soon in 14.04,
the current LTS release.
The problem was the nvidia driver setup was (for at least driver 331)
in two parts and the minor kernel build that incorporates such
is not prepared for that (A 2 part driver build
violates the rules for DKMS builds). So (with some randomness
depending on timing on a given new-kernel-install) gpu module builds could fail
with various consequences (rarely dire, but sometimes...).

I'm no kernel expert. This affected me.
Today's new Linux kernel 3.13.0-49 #83 installed for me
with none of the scary messages (which is not proof
that the problem is actually squashed yet, but still it's nice...)

I just wanted to wrap up this issue so anyone else
on 14.04 would know there is light at the end of the tunnel.
2) Message boards : Cafe SETI : Moderation April 1. (Message 1665217)
Posted 21 days ago by Profile David Anderson (not *that* DA)Project donor
So I noticed the moderator rules posting dated April 1 and I thought "Oh this is going to be good." But then I noticed it was just an edit. Oh well :-)
3) Message boards : Number crunching : CLOSED** SETI/BOINC Milestones [ v2.0 ] - XXVIII **CLOSED (Message 1663344)
Posted 25 days ago by Profile David Anderson (not *that* DA)Project donor
What can I say?
msattler is my hero.
4) Message boards : Number crunching : Gripes and Kudos IV (Message 1638417)
Posted 7 Feb 2015 by Profile David Anderson (not *that* DA)Project donor
Gripe: donated to Seti in January, no green star.
Unsure if thats really my fault or ?
Unsure who to contact.
No, it's not an earthshaking issue :-)
5) Message boards : Number crunching : CLOSED** SETI/BOINC Milestones [ v2.0 ] - XXVIII **CLOSED (Message 1637849)
Posted 5 Feb 2015 by Profile David Anderson (not *that* DA)Project donor
50 million boinc.
Nearly 24 million Seti.

Not getting much Seti GPU work. Oh well. :-)
6) Message boards : Number crunching : Invalid Host Messaging (Message 1627559)
Posted 14 Jan 2015 by Profile David Anderson (not *that* DA)Project donor
http://setiathome.berkeley.edu/workunit.php?wuid=1672054677
is another in the questionable series, this one is: 19ap11ad.22063.48525.140733193388041.12.99

Most of us processing that wu got invalid, but one task
is now marked inconclusive, so ... that's probably not
a good sign?
7) Message boards : Number crunching : Invalid Host Messaging (Message 1619490)
Posted 27 Dec 2014 by Profile David Anderson (not *that* DA)Project donor
While I don't know about all results fields, I note that
GPUs being bad often give spike count of 30 so
it appears there is a strong correlation between
the invalid-gpu-results.
Sometimes its various counts sum to 30.

Not random, in other words, but with
a fairly strong pattern.
8) Message boards : Number crunching : No GPU Units Under Linux (Message 1619456)
Posted 27 Dec 2014 by Profile David Anderson (not *that* DA)Project donor
Thanks for this thread. I failed to realize I needed opencl for AP!
Now installed opencl and boinc sees CUDA and opencl.
Ubuntu 14.04, driver 311.113
9) Message boards : Number crunching : Ubuntu 14.04 updates break cuda (Message 1617632)
Posted 23 Dec 2014 by Profile David Anderson (not *that* DA)Project donor
(was out of town a week, just now back).
Enabled Trusty-Backports so I could install
nvidia-modprobe. Installed it and ran it (executable
is the same name).
Made no visible difference in the output of
lsmod|grep nv
but after
sudo /etc/init.d/boinc-client restart
now boinc sees my GPUs and nvidia_uvm suddenly
shows up in the output of
lsmod|grep nv
!.

GPUs at work again!
See Ubuntu bug 1361207 for additional details if you care.
10) Message boards : Number crunching : Ubuntu 14.04 updates break cuda (Message 1614000)
Posted 14 Dec 2014 by Profile David Anderson (not *that* DA)Project donor
For now just living with it. No time
to deal with this.
Plus I could make things much worse... :-)

Anyway, thanks for the tips, ML1.
11) Message boards : Number crunching : Ubuntu 14.04 updates break cuda (Message 1613899)
Posted 14 Dec 2014 by Profile David Anderson (not *that* DA)Project donor
Switched to Noveau driver.
reboot
(Aside: the boot screen is much prettier with Noveau driver
than with nvidia driver, noveau giving blue screen with cute
xubuntu image and fancy progress indicator, nvidia drivers
give boring black/white screen with dots progress indicator)
Removed all nvidia (with synaptic).
reboot
installed nvidia-331
reboot

lsmod:

nvidia-uvm did not get loaded. Did modprobe which
loaded it, but its use count is zero...

No GPU seen by boinc. So this is not a case of
the upgrade mechanism getting mixed up by my specific
history on the the machine.

While Seti has had no CUDA work for me for a while
Einstein has a steady CUDA supply (now those are stuck).
Reports on Ubuntu suggest this bug is maybe fixed(?) for
next release but I see no sign it's been backported to 14.04. Yet.
12) Message boards : Number crunching : Ubuntu 14.04 updates break cuda (Message 1613440)
Posted 13 Dec 2014 by Profile David Anderson (not *that* DA)Project donor
Updated kernel etc today. Minor update
supposedly. Ubuntu 14.04. x86_64
nvidia gtx 760SC (2 of them)
Suddenly CUDA ceased working, boinc
cannot find GPUs. Ubuntu bug 1401350.
Non-working driver: 331.113.
13) Message boards : Number crunching : CLOSED*SETI/BOINC Milestones [ v2.0 ] - XXVII*CLOSED (Message 1567582)
Posted 5 Sep 2014 by Profile David Anderson (not *that* DA)Project donor
50 million for boinc, but Seti is not managing to
get as much of the computing as I would expect.
Solar panels providing the electricity!
14) Message boards : Number crunching : Who uses PCIe extenders; do they work? (Message 1523361)
Posted 1 Jun 2014 by Profile David Anderson (not *that* DA)Project donor
Emboldened by this thread I bought an extender from Hong Kong
($25 shipped) and a second GTX 760 SC and
made a holder (had to mod case of that machine
somewhat).

To my astonishment it is working.
Had the machine off most of the
day getting it assemble so it will take a
while to get to stable results.
Still only at about 50% of the PSU 860W rating.

Not getting any Seti GPU work at present, so the GPUs
are going to Einstein. But hopefully soon will get
more Seti GPU tasks.

So thanks to all for bringing this up and providing
advice. Would never have tried this otherwise.
15) Message boards : Number crunching : empty stderr or GPU 30 counts (Message 1522060)
Posted 28 May 2014 by Profile David Anderson (not *that* DA)Project donor
Here I mean in this forum, not this thread!
16) Message boards : Number crunching : empty stderr or GPU 30 counts (Message 1522059)
Posted 28 May 2014 by Profile David Anderson (not *that* DA)Project donor
Ok. I'll suggest they post here. Thanks.
17) Message boards : Number crunching : empty stderr or GPU 30 counts (Message 1521705)
Posted 28 May 2014 by Profile David Anderson (not *that* DA)Project donor
A few contributors have systems with GPUs that generate a
very high percent of Invalid: either
a) generate many empty-stderr tasks or
b) generate counts of 30 (or that add to 30)
Of course I noticed due to Inconclusive results we share.

It's bothering some of them --- they
responded im PMs saying so, but they don't know what to try and
I found it difficult to understand (from previous threads
that mentioned these things)
what to suggest for either of these issues.

So: suggestions for either sort of problem?
How they should proceed?
18) Message boards : Number crunching : CLOSED - SETI/BOINC Milestones [ v2.0 ] - XXVI - CLOSED (Message 1486837)
Posted 10 Mar 2014 by Profile David Anderson (not *that* DA)Project donor
I'm at 20,000,000 Seti and 30,000,000 combined
as of today...
19) Message boards : Number crunching : confused about boinc vs seti vs app_config.xml vs stock vs lunatics (Message 1473928)
Posted 7 Feb 2014 by Profile David Anderson (not *that* DA)Project donor
Advanced->Read Config Files

I get (dropping timestamps)
Suspending computation - user request
Resuming computation
Re-reading cc_config.xml
Not using a proxy
Config: GUI RPCs allowed from:
log flags: file_xfer, sched_ops,task


I don't see any sign of reading app_config.xml

app_config.xml contains stuff I gleaned
from the board earlier and may not be
correct or all I need to run MB and AP:
<app_config>
<app>
<name>astropulse_v6</name>
<gpu_versions>
<gpu_usage>0.5</gpu_usage>
<cpu_usage>0.4</cpu_usage>
</gpu_versions>
</app>
<app>
<name>setiathome_v7</name>
<gpu_versions>
<gpu_usage>0.5</gpu_usage>
<cpu_usage>.4</cpu_usage>
</gpu_versions>
</app>
</app_config>

Anything else I should have in there or anything
I should fix?

Ahhh. Bad access flags which were: rw-r----- (owned by root).
Changed to read-by-anyone and now on rereading config
the event log adds 'Found app_config.xml' to the output!

I don't have any Seti gpu work on hand, at the moment.
Thank you Fred! You got me thinking.
20) Message boards : Number crunching : confused about boinc vs seti vs app_config.xml vs stock vs lunatics (Message 1473904)
Posted 7 Feb 2014 by Profile David Anderson (not *that* DA)Project donor
I run recent Linux (Ubuntu 13.10) which identifies boinc as 7.2.7.

Stock seti (not Lunatics) which is using the nvidia GTX 760
fine. As is Einstein. Perhaps incorrectly (?) I did not delete the files
from the previous Lunatics 41z install (I did drain the
task list to empty before installing stock).
I renamed my Lunatics-based v6 app_info.xml so it is no longer
noticed.

But I was hoping to use app_config.xml in the seti project
under the boinc /var/lib/boinc-client/projects/seti* directory to
get a couple tasks running the gtx760 SC to see how that went.
But apparently app_config.xml is ignored with stock seti?

Sorry if this sounds confused, but I am confused.
I would like to get more
GPU tasks running (two would be nice, GPU is 3GB ram)
and I don't really understand the right way.

Delete old Lunatics and reinstall that?
(I would drain the task queues with 'no new tasks' button
first).

Or don't I need Lunatics for that control of the GPU with Seti?

Any hints would be much appreciated, and detailed instructions
even more appreciated.


Next 20

Copyright © 2015 University of California