Posts by Eric B


log in
1) Message boards : Number crunching : Trouble with GPU freezing machine (Message 1790127)
Posted 5 days ago by Profile Eric B
OK, I got it! Through trial and error it seems only the cuda60 will run
Too bad, I was hoping to use OpenCL as the GTX460 supports it according to the boinc log
CUDA: NVIDIA GPU 0: GeForce GTX 460 (driver version unknown, CUDA version 8.0, compute capability 2.1, 964MB, 612MB available, 961 GFLOPS peak) OpenCL: NVIDIA GPU 0: GeForce GTX 460 (driver version 367.18, device version OpenCL 1.1 CUDA, 964MB, 612MB available, 961 GFLOPS peak)
2) Message boards : Number crunching : Trouble with GPU freezing machine (Message 1790123)
Posted 5 days ago by Profile Eric B
when i try that URL and my id (5023) i get an empty page except for the seti headers and footers, no host info
3) Message boards : Number crunching : Trouble with GPU freezing machine (Message 1790071)
Posted 5 days ago by Profile Eric B
I twigged it around and came up with what seems like a workable app_info.xml, now i get GPU WU's but they dont run due to this error:
"Waiting to run (0.05 CPUs + 1 NVIDIA GPU)(Scheduler Wait: Cant read CL file)"

Here's my app_info if anyone can comment on it it would be appreciated:
<app_info> <app> <name>setiathome_v8</name> </app> <file_info> <name>MBv8_8.04r3306_sse42_linux64</name> <executable/> </file_info> <app_version> <app_name>setiathome_v8</app_name> <version_num>804</version_num> <platform>x86_64-pc-linux-gnu</platform> <file_ref> <file_name>MBv8_8.04r3306_sse42_linux64</file_name> <main_program/> </file_ref> </app_version> <app> <name>setiathome_v8</name> </app> <file_info> <name>setiathome_8.10_x86_64-pc-linux-gnu__opencl_nvidia_sah</name> <executable/> </file_info> <app_version> <app_name>setiathome_v8</app_name> <version_num>810</version_num> <platform>x86_64-pc-linux-gnu</platform> <coproc> <type>NVIDIA</type> <count>1</count> </coproc> <plan_class>opencl_nvidia_sah</plan_class> <avg_ncpus>0.05</avg_ncpus> <max_ncpus>0.2</max_ncpus> <cmdline></cmdline> <file_ref> <file_name>setiathome_8.10_x86_64-pc-linux-gnu__opencl_nvidia_sah</file_name> <main_program/> </file_ref> </app_version> <app> <name>setiathome_v8</name> </app> <file_info> <name>setiathome_8.10_x86_64-pc-linux-gnu__opencl_nvidia_SoG</name> <executable/> </file_info> <app_version> <app_name>setiathome_v8</app_name> <version_num>810</version_num> <platform>x86_64-pc-linux-gnu</platform> <coproc> <type>NVIDIA</type> <count>1</count> </coproc> <plan_class>opencl_nvidia_SoG</plan_class> <avg_ncpus>0.05</avg_ncpus> <max_ncpus>0.2</max_ncpus> <cmdline></cmdline> <file_ref> <file_name>setiathome_8.10_x86_64-pc-linux-gnu__opencl_nvidia_SoG</file_name> <main_program/> </file_ref> </app_version> <app> <name>setiathome_v8</name> </app> <file_info> <name>setiathome_8.01_x86_64-pc-linux-gnu__cuda60</name> <executable/> </file_info> <app_version> <app_name>setiathome_v8</app_name> <version_num>801</version_num> <platform>x86_64-pc-linux-gnu</platform> <coproc> <type>NVIDIA</type> <count>1</count> </coproc> <plan_class>cuda60</plan_class> <avg_ncpus>0.05</avg_ncpus> <max_ncpus>0.2</max_ncpus> <cmdline></cmdline> <file_ref> <file_name>setiathome_8.01_x86_64-pc-linux-gnu__cuda60</file_name> <main_program/> </file_ref> </app_version> </app_info>
4) Message boards : Number crunching : Trouble with GPU freezing machine (Message 1790046)
Posted 5 days ago by Profile Eric B
I guess it isnt right:
Tue 24 May 2016 07:55:11 AM PDT | SETI@home | Found app_info.xml; using anonymous platform Tue 24 May 2016 07:55:11 AM PDT | SETI@home | [error] State file error: missing application file setiathome_8.10_x86_64-pc-linux-gnu__opencl_nvidia_SoG Tue 24 May 2016 07:55:11 AM PDT | SETI@home | [error] State file error: missing application file setiathome_8.01_x86_64-pc-linux-gnu__cuda60 Tue 24 May 2016 07:55:11 AM PDT | SETI@home | [error] State file error: missing application file setiathome_8.10_x86_64-pc-linux-gnu__opencl_nvidia_sah

The apps are there in BOINC/projects/setiathome.berkeley.edu
How do I fix this?
5) Message boards : Number crunching : Trouble with GPU freezing machine (Message 1790045)
Posted 5 days ago by Profile Eric B
Does this look right?
<app_info> <app> <name>setiathome_v8</name> </app> <file_info> <name>MBv8_8.04r3306_sse42_linux64</name> <executable/> </file_info> <app_version> <app_name>setiathome_v8</app_name> <version_num>804</version_num> <platform>x86_64-pc-linux-gnu</platform> <cmdline></cmdline> <file_ref> <file_name>MBv8_8.04r3306_sse42_linux64</file_name> <main_program/> </file_ref> </app_version> <app_version> <app_name>setiathome_v8</app_name> <version_num>810</version_num> <platform>x86_64-pc-linux-gnu</platform> <coproc> <type>NVIDIA</type> <count>1</count> </coproc> <plan_class>opencl_nvidia_SoG</plan_class> <avg_ncpus>0.05</avg_ncpus> <max_ncpus>0.2</max_ncpus> <cmdline></cmdline> <file_ref> <file_name>setiathome_8.10_x86_64-pc-linux-gnu__opencl_nvidia_SoG</file_name> <main_program/> </file_ref> </app_version> <app_version> <app_name>setiathome_v8</app_name> <version_num>801</version_num> <platform>x86_64-pc-linux-gnu</platform> <coproc> <type>NVIDIA</type> <count>1</count> </coproc> <plan_class>cuda60</plan_class> <avg_ncpus>0.05</avg_ncpus> <max_ncpus>0.2</max_ncpus> <cmdline></cmdline> <file_ref> <file_name>setiathome_8.01_x86_64-pc-linux-gnu__cuda60</file_name> <main_program/> </file_ref> </app_version> <app_version> <app_name>setiathome_v8</app_name> <version_num>810</version_num> <platform>x86_64-pc-linux-gnu</platform> <coproc> <type>NVIDIA</type> <count>1</count> </coproc> <plan_class>opencl_nvidia_sah</plan_class> <avg_ncpus>0.05</avg_ncpus> <max_ncpus>0.2</max_ncpus> <cmdline></cmdline> <file_ref> <file_name>setiathome_8.10_x86_64-pc-linux-gnu__opencl_nvidia_sah</file_name> <main_program/> </file_ref> </app_version> </app_info>
6) Message boards : Number crunching : Trouble with GPU freezing machine (Message 1789917)
Posted 6 days ago by Profile Eric B
3.5 hrs and so far so good after upgrading my nvidia driver - I'll know better after 24 hrs or so
My other PC has the same video card and driver but doesnt get any GPU work at all - any ideas as to why? Do i need to modify my app_info.xml?

<app_info> <app> <name>setiathome_v8</name> </app> <file_info> <name>MBv8_8.04r3306_sse42_linux64</name> <executable/> </file_info> <app_version> <app_name>setiathome_v8</app_name> <version_num>804</version_num> <platform>x86_64-pc-linux-gnu</platform> <cmdline></cmdline> <file_ref> <file_name>MBv8_8.04r3306_sse42_linux64</file_name> <main_program/> </file_ref> </app_version> </app_info>
7) Message boards : Number crunching : Trouble with GPU freezing machine (Message 1789877)
Posted 6 days ago by Profile Eric B
OK, i just checked and I do see a cuda60 app there setiathome_8.01_x86_64-pc-linux-gnu__cuda60 but it doesn’t seem to be used at all
Here is what is actually running after updating my nvidia driver and rebooting:
setiathome_8.05_i686-pc-linux-gnu
setiathome_8.00_x86_64-pc-linux-gnu
setiathome_8.00_x86_64-pc-linux-gnu
setiathome_8.00_x86_64-pc-linux-gnu
setiathome_8.00_x86_64-pc-linux-gnu
setiathome_8.00_x86_64-pc-linux-gnu
setiathome_8.00_x86_64-pc-linux-gnu
setiathome_8.05_i686-pc-linux-gnu
setiathome_8.10_x86_64-pc-linux-gnu__opencl_nvidia_SoG --device 0

Why is 8.05 running there? Is that normal?
Quad core w/HT so yeh, 8 threads plus a GPU thread so the count is right but whats 8.05 vs 8.00?
Anyway I'll let it run awhile and see if it freezes up
8) Message boards : Number crunching : Trouble with GPU freezing machine (Message 1789781)
Posted 6 days ago by Profile Eric B
What does it mean? "-allow running the cuda60 app for linux" How do i do that?
I'll try updating the driver today to latest nvidia version and if that doesnt fix it I'll downgrade to 3.11 kernel and see what happens
9) Message boards : Number crunching : Trouble with GPU freezing machine (Message 1789740)
Posted 6 days ago by Profile Eric B
I noticed that BOINC downloaded and starting running GPU tasks on my NVIDIA GTX460. a quick check shows three running crunching programs:
setiathome_8.00_x86_64-pc-linux-gnu
setiathome_8.05_i686-pc-linux-gnu
setiathome_8.10_x86_64-pc-linux-gnu__opencl_nvidia_SoG --device 0

It goes ok for awhile then my PC freezes and i have to power cycle it.
NVIDIA driver is 352.63
OS Linux OpenSuse 13.1 running x86_64 on a 4.1.2 kernel, quad core i7 with 8G of dram
The pc is dedicated to Seti and some weather station SW (low impact)
Whats the story on nvidia GPU computing under linux (with latest seti) is it finalized now or still working out the bugs?
Should i update the nvidia driver?
Any ideas as to how i can prevent this freeze up? Its happened twice now so i have GPU computing suspended
10) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769874)
Posted 5 Mar 2016 by Profile Eric B
Everything is working fine now with the lunatics binary - so it appears to be a bug in the official v8 MB software, wouldn't you say? Although, I don't understand why it doesn't affect other machines.
Thanks for everyone's help with this issue - its much appreciated!
11) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769873)
Posted 5 Mar 2016 by Profile Eric B
I noticed your host 7011730 that was having the issue looks to be running an older OS, Linux 3.11.10, than your two other machines. Which have Linux 4.1.2 & Linux 4.4.3.

Possibly an older lib somewhere mucking up the works for the stock app?


The normal kernel for OpenSuse 13.1 is 3.11 In my case, I updated the kernel on those machines myself. I have a 4.4 kernel I'm working on as time permits and will be switching to that on erb1 at some point down the road. erb1 got rebuilt/reformatted about 10 months ago or so because it had been through too many upgrades and things were getting weird there, so I did a fresh 13.1 install. (I tried 13.2 at that time too but I didn't feel it was really ready for prime time so I reformatted and went back to 13.1)
12) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769871)
Posted 5 Mar 2016 by Profile Eric B
I tried an experiment this afternoon and installed the Lunatics V8 MB executable just to see if that was going to have the same issue - and so far it hasn't. Of course its going to take several days to see this is really working but it looks promising after 6 hours or so on lunatics. One thing i do notice with lunatics tho and maybe its nothing but the estimated total time per work unit seems high most are 5 hours plus and some are 14 and a few are around 2 hours.

Switching to anonymous platform means the server has to relearn how long it takes to complete tasks for the application. It also happens when switching to a new application like the change from SETI@home v7 to SETI@home v8.
After about 10-11 completed tasks you will see the estimated time for newly received tasks will be lower and lower until it is accurate.

I would guess the 5 and 2 hour tasks are probably VHAR tasks or the the 14 hour ones are VLAR tasks.


Yup, you are right, after 12 hours or so the estimated times dropped down to the normal 2.5 hrs I used to see. good to understand that now - tahnks
13) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769634)
Posted 5 Mar 2016 by Profile Eric B
I think this looks just like the good old stuck in benchmarks bug.

Eric, could you unhide your hosts or give a link to this one. It would be nice to see the stderr from more tasks.


I just unhid them, the troubled machine is erb1 the others seem fine
14) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769633)
Posted 5 Mar 2016 by Profile Eric B
SETI@home preferences | Should SETI@home show your computers on its web site?

Got it! Thanks! they are visible now.
15) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769632)
Posted 5 Mar 2016 by Profile Eric B
I tried an experiment this afternoon and installed the Lunatics V8 MB executable just to see if that was going to have the same issue - and so far it hasn't. Of course its going to take several days to see this is really working but it looks promising after 6 hours or so on lunatics. One thing i do notice with lunatics tho and maybe its nothing but the estimated total time per work unit seems high most are 5 hours plus and some are 14 and a few are around 2 hours.
16) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769629)
Posted 5 Mar 2016 by Profile Eric B
I think this looks just like the good old stuck in benchmarks bug.

Eric, could you unhide your hosts or give a link to this one. It would be nice to see the stderr from more tasks.

Ok, maybe I'm dense but for the life of me I cant find the place where you allow others to view your computers/tasks, is it under account? I've seen it before (a long time past) but cant find it now
17) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769563)
Posted 4 Mar 2016 by Profile Eric B
setiathome_8.00_x86_64-pc-linux-gnu is the main program and needs to be executable, but at 98 bytes, that can't be the real file, just a symlink. Let's assume the real file is executing properly, else nothing at all would be written to stderr.txt

I don't recognise boinc_setiathome_8, and at 266080 bytes it should be significant. Anyone?


Its a boinc link, not a symlink. In otherwords its an ordinary file that contains a reference - only boinc does this, it isnt a standard part of linux at all

# cat setiathome_8.00_x86_64-pc-linux-gnu
<soft_link>../../projects/setiathome.berkeley.edu/setiathome_8.00_x86_64-pc-linux-gnu</soft_link>
18) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769561)
Posted 4 Mar 2016 by Profile Eric B
Brent Norman:
No, not running as admin (or root as we say in the linux world) Just an ordinary user. I'm not sure i understand your question about x86_64, but that's the cpu mode not a filesystem thing
19) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769554)
Posted 4 Mar 2016 by Profile Eric B
Also, do these file sizes and perms look ok (my username has been replaced with x's)?
# ls -l total 72 -rw-r--r-- 1 xxxxxxxx users 0 Mar 3 06:02 boinc_lockfile -rw-rw---- 1 xxxxxxxx users 266080 Mar 3 06:02 boinc_setiathome_8 -rw-r--r-- 1 xxxxxxxx users 100 Mar 2 22:39 graphics_app -rw-r--r-- 1 xxxxxxxx users 8111 Mar 4 12:14 init_data.xml -rw-r--r-- 1 xxxxxxxx users 104 Mar 2 22:39 result.sah -rw-r--r-- 1 xxxxxxxx users 86 Mar 2 22:39 setiathome-8.00_AUTHORS -rw-r--r-- 1 xxxxxxxx users 86 Mar 2 22:39 setiathome-8.00_COPYING -rw-r--r-- 1 xxxxxxxx users 88 Mar 2 22:39 setiathome-8.00_COPYRIGHT -rw-r--r-- 1 xxxxxxxx users 85 Mar 2 22:39 setiathome-8.00_README -rw-r--r-- 1 xxxxxxxx users 98 Mar 2 22:39 setiathome_8.00_x86_64-pc-linux-gnu -rw-r--r-- 1 xxxxxxxx users 75 Mar 2 22:39 seti_logo -rw-r--r-- 1 xxxxxxxx users 77 Mar 2 22:39 sponsor_bkg -rw-r--r-- 1 xxxxxxxx users 73 Mar 2 22:39 sponsor_logo -rw-r--r-- 1 xxxxxxxx users 1356 Mar 3 06:02 stderr.txt -rw-r--r-- 1 xxxxxxxx users 6590 Mar 3 06:02 wisdom.sah -rw-r--r-- 1 xxxxxxxx users 100 Mar 2 22:39 work_unit.sah
20) Message boards : Number crunching : Need help - Seti v8 MB units running forever (Message 1769550)
Posted 4 Mar 2016 by Profile Eric B
The stderr.txt (in full):

setiathome_v8 8.00 Revision: 3290 g++ (GCC) 4.4.7 20120313 (Red Hat 4.4.7-4)
libboinc: BOINC 7.7.0

Work Unit Info:
...............
WU true angle range is : 0.009480
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_BaseLineSmooth (no other)
v_avxGetPowerSpectrum 0.000065 0.00000
setiathome_v8 8.00 Revision: 3290 g++ (GCC) 4.4.7 20120313 (Red Hat 4.4.7-4)
libboinc: BOINC 7.7.0

Work Unit Info:
...............
WU true angle range is : 0.009480
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_BaseLineSmooth (no other)
v_avxGetPowerSpectrum 0.000111 0.00000
setiathome_v8 8.00 Revision: 3290 g++ (GCC) 4.4.7 20120313 (Red Hat 4.4.7-4)
libboinc: BOINC 7.7.0

Work Unit Info:
...............
WU true angle range is : 0.009480
Optimal function choices:
--------------------------------------------------------
name timing error
--------------------------------------------------------
v_BaseLineSmooth (no other)
v_avxGetPowerSpectrum 0.000101 0.00000


Next 20

Copyright © 2016 University of California