Posts by Woodgie


log in
1) Message boards : Number crunching : Dual GPU, One Crunching, One Not. (Message 1639263)
Posted 17 days ago by Profile WoodgieProject donor
Update:

The Good news: Installed the 7870 drive (14.501.1003.0), BOINC is happy now crunching.

The Bad news: Windows now doesn't like the 4870 and have stopped the device (Windows has stopped this device because it has reported problems. (Code 43))

Same error as before.

So the main issue seems to be installing drivers for both GPU's. As Catalysts 14.***** doesn't recognize the 4870.

Hmmmmmmm


(Standard caveat of "I'm IN NO WAY a Windows person")

Code 43 is what I got with mine and my research said it was either a driver issue or a hardware issue (helpful, huh?)

SO I did a bit of slot swapping etc and determined it was indeed a hardware issue. With you though I have to say it looks like a driver issue where the two different versions required for the two different cards trip over each other.

So far not helpful and nothing you didn't already know.

However maybe the VirtualBox version of BOINC might help here. I say MAYBE as I have no idea HOW it is set up to run. My thinking is along the lines that you might be able to set Windows up to use the card you want (4870?) and then 'pass' the 7870 to VirtualBox/BOINC.

This would depend on a great many things which I have no understanding of, like exactly how BOINC uses VirttualBox, is is a full OS (linux?) a cut down OS, is it even set up to understand hardware itself or does it just ask the host OS what's available and trust it?

All of which is an awful lot of 'maybe'.
2) Message boards : Number crunching : Dual GPU, One Crunching, One Not. (Message 1638898)
Posted 18 days ago by Profile WoodgieProject donor
EDIT: OK, I re-read the thread and the talk about drivers has only just sunk in. This isn't what you asked at all, sorry! Still, I'll leave it here out of interest and it might help someone else.

This stems from the fact that BOINC by default only uses the more powerful card in a multi-gpu system.

Have a look at my thread from about here.

You need to create a cc_config.xml file which lives in your BOINC Data folder. The easiest way to do this, if it doesn't already exit, is to go to Advanced>Event Logs Diagnostic Flags... and select any of the other options, click OK.

Then quit BOINC (actually exit it, don't just close the window) and navigate to the BOINC Data folder where you should now see the cc_config.xml file. Open it in Notepad and scroll until you see the line:
<use_all_gpus>0</use_all_gpus>

Change it to read:
<use_all_gpus>1</use_all_gpus>

And save the file.

You'll now need to reboot your computer for BOINC to pick up the change. Check in the Event Log (Advanced>Event Log) that you see the line:
11-Nov-2014 05:31:12 [---] Config: use all coprocessors

You should also see that BOINC is running GPU tasks on both 'device 0' and 'device 1'

Lastly, you can uncheck the checkbox you checked earlier to reduce the amount of logging BOINC has to do.

That SHOULD be all you need to do.
3) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1638204)
Posted 20 days ago by Profile WoodgieProject donor
In case anyone is searching the forums, I think it's worth a mention as to what I think has been happening with my 'disappearing' settings in the cc_config.xml file.

I'm pretty certain what was happening was that it was only the <use_all_gpus> line which I added by hand was being removed when using the GUI to update settings.

I'd guess this is because BOINC decided (correctly) that line was superfluous due to the fact there was a <use_all_gpus> line already in the file in the 'proper' place, set to 0, which to me (being blind and not noticing it) made it look as if the key was being removed completely.

Here's the tip, kids: READ and read it again. And if it still doesn't make sense ask the rubber duck (or in my case, the coconut)

I hope this thread helps others as it's helped me. Thanks to everyone who contributed.
4) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1638176)
Posted 20 days ago by Profile WoodgieProject donor
Well I never, you're right! Let's check the current config on the host...

Yep, for some reason there's a second <use_all_gpus> set to 0!

Changing
...
Rebooting
...

OK, that's now all working properly. I'm embarrassed and should learn to read. I honestly didn't see that there!

Paul, you're a star, thank you very very much. I owe you a
<beverage_of_choice>Beer|Wine|Other</beverage_of_choice>

should you ever be in London and at a loose end.

Now, where DID I put my reading glasses?
5) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1638159)
Posted 20 days ago by Profile WoodgieProject donor
I've reported these Bugs to the boinc_alpha list, expect it to be fixed in Boinc 7.4.42 or later.

Claggy


Thank you very much. Do you want any more system details? I'm always more than happy to help.
6) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1638152)
Posted 20 days ago by Profile WoodgieProject donor

The Config: use all coprocessors line isn't showing up, I tried setting it on my i5-3210M/GT650M/Intel_Graphics_HD4000 host, and it isn't displayed eithier,
and using the Diagnostic Flags option deletes the <use_all_gpus>1</use_all_gpus> line from the cc_config.xml:

<snip>

This is what it should look like:

http://setiathome.berkeley.edu/forum_thread.php?id=75931&postid=1599863
11-Nov-2014 05:31:12 [---] CUDA: NVIDIA GPU 0: GeForce GTX 780 (driver version 344.60, CUDA version 6.50, compute capability 3.5, 3072MB, 2937MB available, 4636 GFLOPS peak)
11-Nov-2014 05:31:12 [---] CUDA: NVIDIA GPU 1: GeForce GTX 780 (driver version 344.60, CUDA version 6.50, compute capability 3.5, 4096MB, 4096MB available, 4698 GFLOPS peak)
11-Nov-2014 05:31:12 [---] OpenCL: NVIDIA GPU 0: GeForce GTX 780 (driver version 344.60, device version OpenCL 1.1 CUDA, 3072MB, 2937MB available, 4636 GFLOPS peak)
11-Nov-2014 05:31:12 [---] OpenCL: NVIDIA GPU 1: GeForce GTX 780 (driver version 344.60, device version OpenCL 1.1 CUDA, 6144MB, 4096MB available, 4698 GFLOPS peak)
11-Nov-2014 05:31:12 [SETI@home] Found app_info.xml; using anonymous platform
11-Nov-2014 05:31:12 [---] Config: use all coprocessors


Claggy


Nice to know I'm not going completely mad!

It's truly fantastic that you lot are willing to actually get 'down and dirty' and test on your own machines to help troubleshoot my problems, humbling is what it is.

Thanks.
7) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1638139)
Posted 20 days ago by Profile WoodgieProject donor
Just in case - what are you using to make changes to cc_config.xml? Although 'typed' as an XML file, it should be treated as a plain ASCII text file and edited as such. Don't use a full-feature XML or Unicode editor - notepad will do.


Notepad is indeed what I'm using to edit the file. Though that is useful info, thanks!
8) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1638133)
Posted 20 days ago by Profile WoodgieProject donor
(And I did re-boot the host after adding <use_all_gpus>)
9) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1638132)
Posted 20 days ago by Profile WoodgieProject donor
It is worth noting, however, that once the cc_confix.xml file has been created, any change to Advanced>Event Log Diagnostic Flags... will overwrite the file and therefore nix any hand edits made to it.

It makes sense but just a heads up to the unwary.

Again, not quite true. Using the log flags GUI will re-write the file in its entirety, true - but it should write the current working state of the file, including any hand edits previously read in and acted on at startup. Mine still has an edited option I first set years ago, which has stayed in place through many, many BOINC version upgrades and (more recently) use of the log flag dialog.

Your cc_config looks OK, including the placement of <use_all_gpus>1</use_all_gpus> - but I may not have had enough coffee yet. You are aware, I presume, that <use_all_gpus> is one of the few flags which requires a full client restart to become active - most of the other options (and all the log flags) can be re-read and become active while BOINC is running.

I was going to suggest making a small benign change to cc_config manually, and verifying that it is reflected properly in the startup log: <sched_op_debug>1</sched_op_debug> logging is a handy one, and can be turned off again if you don't need it.


Interesting, maybe it's indicative of something else that when I make changes to Advanced>etc... it does indeed nix the file in favour of a new one.

I will do as you say to work the problem the other way around. Thanks.

Edit to add:
I changed </sched_op_debug> and indeed the change was picked up in the logs and the GUI. Turning the option off (unchecking the now checked box in the GUI) definitely overwrote the <use_all_gpus> key.
10) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1638108)
Posted 20 days ago by Profile WoodgieProject donor
It is worth noting, however, that once the cc_confix.xml file has been created, any change to Advanced>Event Log Diagnostic Flags... will overwrite the file and therefore nix any hand edits made to it.

It makes sense but just a heads up to the unwary.
11) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1638107)
Posted 20 days ago by Profile WoodgieProject donor
OK, update:

It's the card, not the slot or power. Which sucks massively as that was not a cheap card, even though I got it second hand.

Which might explain a lot, it's been crunching SETI@Home pretty much 24/7 for a few days short of a whole year!

Well, now I just need to get cc_config.xml to honour <use_all_gpus> and I can use the TITAN and the 750ti together. It's not quite as good as 2 TITANS but not too shoddy :) But there's the thing, It doesn't seem to be honouring it and with the 750ti in it uses it in favour of the TITAN.

So the obvious next step is: What's going on here? I made the cc_config.xml file by using Advanced>Event Log Diagnostic Flags... to add an element. This created cc_config.xml in the right place and with the right format. Then I hand edited the file to add the <use_all_gpus> key.

Here's what my cc_config.xml looks like:
<cc_config> <log_flags> <file_xfer>1</file_xfer> <sched_ops>1</sched_ops> <task>1</task> <android_debug>0</android_debug> <app_msg_receive>0</app_msg_receive> <app_msg_send>0</app_msg_send> <async_file_debug>0</async_file_debug> <benchmark_debug>0</benchmark_debug> <checkpoint_debug>0</checkpoint_debug> <coproc_debug>0</coproc_debug> <cpu_sched>0</cpu_sched> <cpu_sched_debug>0</cpu_sched_debug> <cpu_sched_status>0</cpu_sched_status> <dcf_debug>0</dcf_debug> <disk_usage_debug>0</disk_usage_debug> <file_xfer_debug>0</file_xfer_debug> <gui_rpc_debug>0</gui_rpc_debug> <heartbeat_debug>0</heartbeat_debug> <http_debug>0</http_debug> <http_xfer_debug>0</http_xfer_debug> <mem_usage_debug>0</mem_usage_debug> <network_status_debug>0</network_status_debug> <notice_debug>0</notice_debug> <poll_debug>0</poll_debug> <priority_debug>0</priority_debug> <proxy_debug>0</proxy_debug> <rr_simulation>0</rr_simulation> <rrsim_detail>0</rrsim_detail> <sched_op_debug>0</sched_op_debug> <scrsave_debug>0</scrsave_debug> <slot_debug>0</slot_debug> <state_debug>0</state_debug> <statefile_debug>0</statefile_debug> <suspend_debug>0</suspend_debug> <task_debug>0</task_debug> <time_debug>0</time_debug> <trickle_debug>0</trickle_debug> <unparsed_xml>0</unparsed_xml> <work_fetch_debug>0</work_fetch_debug> </log_flags> <options> <abort_jobs_on_exit>0</abort_jobs_on_exit> <allow_multiple_clients>0</allow_multiple_clients> <allow_remote_gui_rpc>0</allow_remote_gui_rpc> <client_version_check_url>http://boinc.berkeley.edu/download.php?xml=1</client_version_check_url> <client_new_version_text></client_new_version_text> <client_download_url>http://boinc.berkeley.edu/download.php</client_download_url> <disallow_attach>0</disallow_attach> <dont_check_file_sizes>0</dont_check_file_sizes> <dont_contact_ref_site>0</dont_contact_ref_site> <dont_use_vbox>0</dont_use_vbox> <exit_after_finish>0</exit_after_finish> <exit_before_start>0</exit_before_start> <exit_when_idle>0</exit_when_idle> <fetch_minimal_work>0</fetch_minimal_work> <fetch_on_update>0</fetch_on_update> <force_auth>default</force_auth> <http_1_0>0</http_1_0> <http_transfer_timeout>300</http_transfer_timeout> <http_transfer_timeout_bps>10</http_transfer_timeout_bps> <max_event_log_lines>2000</max_event_log_lines> <max_file_xfers>8</max_file_xfers> <max_file_xfers_per_project>2</max_file_xfers_per_project> <max_stderr_file_size>0</max_stderr_file_size> <max_stdout_file_size>0</max_stdout_file_size> <max_tasks_reported>0</max_tasks_reported> <ncpus>-1</ncpus> <network_test_url>http://www.google.com/</network_test_url> <no_alt_platform>0</no_alt_platform> <no_gpus>0</no_gpus> <use_all_gpus>1</use_all_gpus> <no_info_fetch>0</no_info_fetch> <no_priority_change>0</no_priority_change> <os_random_only>0</os_random_only> <proxy_info> <socks_server_name></socks_server_name> <socks_server_port>80</socks_server_port> <http_server_name></http_server_name> <http_server_port>80</http_server_port> <socks5_user_name></socks5_user_name> <socks5_user_passwd></socks5_user_passwd> <http_user_name></http_user_name> <http_user_passwd></http_user_passwd> <no_proxy></no_proxy> </proxy_info> <rec_half_life_days>10.000000</rec_half_life_days> <report_results_immediately>0</report_results_immediately> <run_apps_manually>0</run_apps_manually> <save_stats_days>30</save_stats_days> <skip_cpu_benchmarks>0</skip_cpu_benchmarks> <simple_gui_only>0</simple_gui_only> <start_delay>0.000000</start_delay> <stderr_head>0</stderr_head> <suppress_net_info>0</suppress_net_info> <unsigned_apps_ok>0</unsigned_apps_ok> <use_all_gpus>0</use_all_gpus> <use_certs>0</use_certs> <use_certs_only>0</use_certs_only> <vbox_window>0</vbox_window> </options> </cc_config>


And here's the startup log:
06/02/2015 09:11:22 | | Starting BOINC client version 7.4.36 for windows_x86_64 06/02/2015 09:11:22 | | log flags: file_xfer, sched_ops, task 06/02/2015 09:11:22 | | Libraries: libcurl/7.39.0 OpenSSL/1.0.1j zlib/1.2.8 06/02/2015 09:11:22 | | Data directory: C:\ProgramData\BOINC 06/02/2015 09:11:22 | | Running under account william 06/02/2015 09:11:22 | | CUDA: NVIDIA GPU 0 (not used): GeForce GTX TITAN (driver version 347.25, CUDA version 7.0, compute capability 3.5, 4096MB, 4096MB available, 4707 GFLOPS peak) 06/02/2015 09:11:22 | | CUDA: NVIDIA GPU 1: GeForce GTX 750 Ti (driver version 347.25, CUDA version 7.0, compute capability 5.0, 2048MB, 1947MB available, 1388 GFLOPS peak) 06/02/2015 09:11:22 | | OpenCL: NVIDIA GPU 0 (not used): GeForce GTX TITAN (driver version 347.25, device version OpenCL 1.1 CUDA, 6144MB, 4096MB available, 4707 GFLOPS peak) 06/02/2015 09:11:22 | | OpenCL: NVIDIA GPU 1: GeForce GTX 750 Ti (driver version 347.25, device version OpenCL 1.1 CUDA, 2048MB, 1947MB available, 1388 GFLOPS peak) 06/02/2015 09:11:22 | SETI@home | Found app_info.xml; using anonymous platform 06/02/2015 09:11:22 | | Host name: outlander 06/02/2015 09:11:22 | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz [Family 6 Model 60 Stepping 3] 06/02/2015 09:11:22 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 vmx tm2 pbe fsgsbase bmi1 smep bmi2 06/02/2015 09:11:22 | | OS: Microsoft Windows 7: Ultimate x64 Edition, Service Pack 1, (06.01.7601.00) 06/02/2015 09:11:22 | | Memory: 15.94 GB physical, 31.88 GB virtual 06/02/2015 09:11:22 | | Disk: 465.54 GB total, 381.46 GB free 06/02/2015 09:11:22 | | Local time is UTC +0 hours 06/02/2015 09:11:22 | | Config: GUI RPCs allowed from: <snip> 06/02/2015 09:11:22 | SETI@home | URL http://setiathome.berkeley.edu/; Computer ID 7225736; resource share 100 06/02/2015 09:11:22 | SETI@home | General prefs: from SETI@home (last modified 21-Jan-2015 19:19:02) 06/02/2015 09:11:22 | SETI@home | Computer location: home 06/02/2015 09:11:22 | | General prefs: using separate prefs for home 06/02/2015 09:11:22 | | Reading preferences override file 06/02/2015 09:11:22 | | Preferences: 06/02/2015 09:11:22 | | max memory usage when active: 1632.23MB 06/02/2015 09:11:22 | | max memory usage when idle: 13057.86MB 06/02/2015 09:11:22 | | max disk usage: 11.00GB 06/02/2015 09:11:22 | | suspend work if non-BOINC CPU load exceeds 75% 06/02/2015 09:11:22 | | (to change preferences, visit a project web site or select Preferences in the Manager) 06/02/2015 09:11:22 | | Not using a proxy 06/02/2015 09:11:22 | SETI@home | Sending scheduler request: To fetch work. 06/02/2015 09:11:22 | SETI@home | Requesting new tasks for CPU and NVIDIA GPU 06/02/2015 09:11:25 | SETI@home | Scheduler request completed: got 0 new tasks 06/02/2015 09:11:25 | SETI@home | No tasks sent 06/02/2015 09:11:25 | SETI@home | No tasks are available for SETI@home Enhanced 06/02/2015 09:11:25 | SETI@home | No tasks are available for SETI@home v7 06/02/2015 09:11:25 | SETI@home | No tasks are available for AstroPulse v6 06/02/2015 09:11:25 | SETI@home | No tasks are available for AstroPulse v7 06/02/2015 09:11:25 | SETI@home | This computer has reached a limit on tasks in progress


As we can see, the TITAN in slot 0 isn't being used even though cc_config.xml is set to use all.

I really do appreciate the help, people.
12) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1637814)
Posted 21 days ago by Profile WoodgieProject donor
No one mentioned it yet so I'll go ahead and ask.
Does your cc_config.xml contain this setting? <use_all_gpus>1</use_all_gpus>

If they were both working & then stopped I wouldn't imagine this would be the issue, but it never hurts to ask.


So the first line in the logs after a start up is:
05/02/2015 17:48:39 | | cc_config.xml not found - using defaults

And to be honest I've never seen one so... Time to look into that.


I did not know that and indeed in the logs the one working TITAN was marked as 'Not Used'. So that would explain that. Thank you very much.

Is it possible to make BOINC use all GPUs present? A key in one of the many indecipherable xml files perchance?

Also, I have never seen any documentation regarding BOINCs xml files. Does anyone know anything out there describing the keys and their functions? My Google-fu is weak on this.

What does the Boinc startup say in the Event Log, can you post it please.

All the Docs are in the User manual, you can get to it by going in Boinc Manager: Help>Boinc Help, The user manaual is one of the resources on the left:

http://boinc.berkeley.edu/wiki/User_manual

Claggy


Thanks Claggy, I'll look in the obvious place I should already have looked (maybe I did and my eyes worn't working?)

Here's the full startup:
05/02/2015 17:48:39 | | cc_config.xml not found - using defaults 05/02/2015 17:48:39 | | Starting BOINC client version 7.4.36 for windows_x86_64 05/02/2015 17:48:39 | | log flags: file_xfer, sched_ops, task 05/02/2015 17:48:39 | | Libraries: libcurl/7.39.0 OpenSSL/1.0.1j zlib/1.2.8 05/02/2015 17:48:39 | | Data directory: C:\ProgramData\BOINC 05/02/2015 17:48:39 | | Running under account william 05/02/2015 17:48:39 | | CUDA: NVIDIA GPU 0: GeForce GTX TITAN (driver version 347.25, CUDA version 7.0, compute capability 3.5, 4096MB, 4096MB available, 4707 GFLOPS peak) 05/02/2015 17:48:39 | | OpenCL: NVIDIA GPU 0: GeForce GTX TITAN (driver version 347.25, device version OpenCL 1.1 CUDA, 6144MB, 4096MB available, 4707 GFLOPS peak) 05/02/2015 17:48:39 | SETI@home | Found app_info.xml; using anonymous platform 05/02/2015 17:48:39 | | Host name: outlander 05/02/2015 17:48:39 | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz [Family 6 Model 60 Stepping 3] 05/02/2015 17:48:39 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 vmx tm2 pbe fsgsbase bmi1 smep bmi2 05/02/2015 17:48:39 | | OS: Microsoft Windows 7: Ultimate x64 Edition, Service Pack 1, (06.01.7601.00) 05/02/2015 17:48:39 | | Memory: 15.94 GB physical, 31.88 GB virtual 05/02/2015 17:48:39 | | Disk: 465.54 GB total, 381.51 GB free 05/02/2015 17:48:39 | | Local time is UTC +0 hours 05/02/2015 17:48:39 | | Config: GUI RPCs allowed from: <snip> 05/02/2015 17:48:39 | SETI@home | URL http://setiathome.berkeley.edu/; Computer ID 7225736; resource share 100 05/02/2015 17:48:39 | SETI@home | General prefs: from SETI@home (last modified 21-Jan-2015 19:19:02) 05/02/2015 17:48:39 | SETI@home | Computer location: home 05/02/2015 17:48:39 | | General prefs: using separate prefs for home 05/02/2015 17:48:39 | | Reading preferences override file 05/02/2015 17:48:39 | | Preferences: 05/02/2015 17:48:39 | | max memory usage when active: 1632.16MB 05/02/2015 17:48:39 | | max memory usage when idle: 13057.29MB 05/02/2015 17:48:39 | | max disk usage: 10.00GB 05/02/2015 17:48:39 | | suspend work if non-BOINC CPU load exceeds 75% 05/02/2015 17:48:39 | | (to change preferences, visit a project web site or select Preferences in the Manager) 05/02/2015 17:48:39 | | Not using a proxy 05/02/2015 17:48:42 | SETI@home | Sending scheduler request: To fetch work. 05/02/2015 17:48:42 | SETI@home | Requesting new tasks for CPU and NVIDIA GPU 05/02/2015 17:48:44 | SETI@home | Scheduler request completed: got 0 new tasks 05/02/2015 17:48:44 | SETI@home | Not sending work - last request too recent: 291 sec
13) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1637743)
Posted 21 days ago by Profile WoodgieProject donor
Nope, a roll back to 347.09 didn't help, even when it was a 'clean installation' which removes existing drivers first.

Given that the system sees both cards, I am for the moment discounting a hardware issue.

Well I'm out of ideas, anyone else?

Do you have a Monitor connected to the 2nd GPU?

Often with Multi-vendor GPU hosts, the 2nd GPU isn't invisible to Boinc until either a Monitor is connected, or the desktop is extended onto the 2nd GPU,
What is supposed to happen nowadays with Multiple Nvidia GPUs i'm less sure.

Claggy


By default it has NO monitors attached, it's been running headless in my server room with access over VNC pretty much since I got it and has running happily on both cards until the other day.

I'll find out when I get to pulling the cards, which should be later today.
14) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1637742)
Posted 21 days ago by Profile WoodgieProject donor
OK, removed the 750ti and the TITAN is seen again.

That is strange in and of itself, I think.

Actually, not - depending on what exactly you saw in the logs.

By default, BOINC will only report and use the 'best' GPU in a system (although it should detect all cards, and mark the lesser one(s) as 'not used').

The GTX 750 Ti has a 'compute capability' of 5.0, against the TITAN's 3.5 - so by BOINC's assessment the newer 750 Ti is the best card (raw speed is also considered, as is usable memory and something else I've forgotten - but the compute capability trumps the others).

So yes, the methodical step-by-step approach is needed to isolate this one, but do pay attention to all information sources - GPU-Z, any speed or fan tuning utility you use, and of course the BOINC startup logs.


I did not know that and indeed in the logs the one working TITAN was marked as 'Not Used'. So that would explain that. Thank you very much.

Is it possible to make BOINC use all GPUs present? A key in one of the many indecipherable xml files perchance?

Also, I have never seen any documentation regarding BOINCs xml files. Does anyone know anything out there describing the keys and their functions? My Google-fu is weak on this.
15) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1637685)
Posted 21 days ago by Profile WoodgieProject donor
OK, removed the 750ti and the TITAN is seen again.

That is strange in and of itself, I think.
16) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1637682)
Posted 21 days ago by Profile WoodgieProject donor
That's because I had a 750ti destined for another machine which I put in it and it and have just noticed it myself. Now it's not using EITHER of the TITANs

Damn it.

OK, I think a proper case of methodical troubleshooting is in order to see if it's a card, a slot or what.
17) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1637663)
Posted 21 days ago by Profile WoodgieProject donor
Just updating in case anyone else has the same issue.

I found this which I am working through now.
18) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1637661)
Posted 21 days ago by Profile WoodgieProject donor
Great, NOW I see that the system is saying, for one of the cards:

"Windows has stopped this device because it has reported problems. (Code 43)"

Time for Google...
19) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1637660)
Posted 21 days ago by Profile WoodgieProject donor
Nope, a roll back to 347.09 didn't help, even when it was a 'clean installation' which removes existing drivers first.

Given that the system sees both cards, I am for the moment discounting a hardware issue.

Well I'm out of ideas, anyone else?
20) Message boards : Number crunching : Suddenly Missing GPU (nVidia driver update issue?) (Message 1637647)
Posted 21 days ago by Profile WoodgieProject donor
Let me state from the outset I'm not a Windows person and so forgive me if I'm missing something obvious.

So here's a bit of a poser. I noticed yesterday that BOINC - running SETI@Home only - my machine outlander had suddenly 'lost' visibility of a graphics card. I have 2 nVidia Titans but only one now seems to be seen as per the highlighted lines below. Originally I had NVIDIA GPU 0 and NVIDIA GPU 1

05/02/2015 08:20:41 | | cc_config.xml not found - using defaults
05/02/2015 08:20:41 | | Starting BOINC client version 7.4.36 for windows_x86_64
05/02/2015 08:20:41 | | log flags: file_xfer, sched_ops, task
05/02/2015 08:20:41 | | Libraries: libcurl/7.39.0 OpenSSL/1.0.1j zlib/1.2.8
05/02/2015 08:20:41 | | Data directory: C:\ProgramData\BOINC
05/02/2015 08:20:41 | | Running under account william
05/02/2015 08:20:41 | | CUDA: NVIDIA GPU 0: GeForce GTX TITAN (driver version 347.25, CUDA version 7.0, compute capability 3.5, 4096MB, 4096MB available, 4707 GFLOPS peak)
05/02/2015 08:20:41 | | OpenCL: NVIDIA GPU 0: GeForce GTX TITAN (driver version 347.25, device version OpenCL 1.1 CUDA, 6144MB, 4096MB available, 4707 GFLOPS peak)

05/02/2015 08:20:41 | SETI@home | Found app_info.xml; using anonymous platform
05/02/2015 08:20:41 | | Host name: outlander
05/02/2015 08:20:41 | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz [Family 6 Model 60 Stepping 3]
05/02/2015 08:20:41 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 vmx tm2 pbe fsgsbase bmi1 smep bmi2
05/02/2015 08:20:41 | | OS: Microsoft Windows 7: Ultimate x64 Edition, Service Pack 1, (06.01.7601.00)
05/02/2015 08:20:41 | | Memory: 15.94 GB physical, 31.88 GB virtual
05/02/2015 08:20:41 | | Disk: 465.54 GB total, 385.54 GB free
05/02/2015 08:20:41 | | Local time is UTC +0 hours

The system sees the second graphics card, both Piriform Speccy and CPUZ both pick it up and report it as what it is.

I am also running the Lunatics 0.43a apps, so I've done things like:
Remove BOINC
Reinstall BOINC
Reinstall Lunatics (and re-edit the app_info file)

The only thing which has changed is that there was a GEFORCE driver update which happened around the same time I'm currently running 347.25 I'm going to download and re-install the earlier version (if that is possible) and will post if it makes a difference.

In the mean time, if anyone has nay other troubleshooting suggestions I'd be happy to hear them.

Thanks, you wonderful bunch of people you! :-)

~W


Next 20

Copyright © 2015 University of California