Discussion of Invalid Host Messaging |
![]() |
| log in |
Message boards : Number crunching : Discussion of Invalid Host Messaging
Previous · 1 · 2 · 3 · 4 · 5 · Next
| Author | Message |
|---|---|
Good question, I just add new machines to the default profile. I have just started crunching for Einstein@Home and GPU crunching was on by default. The use of the GPU in the project preferences is on by default. However, the default computing preferences for "Suspend GPU work while computer is in use?" is set to yes, so as not to interfere with the use of the computer. In that case, BOINC will only use the GPU when the computer is idle. ____________ Jord - BOINC FAQ Service - BOINC User Wiki Real is just a matter of perception. | |
| ID: 1299572 · | |
I got a rather unusual reply from 1 person via PM today, rob99999_2 ID 129322, the owner of Computer 6797265, which produces nothing but errors from its GT 650M but here's his reply. No, he is not. He is the owner/user of his computer and only he is responsible for what this computer is doing, just like he has to watch, that it's not sending out spam mails or participate in DDoS attacs, so he has to watch what it is doing with the SETI WUs, that it gets assigned. Just like a car driver is responsible for his car, listen to it and watch how it behaves carefully and if you suspect, that something might be wrong, stop and call help, if you can't fix it yourself. ____________ . | |
| ID: 1299733 · | |
I got a rather unusual reply from 1 person via PM today, rob99999_2 ID 129322, the owner of Computer 6797265, which produces nothing but errors from its GT 650M but here's his reply. Nothing is wrong with his car nothing is wrong with his engine he has it serviced correctly, however the manufacturer has failed to tell him that there is a fault that will mean his engine is about to breakdown. Can the manufacturer fix it, no you have to do it yourself. Or stop using the car! ____________ | |
| ID: 1299757 · | |
|
A car is a deadly weapon so there is a definite moral obligation to keep it well maintained, backed most places by legal requirements. Joe | |
| ID: 1299800 · | |
I got a rather unusual reply from 1 person via PM today, rob99999_2 ID 129322, the owner of Computer 6797265, which produces nothing but errors from its GT 650M but here's his reply. I know that car-computer comparisons are crap, but sometimes I have not a better one. Point is: his computer fails, he should be the first who notices it and see if he can fix it or ask for help. I have the similar situation with Milkyway right now: my old ATI HD3850 can only run the older (not really supported anymore) CAL application, I have to watch if new batches of WUs are still compatible with it, if not I have to stop crunching. It was already once the case, I had to stop crunching for about a month, than it worked again. Wether old or new hardware, you have to watch it, something might always not work as expected. Specially after any changes on the system, for example if you buy a new card or install new drivers, you have to first see that it actually works before you let it do it's work without too much attention from your side. And I'm pretty sure that most of the owners of those 560Ti cards have skipped that part. Something like "set and forget" does not exist with computers anyway, even if many think so. In best case it's "set, see that it works and hope it lasts for a while". ____________ . | |
| ID: 1299832 · | |
A car is a deadly weapon so there is a definite moral obligation to keep it well maintained, backed most places by legal requirements. Not all issues with a car make it more dangerous, if it's leaking a drop of oil every now and than, it's still safe to drive but bad for the environment. And so are such hosts for the SETI environment, they waste bandwidth and eventually (if two such hosts validate against eachother like fermi cards did before) even compromise the science. ____________ . | |
| ID: 1299879 · | |
|
Everyone is correct, however it means SETI@Home is not set and forget. It should have instructions posted that explain the problems with GPU crunching and a need to check on a regular basis to see if your results are valid. Also to warn people that if they are not prepared to do this they could return invalid results and it is best they don't crunch using a GPU. | |
| ID: 1299920 · | |
|
GPU computing disabled by default and a red "READ THIS FIRST" link to a page with a short info like the one you posted would be IMO a good solution. CPU-only crunching might be "set and forget", GPU crunching is not. | |
| ID: 1299945 · | |
Hence I don't see it as a fault of the project staff, if their apps don't run properly on a new hardware. No possibly not, but if GPU crunching is not "set and forget" both current and prospective users need to know, otherwise as you say we could end up with errors validating against each other, corrupting the science! Users like rob99999_2 need to know what they are getting into. ____________ | |
| ID: 1299961 · | |
|
After reading NVidia driver problems which cause computation errors by Richard Haselgrove | |
| ID: 1300176 · | |
After reading NVidia driver problems which cause computation errors by Richard Haselgrove Thank You for posting. It will take a few days for the dust to settle on the invalids before you can see for sure whether the changes you made helped. You can keep an eye on your finished tasks in the mean time , .. watch for short run times , those tend to be the -9 error you are experiencing. | |
| ID: 1300272 · | |
After reading NVidia driver problems which cause computation errors by Richard Haselgrove Read again: 1) Sleeping Monitor Bug Drivers affected: 295.51 (BETA), 295.73 and 296.10 You use driver: 306.97 so what you did was not needed. You also don't have 'Kepler' GPU (GT 6xx and GTX 6xx) so the other (CUDA_GRID_SIZE_COMPAT) fix do not apply to you. GTX 560 Ti problems are 'famous' and not related to 'Sleeping Monitor Bug' nor 'Kepler' Read 'a few' threads about GTX 560 Ti problems: http://www.google.com/#hl=en&q=560+Ti+problems+site:setiathome.berkeley.edu ____________ - ALF - "Find out what you don't do well ..... then don't do it!" :) | |
| ID: 1300629 · | |
|
I have told it not to use gpu and set power settings to never turn off monitor. let me know. | |
| ID: 1302137 · | |
I have told it not to use gpu and set power settings to never turn off monitor. let me know. Setting power settings to never turn off your monitor won't help, you're not running 295.xx or 296.xx drivers, Looking at your inconclusive/errored tasks, they are a mixture of CPU and GPU tasks, all the one's i looked at all say 'Restarted at 100.00 percent.' which is strange, then looking at the stderr.txt results, multiple tasks have the same result, eithier: <core_client_version>7.0.28</core_client_version> <![CDATA[ <stderr_txt> Restarted at 100.00 percent. Flopcounter: 48049228806222.281000 Spike count: 1 Pulse count: 1 Triplet count: 8 Gaussian count: 0 called boinc_finish </stderr_txt> ]]> http://setiathome.berkeley.edu/result.php?resultid=2686739603 http://setiathome.berkeley.edu/result.php?resultid=2686762450 http://setiathome.berkeley.edu/result.php?resultid=2686762474 http://setiathome.berkeley.edu/result.php?resultid=2688750025 Or: Spike count: 10 Pulse count: 0 Triplet count: 0 Gaussian count: 3 called boinc_finish http://setiathome.berkeley.edu/result.php?resultid=2687146759 http://setiathome.berkeley.edu/result.php?resultid=2687140765 Or: Spike count: 14 Pulse count: 5 Triplet count: 12 Gaussian count: 0 http://setiathome.berkeley.edu/result.php?resultid=2687140761 http://setiathome.berkeley.edu/result.php?resultid=2686771628 Looks like your slot directories aren't getting cleared for some reason, Please post your Boinc startup messages from the Event Log, the first 30 lines will do. Claggy | |
| ID: 1302158 · | |
|
Looks as if Thndr has fixed his problems with his slot directories, he's now fully completing 6.03, 6.10 (cuda_fermi), AstroPulse v6 v6.01 and AstroPulse v6 v6.04 (cuda_opencl_100) tasks, | |
| ID: 1303041 · | |
Looks as if Thndr has fixed his problems with his slot directories, he's now fully completing 6.03, 6.10 (cuda_fermi), AstroPulse v6 v6.01 and AstroPulse v6 v6.04 (cuda_opencl_100) tasks, Well.... to make a long story short, I scrapped the boinc software and started over... that and I changed power settings and reset the project and environment but!! I'm back to 6.10 errors again! I checked everything and gpu usage was turned back on?? how?? Clearly this is NOT just a driver problem. ____________ | |
| ID: 1305991 · | |
|
I have removed the boinc manager from my machine again. I will watch this thread for further developments. I can not see wasting my efforts and messing up data packets until there is a fix. | |
| ID: 1306124 · | |
Looks as if Thndr has fixed his problems with his slot directories, he's now fully completing 6.03, 6.10 (cuda_fermi), AstroPulse v6 v6.01 and AstroPulse v6 v6.04 (cuda_opencl_100) tasks, What enviromental setting? If it's the one in the 'NVidia driver problems which cause computation errors' thread, please Note that is for 6** Keplar GPUs only and is not required on your GTS 450, and you also don't need to change power settings as you're not running 295.xx or 296.xx drivers, Uninstalling and Reinstalling the Boinc software didn't help as that only installs the program, the Boinc Data directory is left intact, and that is where your problem is, looking at your errored tasks still shows 'Restarted at 100.00 percent.', did you go and empty all the slot directories, did you delete them? or did you not touch them? Please post your Boinc startup messages from the Event Log, the first 20 to 30 lines will do (I've already asked you for it once before) <core_client_version>7.0.28</core_client_version> Claggy | |
| ID: 1306191 · | |
|
didnt delete anything | |
| ID: 1306228 · | |
didnt delete anything program is completely uninstalled. ____________ | |
| ID: 1306230 · | |
Message boards : Number crunching : Discussion of Invalid Host Messaging
| Copyright © 2013 University of California |