Anonumous host throwing only errors, 3223 right now

Message boards : Number crunching : Anonumous host throwing only errors, 3223 right now
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 5 · Next

AuthorMessage
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 1241310 - Posted: 4 Jun 2012, 20:55:46 UTC
Last modified: 4 Jun 2012, 20:56:34 UTC

While looking at some ATI results, I've come across this host,
5292089, making only GPU (9800GTX)errors.
CPU results are OK.
This host shouldn't get so much tasks, CPU only f.i., IMHO.
ID: 1241310 · Report as offensive
Profile Convergence
Avatar

Send message
Joined: 23 Jun 08
Posts: 117
Credit: 2,928,788
RAC: 0
United States
Message 1241323 - Posted: 4 Jun 2012, 21:16:27 UTC

Wow, that's quite ridiculous.
ID: 1241323 · Report as offensive
Dave

Send message
Joined: 29 Mar 02
Posts: 778
Credit: 25,001,396
RAC: 0
United Kingdom
Message 1241341 - Posted: 4 Jun 2012, 21:49:17 UTC

Resends-alert?
ID: 1241341 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1241404 - Posted: 4 Jun 2012, 23:09:52 UTC

Unfortunately it is not producing only errors, about 1 in 25 are being reported by the CUDA apps as successful, and thereby boosting the quota back up. Later they're being found invalid, but that doesn't help much.

The anonymous user of course can't be contacted, and the 9800 GTX+ has enough memory that both stock 6.08 and 6.09 are being used and have their separate quotas, which sort of doubles the problem.

In the big picture with ~60000 results being reported each hour, one host like that isn't much of a problem, but it is certainly reducing the overall productivity.
                                                                   Joe
ID: 1241404 · Report as offensive
Profile Slavac
Volunteer tester
Avatar

Send message
Joined: 27 Apr 11
Posts: 1932
Credit: 17,952,639
RAC: 0
United States
Message 1241419 - Posted: 4 Jun 2012, 23:33:57 UTC - in response to Message 1241404.  

Don't look at my errors :(

I managed to reset the SETI project during a fit of Boinc lag last week, borking 5K tasks.


Executive Director GPU Users Group Inc. -
brad@gpuug.org
ID: 1241419 · Report as offensive
.clair.

Send message
Joined: 4 Nov 04
Posts: 1300
Credit: 55,390,408
RAC: 69
United Kingdom
Message 1241494 - Posted: 5 Jun 2012, 2:24:44 UTC

OOoo, you, you, badly borked your BOINC.
Sounds nasty,
Can you get tablets to help with that :¬)
ID: 1241494 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 36781
Credit: 261,360,520
RAC: 489
Australia
Message 1241504 - Posted: 5 Jun 2012, 2:52:48 UTC - in response to Message 1241404.  

My pending list is going through the roof again but when you get teamed with these Anonymous users with faulty/unserviced or poorly setup machines that cut large chunks of "inconclusives" through my list its bad,

a few examples,

3378825
6469701
6256705
6589662

but when they double up on you its even worse then throw in a good mix of people using the sleep bug version CUDA drivers (I sent off a lot of PM's today but how many actually get looked at will be another matter) and things get really crazy.

Cheers.
ID: 1241504 · Report as offensive
Profile Ex: "Socialist"
Volunteer tester
Avatar

Send message
Joined: 12 Mar 12
Posts: 3433
Credit: 2,616,158
RAC: 2
United States
Message 1241553 - Posted: 5 Jun 2012, 4:47:36 UTC

Someone on the server side of our project should add some measures, to protect us against people like that. I.e. an error cap, if you produce this many errors, you get cut off of whatever work you are erroring on. Then the person can come here and make themselves known in order to get a "talkin' to" and get their s*** straight.
#resist
ID: 1241553 · Report as offensive
Profile Link
Avatar

Send message
Joined: 18 Sep 03
Posts: 834
Credit: 1,807,369
RAC: 0
Germany
Message 1241609 - Posted: 5 Jun 2012, 8:30:02 UTC - in response to Message 1241404.  

Unfortunately it is not producing only errors, about 1 in 25 are being reported by the CUDA apps as successful, and thereby boosting the quota back up.

That's what I always ask myself, why is the quota not increased by 1 (or 2 at most) for each successful result just like it is decreased by 1 for each error? That would give a GPU 8 new tasks, that should be enough to proof if it's reliable again. I don't think it's not too much to expect, that a host should return at least 50% of his tasks without an error (actually I'd expect at least 95% to be "successful" for a properly working machine, but a machine that has been fixed must have a chance to build up the quota in reasonable time).
ID: 1241609 · Report as offensive
Lionel

Send message
Joined: 25 Mar 00
Posts: 680
Credit: 563,640,304
RAC: 597
Australia
Message 1241621 - Posted: 5 Jun 2012, 9:41:31 UTC - in response to Message 1241419.  

Don't look at my errors :(

I managed to reset the SETI project during a fit of Boinc lag last week, borking 5K tasks.


did that a couple of weeks ago myself ... ~3000 down the tube ... maybe we should have a borking sticky ...

ID: 1241621 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 36781
Credit: 261,360,520
RAC: 489
Australia
Message 1241643 - Posted: 5 Jun 2012, 11:17:07 UTC - in response to Message 1241621.  

Don't look at my errors :(

I managed to reset the SETI project during a fit of Boinc lag last week, borking 5K tasks.


did that a couple of weeks ago myself ... ~3000 down the tube ... maybe we should have a borking sticky ...

Those that aborted work I never sent a PM to as I figured that those who did found their problem just like those who's video cards had disappeared from their setups but this week I've found a lot of bad wingmen (and I mean a great deal but some others had not yet had enough proof of a problem though that also may change next week).

BTW out of about 24 PM's that I sent off around 12hrs ago I've only received 1 reply so far but maybe that will increase by another in another 12hrs (but I may name & shame those who havn't done any corrective action by next week).

Cheers.
ID: 1241643 · Report as offensive
Profile Swordfish
Avatar

Send message
Joined: 5 Aug 06
Posts: 72
Credit: 3,014,493
RAC: 0
United Kingdom
Message 1242110 - Posted: 6 Jun 2012, 10:34:45 UTC - in response to Message 1241504.  

Yep, I noticed that in my pending list over time it contains a wingman called Anonymous, in inconsistants.
I also noticed that Anonymous has a number of host computers associated with this name.

One of which is 3378825, which at the time of this reply had

12795 Pending
88 Valid
1144 Invalid
5 Error

That's one awful lot of Invalids , no wonder we got to rehash the work to get valid results.
ID: 1242110 · Report as offensive
Profile Gundolf Jahn

Send message
Joined: 19 Sep 00
Posts: 3184
Credit: 446,358
RAC: 0
Germany
Message 1242151 - Posted: 6 Jun 2012, 13:13:52 UTC - in response to Message 1242110.  

I also noticed that Anonymous has a number of host computers associated with this name.

Each and every user that has his/her computer(s) hidden is represented as 'Anonymous'!

Gruß,
Gundolf
ID: 1242151 · Report as offensive
Profile Swordfish
Avatar

Send message
Joined: 5 Aug 06
Posts: 72
Credit: 3,014,493
RAC: 0
United Kingdom
Message 1242180 - Posted: 6 Jun 2012, 13:58:41 UTC - in response to Message 1242151.  

Yep, I know that







ID: 1242180 · Report as offensive
Profile Yanivicious
Avatar

Send message
Joined: 29 Mar 12
Posts: 157
Credit: 15,529,301
RAC: 0
United States
Message 1242414 - Posted: 6 Jun 2012, 19:33:12 UTC - in response to Message 1242180.  

so is there really no way to remedy this situation? Can't we email or message the administrators at S@H and report this, and perhaps they can take a few moments to figure out a permanent fix for this (like the error quota mentioned below)?
ID: 1242414 · Report as offensive
Profile Area 51
Avatar

Send message
Joined: 31 Jan 04
Posts: 965
Credit: 42,193,520
RAC: 0
United Kingdom
Message 1242422 - Posted: 6 Jun 2012, 20:14:44 UTC

From previous similar discussions - wasn't there some sort of a 'blacklist' capability within BOINC? Can't remember whether it operated at the host or account level, but I agree - something ought to be done about hosts/users who just spit out rubbish and squander resources!
ID: 1242422 · Report as offensive
Profile Link
Avatar

Send message
Joined: 18 Sep 03
Posts: 834
Credit: 1,807,369
RAC: 0
Germany
Message 1242475 - Posted: 6 Jun 2012, 21:54:56 UTC - in response to Message 1242422.  

From previous similar discussions - wasn't there some sort of a 'blacklist' capability within BOINC? Can't remember whether it operated at the host or account level, but I agree - something ought to be done about hosts/users who just spit out rubbish and squander resources!

We actually have something, that could be able to automatically stop such hosts: quota system. Just the current settings are making it pretty useless, I see that here, I see that on Milkyway and on other projects it won't be much different.
ID: 1242475 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14679
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1242490 - Posted: 6 Jun 2012, 22:13:59 UTC - in response to Message 1242422.  

From previous similar discussions - wasn't there some sort of a 'blacklist' capability within BOINC? Can't remember whether it operated at the host or account level, but I agree - something ought to be done about hosts/users who just spit out rubbish and squander resources!

http://boinc.berkeley.edu/trac/wiki/BlackList#
ID: 1242490 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1242552 - Posted: 7 Jun 2012, 0:38:25 UTC - in response to Message 1242414.  

so is there really no way to remedy this situation? Can't we email or message the administrators at S@H and report this, and perhaps they can take a few moments to figure out a permanent fix for this (like the error quota mentioned below)?

The S@H project can use whatever features BOINC provides, although BOINC is LGPL and they could modify it the time and effort involved would be far more than "a few moments".

There's a <daily_result_quota> project option which is currently at 100. If it were practical to reduce it to 20 or so, the problem would be much reduced. However, the CUDA apps have some builtin "unsupported function" -12 errors which are no fault of the owner or host and occur often enough that a lower base for the quota would impact the productivity of some of the best systems. I can't really judge whether something like a 50 setting might be practical or not, but my guess is not.

There have been several discussions on the boinc_dev mailing list related to the quota logic, but Dr. Anderson has not been persuaded that any major change is needed. If enough projects complained, that could change; the server code is very much intended to target project needs rather than what we external participants see as problematic.
                                                                  Joe
ID: 1242552 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 1242725 - Posted: 7 Jun 2012, 11:00:34 UTC - in response to Message 1242552.  
Last modified: 7 Jun 2012, 11:24:09 UTC

Host
6469701
has a lot of overflows or -9 errors.
Yet another anonymous host with an GTX 560 and a GTS8800!
ID: 1242725 · Report as offensive
1 · 2 · 3 · 4 . . . 5 · Next

Message boards : Number crunching : Anonumous host throwing only errors, 3223 right now


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.