Strange result, how is this possible?

Message boards : Number crunching : Strange result, how is this possible?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · Next

AuthorMessage
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1062833 - Posted: 2 Jan 2011, 20:08:06 UTC - in response to Message 1062822.  

Those I found on mine running old apps have been cleared already. I've got five more inconclusives of which three seem to have been fixed. The other two are one half of a 295 bad and the other is a 260. Hopefully they will notice too.


PROUD MEMBER OF Team Starfire World BOINC
ID: 1062833 · Report as offensive
xx
Volunteer tester

Send message
Joined: 23 May 99
Posts: 166
Credit: 3,450,910
RAC: 0
United States
Message 1062838 - Posted: 2 Jan 2011, 20:30:11 UTC

http://setiathome.berkeley.edu/show_host_detail.php?hostid=4126021 is cranking out a lot of error/invalid tasks.

Chuck
ID: 1062838 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 1062842 - Posted: 2 Jan 2011, 20:40:08 UTC - in response to Message 1062838.  

http://setiathome.berkeley.edu/show_host_detail.php?hostid=4126021 is cranking out a lot of error/invalid tasks.

Chuck


Yep, the only "valid" results were against other -9s. Looks like it's time for him to either blow out the dust bunnies or get him a new card.



PROUD MEMBER OF Team Starfire World BOINC
ID: 1062842 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1064377 - Posted: 7 Jan 2011, 20:29:45 UTC

Here's the list of hosts with Fermi cards still running the V12 optimized application from my checking earlier today.
1754767 3099502 4202271 5149058 5177668 5231715 5257703 5293938 5346869 5354400 5357884 5396192 5444296 5448502 5467571 5472266 5478875 5564376 5701024

The Linux host I noted in an earlier post has no in progress tasks and none reported since 31 Dec. 2010, so I'm considering it gone.
                                                               Joe
ID: 1064377 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 1084601 - Posted: 7 Mar 2011, 13:02:32 UTC - in response to Message 1062842.  

http://setiathome.berkeley.edu/show_host_detail.php?hostid=4126021 is cranking out a lot of error/invalid tasks.

Chuck


Yep, the only "valid" results were against other -9s. Looks like it's time for him to either blow out the dust bunnies or get him a new card.


After checking, these are not -9, but -177 (0xffffffffffffff4f), but ofcoarse,
invalid!


ID: 1084601 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1084612 - Posted: 7 Mar 2011, 13:32:12 UTC - in response to Message 1084601.  
Last modified: 7 Mar 2011, 13:34:10 UTC

http://setiathome.berkeley.edu/show_host_detail.php?hostid=4126021 is cranking out a lot of error/invalid tasks.

Chuck


Yep, the only "valid" results were against other -9s. Looks like it's time for him to either blow out the dust bunnies or get him a new card.


After checking, these are not -9, but -177 (0xffffffffffffff4f), but ofcoarse,
invalid!


He probably had a card that was throwing -9's, then it started working again, but too late,

so now his host has got an Average processing rate of 2895.8263185242!!!!!!!!,

his GPU is never going to match that, so he'll continue doing -177's,

Claggy
ID: 1084612 · Report as offensive
Profile skildude
Avatar

Send message
Joined: 4 Oct 00
Posts: 9541
Credit: 50,759,529
RAC: 60
Yemen
Message 1084613 - Posted: 7 Mar 2011, 13:36:03 UTC - in response to Message 1084612.  

has anyone considered PMing the folks Whose machines are cranking out crap instead of just complaining about them.


In a rich man's house there is no place to spit but his face.
Diogenes Of Sinope
ID: 1084613 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1084628 - Posted: 7 Mar 2011, 14:07:06 UTC - in response to Message 1061903.  

hostid=5393593 has reverted back to Stock since the list was published.

Claggy

No evidence of participation in these forums, but a member of a non-english-language team. I hope that's a good sign that the message might spread outwards through other means.

Edit - yes, they are discussing this problem on the SETI.Germany forums. Thank you.


I wrote already at July'10 in the S.G forum about the Fermi + CUDA_V12 prob.

S.G : S@h subforum : SETI@home auf einer GTX4xx/5xx GPU

ID: 1084628 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1084635 - Posted: 7 Mar 2011, 14:28:47 UTC - in response to Message 1084628.  

Edit - yes, they are discussing this problem on the SETI.Germany forums. Thank you.

I wrote already at July'10 in the S.G forum about the Fermi + CUDA_V12 prob.

S.G : S@h subforum : SETI@home auf einer GTX4xx/5xx GPU

Discussion maybe, but the message still doesn't seem to have got through.

From the previous list, the SETI.Germany team members are:

1754767 Tauern- Apotheke Berlin
5177668 Orgasmann
5231715 Christian Buckatz
5396192 micha123
5444296 Vigilante
5701024 Frank-SETI-Reit

Of those, hosts 5396192 and 5444296 seem to have been taken off-line, the other four have all returned -9 results from the V12 app within the last 24 hours. Not one seems to be successfully crunching SETI with their expensive GPUs.
ID: 1084635 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1084640 - Posted: 7 Mar 2011, 14:36:21 UTC - in response to Message 1084601.  

After checking, these are not -9, but -177 (0xffffffffffffff4f), but of course,
invalid!

And hence kept safely away from the main database of canonical results.
ID: 1084640 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1084642 - Posted: 7 Mar 2011, 14:40:19 UTC - in response to Message 1084613.  
Last modified: 7 Mar 2011, 14:42:26 UTC

has anyone considered PMing the folks Whose machines are cranking out crap instead of just complaining about them.


It would be better if it got reported to the Boinc Devs, haven't thought what to say yet,

things that need to be changed are:

Don't include errored tasks in the Average processing rate calculations, that will fix some hosts too high Average processing rate,
Might make New Credit more concistant,

A better Quota system that limits devices that produce Invalid results, how about setting it to 'One' everytime there's a Invalid result,

Claggy
ID: 1084642 · Report as offensive
Profile Gundolf Jahn

Send message
Joined: 19 Sep 00
Posts: 3184
Credit: 446,358
RAC: 0
Germany
Message 1084678 - Posted: 7 Mar 2011, 16:17:04 UTC - in response to Message 1084642.  

Don't include errored tasks in the Average processing rate calculations, that will fix some hosts too high Average processing rate,
Might make New Credit more concistant,

A better Quota system that limits devices that produce Invalid results, how about setting it to 'One' everytime there's a Invalid result,

Problem with that is: -9 is no error, only a warning.

Gruß,
Gundolf
ID: 1084678 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 1084681 - Posted: 7 Mar 2011, 17:06:11 UTC - in response to Message 1084642.  

has anyone considered PMing the folks Whose machines are cranking out crap instead of just complaining about them.


It would be better if it got reported to the Boinc Devs, haven't thought what to say yet,

things that need to be changed are:

Don't include errored tasks in the Average processing rate calculations, that will fix some hosts too high Average processing rate,
Might make New Credit more concistant,

A better Quota system that limits devices that produce Invalid results, how about setting it to 'One' everytime there's a Invalid result,

Claggy


Or the Normal Amount for a host minus the invalid-results and plus
1 for each valid result reported, after the invalid ones.

ID: 1084681 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34685
Credit: 79,922,639
RAC: 80
Germany
Message 1084684 - Posted: 7 Mar 2011, 17:20:26 UTC

I dont like the idea.

Hows about someone has to find out whats wrong with new hardware ?

With each crime and every kindness we birth our future.
ID: 1084684 · Report as offensive
Profile James Sotherden
Avatar

Send message
Joined: 16 May 99
Posts: 10436
Credit: 110,373,059
RAC: 54
United States
Message 1084690 - Posted: 7 Mar 2011, 18:09:31 UTC

It seems to me if you are on the boards,and you know what goes on in your own computer, If you do have a card or CPU go bad. You fix the problem. These guys buy big money cards and let them run without checking, Need to be cutoff. You cant PM them, I tried a few. Admins need to just cut them off pronto. If they cant get work, maybe just maybe they might wake up.
[/quote]

Old James
ID: 1084690 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1084858 - Posted: 8 Mar 2011, 3:33:39 UTC - in response to Message 1084635.  

Edit - yes, they are discussing this problem on the SETI.Germany forums. Thank you.

I wrote already at July'10 in the S.G forum about the Fermi + CUDA_V12 prob.

S.G : S@h subforum : SETI@home auf einer GTX4xx/5xx GPU

Discussion maybe, but the message still doesn't seem to have got through.

From the previous list, the SETI.Germany team members are:

1754767 Tauern- Apotheke Berlin
5177668 Orgasmann
5231715 Christian Buckatz
5396192 micha123
5444296 Vigilante
5701024 Frank-SETI-Reit

Of those, hosts 5396192 and 5444296 seem to have been taken off-line, the other four have all returned -9 results from the V12 app within the last 24 hours. Not one seems to be successfully crunching SETI with their expensive GPUs.


We don't need to talk about who or what the problem was/is, that still people use the CUDA_v12 app with their GTX4xx-5xx GPUs. The child is fallen already into the fountain.
We have now the problem, that the CUDA_V12 app is still in use with the Fermi GPUs.

I don't understand that people install software and don't look if it work smoothly.
Normally the BOINC Manager show in the tasks overview 'computation error's, or not? (I guess the people use the currently recommended V6.10.58, so the message overview is easy to find - compared to the currently DEV-Vs (6.12.x) where the message overview is little bit hidden)

There are people which never read the S@h forum, also not the team forum.
We see now, we can't reach the members over the forum. If we send them a PM, it's not sure that the message will be received/read.
How we could reach them?

Computers are smart, why not to let run a script or something on the project server and if a PC make only calculation errors he get a message/warning and don't get tasks until the problem is fixed.

The <stderr_txt> of the app isn't only shown on the hosts overview, or?
The stock and opt. CUDA app display the GPU series. Also BOINC do this.
If the script see GTX4xx-5xx and CUDA_V12 -> no new tasks.

If this would need much/all project server resources, shutdown the project servers for the normal/usual work for one/two days and search for the buggy member PCs. After, all PCs under this account ID don't get new WUs. The members get messages in BOINC, EMail (if in the prefs checked) and over PM - and a warning on the first page of the project site (also if it's a 3rd party software, the project must deal with it).

Or, no new CUDA tasks to the special host IDs.

If a project admin know, hey there is a PC which make only errors because of a bad/wrong installation, why not to touch a swtich and don't send WUs?
This would need time and manpower, but would safe project resources (server/bandwidth) for well running member PCs.

This way or this way..

There must be a way/solution for to reach the members/switch off buggy PCs.

ID: 1084858 · Report as offensive
Profile skildude
Avatar

Send message
Joined: 4 Oct 00
Posts: 9541
Credit: 50,759,529
RAC: 60
Yemen
Message 1084882 - Posted: 8 Mar 2011, 4:26:31 UTC - in response to Message 1084858.  

how exactly would the server know that the repairs are made?


In a rich man's house there is no place to spit but his face.
Diogenes Of Sinope
ID: 1084882 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1084941 - Posted: 8 Mar 2011, 10:07:08 UTC - in response to Message 1084882.  
Last modified: 8 Mar 2011, 10:11:08 UTC

how exactly would the server know that the repairs are made?


Either automatically, 1 WU/day, the server read the <stderr_txt> again if the CUDA_V12 is still there. Still there not more WUs/day. CUDA_V12 away, WU quota again on.
Or manually, the member get a message (PM and EMail, in several languages) and an 'activation-URL' where he must click, if he updated his machine/s. After a short time the project server look again to the <stderr_txt> if it's true.
ID: 1084941 · Report as offensive
Profile Miep
Volunteer moderator
Avatar

Send message
Joined: 23 Jul 99
Posts: 2412
Credit: 351,996
RAC: 0
Message 1084948 - Posted: 8 Mar 2011, 10:36:25 UTC - in response to Message 1084941.  

how exactly would the server know that the repairs are made?


Either automatically, 1 WU/day, the server read the <stderr_txt> again if the CUDA_V12 is still there. Still there not more WUs/day. CUDA_V12 away, WU quota again on.
Or manually, the member get a message (PM and EMail, in several languages) and an 'activation-URL' where he must click, if he updated his machine/s. After a short time the project server look again to the <stderr_txt> if it's true.


Ok, are you going to code the sophisticated AI needed to 'automatically' discover bad hosts, check their app status and cut them down until something changes?
Carola
-------
I'm multilingual - I can misunderstand people in several languages!
ID: 1084948 · Report as offensive
Profile Dirk Sadowski
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1084951 - Posted: 8 Mar 2011, 10:40:18 UTC - in response to Message 1062818.  
Last modified: 8 Mar 2011, 11:13:14 UTC


I´m wondering how this can happen.

http://setiathome.berkeley.edu/workunit.php?wuid=671914757

I can´t find the credit multiplier in the fermi app.

The new credit system doesn't use anything from the app, the elapsed time reported by the BOINC core client (or CPU time from older versions of BOINC) is the basis of credits. That's combined with server averaging which produces what amounts to a benchmark for each application on the host.

The WU you linked has of course been purged from the database, with only about 6 hour purge delay it's best to quote pertinent details here.
                                                               Joe


Just curious..

Which BOINC version is now at least needed for well Cr./MB+AP WU?
In past with the previous Cr.-system it was V5.2.6 .

What will happen if I reschedule GPU/CUDA WUs to CPU (if BOINC have too less DLed and the project servers are down). This will influence the granted Cr. for this WUs (+ or -)?
ID: 1084951 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · Next

Message boards : Number crunching : Strange result, how is this possible?


 
©2026 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.