Confusing results for old AP workunit.

Message boards : Number crunching : Confusing results for old AP workunit.
Message board moderation

To post messages, you must log in.

AuthorMessage
Bill Barto

Send message
Joined: 28 Jun 99
Posts: 864
Credit: 58,712,313
RAC: 91
United States
Message 876918 - Posted: 18 Mar 2009, 17:51:38 UTC

What is up with this work unit

The first two computers went past the deadline so it was issued to two more. Then one of the first two returned a result after the deadline.

Of the second two computers one returned the result before the deadline but the other missed the deadline so it was issued to another computer. Then the result was returned by the one that missed the deadline.

The third re-issue timed out and was re-issued to me.

Computer 4669410 has a validate error and granted credit ---.

Computer 2224122 has "Completed, validation inconclusive" (no concensus) and credit is pending.

Computer 4137258 had "completed and validated" but granted credit zero.

If credit was granted, why was it re-issued to me when computer 4663654 timed out?

Was the science valid or did the validator mess up?

If the science is valid I'll just abort it. However, if credit has been "granted", will the validator look at my result?

Inputs?
ID: 876918 · Report as offensive
Profile Hammeh
Volunteer tester
Avatar

Send message
Joined: 21 May 01
Posts: 135
Credit: 1,143,316
RAC: 0
United Kingdom
Message 876929 - Posted: 18 Mar 2009, 18:37:17 UTC - in response to Message 876918.  

What is up with this work unit

The first two computers went past the deadline so it was issued to two more. Then one of the first two returned a result after the deadline.

Of the second two computers one returned the result before the deadline but the other missed the deadline so it was issued to another computer. Then the result was returned by the one that missed the deadline.

The third re-issue timed out and was re-issued to me.

Computer 4669410 has a validate error and granted credit ---.

Computer 2224122 has "Completed, validation inconclusive" (no concensus) and credit is pending.

Computer 4137258 had "completed and validated" but granted credit zero.

If credit was granted, why was it re-issued to me when computer 4663654 timed out?

Was the science valid or did the validator mess up?

If the science is valid I'll just abort it. However, if credit has been "granted", will the validator look at my result?

Inputs?


Looks to me like a validation mess up!
Computer 4669410 has a validate error and granted credit ---.

Has not received credit for the scan. The task ID page shows it as invalid and has granted 0 credit.

Not sure what has happened to be honest.
ID: 876929 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 876964 - Posted: 18 Mar 2009, 20:21:55 UTC

That is...a very strange WU. Someone should get the attention of one of the admins on that one.. strange.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 876964 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 876988 - Posted: 18 Mar 2009, 21:32:59 UTC

This is a weird one. I've PMed Dr. Korpela to take a look at this thread.
ID: 876988 · Report as offensive
Eric Korpela Project Donor
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 3 Apr 99
Posts: 1382
Credit: 54,506,847
RAC: 60
United States
Message 877226 - Posted: 19 Mar 2009, 17:28:04 UTC - in response to Message 876988.  

Looks like there's bug in the validator that marked an unvalidated result as validated. Fortunately the workunit itself isn't considered validated until a canonical result is identified.

I'm looking for the source of the problem.
@SETIEric@qoto.org (Mastodon)

ID: 877226 · Report as offensive
Bill Barto

Send message
Joined: 28 Jun 99
Posts: 864
Credit: 58,712,313
RAC: 91
United States
Message 877243 - Posted: 19 Mar 2009, 19:52:57 UTC - in response to Message 877226.  

Looks like there's bug in the validator that marked an unvalidated result as validated. Fortunately the workunit itself isn't considered validated until a canonical result is identified.

I'm looking for the source of the problem.


I'll go ahead and process the workunit.
ID: 877243 · Report as offensive
Bill Barto

Send message
Joined: 28 Jun 99
Posts: 864
Credit: 58,712,313
RAC: 91
United States
Message 877442 - Posted: 20 Mar 2009, 12:55:32 UTC - in response to Message 877243.  

OK, I processed the work unit and it validated against the computer that had been granted the zero credit. We both were granted credit and there is now a canonical result. Thanks for the interest in this.
ID: 877442 · Report as offensive
Profile ccappel
Avatar

Send message
Joined: 27 Jan 00
Posts: 362
Credit: 1,516,412
RAC: 0
United States
Message 877464 - Posted: 20 Mar 2009, 15:25:11 UTC - in response to Message 877442.  

Those poor two guys/gals who had the validate error/marked as invalid.
"Life is a tragedy for those who feel, and a comedy for those who think."

"I never get into an argument that I cannot win."
ID: 877464 · Report as offensive
Profile -=SuperG=-
Avatar

Send message
Joined: 3 Apr 99
Posts: 63
Credit: 89,161,651
RAC: 23
Canada
Message 878586 - Posted: 23 Mar 2009, 15:35:02 UTC - in response to Message 877464.  
Last modified: 23 Mar 2009, 15:37:26 UTC

I too am having issues with the "Completed, validation inconclusive" error. Anyhelp will be appreciated.

Also some trouble with a "client detached" error.

Host 3903055

Thanks..
Boinc Wiki




"Great spirits have always encountered violent opposition from mediocre minds." -Albert Einstein
ID: 878586 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 878674 - Posted: 23 Mar 2009, 20:29:58 UTC - in response to Message 878586.  

"Completed, Validation inconclusive" is not an error per se, it is simply the validator letting you know that your result did not match closely enough to that of your quorum partner's so a third must be sent out to find which one is canonical.

The "client detached" errors I'm not entirely certain about. Have you been hitting the "reset project" button, or have you been detaching and re-attaching to the project trying to get more work?
ID: 878674 · Report as offensive
Profile -=SuperG=-
Avatar

Send message
Joined: 3 Apr 99
Posts: 63
Credit: 89,161,651
RAC: 23
Canada
Message 878723 - Posted: 23 Mar 2009, 23:37:58 UTC - in response to Message 878674.  

Hi, Thanks for the response OzzFan,

Problems all started when I checked this machine 1 morning back at the beginning of March.. checked the "Tasks" for their status like every other day and found that everything was marked: "Computation Error" Sadly there were probably 60 WUs. After that I could not get WUs to download and this PC sat idle for about 3 days or so.

I was reading the forums and at 1 point read to detach and reatach and then re-apply the location prefs... so I did. Helped a bit, machine downloaded about 20 WUs and thats it. It would not DL anymore until the next day and then another 20 WUs. Anyways, likely that is my "client detached" error right there. I am surprised that it took 2 weeks for that to clear itself up. Oh well.. we all know the systems have been very busy.

Here is a link to 1 of the failed WUs:

WU 1179033360

and here is a link to 1 of the "Completed, Validation inconclusive" WUs.

WU 1173439548

I will wait for a few more days to see what happens with the validating...


Boinc Wiki




"Great spirits have always encountered violent opposition from mediocre minds." -Albert Einstein
ID: 878723 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 878732 - Posted: 24 Mar 2009, 0:21:45 UTC - in response to Message 878723.  
Last modified: 24 Mar 2009, 0:28:18 UTC

Hi, Thanks for the response OzzFan,

Problems all started when I checked this machine 1 morning back at the beginning of March.. checked the "Tasks" for their status like every other day and found that everything was marked: "Computation Error" Sadly there were probably 60 WUs. After that I could not get WUs to download and this PC sat idle for about 3 days or so.

I was reading the forums and at 1 point read to detach and reatach and then re-apply the location prefs... so I did. Helped a bit, machine downloaded about 20 WUs and thats it. It would not DL anymore until the next day and then another 20 WUs. Anyways, likely that is my "client detached" error right there. I am surprised that it took 2 weeks for that to clear itself up. Oh well.. we all know the systems have been very busy.

Here is a link to 1 of the failed WUs:

WU 1179033360

Hmm, I think the "too many exit(0)s" on those was from Raistmer's team mod, at least task 1182105168 indicates you were using that package. I don't have a CUDA capable GPU so haven't followed the features of that package closely, but IIRC its method of trying to get work to the GPU is to do an exit 0, then when BOINC restarts the task it should end up using CUDA. There don't seem to be such errors since March 10, I'd advise not worrying about those unless they start happening again.

and here is a link to 1 of the "Completed, Validation inconclusive" WUs.

WU 1173439548

I will wait for a few more days to see what happens with the validating...

That's a very interesting case where your CPU result with 4 spikes found was compared against 2 CUDA results, one of which got an early result_overflow with 31 pulses but the other found 4 spikes. I expect you'll get credit when the 4th host reports, when work is computed on different hardware there's a higher likelihood of minor computational differences adding up to an imperfect match but it usually only causes one of the signals to be mismatched. Again, unless you get a lot of those you probably shouldn't worry, and I didn't see any in your recently reported work.


Edit: I almost forgot, the recent 'Client detached' indications are from 21 Mar 2009 22:07:38 UTC, so not related to the much earlier case you noted. There have been occasional reports of that happening spontaneously with no user action, and the servers were dealing with recovery from the science database problem at that time. The tasks will have been sent to other hosts, and that's not counted as a host error.
                                                              Joe
ID: 878732 · Report as offensive
Profile -=SuperG=-
Avatar

Send message
Joined: 3 Apr 99
Posts: 63
Credit: 89,161,651
RAC: 23
Canada
Message 878734 - Posted: 24 Mar 2009, 0:47:33 UTC

Heh.. well I had a big long post set but then hit preview and decided to follow 1 of the links.. oops.. clicked back and the page had expired... lol

anyways.. seems when I click the links (BSs and FreeDC) next to the profile, the wrong computer is being listed too.

wow I really screwed this 1 up.. yet it continues to crunch away..

I will keep an eye on it.. hopefully things clear themselves up soon.. :P

Thanks for the help.


Boinc Wiki




"Great spirits have always encountered violent opposition from mediocre minds." -Albert Einstein
ID: 878734 · Report as offensive

Message boards : Number crunching : Confusing results for old AP workunit.


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.