Waiting for Validation

Message boards : Number crunching : Waiting for Validation
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 79028 - Posted: 13 Feb 2005, 13:46:06 UTC

Yesterday the number stuck around 143K give or take some, then this morning I see

Ready to send 1,279,362
In progress 1,713,462
Waiting for validation 199,788

Given that the new server was reducing this number by 6k/hr it should be down to zero (relative to new incoming units).

Hope nothings broke.

tony

ID: 79028 · Report as offensive
Saimek

Send message
Joined: 25 Jan 00
Posts: 121
Credit: 454,423
RAC: 0
Poland
Message 79029 - Posted: 13 Feb 2005, 13:49:29 UTC

Yeah i noticed that today also =) strange... :/
ID: 79029 · Report as offensive
Dave Mickey

Send message
Joined: 19 Oct 99
Posts: 178
Credit: 11,122,965
RAC: 0
United States
Message 79032 - Posted: 13 Feb 2005, 13:58:09 UTC


Yes, a very clear reversal of fortunes for the Validation Q -
it seems to have spent the last 10 or 12 hours adding several
thousand results per hour instead of clearing at that rate
as it had been since Friday. :(

Possible that an early-Sunday a.m. (CA time) cron job backup
or some such kicked in and took away resources from the DB or
or the validator? hmmm.

Dave
ID: 79032 · Report as offensive
Antjest
Volunteer tester

Send message
Joined: 27 Oct 99
Posts: 27
Credit: 19,796,139
RAC: 0
Slovenia
Message 79033 - Posted: 13 Feb 2005, 14:00:33 UTC

They probably remove WU's on the older server when queue got extremly large and are now inserting them back.
ID: 79033 · Report as offensive
Dave Mickey

Send message
Joined: 19 Oct 99
Posts: 178
Credit: 11,122,965
RAC: 0
United States
Message 79038 - Posted: 13 Feb 2005, 14:40:02 UTC


Well, what ever it is continues to happen at a good clip.

Last hour on the stat pages shows that the Q grew by
another 15,000. I think that's much faster growth
than what was occuring when the old machine was
trying (but not able) to keep up. So either the Q is
getting much more input, or having much less validation
done, than was happening before the change over.

Dave
ID: 79038 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 79041 - Posted: 13 Feb 2005, 14:50:45 UTC

another 48k and we're right back to where we started after the migration. Hope this upward trend does not continue.

Ready to send 1,275,094
In progress 706,226
Waiting for validation 210,906
ID: 79041 · Report as offensive
Profile Dunc
Volunteer tester

Send message
Joined: 3 Jul 02
Posts: 129
Credit: 2,166,460
RAC: 0
United States
Message 79044 - Posted: 13 Feb 2005, 14:59:41 UTC
Last modified: 13 Feb 2005, 15:01:48 UTC

The validator is still working.

This



Is a good barometer of what is happening :)

Dunc

ID: 79044 · Report as offensive
Profile Paul D. Buck
Volunteer tester

Send message
Joined: 19 Jul 00
Posts: 3898
Credit: 1,158,042
RAC: 0
United States
Message 79049 - Posted: 13 Feb 2005, 15:07:13 UTC

In theory, the average number of WU and Results should be the same each day. But, with the connect every "x" being set differently, and people like me that "flush" our results daily, these numbers are not likely to ever be in balance.

So, minor fluctuations in results returned add turbulence to the process flow and may add to our troubles.
ID: 79049 · Report as offensive
Profile Roks

Send message
Joined: 20 Dec 02
Posts: 55
Credit: 137,776
RAC: 0
Slovenia
Message 79054 - Posted: 13 Feb 2005, 15:15:43 UTC

I agree, the validator IS working, but I have no idea why the number keeps growing. The most likely reason is what Paul told. I have no other explanation.
<img src="http://www.boincstats.com/stats/banner.php?cpid=d2319b8f0ad14565556d0ba45b64e779">
ID: 79054 · Report as offensive
Profile Paul D. Buck
Volunteer tester

Send message
Joined: 19 Jul 00
Posts: 3898
Credit: 1,158,042
RAC: 0
United States
Message 79057 - Posted: 13 Feb 2005, 15:24:25 UTC - in response to Message 79054.  

> I agree, the validator IS working, but I have no idea why the number keeps
> growing. The most likely reason is what Paul told. I have no other
> explanation.

Of course, maybe now the new server is spinning on the "garbage" collection spindle ...

ID: 79057 · Report as offensive
Profile Neil Walker
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 288
Credit: 18,101,056
RAC: 0
United Kingdom
Message 79059 - Posted: 13 Feb 2005, 15:30:25 UTC

Remember that count was out by around 150k to start with. I think it's found those "missing" WUs. A few hours ago, it looked like it was going very slowly yet I was getting credit at a phenominal rate. ;)


Be lucky

Neil



ID: 79059 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 79064 - Posted: 13 Feb 2005, 15:40:11 UTC - in response to Message 79059.  

> Remember that count was out by around 150k to start with. I think it's found
> those "missing" WUs. A few hours ago, it looked like it was going very slowly
> yet I was getting credit at a phenominal rate. ;)
>
Yesterday the count hovered around 140K, 143K, 144K, for most the day. so you think that's some "Baseline"(for some reason), and then that would be the theoretical zero? Todays climb by 70K would be some normal fluctuation?

I think Paul's closer to the truth, and that the DB is undergoing some kind of internal function, and that the 70K is the inbound returns not being validated.

What do you think? anyone?
ID: 79064 · Report as offensive
Profile Pooh Bear 27
Volunteer tester
Avatar

Send message
Joined: 14 Jul 03
Posts: 3224
Credit: 4,603,826
RAC: 0
United States
Message 79080 - Posted: 13 Feb 2005, 16:17:02 UTC

Could Berkeley have taken some of the WU offline for a while, when the servers were going nuts, and have over the weekend slowly incorporated them back in, and today they dumped the last of the load in?

It's going in the correct direction again, and I have seen a couple of my older WU that had all the correct responses etc., finally get validated.

And I hit 100K today! WOO!



My movie https://vimeo.com/manage/videos/502242
ID: 79080 · Report as offensive
Iztok s52d (and friends)

Send message
Joined: 12 Jan 01
Posts: 136
Credit: 393,469,375
RAC: 116
Slovenia
Message 79082 - Posted: 13 Feb 2005, 16:21:15 UTC - in response to Message 79064.  

> > Remember that count was out by around 150k to start with. I think it's
> found
> > those "missing" WUs. A few hours ago, it looked like it was going very
> slowly
> > yet I was getting credit at a phenominal rate. ;)
> >
> Yesterday the count hovered around 140K, 143K, 144K, for most the day. so you
> think that's some "Baseline"(for some reason), and then that would be the
> theoretical zero? Todays climb by 70K would be some normal fluctuation?
>
> I think Paul's closer to the truth, and that the DB is undergoing some kind of
> internal function, and that the 70K is the inbound returns not being
> validated.
>
> What do you think? anyone?
>
Hi! Looks like missing WUs (look at thread about missing 150k: http://setiweb.ssl.berkeley.edu/forum_thread.php?id=11303 ) are found.
It used to be wrong number in stats before.

On the other hand: I am getting less credits per hour last few hours,
compared to yesterday.

It might be related with validating of 3 result/4 result WUs?
WUs with no proper flag in dB?

Whatever. Few more days of fast credit flow does not hurt.

73
Iztok


ID: 79082 · Report as offensive
Divide Overflow
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 365
Credit: 131,684
RAC: 0
United States
Message 79105 - Posted: 13 Feb 2005, 17:04:11 UTC - in response to Message 79059.  
Last modified: 13 Feb 2005, 17:04:45 UTC

> Remember that count was out by around 150k to start with. I think it's found
> those "missing" WUs. A few hours ago, it looked like it was going very slowly
> yet I was getting credit at a phenominal rate. ;)

I agree, S-C. The "waiting for validation" queue is an estimate, and that counter was zeroed out a few times over the past week or two. I'm still seeing a steady flow of credits being validated, so I think that the validator is working fine, it's just this counter that's adjusting itself to now account for WU's that it's rediscovered. As I look at results records, many old "stuck" units have now been validated properly or had another WU sent out.

Paul's theory is certainly accurate too. There will always be fluctuations in the volume of WU's returned. With many people out there increasing their cache sizes recently, some of these results are coming home to roost in large numbers!

ID: 79105 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 79144 - Posted: 13 Feb 2005, 18:51:22 UTC

here's Roms' answer:

Rom Walton
Forum moderator
Project developer

Joined: Apr 27, 2000
Posts: 283
ID: 85465
Posted: 13 Feb 2005 18:47:17 UTC

The disk array that was holding the log files for the backend componentss, became full and caused all the components to stop.

Jeff went in and manully cycled the log files and moved them to a different array.

----- Rom

BOINC Development Team, U.C. Berkeley

ID: 79144 · Report as offensive
Nuadormrac
Volunteer tester
Avatar

Send message
Joined: 7 Apr 00
Posts: 136
Credit: 1,703,351
RAC: 0
United States
Message 79313 - Posted: 14 Feb 2005, 7:46:11 UTC
Last modified: 14 Feb 2005, 7:48:03 UTC

I noticed the increase in "number to be validated" as it's listed by the counter on the server status page, myself.

http://setiweb.ssl.berkeley.edu/sah_status.html

Waiting for validation 176,821

as of that page right now. It was steadily dropping since the new servers came up. Does make me wonder if another validator (perhaps on another server or CPU) might help with getting these validated, especially as we now have a new server that according to the SETI team is under-utilized and not likely to not be up to the task itself...

Then again, someone does bring up a point with people having perhaps increased their cache sizes due to the difficulty people were having in getting WUs last week and so. How many of those WUs with a possible increase, might be waiting for the quorum? Honestly couldn't say, and not inclined to take a tabulation on every user's account for how many WUs they have pending, for which a quorum hasn't been reached. The people at Berkely probably have a better way to query their own database, to find such things out anyhow, then such a method which we could utilize... And yes, I know we're looking at an estimate while following the trend it displays as well.

ID: 79313 · Report as offensive
Profile Toby
Volunteer tester
Avatar

Send message
Joined: 26 Oct 00
Posts: 1005
Credit: 6,366,949
RAC: 0
United States
Message 79321 - Posted: 14 Feb 2005, 8:22:04 UTC - in response to Message 79313.  

> as of that page right now. It was steadily dropping since the new servers
> came up. Does make me wonder if another validator (perhaps on another server
> or CPU) might help with getting these validated, especially as we now have a
> new server that according to the SETI team is under-utilized and not likely to
> not be up to the task itself...

"under-utilized" means it could do more so it is up to the task. The "waiting for validation" queue has been dropping at up to 6,000 per hour from what I have seen. This means that the validator is validating all the results as they come in PLUS 6,000 per hour so there is still room for growth here. Add to that the fact that they can add another 2 CPUs and several GB more RAM to the thing and I think it is more than up to the task. Besides, the validator itself may not be the bottleneck. For some time now it has always been the database that has been the bottleneck in the system so adding another validator wouldn't even help. I'm not sure if this has changed with the new hardware or not.
A member of The Knights Who Say NI!
For rankings, history graphs and more, check out:
My BOINC stats site
ID: 79321 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 79364 - Posted: 14 Feb 2005, 14:18:22 UTC
Last modified: 14 Feb 2005, 14:19:45 UTC

The server status page seems to be stuck. It's not updated since 14 Feb 05:10 UTC. (10:10 pm eastern, Feb 13).

[As of 14 Feb 2005 5:10:07 UTC]

Ready to send 1,087,643
In progress 920,822
Waiting for validation 176,821

Patiently waiting
ID: 79364 · Report as offensive
virex
Volunteer tester

Send message
Joined: 14 Jul 01
Posts: 17
Credit: 294,731
RAC: 0
United States
Message 79366 - Posted: 14 Feb 2005, 14:32:28 UTC - in response to Message 79313.  

> How many of those WUs with a possible increase, might be waiting
> for the quorum? Honestly couldn't say, and not inclined to take a tabulation
> on every user's account for how many WUs they have pending, for which a quorum
> hasn't been reached.

the waiting for validation queue only shows WU's that are ready for validation. The transistioner only sends WU's that have a quorum to the validator for validation. So WU's that are waiting on results would not be in that list
ID: 79366 · Report as offensive
1 · 2 · 3 · Next

Message boards : Number crunching : Waiting for Validation


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.