(why is) ap_validate (synergy) not running

Message boards : Number crunching : (why is) ap_validate (synergy) not running

To post messages, you must log in.

AuthorMessage
terencewee*

Send message
Joined: 10 Oct 09
Posts: 53
Credit: 7,022,510
RAC: 0
Malaysia
Message 1136855 - Posted: 6 Aug 2011, 17:07:14 UTC
Last modified: 6 Aug 2011, 17:27:00 UTC

Completed AP-WUs are piling up ( 11+k at present ) - any reason why ap_validate(1/2/3) are not running ?

Could it be due to all AP "tapes" are completed?

I see AP assimilators are running, but the queue is 0.

Just trying to understand better.

Thanks in advance.

terencewee*
Sicituradastra.

ID: 1136855 · Report as offensive
Richard HaselgroveProject Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 11140
Credit: 83,766,343
RAC: 46,195
United Kingdom
Message 1136883 - Posted: 6 Aug 2011, 17:50:54 UTC

I imagine that the short answer is "because it's the weekend".

Matt Lebofsky did post recently (message 1112205):

There are some broken astropulse results clogging one of the validators (which is why it shows up on red on the status page). We'll have to figure out an automated way to detect these results and push them through (it's a real pain to do by hand).

We might be suffering a recurrence of that - I don't know if they had any luck in working out what exactly was 'broken' about the results and where they were coming from - or even if they had time to look.

And BTW - I think that Joe Segur explained recently that only ap_validate3 was active at the moment - 1 and 2 are left over from earlier Astropulse runs, and shouldn't have any work to do this long after the event.

ID: 1136883 · Report as offensive
Profile arkaynProject Donor
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4097
Credit: 51,576,341
RAC: 968
United States
Message 1136891 - Posted: 6 Aug 2011, 18:03:02 UTC
Last modified: 6 Aug 2011, 18:03:42 UTC

And don't forget that Synergy is running on half of it's normal RAM right now as well.

The short story is we just plucked 48GB of memory out of synergy (back-end compute server) and added it to oscar (the main science database server).


ID: 1136891 · Report as offensive
IFRS
Volunteer tester
Avatar

Send message
Joined: 21 May 99
Posts: 1731
Credit: 258,892,465
RAC: 0
Brazil
Message 1137442 - Posted: 7 Aug 2011, 23:24:57 UTC

Bah, that sux. I run just AP on cpu (thought it´s better payed and run less units on cache) and when this happens, my RAC just stall.


ID: 1137442 · Report as offensive
terencewee*

Send message
Joined: 10 Oct 09
Posts: 53
Credit: 7,022,510
RAC: 0
Malaysia
Message 1137487 - Posted: 8 Aug 2011, 2:57:43 UTC

thanks guys. I know what to expect during "the-weekend". :D


@Firehawk:
No kidding. I lost 7th placing this recent SETI-Challenge over at BOINCstats due to ap_validate not running.

terencewee*
Sicituradastra.

ID: 1137487 · Report as offensive
rob smithProject Donor
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 13336
Credit: 154,759,972
RAC: 117,849
United Kingdom
Message 1137551 - Posted: 8 Aug 2011, 7:24:40 UTC - in response to Message 1137442.  

Bah, that sux. I run just AP on cpu (thought it´s better payed and run less units on cache) and when this happens, my RAC just stall.


Don't forget that, even if the assimilators are running you won't get credit until your wingman has completed his processing of that WU. And that assumes he gets it back in time, and the two results validate against each other, if either of these conditions "fails" then you have to wait for it to be sent out to someone else to process, return...

(Is a limit as to how often a WU can be sent out until its declared "dead"??)
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?

ID: 1137551 · Report as offensive
Profile Mike
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 29579
Credit: 49,101,342
RAC: 17,148
Germany
Message 1137559 - Posted: 8 Aug 2011, 8:30:52 UTC - in response to Message 1137551.  

Bah, that sux. I run just AP on cpu (thought it´s better payed and run less units on cache) and when this happens, my RAC just stall.


Don't forget that, even if the assimilators are running you won't get credit until your wingman has completed his processing of that WU. And that assumes he gets it back in time, and the two results validate against each other, if either of these conditions "fails" then you have to wait for it to be sent out to someone else to process, return...

(Is a limit as to how often a WU can be sent out until its declared "dead"??)


Yes, its 5/10/10

max 5 errors and 10 in total.

With each crime and every kindness we birth our future.

ID: 1137559 · Report as offensive
kittymanProject Donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 45918
Credit: 815,237,509
RAC: 124,954
United States
Message 1137627 - Posted: 8 Aug 2011, 16:30:11 UTC - in response to Message 1137551.  



(Is a limit as to how often a WU can be sent out until its declared "dead"??)

Yes....

max # of error/total/success tasks 5, 10, 5

It is listed in the WU details for every WU sent out.
Cats.....what more does one need?

Have made friends in this life.
Most were cats.

ID: 1137627 · Report as offensive

Message boards : Number crunching : (why is) ap_validate (synergy) not running


 
©2016 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.