Do we want to believe the SSP status is real?

Message boards : Number crunching : Do we want to believe the SSP status is real?
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1831948 - Posted: 23 Nov 2016, 1:06:51 UTC

Just noticed the SSP looks current and that Carolyn, the replica database server is shown running. However the status is still shown as offline. Do we think that things might be returning to something more normal?
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1831948 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1831955 - Posted: 23 Nov 2016, 1:39:51 UTC - in response to Message 1831948.  

Just noticed the SSP looks current and that Carolyn, the replica database server is shown running. However the status is still shown as offline. Do we think that things might be returning to something more normal?

You are free to believe in whatever you like. The Server Status is often just a small glimpse of what may, or may not, be the true status of the servers at the given timestamp.
I imagine it is mostly SNMP data, while highly reliable, still doesn't qualify for 100% status. Especially given the number of system involved.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1831955 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1831969 - Posted: 23 Nov 2016, 2:22:09 UTC - in response to Message 1831955.  

Well, I was hoping that since today was a maintenance Tuesday, that they might have fixed Carolyn's problems and put the replica back into play. I was thinking that with it working they would have had time to get the server back in sync with the main database. I have been waiting on the replica and the AP validators to get around to processing two AP CPU tasks that have been hanging around since the first of the month. The tasks have been finished by me and my wingmen, just haven't been cleared for some reason. Other, newer tasks have been cleared already for some reason. It seems like these have been forgotten somehow. I need them validated to get my APR for one machine to be set finally for realistic estimated completion dates and times.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1831969 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1831972 - Posted: 23 Nov 2016, 2:34:34 UTC - in response to Message 1831969.  
Last modified: 23 Nov 2016, 2:38:54 UTC

Well, I was hoping that since today was a maintenance Tuesday, that they might have fixed Carolyn's problems and put the replica back into play. I was thinking that with it working they would have had time to get the server back in sync with the main database. I have been waiting on the replica and the AP validators to get around to processing two AP CPU tasks that have been hanging around since the first of the month. The tasks have been finished by me and my wingmen, just haven't been cleared for some reason. Other, newer tasks have been cleared already for some reason. It seems like these have been forgotten somehow. I need them validated to get my APR for one machine to be set finally for realistic estimated completion dates and times.

We can always hope that getting parts or swamping bits around has happened to keep the project as stable as possible, but no news is no news.
Sometimes things take a few tries to get it all ironed out and we don't know how busy they are. With the holiday this week they might technically be off all week. I'm really only aware of the campus schedule when someone mentions a semester starting or ending.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1831972 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1831973 - Posted: 23 Nov 2016, 2:43:54 UTC - in response to Message 1831969.  

I have been waiting on the replica and the AP validators to get around to processing two AP CPU tasks that have been hanging around since the first of the month. The tasks have been finished by me and my wingmen, just haven't been cleared for some reason. Other, newer tasks have been cleared already for some reason. It seems like these have been forgotten somehow. I need them validated to get my APR for one machine to be set finally for realistic estimated completion dates and times.

I don't think that's related to the replica DB, although there were some validation hiccups that have seemed to occur each time the system has gone down. I'm pretty sure that once a WU gets bypassed by the validator when the second task reports, it then has to wait for a "second chance" when the original WU reporting deadline rolls around. In the case of your 2 AP WUs, it looks like that window should roll around sometime tomorrow, one in the morning (PT) and one in early afternoon. Keep your fingers crossed! :^)
ID: 1831973 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1831976 - Posted: 23 Nov 2016, 3:02:07 UTC - in response to Message 1831973.  

Thanks for the further information about "second chance" validation, Jeff. Wasn't aware that that mechanism existed. Yes, keeping my fingers crossed for good things to happen tomorrow with the original deadlines passing.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1831976 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1831982 - Posted: 23 Nov 2016, 3:39:44 UTC - in response to Message 1831976.  

Thanks for the further information about "second chance" validation, Jeff. Wasn't aware that that mechanism existed. Yes, keeping my fingers crossed for good things to happen tomorrow with the original deadlines passing.

I originally ran into this problem a couple of years ago (see Message 1586344 and earlier messages for details). I had found myself with 99 MB and 2 AP tasks that had reported but not validated in about a 10 minute black hole following a system outage. Richard Haselgrove was the one who brought up the second chance validation window, and he was proved correct for all of those WUs.

I think it's just one of those things that sometimes happens when the various server apps like the validator are all coming back up to speed following an outage, and massive numbers of tasks are being reported. In fact, it probably happens to some small percentage of tasks during every restart, but only when it happens to one of us who's paying close attention does it ever get brought up here in the forums.
ID: 1831982 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1832034 - Posted: 23 Nov 2016, 17:11:11 UTC

Looks like the replica is back and online. Only 65,000 seconds behind currently.

Jeff, you were right, one of my tardy AP CPU tasks has validated. Now at 10, just the one more today and I will be official at 11. (yes the knobs go to eleven!)
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1832034 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1832035 - Posted: 23 Nov 2016, 17:13:23 UTC

And the kitties meow for carolyn!
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1832035 · Report as offensive
Profile Rune Bjørge

Send message
Joined: 5 Feb 00
Posts: 45
Credit: 30,508,204
RAC: 5
Norway
Message 1832049 - Posted: 23 Nov 2016, 18:43:49 UTC - in response to Message 1832034.  

Seems like Carolyn is catching up too. Now it is 59.106 seconds behind..
ID: 1832049 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1832078 - Posted: 23 Nov 2016, 22:26:35 UTC - in response to Message 1832034.  
Last modified: 23 Nov 2016, 22:27:05 UTC

Jeff, you were right, one of my tardy AP CPU tasks has validated. Now at 10, just the one more today and I will be official at 11. (yes the knobs go to eleven!)

Well, it looks like your second one validated, but it also looks like there was too much blanking for it to count toward your graduation. It seems to me that either Joe Segur or Richard once mentioned what the maximum blanking percentage is that would allow those to count, but I've forgotten what the cutoff is.
ID: 1832078 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1832080 - Posted: 23 Nov 2016, 22:42:54 UTC - in response to Message 1832078.  

Crap! Thanks again for pointing out things I don't know about. I figured since it didn't finish in 3 seconds or whatever like the 100% blanked ones, in fact it took almost a third of the normal CPU time on that machine, that it was going to fly. I didn't know that the task wouldn't count if it had that much blanking. I knew the 100% ones count as valid results but not completed. It looks like I will still have a very long wait to get to graduation. Especially with the dearth of AP tasks we have seen for the last 3 months or so.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1832080 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1832086 - Posted: 23 Nov 2016, 23:08:11 UTC - in response to Message 1832078.  

Jeff, you were right, one of my tardy AP CPU tasks has validated. Now at 10, just the one more today and I will be official at 11. (yes the knobs go to eleven!)

Well, it looks like your second one validated, but it also looks like there was too much blanking for it to count toward your graduation. It seems to me that either Joe Segur or Richard once mentioned what the maximum blanking percentage is that would allow those to count, but I've forgotten what the cutoff is.

@ https://setisvn.ssl.berkeley.edu/trac/browser/astropulse/server/validate/ap_result.cpp?desc=1#L127:

127	    if (this->fraction_blanked > 0.1) {
128	        log_messages.printf(SCHED_MSG_LOG::MSG_CRITICAL,
129	                            "[RESULT#%ld] is an OVERFLOW result (%f%% blanked)\n",
130	                            result.id, this->fraction_blanked*100);
131	        result.runtime_outlier=true;

Anything above 10% blanked doesn't count towards the '11 for APR'.
ID: 1832086 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1832089 - Posted: 23 Nov 2016, 23:18:26 UTC - in response to Message 1832080.  

Well, if it makes you feel any better, I have an old IBM ThinkPad that only averages about 1 AP a month. In the 2+ years since AP v7 came along, the "Consecutive valid tasks" total has reached 23, but only 7 of those count toward "Number of tasks completed", so I figure it'll be at least another year before I hit the magic 11 number. The silver lining, though, is that until it reaches 11, almost every one of those 100% blanked tasks still gets 400+ credits, just about the same for a minute's work as for 48 hours of crunching. A bonus for my wingmen, too. :^)
ID: 1832089 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1832094 - Posted: 24 Nov 2016, 0:00:18 UTC - in response to Message 1832086.  



127	    if (this->fraction_blanked > 0.1) {
128	        log_messages.printf(SCHED_MSG_LOG::MSG_CRITICAL,
129	                            "[RESULT#%ld] is an OVERFLOW result (%f%% blanked)\n",
130	                            result.id, this->fraction_blanked*100);
131	        result.runtime_outlier=true;

Anything above 10% blanked doesn't count towards the '11 for APR'.

Thanks for chiming in, Richard. Now both Jeff and I know the threshold cutoff for validated, completed AP tasks.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1832094 · Report as offensive

Message boards : Number crunching : Do we want to believe the SSP status is real?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.