Message boards :
Number crunching :
Do we want to believe the SSP status is real?
Message board moderation
Author | Message |
---|---|
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Just noticed the SSP looks current and that Carolyn, the replica database server is shown running. However the status is still shown as offline. Do we think that things might be returning to something more normal? Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
HAL9000 Send message Joined: 11 Sep 99 Posts: 6534 Credit: 196,805,888 RAC: 57 |
Just noticed the SSP looks current and that Carolyn, the replica database server is shown running. However the status is still shown as offline. Do we think that things might be returning to something more normal? You are free to believe in whatever you like. The Server Status is often just a small glimpse of what may, or may not, be the true status of the servers at the given timestamp. I imagine it is mostly SNMP data, while highly reliable, still doesn't qualify for 100% status. Especially given the number of system involved. SETI@home classic workunits: 93,865 CPU time: 863,447 hours Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[ |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Well, I was hoping that since today was a maintenance Tuesday, that they might have fixed Carolyn's problems and put the replica back into play. I was thinking that with it working they would have had time to get the server back in sync with the main database. I have been waiting on the replica and the AP validators to get around to processing two AP CPU tasks that have been hanging around since the first of the month. The tasks have been finished by me and my wingmen, just haven't been cleared for some reason. Other, newer tasks have been cleared already for some reason. It seems like these have been forgotten somehow. I need them validated to get my APR for one machine to be set finally for realistic estimated completion dates and times. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
HAL9000 Send message Joined: 11 Sep 99 Posts: 6534 Credit: 196,805,888 RAC: 57 |
Well, I was hoping that since today was a maintenance Tuesday, that they might have fixed Carolyn's problems and put the replica back into play. I was thinking that with it working they would have had time to get the server back in sync with the main database. I have been waiting on the replica and the AP validators to get around to processing two AP CPU tasks that have been hanging around since the first of the month. The tasks have been finished by me and my wingmen, just haven't been cleared for some reason. Other, newer tasks have been cleared already for some reason. It seems like these have been forgotten somehow. I need them validated to get my APR for one machine to be set finally for realistic estimated completion dates and times. We can always hope that getting parts or swamping bits around has happened to keep the project as stable as possible, but no news is no news. Sometimes things take a few tries to get it all ironed out and we don't know how busy they are. With the holiday this week they might technically be off all week. I'm really only aware of the campus schedule when someone mentions a semester starting or ending. SETI@home classic workunits: 93,865 CPU time: 863,447 hours Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[ |
Jeff Buck Send message Joined: 11 Feb 00 Posts: 1441 Credit: 148,764,870 RAC: 0 |
I have been waiting on the replica and the AP validators to get around to processing two AP CPU tasks that have been hanging around since the first of the month. The tasks have been finished by me and my wingmen, just haven't been cleared for some reason. Other, newer tasks have been cleared already for some reason. It seems like these have been forgotten somehow. I need them validated to get my APR for one machine to be set finally for realistic estimated completion dates and times. I don't think that's related to the replica DB, although there were some validation hiccups that have seemed to occur each time the system has gone down. I'm pretty sure that once a WU gets bypassed by the validator when the second task reports, it then has to wait for a "second chance" when the original WU reporting deadline rolls around. In the case of your 2 AP WUs, it looks like that window should roll around sometime tomorrow, one in the morning (PT) and one in early afternoon. Keep your fingers crossed! :^) |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Thanks for the further information about "second chance" validation, Jeff. Wasn't aware that that mechanism existed. Yes, keeping my fingers crossed for good things to happen tomorrow with the original deadlines passing. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Jeff Buck Send message Joined: 11 Feb 00 Posts: 1441 Credit: 148,764,870 RAC: 0 |
Thanks for the further information about "second chance" validation, Jeff. Wasn't aware that that mechanism existed. Yes, keeping my fingers crossed for good things to happen tomorrow with the original deadlines passing. I originally ran into this problem a couple of years ago (see Message 1586344 and earlier messages for details). I had found myself with 99 MB and 2 AP tasks that had reported but not validated in about a 10 minute black hole following a system outage. Richard Haselgrove was the one who brought up the second chance validation window, and he was proved correct for all of those WUs. I think it's just one of those things that sometimes happens when the various server apps like the validator are all coming back up to speed following an outage, and massive numbers of tasks are being reported. In fact, it probably happens to some small percentage of tasks during every restart, but only when it happens to one of us who's paying close attention does it ever get brought up here in the forums. |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Looks like the replica is back and online. Only 65,000 seconds behind currently. Jeff, you were right, one of my tardy AP CPU tasks has validated. Now at 10, just the one more today and I will be official at 11. (yes the knobs go to eleven!) Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
kittyman Send message Joined: 9 Jul 00 Posts: 51468 Credit: 1,018,363,574 RAC: 1,004 |
And the kitties meow for carolyn! "Freedom is just Chaos, with better lighting." Alan Dean Foster |
Rune Bjørge Send message Joined: 5 Feb 00 Posts: 45 Credit: 30,508,204 RAC: 5 |
Seems like Carolyn is catching up too. Now it is 59.106 seconds behind.. |
Jeff Buck Send message Joined: 11 Feb 00 Posts: 1441 Credit: 148,764,870 RAC: 0 |
Jeff, you were right, one of my tardy AP CPU tasks has validated. Now at 10, just the one more today and I will be official at 11. (yes the knobs go to eleven!) Well, it looks like your second one validated, but it also looks like there was too much blanking for it to count toward your graduation. It seems to me that either Joe Segur or Richard once mentioned what the maximum blanking percentage is that would allow those to count, but I've forgotten what the cutoff is. |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Crap! Thanks again for pointing out things I don't know about. I figured since it didn't finish in 3 seconds or whatever like the 100% blanked ones, in fact it took almost a third of the normal CPU time on that machine, that it was going to fly. I didn't know that the task wouldn't count if it had that much blanking. I knew the 100% ones count as valid results but not completed. It looks like I will still have a very long wait to get to graduation. Especially with the dearth of AP tasks we have seen for the last 3 months or so. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14650 Credit: 200,643,578 RAC: 874 |
Jeff, you were right, one of my tardy AP CPU tasks has validated. Now at 10, just the one more today and I will be official at 11. (yes the knobs go to eleven!) @ https://setisvn.ssl.berkeley.edu/trac/browser/astropulse/server/validate/ap_result.cpp?desc=1#L127: 127 if (this->fraction_blanked > 0.1) { 128 log_messages.printf(SCHED_MSG_LOG::MSG_CRITICAL, 129 "[RESULT#%ld] is an OVERFLOW result (%f%% blanked)\n", 130 result.id, this->fraction_blanked*100); 131 result.runtime_outlier=true; Anything above 10% blanked doesn't count towards the '11 for APR'. |
Jeff Buck Send message Joined: 11 Feb 00 Posts: 1441 Credit: 148,764,870 RAC: 0 |
Well, if it makes you feel any better, I have an old IBM ThinkPad that only averages about 1 AP a month. In the 2+ years since AP v7 came along, the "Consecutive valid tasks" total has reached 23, but only 7 of those count toward "Number of tasks completed", so I figure it'll be at least another year before I hit the magic 11 number. The silver lining, though, is that until it reaches 11, almost every one of those 100% blanked tasks still gets 400+ credits, just about the same for a minute's work as for 48 hours of crunching. A bonus for my wingmen, too. :^) |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Thanks for chiming in, Richard. Now both Jeff and I know the threshold cutoff for validated, completed AP tasks. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.