Panic Mode On (113) Server Problems?

Message boards : Number crunching : Panic Mode On (113) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 37 · Next

AuthorMessage
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13855
Credit: 208,696,464
RAC: 304
Australia
Message 1958003 - Posted: 1 Oct 2018, 7:07:25 UTC

Ready-to-send = 0 so that's it till at least opening time.
Only resends till then.
Grant
Darwin NT
ID: 1958003 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 36864
Credit: 261,360,520
RAC: 489
Australia
Message 1958005 - Posted: 1 Oct 2018, 7:17:06 UTC
Last modified: 1 Oct 2018, 7:17:23 UTC

*Prepare for a crash landing Mr. Worf."
ID: 1958005 · Report as offensive
Profile Stargate (SA)
Volunteer tester
Avatar

Send message
Joined: 4 Mar 10
Posts: 1854
Credit: 2,258,721
RAC: 0
Australia
Message 1958006 - Posted: 1 Oct 2018, 7:29:49 UTC

"Saucer separated"
ID: 1958006 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1958017 - Posted: 1 Oct 2018, 11:55:45 UTC - in response to Message 1958005.  

*Prepare for a crash landing Mr. Worf."


. . It must be nearly time for the maintenance outage, the system is in limbo ...

. . Lately every week just before the maintenance period there is a problem getting work ...

Stephen

:(
ID: 1958017 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1958021 - Posted: 1 Oct 2018, 12:27:04 UTC

I just sent the word out about the splitters, but of course it's only 5:27AM in Berkeley.
Meow.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1958021 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1958039 - Posted: 1 Oct 2018, 14:20:38 UTC

it is after 7am in California now.

On the bright side the db purge numbers are nice and low, and there are over 600K resends ready for assimilation.
ID: 1958039 · Report as offensive
BetelgeuseFive Project Donor
Volunteer tester

Send message
Joined: 6 Jul 99
Posts: 158
Credit: 17,117,787
RAC: 19
Netherlands
Message 1958042 - Posted: 1 Oct 2018, 14:40:35 UTC - in response to Message 1958039.  

it is after 7am in California now.

On the bright side the db purge numbers are nice and low, and there are over 600K resends ready for assimilation.


I don't think these 600K are resends. Validators still seem to be working and resends are taking place (received 4 of them a little while ago ...).

Tom
ID: 1958042 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1958045 - Posted: 1 Oct 2018, 14:55:18 UTC - in response to Message 1958042.  
Last modified: 1 Oct 2018, 15:27:48 UTC



I don't think these 600K are resends. Validators still seem to be working and resends are taking place (received 4 of them a little while ago ...).

Tom


interesting. My guess had been that AP resends were still happening but that s@h resends were not. The files being split hadn't moved (assuming we are seeing current info), and the "Results returned and awaiting validation" had increased (and I can see that my completed WUs are being validated just fine), so I assumed that resends were getting stuck in the assimilation phase and that Vader or Georgem had decided to take a vacation.

The other option is that splitting has been happening at a VERY slow rate and getting stuck at assimilation.

Thanks for the extra info. It gives me something to ponder while I wait...

edit to add more thoughts. Since some WUs are making it to the RTS . I'm going to guess Georgem crapped out and Vader can manage a small amount of assimilation to give us some WUs, but not enough to keep the system running well.

Strapping in and getting ready to crash now, wondering if some "Scotty" will come in to work and save us at last minute.
ID: 1958045 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1958058 - Posted: 1 Oct 2018, 16:42:03 UTC

Eric looked into things, and thinks he has things working again.
We shall see.
Meowmeowmeow!
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1958058 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11416
Credit: 29,581,041
RAC: 66
United States
Message 1958061 - Posted: 1 Oct 2018, 16:50:45 UTC - in response to Message 1958058.  

SSP says not yet
ID: 1958061 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1958062 - Posted: 1 Oct 2018, 16:53:19 UTC - in response to Message 1958061.  

SSP says not yet

Now it does.
Meow!
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1958062 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11416
Credit: 29,581,041
RAC: 66
United States
Message 1958063 - Posted: 1 Oct 2018, 16:54:30 UTC - in response to Message 1958062.  

Yep
ID: 1958063 · Report as offensive
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1958065 - Posted: 1 Oct 2018, 16:56:17 UTC - in response to Message 1958062.  

Just got 74 new task on one computer.

SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1958065 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1958097 - Posted: 2 Oct 2018, 0:01:47 UTC

would this explain why i have one computer that has run dry? i rebooted it, but it is not downloading anything new :(
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 1958097 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1958098 - Posted: 2 Oct 2018, 0:07:25 UTC - in response to Message 1958097.  

The system was just in Maintenance mode for a few minutes, but it is back now.
Check your retry times, and try a manual update.
ID: 1958098 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1958106 - Posted: 2 Oct 2018, 0:41:51 UTC - in response to Message 1958098.  

All my hosts must have hit the servers while they were in Maintenance mode. They got forced into a 50 minute backoff. Now they are reporting the work they have finished but unfortunately getting nothing in return.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1958106 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1958125 - Posted: 2 Oct 2018, 4:56:16 UTC
Last modified: 2 Oct 2018, 5:29:49 UTC

I'm having some weirdness in my personal "remaining time".

usually the blc22 are estimated to take 3:08:53

but I got 10 that are blc22_2bit_guppi_58340_48827_HIP4845_.... that have an estimated time of 20:28:09

seems a bit ridiculous as not even APs take that long. anyone else notice this??
fills up my cache level, so I can't get anymore WUs.

edit - nevermind - I aborted all but one of them so I wasn't short on WUs before tomorrow's outage. Not sure why they gave such a bad estimate.

2nd edit - looks like mac got a new version of s@h today.
ID: 1958125 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13855
Credit: 208,696,464
RAC: 304
Australia
Message 1958128 - Posted: 2 Oct 2018, 6:22:37 UTC - in response to Message 1958125.  

edit - nevermind - I aborted all but one of them so I wasn't short on WUs before tomorrow's outage. Not sure why they gave such a bad estimate.

2nd edit - looks like mac got a new version of s@h today.

And that was why they had a high estimate for the compute time- The new application needs to run to completion several WUs for the estimates to become more accurate, then as it processes more WUs it will refine the estimate further till it's pretty close to accurate.
Aborting them won't have helped that along.

Best bet when something like that occurs in the future- let the WUs run, and see how quickly the estimated time reduces.
Grant
Darwin NT
ID: 1958128 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1958130 - Posted: 2 Oct 2018, 6:33:04 UTC - in response to Message 1958128.  

edit - nevermind - I aborted all but one of them so I wasn't short on WUs before tomorrow's outage. Not sure why they gave such a bad estimate.

2nd edit - looks like mac got a new version of s@h today.


And that was why they had a high estimate for the compute time- The new application needs to run to completion several WUs for the estimates to become more accurate, then as it processes more WUs it will refine the estimate further till it's pretty close to accurate.
Aborting them won't have helped that along.

Best bet when something like that occurs in the future- let the WUs run, and see how quickly the estimated time reduces.


After I aborted most of them, I got new WUs with reasonable times (around 4 hours and not 20 hours), so all is ok. I kept one of the WUs since I was the 4th person, and I wanted to make sure it got a good result without passing it to a 5th person.
ID: 1958130 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1958158 - Posted: 2 Oct 2018, 14:00:57 UTC

Grant was right. I was wrong. I didn't know that over time that the times would change. Thanks for all those who continue to respond to my silly posts and teach me how seti works.

anyone else noticing that db purge doesn't seem to be working?? I have a longer than usual list of valid tasks that goes back longer than the usual 24ish hours. I think we are in for an outRage as the system has had hiccups this week.
ID: 1958158 · Report as offensive
Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 37 · Next

Message boards : Number crunching : Panic Mode On (113) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.