Panic Mode On (115) Server Problems?

Message boards : Number crunching : Panic Mode On (115) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 31 · Next

AuthorMessage
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1982007 - Posted: 24 Feb 2019, 13:55:37 UTC - in response to Message 1981993.  

Haha Steven, It may be gone for now but not forgotten.
It will be among the 300 ghosts and will be coming back for round 2 :D


. . There were only 299 ghosts (239 now) because that annoying task actually no longer exists and there will be NO resend for it ...

. . It only appeared in my flawed client_state file, now that is gone (the hard way) ...

phew ...

Stephen

:)
ID: 1982007 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13370
Credit: 208,696,464
RAC: 304
Australia
Message 1982106 - Posted: 25 Feb 2019, 8:53:07 UTC

Looks like the splitters are taking a break again. Ready-to-send is making it's way to zero once again.
Grant
Darwin NT
ID: 1982106 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14505
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1982112 - Posted: 25 Feb 2019, 10:02:37 UTC - in response to Message 1982106.  

I was getting worried that they were all stuck in a cluster-f over 05jl06ef, but they seemed to have worked themselves free again.
ID: 1982112 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1982144 - Posted: 25 Feb 2019, 15:05:22 UTC - in response to Message 1982112.  
Last modified: 25 Feb 2019, 15:33:01 UTC

I think you are right there Richard, that file has been sitting on 11/7 splitters for over an hour now.
There is a small trickle from the AP side received here, but hard to say with MB since they would be getting queued.
EDIT: It has now moved up 1 MB channel, and down to 10 splitters.
ID: 1982144 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13157
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1982166 - Posted: 25 Feb 2019, 16:55:06 UTC - in response to Message 1982144.  

Downloads are sticking.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1982166 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 813
Credit: 2,361,516
RAC: 22
United States
Message 1982179 - Posted: 25 Feb 2019, 18:03:06 UTC - in response to Message 1982112.  

I was getting worried that they were all stuck in a cluster-f over 05jl06ef, but they seemed to have worked themselves free again.


Look at the size of the file
05jl06ef 272.67 GB

It might be weird to split that one considering the size. usually the files are around 50GB.
ID: 1982179 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13157
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1982201 - Posted: 25 Feb 2019, 20:59:41 UTC
Last modified: 25 Feb 2019, 21:01:05 UTC

Getting project is down for maintenance messages on two all hosts now. SSP does not show maintenance.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1982201 · Report as offensive
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1982202 - Posted: 25 Feb 2019, 21:03:17 UTC - in response to Message 1982201.  

Getting project is down for maintenance messages on two all hosts now. SSP does not show maintenance.

Same here.

SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1982202 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1982204 - Posted: 25 Feb 2019, 21:43:57 UTC

. . I only got the maintenance message once or twice but then it went to "no tasks". Getting work again now ... at least some work.

Stephen

<shrug>
ID: 1982204 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13157
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1982208 - Posted: 25 Feb 2019, 22:08:35 UTC - in response to Message 1982204.  
Last modified: 25 Feb 2019, 22:16:03 UTC

I got put into a 1 hour backoff while the project was down for maintenance. Came back to the computer an hour later and as all the hosts finally reconnected after their backoff, started replacing work in my caches.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1982208 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1635
Credit: 12,921,799
RAC: 89
New Zealand
Message 1982273 - Posted: 26 Feb 2019, 8:18:14 UTC - in response to Message 1982179.  

I was getting worried that they were all stuck in a cluster-f over 05jl06ef, but they seemed to have worked themselves free again.


Look at the size of the file
05jl06ef 272.67 GB

It might be weird to split that one considering the size. usually the files are around 50GB.

Must have been removed. I went to have a look at it but would not see it. It would have been impressive to see it munch through a file that big
ID: 1982273 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1849
Credit: 268,616,081
RAC: 1,349
United States
Message 1982274 - Posted: 26 Feb 2019, 8:52:34 UTC - in response to Message 1982273.  

I was getting worried that they were all stuck in a cluster-f over 05jl06ef, but they seemed to have worked themselves free again.


Look at the size of the file
05jl06ef 272.67 GB

It might be weird to split that one considering the size. usually the files are around 50GB.

Must have been removed. I went to have a look at it but would not see it. It would have been impressive to see it munch through a file that big

Yeah, it finally wrapped up a bit ago. I think all told it took over 24 hours to fully process for both MB and AP.
ID: 1982274 · Report as offensive
Profile Ghan-buri-Ghan Mike

Send message
Joined: 27 Dec 15
Posts: 123
Credit: 92,602,985
RAC: 172
United States
Message 1982349 - Posted: 27 Feb 2019, 0:02:03 UTC

We're back, and the infamous "project has no tasks available" is on all 17 of my machines....
ID: 1982349 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1982354 - Posted: 27 Feb 2019, 0:19:32 UTC - in response to Message 1982349.  

We're back, and the infamous "project has no tasks available" is on all 17 of my machines....


. . I wish I could get that far. I'm still trying to report work, getting the usual post outage 'http internal error'.

. . The two slower machines with not a lot of work to report have managed to get through eventually but the faster machines are getting no where ... fast ...

Stephen

:(
ID: 1982354 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1982355 - Posted: 27 Feb 2019, 0:20:41 UTC - in response to Message 1982354.  

We're back, and the infamous "project has no tasks available" is on all 17 of my machines....


. . I wish I could get that far. I'm still trying to report work, getting the usual post outage 'http internal error'.

. . The two slower machines with not a lot of work to report have managed to get through eventually but the faster machines are getting no where ... fast ...

Stephen

:(


Ditto :(

Tom
A proud member of the OFA (Old Farts Association).
ID: 1982355 · Report as offensive
Sirius B Project Donor
Volunteer tester
Avatar

Send message
Joined: 26 Dec 00
Posts: 24554
Credit: 3,081,182
RAC: 7
Ireland
Message 1982358 - Posted: 27 Feb 2019, 0:27:08 UTC

No tasks available for main, but did get cache for Beta topped up. :-)
ID: 1982358 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1982359 - Posted: 27 Feb 2019, 0:27:51 UTC - in response to Message 1982355.  

. . I wish I could get that far. I'm still trying to report work, getting the usual post outage 'http internal error'.
. . The two slower machines with not a lot of work to report have managed to get through eventually but the faster machines are getting no where ... fast ...
Stephen
:(


Ditto :(

Tom


. . Because I have tried playing 'kick the servers' to report the work I am now in 4 hour plus backoffs .... to hell with it, I'm going to lunch ...

Stephen

:(
ID: 1982359 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13157
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1982361 - Posted: 27 Feb 2019, 0:37:52 UTC - in response to Message 1982359.  

If I try to report more than 100 tasks a time, I will get the comms error and a backoff. 100 seems to be my magic number. I'm 95% successful every time.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1982361 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1982362 - Posted: 27 Feb 2019, 0:48:47 UTC - in response to Message 1982361.  

If I try to report more than 100 tasks a time, I will get the comms error and a backoff. 100 seems to be my magic number. I'm 95% successful every time.


. . which file is that setting in again? I am sure mine is still over 100 (might be 128) but until now I could not see anyone stating a number which was consistently working ...

. . I might try one of my favourite numbers ... 99 (think get smart)

Stephen

<shrug>
ID: 1982362 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4262
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1982364 - Posted: 27 Feb 2019, 0:55:12 UTC - in response to Message 1982354.  

same. this happens to me every time. the slow windows machines pick up right away and the fast Linux machines take a long time to start comms again. none of the usual tricks (NNT, etc) are working.
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 1982364 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 31 · Next

Message boards : Number crunching : Panic Mode On (115) Server Problems?


 
©2022 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.