Panic Mode On (109) Server Problems?

Message boards : Number crunching : Panic Mode On (109) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 38 · Next

AuthorMessage
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 11991
Credit: 118,702,027
RAC: 40,536
United Kingdom
Message 1906831 - Posted: 13 Dec 2017, 20:12:43 UTC

Well, something is working right - all the 'ready to send' have gone. Splitters have started to ramp up - I wonder how far they'll get.
ID: 1906831 · Report as offensive
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1027
Credit: 154,108,233
RAC: 195,141
Denmark
Message 1906832 - Posted: 13 Dec 2017, 20:12:54 UTC
Last modified: 13 Dec 2017, 20:13:14 UTC

Well DLs are OK now, all one have to do is write about it in here.
ID: 1906832 · Report as offensive
Profile Advent42
Avatar

Send message
Joined: 23 Mar 17
Posts: 175
Credit: 3,990,912
RAC: 12,510
Ireland
Message 1906839 - Posted: 13 Dec 2017, 20:56:28 UTC - in response to Message 1906832.  

So....there is the glimmers of life after all...:-)
ID: 1906839 · Report as offensive
Stephen "Heretic" Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 3363
Credit: 69,788,343
RAC: 90,254
Australia
Message 1906842 - Posted: 13 Dec 2017, 21:14:23 UTC - in response to Message 1906839.  

So....there is the glimmers of life after all...:-)


. . Getting work now but 90% GBT ...

Stephen

:(
ID: 1906842 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 4486
Credit: 279,585,984
RAC: 626,703
United States
Message 1906846 - Posted: 13 Dec 2017, 21:22:00 UTC

Wish I had gotten work when there was some available. Nothing could be retrieved because of stalled downloads. Finally cleared those through the 3 hour backoffs and retries. Hope the splitters can get going again.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1906846 · Report as offensive
rob smith Special Project $250 donor
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 16155
Credit: 312,827,029
RAC: 255,996
United Kingdom
Message 1906956 - Posted: 14 Dec 2017, 6:18:01 UTC

...and the SSP is now hiding in a corner with the lights turned off :-(
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1906956 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 9900
Credit: 128,541,440
RAC: 81,381
Australia
Message 1906958 - Posted: 14 Dec 2017, 6:29:29 UTC
Last modified: 14 Dec 2017, 6:41:42 UTC

Looks like it took about 7 hours, but the splitters have finally come good. Managing to pump out plenty of work as required. The assimilation backlog is clearing.
Unfortunately the WU awaiting deletion backlog continues to grow (it was falling while the assimilator backlog was growing), and the replica continues to fall further & further behind.


EDIT- and the forums are very random as to whether they will load, or take a minute or 2 to do it... (same for looking at accounts and tasks/systems).
Grant
Darwin NT
ID: 1906958 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 9900
Credit: 128,541,440
RAC: 81,381
Australia
Message 1906968 - Posted: 14 Dec 2017, 7:06:49 UTC
Last modified: 14 Dec 2017, 7:18:55 UTC

Anyone had successful contact with the Scheduler in the last 25min?
14/12/2017 16:19:35 | SETI@home | Scheduler request failed: Couldn't connect to server
14/12/2017 16:27:54 | SETI@home | Scheduler request failed: Couldn't connect to server
14/12/2017 16:29:53 | SETI@home | Scheduler request failed: Couldn't connect to server
14/12/2017 16:35:17 | SETI@home | Scheduler request failed: HTTP internal server error
14/12/2017 16:45:49 | SETI@home | Scheduler request failed: HTTP service unavailable
Grant
Darwin NT
ID: 1906968 · Report as offensive
Profile David@home
Volunteer tester
Avatar

Send message
Joined: 16 Jan 03
Posts: 738
Credit: 4,142,924
RAC: 960
United Kingdom
Message 1906972 - Posted: 14 Dec 2017, 7:21:31 UTC - in response to Message 1906968.  

Anyone had successful contact with the Scheduler in the last 25min?


No, same hear. Communication deferred for 20 minutes. Hopefully not a sign of database issues.

Forum also very slow to respond.
ID: 1906972 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 11991
Credit: 118,702,027
RAC: 40,536
United Kingdom
Message 1906978 - Posted: 14 Dec 2017, 7:44:43 UTC - in response to Message 1906968.  

Anyone had successful contact with the Scheduler in the last 25min?
Yup.

14/12/2017 07:40:35 | SETI@home | Sending scheduler request: To fetch work.
14/12/2017 07:40:35 | SETI@home | Reporting 1 completed tasks
14/12/2017 07:40:35 | SETI@home | Requesting new tasks for NVIDIA GPU
14/12/2017 07:40:39 | SETI@home | Scheduler request completed: got 77 new tasks
One of my other machines had been failing for a little while, but got through just as I was starting my morning checks.
ID: 1906978 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 4486
Credit: 279,585,984
RAC: 626,703
United States
Message 1906980 - Posted: 14 Dec 2017, 7:48:38 UTC

Just looked at my machines. They had a few instances of no communication a half hour ago. All machines were able to get work in the last few minutes and are fully cached. Don't like the fact the SSP has gone missing again. I had hoped they would fix the ever growing task load on the deleters during the extended outage. Things are NOT running normally.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1906980 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 9900
Credit: 128,541,440
RAC: 81,381
Australia
Message 1906981 - Posted: 14 Dec 2017, 7:50:27 UTC - in response to Message 1906978.  

Anyone had successful contact with the Scheduler in the last 25min?
Yup.

14/12/2017 07:40:35 | SETI@home | Sending scheduler request: To fetch work.
14/12/2017 07:40:35 | SETI@home | Reporting 1 completed tasks
14/12/2017 07:40:35 | SETI@home | Requesting new tasks for NVIDIA GPU
14/12/2017 07:40:39 | SETI@home | Scheduler request completed: got 77 new tasks
One of my other machines had been failing for a little while, but got through just as I was starting my morning checks.

Next request, it managed to find signs of life.
14/12/2017 17:01:19 | SETI@home | Scheduler request completed: got 26 new tasks.

Hopefully it'll hang in there till morning when Eric can have another look at what's going on.
Grant
Darwin NT
ID: 1906981 · Report as offensive
rob smith Special Project $250 donor
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 16155
Credit: 312,827,029
RAC: 255,996
United Kingdom
Message 1906988 - Posted: 14 Dec 2017, 8:14:04 UTC

Add to the SSP being broken. the forum is slow, and a few random profile pages are "broken".
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1906988 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 11991
Credit: 118,702,027
RAC: 40,536
United Kingdom
Message 1906992 - Posted: 14 Dec 2017, 8:33:53 UTC - in response to Message 1906988.  

And the fundraiser icons are still coming and going like yo-yos.
ID: 1906992 · Report as offensive
Tutankhamon
Volunteer tester
Avatar

Send message
Joined: 1 Nov 08
Posts: 7114
Credit: 44,225,527
RAC: 3,737
Sweden
Message 1907015 - Posted: 14 Dec 2017, 10:36:25 UTC

Oh well, I'm at work, and my computer at home is shut down.
So, whatever happens with the project for the coming 6 hours, doesn't really matter to me :-)
Up, down, whatever.....
ID: 1907015 · Report as offensive
Profile Brent Norman Special Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2132
Credit: 204,410,236
RAC: 495,947
Canada
Message 1907019 - Posted: 14 Dec 2017, 11:31:25 UTC

The haveland graphs (and my RAC) look more like a blueprint design for a new LEGO build than a stable system.
ID: 1907019 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 4045
Credit: 233,604,048
RAC: 200,703
United States
Message 1907025 - Posted: 14 Dec 2017, 13:24:37 UTC
Last modified: 14 Dec 2017, 13:53:35 UTC

Looks to be a number of problems from Pages just being Slow to pages not displaying.
Missing;
https://setiathome.berkeley.edu/show_server_status.php
https://setiathome.berkeley.edu/top_hosts.php?sort_by=expavg_credit&offset=20
ID: 1907025 · Report as offensive
rob smith Special Project $250 donor
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 16155
Credit: 312,827,029
RAC: 255,996
United Kingdom
Message 1907030 - Posted: 14 Dec 2017, 14:46:27 UTC

It's somewhat random which pages vanish, as some of them re-appear. However it does appear to be getting more widespread and persistent. I think a common thread is that they are all data-driven pages, so that might be a clue (or it might be a read herring)
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1907030 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 11991
Credit: 118,702,027
RAC: 40,536
United Kingdom
Message 1907034 - Posted: 14 Dec 2017, 15:06:48 UTC - in response to Message 1907030.  

But on the other hand, I had problems a few minutes ago refreshing these boards, with the browser pausing at 'establishing secure connection'. Surely that would happen a long way down into the web server hierarchy, long before it got to binding data into a driven web page?
ID: 1907034 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 4486
Credit: 279,585,984
RAC: 626,703
United States
Message 1907053 - Posted: 14 Dec 2017, 21:00:25 UTC

Wish we had some news about the difficulties the project is having. The SSP numbers are still whack, especially the db purging. I haven't been able to get any work on the Linux cruncher since the project came back this morning. I can't view any host because the request times out. I wish they would tell us what the issue is and what they need to do and then just take the project down for however long it takes to fix things correctly.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1907053 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 38 · Next

Message boards : Number crunching : Panic Mode On (109) Server Problems?


 
©2018 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.