The Server Issues / Outages Thread - Panic Mode On! (117)

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (117)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 43 · 44 · 45 · 46 · 47 · 48 · 49 . . . 52 · Next

AuthorMessage
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13368
Credit: 208,696,464
RAC: 304
Australia
Message 2023940 - Posted: 20 Dec 2019, 23:00:26 UTC
Last modified: 20 Dec 2019, 23:02:57 UTC

Unable to contact the Scheduler again,
21/12/2019 08:29:56 | SETI@home | Scheduler request failed: HTTP internal server error
and :Timeout was reached.

The large number of shorties going through at the moment won't be helping matters either.
Grant
Darwin NT
ID: 2023940 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 2023943 - Posted: 20 Dec 2019, 23:12:05 UTC

I just switched one of my Weekend Warriors to "all cpu" with a dinky little gpu on there that has no hope of processing most anything.

What do I get?

"Project down" try again in an hour.

Reminds me of "all dressed up and no place to go".

Tom
A proud member of the OFA (Old Farts Association).
ID: 2023943 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11145
Credit: 29,581,041
RAC: 66
United States
Message 2023950 - Posted: 20 Dec 2019, 23:33:52 UTC

Well it won't let me report completed tasks, sometimes it says it may be down, or it says it can't connect to server.
ID: 2023950 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 2023963 - Posted: 20 Dec 2019, 23:54:30 UTC - in response to Message 2023950.  

Well it won't let me report completed tasks, sometimes it says it may be down, or it says it can't connect to server.

i offer to do the sacrifice.... Again.
ID: 2023963 · Report as offensive
wujj123456

Send message
Joined: 5 Sep 04
Posts: 40
Credit: 20,877,975
RAC: 219
China
Message 2023968 - Posted: 21 Dec 2019, 0:53:03 UTC - in response to Message 2023939.  

I see that they brought the missing validators back up again after they were down. Don't see any improvement on validation tasks outstanding. They really need to whittle that down to something manageable. I haven't been able to pull up any task web page on any of my hosts since the outage. Don't know if the hosts are doing OK or not.

Hmm, I couldn't get tasks either, but I have no problem visiting my host task page like this one: https://setiathome.berkeley.edu/results.php?hostid=8846357. I don't have as many tasks as you guys do though.
ID: 2023968 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13368
Credit: 208,696,464
RAC: 304
Australia
Message 2023970 - Posted: 21 Dec 2019, 1:18:00 UTC - in response to Message 2023940.  
Last modified: 21 Dec 2019, 1:18:26 UTC

The large number of shorties going through at the moment won't be helping matters either.
Results-received-in-last-hour 186,386- much more than 140k often causes issues. But that many, on top of the existing problems? No wonder the system has choked up.
At least the return rate will drop off with people unable to contact the Scheduler, then of course go through the roof again when they can, but hopefully by then some of the other backlogs will have cleared.
Grant
Darwin NT
ID: 2023970 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1635
Credit: 12,921,799
RAC: 89
New Zealand
Message 2023976 - Posted: 21 Dec 2019, 1:42:09 UTC

If you want to help this situation the best thing to do would be to have a look through your task list and see if you have any ending with _2 or higher and return these so that "work waiting to be administered" can drop. I am aware this is not possible for everybody. Just my 2 cents.
ID: 2023976 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1849
Credit: 268,616,081
RAC: 1,349
United States
Message 2023983 - Posted: 21 Dec 2019, 2:16:33 UTC - in response to Message 2023976.  

If you want to help this situation the best thing to do would be to have a look through your task list and see if you have any ending with _2 or higher and return these so that "work waiting to be administered" can drop. I am aware this is not possible for everybody. Just my 2 cents.
No clue here how I could select tasks to prioritize reporting...
ID: 2023983 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1635
Credit: 12,921,799
RAC: 89
New Zealand
Message 2023986 - Posted: 21 Dec 2019, 2:23:56 UTC - in response to Message 2023983.  

If you want to help this situation the best thing to do would be to have a look through your task list and see if you have any ending with _2 or higher and return these so that "work waiting to be administered" can drop. I am aware this is not possible for everybody. Just my 2 cents.
No clue here how I could select tasks to prioritize reporting...

On Windows you have to select the tasks then on the left-hand side you will see an option saying suspend. So suspend all tasks and then click each task that has _2 or higher and then click resume. The suspend/resume button is located under the graphics button. I imagine it would be a similar process using the graphical interface on Linux. Of course after they are all processed you will need to resume the other work units
ID: 2023986 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 27447
Credit: 261,360,520
RAC: 489
Australia
Message 2023990 - Posted: 21 Dec 2019, 2:32:02 UTC - in response to Message 2023976.  

If you want to help this situation the best thing to do would be to have a look through your task list and see if you have any ending with _2 or higher and return these so that "work waiting to be administered" can drop. I am aware this is not possible for everybody. Just my 2 cents.
Now that's easier said than done Speedy for some of us and seeing as I go through my GPU cache in 9-10hrs anyway the time required for me to go through all those tasks just isn't worth it. Even just doing the CPU tasks I consider a waste of time when they'll be returned in 2-3 days anyway ;-)

Though that'll work ok I guess for those with slower rigs using a minimal cache or those like yourself who share their time around other projects as well and not the straight out SETI only with the 3/0.1 day setting that I'm currently using. Once I get my personal experiments done with here I will initiate my "back up only projects" again, but not until then.

Cheers.
ID: 2023990 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2023992 - Posted: 21 Dec 2019, 2:34:37 UTC - in response to Message 2023983.  

If you want to help this situation the best thing to do would be to have a look through your task list and see if you have any ending with _2 or higher and return these so that "work waiting to be administered" can drop. I am aware this is not possible for everybody. Just my 2 cents.
No clue here how I could select tasks to prioritize reporting...


. . Select all tasks with _0 or _1 at the end and suspend them, then when the _2 and above WUs are crunched unfreeze the others and let it report the completed units. If you want to go to that trouble. It should help clear some older tasks hanging around though.

Stephen

. .
ID: 2023992 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1635
Credit: 12,921,799
RAC: 89
New Zealand
Message 2023994 - Posted: 21 Dec 2019, 2:37:43 UTC - in response to Message 2023990.  

Thanks Wingo I completely understand. All the best with your goals
ID: 2023994 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13368
Credit: 208,696,464
RAC: 304
Australia
Message 2023998 - Posted: 21 Dec 2019, 2:56:45 UTC

Linux system out of GPU work.
Occasionally contacting the Scheduler, but "Project has no tasks available" is the response.
Grant
Darwin NT
ID: 2023998 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2024001 - Posted: 21 Dec 2019, 3:00:49 UTC - in response to Message 2023998.  

Linux system out of GPU work.
Occasionally contacting the Scheduler, but "Project has no tasks available" is the response.


. . I notice you have not implemented the spoofed client ...

Stephen

? ?
ID: 2024001 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13368
Credit: 208,696,464
RAC: 304
Australia
Message 2024002 - Posted: 21 Dec 2019, 3:03:46 UTC - in response to Message 2024001.  

I notice you have not implemented the spoofed client
Gave it a go, couldn't get it to run, went back to what worked. Probably needed to set the permissions for the new files.
Grant
Darwin NT
ID: 2024002 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 2024006 - Posted: 21 Dec 2019, 3:11:45 UTC - in response to Message 2024002.  

I notice you have not implemented the spoofed client
Gave it a go, couldn't get it to run, went back to what worked. Probably needed to set the permissions for the new files.


. . Maybe it was a permissions thing but maybe it was the dependencies. The reason that I only have it running on one of my major systems is that it was the only one already running libcurl4.0 (I think that is the dependency missing on the other 3 rigs).

. . If you go to the directory in which the spoof version files are stored and type sudo ldd and the filename (boinc, boinccmd, boincmgr) it will list all the dependencies and show any that are missing. There was only the same one missing on all of the other three rigs so I hesitated updating that file in case I needed to roll back. With the new limits the 2 slower rigs are AOK now anyway but I really should get it working on the highest performer to ensure its continued operation.

Stephen

. .
ID: 2024006 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1849
Credit: 268,616,081
RAC: 1,349
United States
Message 2024008 - Posted: 21 Dec 2019, 3:17:32 UTC - in response to Message 2024006.  
Last modified: 21 Dec 2019, 3:18:08 UTC

. . If you go to the directory in which the spoof version files are stored and type sudo ldd and the filename (boinc, boinccmd, boincmgr) it will list all the dependencies and show any that are missing. There was only the same one missing on all of the other three rigs so I hesitated updating that file in case I needed to roll back. With the new limits the 2 slower rigs are AOK now anyway but I really should get it working on the highest performer to ensure its continued operation.

Stephen

. .
Probably so. I did need a sudo apt install libcurl4 on mine.
ID: 2024008 · Report as offensive
halfempty
Avatar

Send message
Joined: 2 Jun 99
Posts: 97
Credit: 35,236,901
RAC: 114
United States
Message 2024035 - Posted: 21 Dec 2019, 7:43:23 UTC

First I was getting these:

Scheduler request completed: got 0 new tasks
Project has no tasks available


Now it's this:

Scheduler request failed: Couldn't connect to server
Project communication failed: attempting access to reference site
Internet access OK - project servers may be temporarily down.


But the rest of the site is working fine.

Is there something I can do? (Other than PANIC)
ID: 2024035 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 2024043 - Posted: 21 Dec 2019, 9:21:18 UTC - in response to Message 2024035.  

First I was getting these:

Scheduler request completed: got 0 new tasks
Project has no tasks available


Now it's this:

Scheduler request failed: Couldn't connect to server
Project communication failed: attempting access to reference site
Internet access OK - project servers may be temporarily down.


But the rest of the site is working fine.

Is there something I can do? (Other than PANIC)

A proud member of the OFA (Old Farts Association).
ID: 2024043 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13368
Credit: 208,696,464
RAC: 304
Australia
Message 2024047 - Posted: 21 Dec 2019, 9:45:40 UTC

Not too sure about the server status page numbers. It shows a return rate of 144k, but it's been over 4 hours since either of my systems were able to contact the Scheduler & get a response that wasn't one type of an error or another.
Grant
Darwin NT
ID: 2024047 · Report as offensive
Previous · 1 . . . 43 · 44 · 45 · 46 · 47 · 48 · 49 . . . 52 · Next

Message boards : Number crunching : The Server Issues / Outages Thread - Panic Mode On! (117)


 
©2022 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.