Message boards :
Number crunching :
The Server Issues / Outages Thread - Panic Mode On! (117)
Message board moderation
Previous · 1 . . . 47 · 48 · 49 · 50 · 51 · 52 · Next
Author | Message |
---|---|
JohnDK Send message Joined: 28 May 00 Posts: 1222 Credit: 451,243,443 RAC: 1,127 |
Look at the 'Application' column in BOINC Manager, advanced view. If it starts with the word 'Local:', BOINC is using Anonymous Platform. If the word 'Local:' is missing, it's running stock. The task page is not up to date due the server is some 40.000 secs behind, so what you see is the status for 40.000 secs ago. And it's just getting worse, yesterday it was some 17.000 secs behind. |
wujj123456 Send message Joined: 5 Sep 04 Posts: 40 Credit: 20,877,975 RAC: 219 |
The task page is not up to date due the server is some 40.000 secs behind, so what you see is the status for 40.000 secs ago. The task page is updated from the Replica, which as of now is 43961 seconds behind (more than 12 hours) Ah, thanks. It makes sense that these non-critical stats are from querying replica. It just means I need to check on hosts directly for now, which I am kinda already doing anyway... |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
Anonymous Platform here. Interestingly, my Windows system is still getting mostly Scheduler errors (30sec to 3min wait for a response), my Linux system is getting "Project has no tasks available" responses (30-50sec response time). Grant Darwin NT |
Siran d'Vel'nahr Send message Joined: 23 May 99 Posts: 7379 Credit: 44,181,323 RAC: 238 |
Greetings, Ok, now I'm running stock again. I got cuda60 and sah WUs. This, on my main. My other Linux PC still has almost 16 hours of crunching to do before I work with it. Instead of archiving my anonymous SETI directory, I just renamed it before restarting BOINC and resetting the project. Works for me. :) Have a great day! :) Siran CAPT Siran d'Vel'nahr - L L & P _\\// Winders 11 OS? "What a piece of junk!" - L. Skywalker "Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
I was just sent two more groups of "Lost" tasks, I think that was all of them. So it appears 'Resend Lost Tasks' is turned on by default just as it was on BETA. Hopefully it will help cut down on Database bloat. I also found you can download tasks while running Stock with 'Suspend GPU' set, that helps when the Server insists on sending you Apps that crash 5 seconds after they start. Once you have a few hundred tasks you can 'reschedule' the crashing tasks to an App that doesn't crash....such as CUDA90. |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
One observation, when we ask for new work it returns: Sat 21 Dec 2019 05:39:10 PM EST | SETI@home | Scheduler request failed: HTTP internal server error Maybe somebody forget to update the address of the server on the task started when the anonymous host ask for new work... |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14679 Credit: 200,643,578 RAC: 874 |
So it appears 'Resend Lost Tasks' is turned on by default just as it was on BETA.That probably accounts for the database sluggishness, all by itself - it was implicated in the November 2013 database event. That CAN'T have been deliberate, surely? Have you told Eric yet? I haven't had any reply from Eric yet, but I won't pester him - but I will pass on what we know by the beginning of tomorrow's Berkeley daylight. |
Mr. Kevvy Send message Joined: 15 May 99 Posts: 3806 Credit: 1,114,826,392 RAC: 3,319 |
One observation, when we ask for new work it returns: Sat 21 Dec 2019 05:39:10 PM EST | SETI@home | Scheduler request failed: HTTP internal server error This is, I think, further evidence that something went very wrong in the disconnection noted in News and we're actually connecting to the Beta scheduler. |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14679 Credit: 200,643,578 RAC: 874 |
One observation, when we ask for new work it returns: Sat 21 Dec 2019 05:39:10 PM EST | SETI@home | Scheduler request failed: HTTP internal server errorIt's been doing that at intervals all day - among all the other grotty responses it's capable of sending out. I don't think it's anything to do with a bad address - it just uses up all the available time or memory and then crashes. Possibly because of all the extra work and database querying it's trying to do for a 'lost tasks' check. |
Cherokee150 Send message Joined: 11 Nov 99 Posts: 192 Credit: 58,513,758 RAC: 74 |
It would seem to me that the current problems might be related to Eric's "BOINC Notice" he posted yesterday: _____________________________________________________________________________________________________________________________ SETI@home: Some server issues today... It's the Friday before a holiday week and the servers know it. The file system containing the beta project uploads directory is having problems, so beta is down until further notice. This problem may be affecting the rate at which the main project can handle results, so the validation and assimilation queues are getting large, which may affect the rate of work generation. 12/20/2019 17:10:04 _____________________________________________________________________________________________________________________________ |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14679 Credit: 200,643,578 RAC: 874 |
I'm sure they're related, but I'm not exactly sure how, yet. I thought they might have upgraded the server software in a misguided attempt to rectify the database problems. Now I'm not so sure. Recent posts have suggested that they might simply have got the wires crossed (well, not quite as simple as that ...) and set the Beta server up to process the Main project - but with the Beta settings still in place. Too soon to tell, until we set some feedback from inside the project. Our scheduler here is still running on Synergy, though - what was Beta's scheduler running on? |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
Have you told Eric yet? I haven't had any reply from Eric yet, but I won't pester him - but I will pass on what we know by the beginning of tomorrow's Berkeley daylight.I told him about it a couple months ago, back when I was running Apps on BETA. I dunno, maybe remind him... |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14679 Credit: 200,643,578 RAC: 874 |
I was asking about the new observation about Resend Lost Tasks being active on Main - that's nearer two hours ago than two months ago!Have you told Eric yet? I haven't had any reply from Eric yet, but I won't pester him - but I will pass on what we know by the beginning of tomorrow's Berkeley daylight.I told him about it a couple months ago, back when I was running Apps on BETA. I dunno, maybe remind him... Don't worry about it - Mr. Kevvy has passed the new information on. And I'm going to bed. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
It certainly doesn't like Anonymous Platform. On my WIndows system I exited BOINC, backed up my Seti project folder, removed app_info.xml and restarted BOINC. First couple of Scheduler requests failed, then got work. 22/12/2019 09:09:52 | SETI@home | Scheduler request failed: Failure when receiving data from the peer 22/12/2019 09:14:37 | SETI@home | Scheduler request failed: Couldn't connect to server 22/12/2019 09:15:58 | SETI@home | Scheduler request completed: got 68 new tasks 22/12/2019 09:21:25 | SETI@home | Scheduler request failed: Couldn't connect to server 22/12/2019 09:23:26 | SETI@home | Scheduler request completed: got 20 new tasks With anonymous platform, any successful Scheduler request results in "Project has no tasks available", and there were very, very, very few successful requests. Edit- and even running stock, successful Scheduler requests are in the minority. So far 9 failures, 4 successful (3 successful and getting work in a sequence of 4 requests), and one "project has no tasks available response." It is very, very broken. Grant Darwin NT |
Eric Korpela Send message Joined: 3 Apr 99 Posts: 1382 Credit: 54,506,847 RAC: 60 |
|
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
I'm looking into the problem. Grrrrr..... |
betreger Send message Joined: 29 Jun 99 Posts: 11416 Credit: 29,581,041 RAC: 66 |
We are doing all we can do to help solve the problem. https://boinc.berkeley.edu/dev/forum_thread.php?id=8105&postid=94433 https://boinc.berkeley.edu/dev/forum_thread.php?id=8105&postid=94439 Howling at the moon and seting our hair on fire often helps. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13854 Credit: 208,696,464 RAC: 304 |
Well, running stock isn't helping that much. So far I've picked up a dozen CUDA50 WUs, I did get 100 SoG WUs, but the downloads errored out for some reason. And I've picked up around 300 CUDA42 WUs, which take 30min to process (instead of the 9min or less with SoG). I'll give it a while longer & see if i can get some SoG work that downloads OK, otherwise I might as well just set it for No New Tasks and wait till it's fixed. Grant Darwin NT |
Freewill Send message Joined: 19 May 99 Posts: 766 Credit: 354,398,348 RAC: 11,693 |
We are doing all we can do to help solve the problem. I'm out of hair, but I can still drink. |
betreger Send message Joined: 29 Jun 99 Posts: 11416 Credit: 29,581,041 RAC: 66 |
Juan maybe is waiting for you to arrive. |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.