Message boards :
Number crunching :
The Server Issues / Outages Thread - Panic Mode On! (119)
Message board moderation
Previous · 1 . . . 46 · 47 · 48 · 49 · 50 · 51 · 52 . . . 107 · Next
Author | Message |
---|---|
AllgoodGuy Send message Joined: 29 May 01 Posts: 293 Credit: 16,348,499 RAC: 266 |
I realize it is nearly an hour old, but: Workunits waiting for db purging 0 67,703 737,741 58m Results waiting for db purging 0 140,726 1,576,766 58m I'm I wrong in thinking that this should be low lying fruit to help clean up the system? What exactly has to happen to trigger this cleanup. |
AllgoodGuy Send message Joined: 29 May 01 Posts: 293 Credit: 16,348,499 RAC: 266 |
Nevermind, that was a dumb question. Still waiting on wingmen. |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
This will going to be another long sunday.... due the lockdown every day is sunday. And S@H is not helping us. 29-Mar-2020 07:06:12 [SETI@home] Sending scheduler request: To fetch work. 29-Mar-2020 07:06:12 [SETI@home] Reporting 57 completed tasks 29-Mar-2020 07:06:12 [SETI@home] Requesting new tasks for CPU and NVIDIA GPU 29-Mar-2020 07:06:36 [SETI@home] Started upload of 06ap10ab.29609.7429.6.33.58_0_r1137296510_0 29-Mar-2020 07:06:36 [SETI@home] Scheduler request completed: got 0 new tasks 29-Mar-2020 07:06:36 [SETI@home] [sched_op] Server version 709 29-Mar-2020 07:06:36 [SETI@home] Project has no tasks available 29-Mar-2020 07:06:36 [SETI@home] Project requested delay of 303 seconds Draining 57 Wu on each scheduled call and not getting anything will deplete the cache very soon. Since there are 14 Arecibo tapes and the pfb splitters are running (at least 6 of them) I just wondering where all those arecibo new splitted WU are going? |
Link Send message Joined: 18 Sep 03 Posts: 834 Credit: 1,807,369 RAC: 0 |
According to the SSP they are not spliting at all or not very much at least. Guess this has something to do with the splitter_throttle_sah process. This is a good thing, the replica is chatching up constantly and the other processes should follow too. They should have slowed down the splitters long time ago. |
AllgoodGuy Send message Joined: 29 May 01 Posts: 293 Credit: 16,348,499 RAC: 266 |
29-Mar-2020 04:12:12 [SETI@home] Scheduler request completed: got 0 new tasks 29-Mar-2020 04:17:21 [SETI@home] Scheduler request completed: got 1 new tasks 29-Mar-2020 04:22:28 [SETI@home] Scheduler request completed: got 1 new tasks 29-Mar-2020 04:32:46 [SETI@home] Scheduler request completed: got 1 new tasks 29-Mar-2020 04:37:11 [SETI@home] Scheduler request completed: got 0 new tasks 29-Mar-2020 04:42:17 [SETI@home] Scheduler request completed: got 0 new tasks 29-Mar-2020 04:47:29 [SETI@home] Scheduler request completed: got 0 new tasks 29-Mar-2020 04:52:36 [SETI@home] Scheduler request completed: got 0 new tasks 29-Mar-2020 04:57:43 [SETI@home] Scheduler request completed: got 3 new tasks 29-Mar-2020 05:02:50 [SETI@home] Scheduler request completed: got 0 new tasks 29-Mar-2020 05:15:04 [SETI@home] Scheduler request completed: got 0 new tasks 29-Mar-2020 05:20:10 [SETI@home] Scheduler request completed: got 2 new tasks 29-Mar-2020 05:36:26 [SETI@home] Scheduler request completed: got 0 new tasks 29-Mar-2020 05:41:34 [SETI@home] Scheduler request completed: got 0 new tasks I'm getting a trickle. |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
Probably what you get are resends. Look at task name: 19mr10af.13048.12751.14.41.3_2 on your history file. if there is a _2 , _3 .... etc. on it they are resends. Not new splitted WU |
Richard Haselgrove Send message Joined: 4 Jul 99 Posts: 14667 Credit: 200,643,578 RAC: 874 |
For me, it's the same as yesterday - a very short burst of new work, then nothing except the occasional resend until the next burst. These were all the same machine: 29/03/2020 12:58:44 | SETI@home | Scheduler request completed: got 4 new tasks 29/03/2020 13:03:54 | SETI@home | Scheduler request completed: got 8 new tasks 29/03/2020 13:09:04 | SETI@home | Scheduler request completed: got 6 new tasksOnly one of those was a resend. Nothing but big fat zeroes before and after. |
Siran d'Vel'nahr Send message Joined: 23 May 99 Posts: 7379 Credit: 44,181,323 RAC: 238 |
Greetings, This website is running at a sluggish snails pace. Stats haven't updated, at least mine, since yesterday March 28th, midday or earlier. G o i n g d o w n f a s t e r a n d f a s t e r . . . . . [edit] And my only hosts that are still working are my Pis and laptop. My main and my other Linux PC are idle. Perhaps I should just shut down the other Linux PC. Yeah, sounds like a good idea. [/edit] Have a great day! :) Siran CAPT Siran d'Vel'nahr - L L & P _\\// Winders 11 OS? "What a piece of junk!" - L. Skywalker "Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath |
Ville Saari Send message Joined: 30 Nov 00 Posts: 1158 Credit: 49,177,052 RAC: 82,530 |
According to the SSP they are not spliting at all or not very much at least. Guess this has something to do with the splitter_throttle_sah process. This is a good thing, the replica is chatching up constantly and the other processes should follow too. They should have slowed down the splitters long time ago.Replica is catching about 0.41 seconds per second. If it keeps doing it, It'll take almost two weeks for it to catch up. All the other numbers still look very bad. The result count in the database is still going up, the assimilation queue stays extremely high without going either way and validation that had been running without problems for a long time despite all the other problems is now falling behind fast. And all this is happening when the return rate is at half the normal value due to most of the big crunchers having run out of tasks to crunch. |
AllgoodGuy Send message Joined: 29 May 01 Posts: 293 Credit: 16,348,499 RAC: 266 |
[quote] Actually no resends came through after I purged them last night. Edit: I have two left one one system that I marked last night, and they're about 30% complete, then I have no resends. |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
Actually no resends came through after I purged them last night. All the ones i received (very few BTW) where resends. Who i could know for sure? Because my host is programmed to start the resend AFAP so they are DL and returned in the following scheduler call. That makes easy to track. |
Freewill Send message Joined: 19 May 99 Posts: 766 Credit: 354,398,348 RAC: 11,693 |
|
Link Send message Joined: 18 Sep 03 Posts: 834 Credit: 1,807,369 RAC: 0 |
Replica is catching about 0.41 seconds per second. If it keeps doing it, It'll take almost two weeks for it to catch up. Still better than falling behind all the time. And all this is happening when the return rate is at half the normal value due to most of the big crunchers having run out of tasks to crunch. Well, like I said, they should have slowed down the splitters long time ago. They know how many results the database can contain before swapping to disk starts and everything becomes slow. Don't get why after running this project for over 20 years they still let such things happen again and again. |
AllgoodGuy Send message Joined: 29 May 01 Posts: 293 Credit: 16,348,499 RAC: 266 |
I haven't received any tasks in 90 minutes now. I was lucky to be in the splitters as they were splitting a lot of files, and one AstroPulse file. I haven't received any GPU tasks since then. Got about 8-10 AP GPU tasks, and 6-7 AP CPU tasks among all three hosts. About the same number of SETI GPU tasks. Was nice while it lasted. |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
It looks like a repeat of yesterday. Machines out of work, not receiving any new work, most times on the SSP over 4 Hours old, Results received in last hour down to 70k. I guess it's time to go out and wander around the yard again... |
ML1 Send message Joined: 25 Nov 01 Posts: 20850 Credit: 7,508,002 RAC: 20 |
Windows 10 Outperforming Linux On A ~$5000 LaptopUsing a brand new top end laptop is a bad way to compare the performances of different operating systems. What you really end up comparing is how fast the support for the latest proprietary hardware quirks gets added to different operating systems. Laptop power saving features are a notoriously fast moving target... Indeed that is the suspicion: There is some new/custom ACPI/'power-saving' that has kept that particular laptop in power-saving mode throughout the Linux tests. Regardless, pretty good all round for the results seen! IT is what we make it... Martin See new freedom: Mageia Linux Take a look for yourself: Linux Format The Future is what We all make IT (GPLv3) |
Ville Saari Send message Joined: 30 Nov 00 Posts: 1158 Credit: 49,177,052 RAC: 82,530 |
Looks like the validators have stopped validating. SSP hasn't updated those values for hours but my RAC is dropping fast. |
Ville Saari Send message Joined: 30 Nov 00 Posts: 1158 Credit: 49,177,052 RAC: 82,530 |
And now my stats have stopped updating at all. Credits and RAC displayed by the manager don't change when the client does scheduler requests. The numbers have stayed the same for almost an hour. Both numbers staying unchanged is a clear indication that the data doesn't update. If credits stayed the same, then RAC should drop fast and if RAC stayed the same, then credits should grow at a steady rate. |
juan BFP Send message Joined: 16 Mar 07 Posts: 9786 Credit: 572,710,851 RAC: 3,799 |
The end is closing. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13824 Credit: 208,696,464 RAC: 304 |
Getting very slow Scheduler responses, and getting errors on some responses. Grant Darwin NT |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.