Message boards :
Number crunching :
Panic Mode On (105) Server Problems?
Message board moderation
Previous · 1 . . . 20 · 21 · 22 · 23 · 24 · 25 · 26 . . . 34 · Next
Author | Message |
---|---|
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
LOL, I'm sympathizing. I learned a few years back that BOINC simply doesn't have enough fine-grained tools to handle projects like Einstein. I have tried all tricks mentioned. Nothing worked. My simple solution is NNT on Einstein until I'm low on work. Then allow tasks for two download cycles, which takes all of two minutes, and shut her off again. That gets me around 2-3 days of work before I have to run the process again. Oh well..... everyone that has an abundance of Einstein work is going to see a huge bump in their overall BOINC RAC. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
. . Did you set the E@H resource priority to 0%? That advice proved successful for me. . . And I thought I had screwed up by getting about 90 on one machine and 30 to 50 on another. 5600 is a hell of lot of tasks .... Stephen ? |
Zalster Send message Joined: 27 May 99 Posts: 5517 Credit: 528,817,460 RAC: 242 |
LOL, I'm sympathizing. I learned a few years back that BOINC simply doesn't have enough fine-grained tools to handle projects like Einstein. I have tried all tricks mentioned. Nothing worked. My simple solution is NNT on Einstein until I'm low on work. Then allow tasks for two download cycles, which takes all of two minutes, and shut her off again. That gets me around 2-3 days of work before I have to run the process again. Oh well..... everyone that has an abundance of Einstein work is going to see a huge bump in their overall BOINC RAC. With mine set to 0, I only get enough to keep the GPUs busy and 1 more. They will continue to crunch Einstein until I get Seti work, then they finish the Einstein and start up on the Seti work... |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
That is for me is a really common onboard task count for Einstein. That is about 2-3 days work at my 12.5% resource share with SETI. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
But I don't use Einstein just as a backup project. I actively crunch at 12.5% usage and 7.5% usage for MilkyWay. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Jimbocous Send message Joined: 1 Apr 13 Posts: 1853 Credit: 268,616,081 RAC: 1,349 |
. . And I thought I had screwed up by getting about 90 on one machine and 30 to 50 on another. 5600 is a hell of lot of tasks .... Yeah, I screwed the pooch so bad on that one they heard it blocks down the road!:) And somehow I managed that with tasks set to 1+1 days, and priority at 1. Could be wrong, but I think the deal is that initial hit to the web site gets you downloads before the preferences are read and accepted. So it goes ... Simple lesson I know well and always ignore is to make changes on just one box, then wait a day or two to see and deal with any fallout. But, being a charge-ahead kinda guy, .... |
Ghia Send message Joined: 7 Feb 17 Posts: 238 Credit: 28,911,438 RAC: 50 |
I also got a heap of E@H WUs before I had the sense to set NNT. Saving them for Tuesday now. Setting resource share to 0 seems to work well enough, the active tasks just sit there 'Waining to run'. Humans may rule the world...but bacteria run it... |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
LOL, I'm sympathizing. I learned a few years back that BOINC simply doesn't have enough fine-grained tools to handle projects like Einstein. I have tried all tricks mentioned. Nothing worked. My simple solution is NNT on Einstein until I'm low on work. Then allow tasks for two download cycles, which takes all of two minutes, and shut her off again. That gets me around 2-3 days of work before I have to run the process again. Oh well..... everyone that has an abundance of Einstein work is going to see a huge bump in their overall BOINC RAC. . . Yep, it works pretty much the same for me. When one E@H task finishes I get another to run. Just enough to keep the mill grinding, and no more. So when Seti work does start to flow there is only one E@H tasks to finish on each GPU. And that doesn't take that long either. Stephen :) |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
. . And I thought I had screwed up by getting about 90 on one machine and 30 to 50 on another. 5600 is a hell of lot of tasks .... . . Damn the torpedoes, full speed a...... OOPS! Stephen :) |
Jimbocous Send message Joined: 1 Apr 13 Posts: 1853 Credit: 268,616,081 RAC: 1,349 |
But, being a charge-ahead kinda guy, .... Yep. I think I code that way too ! :) |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
Seems there are a few WUs apparently Hung. I was looking for a particular AR and found one in Suspended Animation. Then I found a few more just sitting there waiting for the Validator. They have been waiting a couple of days, some a few weeks. Anyway to resurrect these dead WUs and get them moving again? http://setiathome.berkeley.edu/workunit.php?wuid=2377405378 http://setiathome.berkeley.edu/workunit.php?wuid=2485675271 http://setiathome.berkeley.edu/workunit.php?wuid=2485884961 http://setiathome.berkeley.edu/workunit.php?wuid=2485926712 It looks as though most of these WUs were started before the Outage, and the competed tasks since the Outage are not being acknowledged by the Validator as needing Validation. |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
I've had similar occur in the past. Nothing we can do on our end as far as I know. You just have to wait for the assimilators to process them or wait for Eric to run a script to clear out the deadwood. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
TBar Send message Joined: 22 May 99 Posts: 5204 Credit: 840,779,836 RAC: 2,768 |
Yes, and now you have a few more "Dead" tasks, https://setiathome.berkeley.edu/results.php?hostid=8030022&offset=420&state=2 In fact, I'd say it's safe to assume Anyone that had Pending Tasks at the start of the Outage has 'Dead' tasks. Hundreds of thousands...or more ;-) I have quite a few across Four machines. As the scenario suggests, probably Everyone does. |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Yuck. So maybe there are still some crossed wires in the servers because of the data corruption remedy. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Looks like the SETI WU and results awaiting purge flatlined right around 00:00 UTC Monday. Haveland Stats Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
Yes, and now you have a few more "Dead" tasks, https://setiathome.berkeley.edu/results.php?hostid=8030022&offset=420&state=2 . . Ouch, that would explain the big drop in my acknowledged returns since the unscheduled outage :( . . Oh well.. Stephen <shrug> |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13750 Credit: 208,696,464 RAC: 304 |
Web site and forums behaving badly at the moment. Slow to respond, and other times not even responding at all. EDIT- just had a look in my Manager's Event log & a few Scheduler errors (Couldn't connect to server) are showing there. Grant Darwin NT |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Yes, I've been seeing the same slowness and no server response too. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
Yes, I've been seeing the same slowness and no server response too. . . That was probably me writing a long message .. :) <joke> Stephen :) |
HAL9000 Send message Joined: 11 Sep 99 Posts: 6534 Credit: 196,805,888 RAC: 57 |
Web site and forums behaving badly at the moment. In scanning my stdoutdae.txt I do see a slight increase in the number of scheduler failures over the course of this past week. Project details for: SETI@home including all dates SETI@home classic workunits: 93,865 CPU time: 863,447 hours Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[ |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.