Panic Mode On (105) Server Problems?

Message boards : Number crunching : Panic Mode On (105) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 20 · 21 · 22 · 23 · 24 · 25 · 26 . . . 34 · Next

AuthorMessage
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1859090 - Posted: 2 Apr 2017, 1:20:39 UTC

LOL, I'm sympathizing. I learned a few years back that BOINC simply doesn't have enough fine-grained tools to handle projects like Einstein. I have tried all tricks mentioned. Nothing worked. My simple solution is NNT on Einstein until I'm low on work. Then allow tasks for two download cycles, which takes all of two minutes, and shut her off again. That gets me around 2-3 days of work before I have to run the process again. Oh well..... everyone that has an abundance of Einstein work is going to see a huge bump in their overall BOINC RAC.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1859090 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1859091 - Posted: 2 Apr 2017, 1:21:24 UTC - in response to Message 1859053.  

. . Did you set the E@H resource priority to 0%? That advice proved successful for me.

I also added E@H, but couldn't find where to set resource proiority....?

I too wasn't able to find the setting in BOINC Manager to change the priority on projects. And I made the mistake of downloading a crap load of tasks from 2 other projects so it might be a couple days before my Windows machine is able to start the Seti tasks it has. But I'm glad to wake up and see new Seti tasks have been downloaded to my 2 best computers!

To set resource priority to other than the default 100, on the E@H web page, under tabs:
Account>Preferences>Project
it's the first field, Resource Share.

Like others, it may be a while before I'm back to pumping SETI, as I ended up with like 5600 tasks between 5 machines before I could get it choked off.


. . And I thought I had screwed up by getting about 90 on one machine and 30 to 50 on another. 5600 is a hell of lot of tasks ....

Stephen

?
ID: 1859091 · Report as offensive
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5517
Credit: 528,817,460
RAC: 242
United States
Message 1859095 - Posted: 2 Apr 2017, 1:43:00 UTC - in response to Message 1859090.  

LOL, I'm sympathizing. I learned a few years back that BOINC simply doesn't have enough fine-grained tools to handle projects like Einstein. I have tried all tricks mentioned. Nothing worked. My simple solution is NNT on Einstein until I'm low on work. Then allow tasks for two download cycles, which takes all of two minutes, and shut her off again. That gets me around 2-3 days of work before I have to run the process again. Oh well..... everyone that has an abundance of Einstein work is going to see a huge bump in their overall BOINC RAC.


With mine set to 0, I only get enough to keep the GPUs busy and 1 more. They will continue to crunch Einstein until I get Seti work, then they finish the Einstein and start up on the Seti work...
ID: 1859095 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1859099 - Posted: 2 Apr 2017, 2:00:12 UTC - in response to Message 1859091.  



. . And I thought I had screwed up by getting about 90 on one machine and 30 to 50 on another. 5600 is a hell of lot of tasks ....

Stephen

?

That is for me is a really common onboard task count for Einstein. That is about 2-3 days work at my 12.5% resource share with SETI.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1859099 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1859101 - Posted: 2 Apr 2017, 2:01:42 UTC - in response to Message 1859095.  



With mine set to 0, I only get enough to keep the GPUs busy and 1 more. They will continue to crunch Einstein until I get Seti work, then they finish the Einstein and start up on the Seti work...

But I don't use Einstein just as a backup project. I actively crunch at 12.5% usage and 7.5% usage for MilkyWay.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1859101 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1853
Credit: 268,616,081
RAC: 1,349
United States
Message 1859111 - Posted: 2 Apr 2017, 3:15:49 UTC - in response to Message 1859091.  

. . And I thought I had screwed up by getting about 90 on one machine and 30 to 50 on another. 5600 is a hell of lot of tasks ....
Stephen
?

Yeah, I screwed the pooch so bad on that one they heard it blocks down the road!:)
And somehow I managed that with tasks set to 1+1 days, and priority at 1.
Could be wrong, but I think the deal is that initial hit to the web site gets you downloads before the preferences are read and accepted.
So it goes ...
Simple lesson I know well and always ignore is to make changes on just one box, then wait a day or two to see and deal with any fallout.
But, being a charge-ahead kinda guy, ....
ID: 1859111 · Report as offensive
Ghia
Avatar

Send message
Joined: 7 Feb 17
Posts: 238
Credit: 28,911,438
RAC: 50
Norway
Message 1859152 - Posted: 2 Apr 2017, 7:40:29 UTC - in response to Message 1859111.  

I also got a heap of E@H WUs before I had the sense to set NNT. Saving them for Tuesday now. Setting resource share to 0 seems to work well enough, the active tasks just sit there 'Waining to run'.
Humans may rule the world...but bacteria run it...
ID: 1859152 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1859171 - Posted: 2 Apr 2017, 11:39:21 UTC - in response to Message 1859095.  

LOL, I'm sympathizing. I learned a few years back that BOINC simply doesn't have enough fine-grained tools to handle projects like Einstein. I have tried all tricks mentioned. Nothing worked. My simple solution is NNT on Einstein until I'm low on work. Then allow tasks for two download cycles, which takes all of two minutes, and shut her off again. That gets me around 2-3 days of work before I have to run the process again. Oh well..... everyone that has an abundance of Einstein work is going to see a huge bump in their overall BOINC RAC.


With mine set to 0, I only get enough to keep the GPUs busy and 1 more. They will continue to crunch Einstein until I get Seti work, then they finish the Einstein and start up on the Seti work...


. . Yep, it works pretty much the same for me. When one E@H task finishes I get another to run. Just enough to keep the mill grinding, and no more. So when Seti work does start to flow there is only one E@H tasks to finish on each GPU. And that doesn't take that long either.

Stephen

:)
ID: 1859171 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1859173 - Posted: 2 Apr 2017, 11:41:31 UTC - in response to Message 1859111.  

. . And I thought I had screwed up by getting about 90 on one machine and 30 to 50 on another. 5600 is a hell of lot of tasks ....
Stephen
?

Yeah, I screwed the pooch so bad on that one they heard it blocks down the road!:)
And somehow I managed that with tasks set to 1+1 days, and priority at 1.
Could be wrong, but I think the deal is that initial hit to the web site gets you downloads before the preferences are read and accepted.
So it goes ...
Simple lesson I know well and always ignore is to make changes on just one box, then wait a day or two to see and deal with any fallout.
But, being a charge-ahead kinda guy, ....


. . Damn the torpedoes, full speed a...... OOPS!

Stephen

:)
ID: 1859173 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1853
Credit: 268,616,081
RAC: 1,349
United States
Message 1859185 - Posted: 2 Apr 2017, 13:36:31 UTC - in response to Message 1859173.  

But, being a charge-ahead kinda guy, ....


. . Damn the torpedoes, full speed a...... OOPS!

Stephen

:)

Yep. I think I code that way too ! :)
ID: 1859185 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1859291 - Posted: 3 Apr 2017, 3:42:55 UTC

Seems there are a few WUs apparently Hung. I was looking for a particular AR and found one in Suspended Animation. Then I found a few more just sitting there waiting for the Validator. They have been waiting a couple of days, some a few weeks. Anyway to resurrect these dead WUs and get them moving again?
http://setiathome.berkeley.edu/workunit.php?wuid=2377405378
http://setiathome.berkeley.edu/workunit.php?wuid=2485675271
http://setiathome.berkeley.edu/workunit.php?wuid=2485884961
http://setiathome.berkeley.edu/workunit.php?wuid=2485926712

It looks as though most of these WUs were started before the Outage, and the competed tasks since the Outage are not being acknowledged by the Validator as needing Validation.
ID: 1859291 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1859294 - Posted: 3 Apr 2017, 4:04:35 UTC - in response to Message 1859291.  

I've had similar occur in the past. Nothing we can do on our end as far as I know. You just have to wait for the assimilators to process them or wait for Eric to run a script to clear out the deadwood.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1859294 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1859311 - Posted: 3 Apr 2017, 5:37:04 UTC - in response to Message 1859294.  
Last modified: 3 Apr 2017, 5:41:35 UTC

Yes, and now you have a few more "Dead" tasks, https://setiathome.berkeley.edu/results.php?hostid=8030022&offset=420&state=2
In fact, I'd say it's safe to assume Anyone that had Pending Tasks at the start of the Outage has 'Dead' tasks.
Hundreds of thousands...or more ;-)
I have quite a few across Four machines. As the scenario suggests, probably Everyone does.
ID: 1859311 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1859312 - Posted: 3 Apr 2017, 5:45:06 UTC - in response to Message 1859311.  

Yuck. So maybe there are still some crossed wires in the servers because of the data corruption remedy.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1859312 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1859313 - Posted: 3 Apr 2017, 5:49:03 UTC

Looks like the SETI WU and results awaiting purge flatlined right around 00:00 UTC Monday. Haveland Stats
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1859313 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1859399 - Posted: 3 Apr 2017, 22:08:01 UTC - in response to Message 1859311.  

Yes, and now you have a few more "Dead" tasks, https://setiathome.berkeley.edu/results.php?hostid=8030022&offset=420&state=2
In fact, I'd say it's safe to assume Anyone that had Pending Tasks at the start of the Outage has 'Dead' tasks.
Hundreds of thousands...or more ;-)
I have quite a few across Four machines. As the scenario suggests, probably Everyone does.


. . Ouch, that would explain the big drop in my acknowledged returns since the unscheduled outage :(

. . Oh well..

Stephen

<shrug>
ID: 1859399 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 1860268 - Posted: 8 Apr 2017, 5:22:37 UTC
Last modified: 8 Apr 2017, 5:25:54 UTC

Web site and forums behaving badly at the moment.
Slow to respond, and other times not even responding at all.

EDIT- just had a look in my Manager's Event log & a few Scheduler errors (Couldn't connect to server) are showing there.
Grant
Darwin NT
ID: 1860268 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1860274 - Posted: 8 Apr 2017, 6:43:12 UTC - in response to Message 1860268.  

Yes, I've been seeing the same slowness and no server response too.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1860274 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1860295 - Posted: 8 Apr 2017, 8:53:21 UTC - in response to Message 1860274.  

Yes, I've been seeing the same slowness and no server response too.


. . That was probably me writing a long message .. :) <joke>

Stephen

:)
ID: 1860295 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1860346 - Posted: 8 Apr 2017, 15:05:05 UTC - in response to Message 1860268.  

Web site and forums behaving badly at the moment.
Slow to respond, and other times not even responding at all.

EDIT- just had a look in my Manager's Event log & a few Scheduler errors (Couldn't connect to server) are showing there.

In scanning my stdoutdae.txt I do see a slight increase in the number of scheduler failures over the course of this past week.
Project details for: SETI@home including all dates
Scheduler Requests: 4090
Scheduler Success: 99 %, Count: 4062
Scheduler Failure: 0 %, Count: 28 (Total)
Scheduler Failure: 0 % of total, Count: 24 (Couldn't connect to server)
Scheduler Failure: 0 % of total, Count: 3 (HTTP service unavailable)
Scheduler Failure: 0 % of total, Count: 0 (HTTP internal server error)
Scheduler Failure: 0 % of total, Count: 1 (Failure when receiving data from the peer)
Scheduler Failure: 0 % of total, Count: 0 (Timeout was reached)
Scheduler Timeout: 0 % of failures

Project details for: SETI@home including 08-Apr-2017
Scheduler Requests: 217
Scheduler Success: 98 %, Count: 214
Scheduler Failure: 1 %, Count: 3 (Total)
Scheduler Failure: 1 % of total, Count: 3 (Couldn't connect to server)

Project details for: SETI@home including 07-Apr-2017
Scheduler Requests: 480
Scheduler Success: 99 %, Count: 479
Scheduler Failure: 0 %, Count: 1 (Total)
Scheduler Failure: 0 % of total, Count: 1 (Couldn't connect to server)

Project details for: SETI@home including 06-Apr-2017
Scheduler Requests: 479
Scheduler Success: 99 %, Count: 476
Scheduler Failure: 0 %, Count: 3 (Total)
Scheduler Failure: 0 % of total, Count: 2 (Couldn't connect to server)
Scheduler Failure: 0 % of total, Count: 1 (HTTP service unavailable)

Project details for: SETI@home including 05-Apr-2017
Scheduler Requests: 478
Scheduler Success: 99 %, Count: 474
Scheduler Failure: 0 %, Count: 4 (Total)
Scheduler Failure: 0 % of total, Count: 2 (Couldn't connect to server)
Scheduler Failure: 0 % of total, Count: 2 (HTTP service unavailable)

Project details for: SETI@home including 04-Apr-2017
Scheduler Requests: 381
Scheduler Success: 99 %, Count: 379
Scheduler Failure: 0 %, Count: 2 (Total)
Scheduler Failure: 0 % of total, Count: 2 (Couldn't connect to server)

Project details for: SETI@home including 03-Apr-2017
Scheduler Requests: 480
Scheduler Success: 99 %, Count: 476
Scheduler Failure: 0 %, Count: 4 (Total)
Scheduler Failure: 0 % of total, Count: 4 (Couldn't connect to server)

Project details for: SETI@home including 02-Apr-2017
Scheduler Requests: 480
Scheduler Success: 100 %, Count: 480
Scheduler Failure: 0 %, Count: 0 (Total)

SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1860346 · Report as offensive
Previous · 1 . . . 20 · 21 · 22 · 23 · 24 · 25 · 26 . . . 34 · Next

Message boards : Number crunching : Panic Mode On (105) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.