Panic Mode On (107) Server Problems?

Message boards : Number crunching : Panic Mode On (107) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 29 · Next

AuthorMessage
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13854
Credit: 208,696,464
RAC: 304
Australia
Message 1883265 - Posted: 11 Aug 2017, 22:08:20 UTC - in response to Message 1883261.  

running out of work...and on a non-maintenance day...

And what work there is, is almost all Arecibo. Very little GBT at all.

Won't be much of a WOW event if they can't get any work.
Grant
Darwin NT
ID: 1883265 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 36765
Credit: 261,360,520
RAC: 489
Australia
Message 1883267 - Posted: 11 Aug 2017, 22:26:04 UTC

I just went through my logs here and I havn't had any issues getting work since our extended outrage ended and I also have plenty of GBT onboard so it must be that quirky problem you fellows have been battling this year.

Cheers.
ID: 1883267 · Report as offensive
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9958
Credit: 103,452,613
RAC: 328
United Kingdom
Message 1883271 - Posted: 11 Aug 2017, 22:34:00 UTC - in response to Message 1883267.  

I just went through my logs here and I havn't had any issues getting work since our extended outrage ended and I also have plenty of GBT onboard so it must be that quirky problem you fellows have been battling this year.

Cheers.

Yes same here, my machines are not in the same league as some here but every time I see complaints about not getting work my old machines are usually full as they are now.

Just watched my 970 finish two tasks and get two more
ID: 1883271 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1883273 - Posted: 11 Aug 2017, 22:34:48 UTC - in response to Message 1883202.  

Seems like there is problem with work being sent out...


. . LOL, when is there not?

Stephen

:)
ID: 1883273 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1883275 - Posted: 11 Aug 2017, 22:37:31 UTC - in response to Message 1883265.  
Last modified: 11 Aug 2017, 22:38:47 UTC

running out of work...and on a non-maintenance day...

And what work there is, is almost all Arecibo. Very little GBT at all.

Won't be much of a WOW event if they can't get any work.


. . If it cheers you up, 66% of the tasks I have are GBT work.

Stephen

:(
ID: 1883275 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13854
Credit: 208,696,464
RAC: 304
Australia
Message 1883277 - Posted: 11 Aug 2017, 22:38:04 UTC - in response to Message 1883209.  
Last modified: 11 Aug 2017, 22:51:21 UTC

I have tried the flipping of applications and TBar method of hitting update 3 times rapidly just as the computer requests work and nothing has persuaded the scheduler to send me work.

I just tried TBars' repeated requests, then picked up 49, and then 12 WUs on the following requests.
Seem ridiculous, but it's worked every time I've tried it so far.


EDIT-
And the even weirder thing is that once you get that initial dump of outstanding work, the Scheduler will then continue to send replacement work as you return it. At least until the next time it gets stupid.
Grant
Darwin NT
ID: 1883277 · Report as offensive
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5517
Credit: 528,817,460
RAC: 242
United States
Message 1883280 - Posted: 11 Aug 2017, 22:45:43 UTC - in response to Message 1883275.  



. . If it cheers you up, 66% of the tasks I have are GBT work.

Stephen

:(


Since I don't reschedule, it makes no difference to me. I just want work, lol... Was down to 12 work units before it finally decided to give me some more...

Yeah, if it keeps this up, the WOW isn't going to be very much fun...
ID: 1883280 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1883282 - Posted: 11 Aug 2017, 22:59:29 UTC - in response to Message 1883277.  

I have tried the flipping of applications and TBar method of hitting update 3 times rapidly just as the computer requests work and nothing has persuaded the scheduler to send me work.

I just tried TBars' repeated requests, then picked up 49, and then 12 WUs on the following requests.
Seem ridiculous, but it's worked every time I've tried it so far.


EDIT-
And the even weirder thing is that once you get that initial dump of outstanding work, the Scheduler will then continue to send replacement work as you return it. At least until the next time it gets stupid.

I think there is a timing issue with TBar's method. You have to hit the sequence just right. I managed to get it work on my 5th or 6th try and started getting work. The Linux machine is back to a full cache.

Now the two Windows 7 machines are down in work with no tasks available. Was away for an hour and will have to try TBar's technique on those machines.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1883282 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13854
Credit: 208,696,464
RAC: 304
Australia
Message 1883283 - Posted: 11 Aug 2017, 23:02:18 UTC - in response to Message 1883282.  

I think there is a timing issue with TBar's method. You have to hit the sequence just right.

Definitely.
Grant
Darwin NT
ID: 1883283 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1883342 - Posted: 12 Aug 2017, 4:48:01 UTC - in response to Message 1883283.  
Last modified: 12 Aug 2017, 4:50:09 UTC

I think there is a timing issue with TBar's method. You have to hit the sequence just right.
Definitely.
What I have found is there needs to be 2 'empty' reports where no tasks are sent back. If you have tasks to report, it takes 3 updates. And wait for updates to complete before the next one.
EDIT: And sometimes flipping prefs, and double reporting doesn't work, but a restart resets things - grrr!
ID: 1883342 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13854
Credit: 208,696,464
RAC: 304
Australia
Message 1883353 - Posted: 12 Aug 2017, 6:31:53 UTC - in response to Message 1883342.  

What I have found is there needs to be 2 'empty' reports where no tasks are sent back.

Possibly.
I just do as Tbar posted.
Update, then as soon as it says "Scheduler request in progress", Update again, then once the request is finished, Update again.
So far, so good.

And to add to the usual Scheduler woes, I notice my Pendings are gradually going through the roof. Usually around 750, now up over 1,000.
Grant
Darwin NT
ID: 1883353 · Report as offensive
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9958
Credit: 103,452,613
RAC: 328
United Kingdom
Message 1883354 - Posted: 12 Aug 2017, 6:46:28 UTC

And to add to the usual Scheduler woes, I notice my Pendings are gradually going through the roof. Usually around 750, now up over 1,000.


Very likely due to "bunkering" for the WOW event.
ID: 1883354 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1883357 - Posted: 12 Aug 2017, 7:25:33 UTC - in response to Message 1883342.  

I think there is a timing issue with TBar's method. You have to hit the sequence just right.
Definitely.
What I have found is there needs to be 2 'empty' reports where no tasks are sent back. If you have tasks to report, it takes 3 updates. And wait for updates to complete before the next one.
EDIT: And sometimes flipping prefs, and double reporting doesn't work, but a restart resets things - grrr!

I think there is some truth to this observation now that I think about it. It helps to have those "empty" reports where nothing is sent back.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1883357 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1883364 - Posted: 12 Aug 2017, 11:21:15 UTC

I wish I had some advice for everybody here, but alas I do not.
I don't wish to jinx myself, but I have never had any problem getting work except when RTS was empty and nobody was getting much work.

I am using an old tried and true version of Boinc. I can't imagine that the work requests from the old Boinc are different than from a recent version, or that the scheduler would treat them any differently.
I am using XP on 4 rigs, and 7 on my daily driver. No difference there.
I am using the most recent version of the Lunatics installer.

I dunno. Maybe the scheduler just says 'Make way, it's the kittyman calling'...............LOL.

Meow meow meow.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1883364 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1883384 - Posted: 12 Aug 2017, 16:12:20 UTC - in response to Message 1883364.  

I, too, have tried to avoid "jinxing" myself by commenting on this issue, as I've never had a problem keeping the work buffers full (or close to it) except when the feeder was clogged with Arecibo VLARs. However, yesterday my #1 cruncher actually ran out of GPU work late in the afternoon. Fortunately, it reached that point about 5 minutes before its scheduled weekday afternoon shutdown (avoiding peak period electric rates). When it came back up 5 hours later, it filled the buffer in 2 requests, 176 tasks in the first and 127 in the second. I tend to believe that Arecibo VLARs were the primary culprit again yesterday, but then that probably wouldn't explain the responses that said the limit of tasks in progress had been reached when, in fact, the work buffer was steadily shrinking to zero.

Since refraining from comment ultimately didn't avoid the jinx, I'll go ahead and mention the following, for what it's worth. I haven't touched my settings in a long, long time, probably a couple years. My application preferences are "Yes, Yes, No" and my work buffer is 1.8 days plus 0.2 days. Also, assuming that the project shares might somehow play into the scheduler's decision-making, I have no active backup projects that might possibly muddy the waters.

That's about it, for whatever bit of insight it might provide.
ID: 1883384 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1883387 - Posted: 12 Aug 2017, 16:35:25 UTC - in response to Message 1883384.  


Since refraining from comment ultimately didn't avoid the jinx, I'll go ahead and mention the following, for what it's worth. I haven't touched my settings in a long, long time, probably a couple years. My application preferences are "Yes, Yes, No" and my work buffer is 1.8 days plus 0.2 days. Also, assuming that the project shares might somehow play into the scheduler's decision-making, I have no active backup projects that might possibly muddy the waters.

That's about it, for whatever bit of insight it might provide.


What I wonder based on your comments and kittyman's is that you both have no backup projects muddying the waters as you say with shared resources to other projects. I have always had my resources split between SETI, Einstein and MilkyWay. I would need to look at all the others profiles that have this issue to see if it is a common factor.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1883387 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1883388 - Posted: 12 Aug 2017, 16:36:07 UTC - in response to Message 1883364.  

I wish I had some advice for everybody here, but alas I do not.
I don't wish to jinx myself, but I have never had any problem getting work except when RTS was empty and nobody was getting much work.

I am using an old tried and true version of Boinc. I can't imagine that the work requests from the old Boinc are different than from a recent version, or that the scheduler would treat them any differently.
I am using XP on 4 rigs, and 7 on my daily driver. No difference there.
I am using the most recent version of the Lunatics installer.

I dunno. Maybe the scheduler just says 'Make way, it's the kittyman calling'...............LOL.

Meow meow meow.

Pretty early on things like BOINC and OS versions were discussed to try and pinpoint their issue.
Nothing has really made sense as to why only a few users are having issues.
The only thing I can think of at this point is that there is some weirdness along their route connecting to Berkley.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1883388 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1883389 - Posted: 12 Aug 2017, 16:44:28 UTC - in response to Message 1883388.  


Pretty early on things like BOINC and OS versions were discussed to try and pinpoint their issue.
Nothing has really made sense as to why only a few users are having issues.
The only thing I can think of at this point is that there is some weirdness along their route connecting to Berkley.

You allude to connectivity issues being the problem. But if you don't connect to Berkeley, then you obviously couldn't get the responses we receive that you have reached a limit of tasks in progress or that there are no tasks to send.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1883389 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1883390 - Posted: 12 Aug 2017, 16:45:02 UTC - in response to Message 1883387.  

That is true for me.
I have backup projects loaded if Seti were to go down for an extended period of time.
However, none is active. For now, if Seti runs dry, the rigs just take a rest.

Meow.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1883390 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1883393 - Posted: 12 Aug 2017, 17:06:36 UTC - in response to Message 1883389.  


Pretty early on things like BOINC and OS versions were discussed to try and pinpoint their issue.
Nothing has really made sense as to why only a few users are having issues.
The only thing I can think of at this point is that there is some weirdness along their route connecting to Berkley.

You allude to connectivity issues being the problem. But if you don't connect to Berkeley, then you obviously couldn't get the responses we receive that you have reached a limit of tasks in progress or that there are no tasks to send.

Connectivity issues can be a lot more complicated than simply connection vs no connection.
If the data is malformed or truncated you can still get a response, but it likely won't be what is expected.
Not receiving a "Not sending work - last request too recent:" response after performing several updates would indicate something is being lost.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1883393 · Report as offensive
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 29 · Next

Message boards : Number crunching : Panic Mode On (107) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.