Panic Mode On (109) Server Problems?

Message boards : Number crunching : Panic Mode On (109) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 36 · Next

AuthorMessage
Iona
Avatar

Send message
Joined: 12 Jul 07
Posts: 790
Credit: 22,438,118
RAC: 0
United Kingdom
Message 1907888 - Posted: 18 Dec 2017, 18:49:27 UTC

I know it is a week away, but will there still be the usual 'Maintenance' on the 26th, or will it be sooner/later to accommodate holidays etc over the Christmas period? TBar......I frequently get instances where I have to manually 'update' and there are also quite a few times, when, I can (as an example) report 3 tasks and get no tasks in return, without any mention of 'no tasks available'. I'm now at 184 tasks in progress, from the 200 of 2 days ago. For what it is worth, I still feel that a number of apparent issues need resolving and preparatory work done for Parkes upgrades, so best to do it sooner rather than later.
Don't take life too seriously, as you'll never come out of it alive!
ID: 1907888 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1907939 - Posted: 19 Dec 2017, 1:27:16 UTC

Hey - it seems the file 13ja07ad has been stuck for a while. Has anyone else noticed this? Or am I crazy?
ID: 1907939 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13351
Credit: 208,696,464
RAC: 304
Australia
Message 1908000 - Posted: 19 Dec 2017, 6:39:23 UTC - in response to Message 1907849.  
Last modified: 19 Dec 2017, 7:25:17 UTC

Why would it Ignore one Host and just miss a single beat on the other Host sitting next to it?

Just the general perversity of nature & technology.

Edit-
The present work mix is a great example of that.
Here we are, just before the weekly outage, and most of the work i'm getting is Arecibo. And most of that is all shorties.
Grant
Darwin NT
ID: 1908000 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 26943
Credit: 261,360,520
RAC: 489
Australia
Message 1908001 - Posted: 19 Dec 2017, 6:47:41 UTC - in response to Message 1907939.  

Hey - it seems the file 13ja07ad has been stuck for a while. Has anyone else noticed this? Or am I crazy?

It's working, but just maybe a little slower than usual as AP's as well as MB's are being split off that file at the same time.

Cheers.
ID: 1908001 · Report as offensive
Sirius B Project Donor
Volunteer tester
Avatar

Send message
Joined: 26 Dec 00
Posts: 24501
Credit: 3,081,182
RAC: 7
Ireland
Message 1908043 - Posted: 19 Dec 2017, 23:44:46 UTC

Not too long this week :-)
ID: 1908043 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1908052 - Posted: 20 Dec 2017, 0:56:08 UTC - in response to Message 1908001.  

Hey - it seems the file 13ja07ad has been stuck for a while. Has anyone else noticed this? Or am I crazy?

It's working, but just maybe a little slower than usual as AP's as well as MB's are being split off that file at the same time.

Cheers.


Maybe so, but it has been there for at least a day (or more), I believe. Shouldn't take that long, should it?
ID: 1908052 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13351
Credit: 208,696,464
RAC: 304
Australia
Message 1908110 - Posted: 20 Dec 2017, 7:49:10 UTC

The replica is finally starting to catch up.
However the AP & MB file deleters are having issues again.
Grant
Darwin NT
ID: 1908110 · Report as offensive
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1908363 - Posted: 21 Dec 2017, 22:17:32 UTC

Something is going on. Slowing down of the work units available
ID: 1908363 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 26943
Credit: 261,360,520
RAC: 489
Australia
Message 1908364 - Posted: 21 Dec 2017, 22:33:04 UTC - in response to Message 1908363.  

Something is going on. Slowing down of the work units available

I think that you jumped the gun there. ;-)

Ready to send 609,973 and a creation rate of 56.4247/sec is currently showing.

Cheers.
ID: 1908364 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1908365 - Posted: 21 Dec 2017, 22:51:33 UTC - in response to Message 1908364.  

Something is going on. Slowing down of the work units available

I think that you jumped the gun there. ;-)

Ready to send 609,973 and a creation rate of 56.4247/sec is currently showing.

Cheers.


. . But he is still right, I am getting "no tasks available" and just simply "no tasks sent" again. This issue is independent of work being actually available ... and manifests mostly on the most productive machines. It seems the schedulers have a limitation that prevents them from keeping work supplied above some unspecified rate. At least in the long term. Richard speculated that it was the transfer limit from the RTS queue to the download buffer (what I call the hopper) but I would think that would mostly manifest as shortfalls in d/l's rather than none at all.

Stephen

??
ID: 1908365 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 26943
Credit: 261,360,520
RAC: 489
Australia
Message 1908368 - Posted: 21 Dec 2017, 22:55:46 UTC - in response to Message 1908365.  

Something is going on. Slowing down of the work units available

I think that you jumped the gun there. ;-)

Ready to send 609,973 and a creation rate of 56.4247/sec is currently showing.

Cheers.


. . But he is still right, I am getting "no tasks available" and just simply "no tasks sent" again. This issue is independent of work being actually available ... and manifests mostly on the most productive machines. It seems the schedulers have a limitation that prevents them from keeping work supplied above some unspecified rate. At least in the long term. Richard speculated that it was the transfer limit from the RTS queue to the download buffer (what I call the hopper) but I would think that would mostly manifest as shortfalls in d/l's rather than none at all.

Stephen

??

Well that is not happening here at all.

Cheers.
ID: 1908368 · Report as offensive
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1908379 - Posted: 21 Dec 2017, 23:44:09 UTC - in response to Message 1908368.  

Not supposed to and is are always 2 separate things, lol...

Number in progress dropping like a rock, I see Einstein@home getting a boost in work done within the next 3 hours...
ID: 1908379 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1908381 - Posted: 21 Dec 2017, 23:55:20 UTC - in response to Message 1908368.  


. . But he is still right, I am getting "no tasks available" and just simply "no tasks sent" again.
Stephen

Well that is not happening here at all.

Cheers.


, , Yes I have noticed your zone of immunity :)

Stephen

:)
ID: 1908381 · Report as offensive
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5516
Credit: 528,817,460
RAC: 242
United States
Message 1908384 - Posted: 22 Dec 2017, 0:05:34 UTC

Keith is right, B@tch about the servers and they reward you with work... Was down to about 25 and then the flood gates opened.....
ID: 1908384 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13141
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1908385 - Posted: 22 Dec 2017, 0:09:02 UTC

Yes, I am having issues getting work too with lots of no tasks are available or only getting one task every 10-20 minute after returning 5-10 tasks. Trying the Triple Update gets some success for a while, but ultimately falling caches.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1908385 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13141
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1908386 - Posted: 22 Dec 2017, 0:09:37 UTC - in response to Message 1908384.  

Keith is right, B@tch about the servers and they reward you with work... Was down to about 25 and then the flood gates opened.....

Hence, my post ;-}
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1908386 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13141
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1908387 - Posted: 22 Dec 2017, 0:11:28 UTC - in response to Message 1908381.  


. . But he is still right, I am getting "no tasks available" and just simply "no tasks sent" again.
Stephen

Well that is not happening here at all.

Cheers.


, , Yes I have noticed your zone of immunity :)

Stephen

:)

Wiggo's zone of immunity is BOINC 6.10.60.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1908387 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1908388 - Posted: 22 Dec 2017, 0:13:15 UTC

One of my machines is currently being starved. It's currently down by about 100 tasks and seems to have been holding there for a while. All four machines are connected to the same Ethernet switch, only one is currently having trouble. Reminds me of a rolling blackout, one or two machines will stop receiving work and drop by 40 or 100 tasks and then recover by itself. Then a bit later it will happen to another machine. The fastest machine seems to be almost unaffected.
ID: 1908388 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13351
Credit: 208,696,464
RAC: 304
Australia
Message 1908412 - Posted: 22 Dec 2017, 5:53:40 UTC - in response to Message 1908388.  

Got home to find my cache running down as well. The triple update got it flowing again (at least for now).
Grant
Darwin NT
ID: 1908412 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 26943
Credit: 261,360,520
RAC: 489
Australia
Message 1908415 - Posted: 22 Dec 2017, 7:17:45 UTC

I still have full caches here and I get 1 for 1 at every request so far today.

Yes, as Stephen mentioned, I do use 6:10:60 and the only time I suffer is if there's a real problem. ;-)

Cheers.
ID: 1908415 · Report as offensive
Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 36 · Next

Message boards : Number crunching : Panic Mode On (109) Server Problems?


 
©2022 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.