Panic Mode On (115) Server Problems?

Message boards : Number crunching : Panic Mode On (115) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 31 · Next

AuthorMessage
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5126
Credit: 276,046,078
RAC: 462
Message 1982673 - Posted: 28 Feb 2019, 22:59:25 UTC - in response to Message 1982671.  

Scroll to the bottom task in the list on the download page in the Manager. Select it with the mouse and click the Retry button. I can usually get a dozen or so tasks that way cleared from the list before the download server begins to ignore me and give me a increased backoff. Then I move to another host and try there until it too craps out. Then move to another host etc.


Ahh, the "bottom" has it. TY!
A proud member of the OFA (Old Farts Association).
ID: 1982673 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1982674 - Posted: 28 Feb 2019, 23:02:11 UTC - in response to Message 1982664.  

My caches are both back to normal again.

Cheers.


. . Are you still using Vader? If so what is the address? If you don't mind?

Stephen

? ?
ID: 1982674 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5126
Credit: 276,046,078
RAC: 462
Message 1982675 - Posted: 28 Feb 2019, 23:02:31 UTC - in response to Message 1982673.  

Scroll to the bottom task in the list on the download page in the Manager. Select it with the mouse and click the Retry button. I can usually get a dozen or so tasks that way cleared from the list before the download server begins to ignore me and give me a increased backoff. Then I move to another host and try there until it too craps out. Then move to another host etc.


Ahh, the "bottom" has it. TY!


Honest, I have no idea. Bumped one and suddenly tons of CUDA90 tasks are downloading.

Tom
A proud member of the OFA (Old Farts Association).
ID: 1982675 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1982676 - Posted: 28 Feb 2019, 23:03:55 UTC - in response to Message 1982674.  

My caches are both back to normal again.

Cheers.


. . Are you still using Vader? If so what is the address? If you don't mind?

Stephen

? ?

If you want to modify your hosts list, then these are the current IP addresses.

208.68.240.118 setiboincdata.ssl.berkeley.edu # upload server Oct 2016
208.68.240.119 boinc2.ssl.berkeley.edu # Georgem download server Oct 2016
208.68.240.126 setiboinc.ssl.berkeley.edu # scheduler Oct 2016
208.68.240.127 vader.ssl.berkeley.edu # Vader download server Oct 2016
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1982676 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 36343
Credit: 261,360,520
RAC: 489
Australia
Message 1982677 - Posted: 28 Feb 2019, 23:03:58 UTC - in response to Message 1982674.  

My caches are both back to normal again.

Cheers.
. . Are you still using Vader? If so what is the address? If you don't mind?

Stephen

? ?
Take a look at the sticky in our forum Stephen. ;-)

Cheers.
ID: 1982677 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5126
Credit: 276,046,078
RAC: 462
Message 1982678 - Posted: 28 Feb 2019, 23:05:23 UTC - in response to Message 1982675.  

Scroll to the bottom task in the list on the download page in the Manager. Select it with the mouse and click the Retry button. I can usually get a dozen or so tasks that way cleared from the list before the download server begins to ignore me and give me a increased backoff. Then I move to another host and try there until it too craps out. Then move to another host etc.


Ahh, the "bottom" has it. TY!


Honest, I have no idea. Bumped one and suddenly tons of CUDA90 tasks are downloading.

Tom


And when I bumped it again, the whole backup starts downloading. Hmmmmmm....... (Vodoo?)

Tom
A proud member of the OFA (Old Farts Association).
ID: 1982678 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1982679 - Posted: 28 Feb 2019, 23:14:29 UTC - in response to Message 1982678.  

Scroll to the bottom task in the list on the download page in the Manager. Select it with the mouse and click the Retry button. I can usually get a dozen or so tasks that way cleared from the list before the download server begins to ignore me and give me a increased backoff. Then I move to another host and try there until it too craps out. Then move to another host etc.


Ahh, the "bottom" has it. TY!


Honest, I have no idea. Bumped one and suddenly tons of CUDA90 tasks are downloading.

Tom


And when I bumped it again, the whole backup starts downloading. Hmmmmmm....... (Vodoo?)

Tom

You are having better luck than I am. I think the more attempts on the download counter, the harder it is to get the task. I find if I always try to get the smallest backoff task in the list, the easier it is to get it to download. The tasks that are 10 or more attempts are permanently stuck it seems.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1982679 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14674
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1982681 - Posted: 28 Feb 2019, 23:16:07 UTC - in response to Message 1982676.  

If you want to modify your hosts list, then these are the current IP addresses.

208.68.240.118 setiboincdata.ssl.berkeley.edu # upload server Oct 2016
208.68.240.119 boinc2.ssl.berkeley.edu # Georgem download server Oct 2016
208.68.240.126 setiboinc.ssl.berkeley.edu # scheduler Oct 2016
208.68.240.127 vader.ssl.berkeley.edu # Vader download server Oct 2016
Well, they were current at the dates stated. My local reference set has the dates updated to August 2017, the last time we had to dust them off.

But you should not modify the final line like that. The purpose of the hosts file is to replace the DNS service when that fails (which is not the case in this outage).

So, when a program calls for a URL in the second column, the hosts file returns the IP address in the first column. BOINC will never try to access Vader by name: our downloads all come from boinc2.ssl.berkeley.edu. Only the IP part changes.
ID: 1982681 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5126
Credit: 276,046,078
RAC: 462
Message 1982684 - Posted: 28 Feb 2019, 23:23:31 UTC - in response to Message 1982679.  


You are having better luck than I am. I think the more attempts on the download counter, the harder it is to get the task. I find if I always try to get the smallest backoff task in the list, the easier it is to get it to download. The tasks that are 10 or more attempts are permanently stuck it seems.


They are all gpu tasks though, so I am chewing through them at the usual very high speeds.

Tom
A proud member of the OFA (Old Farts Association).
ID: 1982684 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1856
Credit: 268,616,081
RAC: 1,349
United States
Message 1982688 - Posted: 28 Feb 2019, 23:47:41 UTC
Last modified: 28 Feb 2019, 23:48:37 UTC

Would be interesting to know how the dual download servers actually work.

If the scheduler directs you to one or the other server based on where files were actually placed at the time of the download request, there would be no workaround and files previously placed on the down server would be inaccessible where those placed on the other server would be fine. Assuming this is how it works, and it's the scheduler that's doing the load balancing.

If (and I think this is very doubtful) both download servers are using a common storage place or are otherwise truly redundant, then twiddling IP addresses might have some function if one died.

I didn't have any success getting a download by changing the IP address of boinc2 to .127.
ID: 1982688 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1982689 - Posted: 1 Mar 2019, 0:03:19 UTC - in response to Message 1982681.  

If you want to modify your hosts list, then these are the current IP addresses.

208.68.240.118 setiboincdata.ssl.berkeley.edu # upload server Oct 2016
208.68.240.119 boinc2.ssl.berkeley.edu # Georgem download server Oct 2016
208.68.240.126 setiboinc.ssl.berkeley.edu # scheduler Oct 2016
208.68.240.127 vader.ssl.berkeley.edu # Vader download server Oct 2016
Well, they were current at the dates stated. My local reference set has the dates updated to August 2017, the last time we had to dust them off.

But you should not modify the final line like that. The purpose of the hosts file is to replace the DNS service when that fails (which is not the case in this outage).

So, when a program calls for a URL in the second column, the hosts file returns the IP address in the first column. BOINC will never try to access Vader by name: our downloads all come from boinc2.ssl.berkeley.edu. Only the IP part changes.

True. But 208.68.240.127 resolves to vader.ssl.berkeley.edu in nslookup. I just posted the IP addresses and who they belonged to. Up to the user to know enough about the hosts file to use the information.

I DID NOT say in anyway to put that text into your hosts file. Somebody asked for vader's address, I simply copied my reference text file from documents folder.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1982689 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1982691 - Posted: 1 Mar 2019, 0:07:18 UTC - in response to Message 1982688.  

I didn't have any success getting a download by changing the IP address of boinc2 to .127.

. . Nor I. The restarting stalled downloads one at a time has worked on 2 rigs which now have filled caches but not on the fastest rig which has a totally empty cache :( ... Murphy's Law.

Stephen

:(
ID: 1982691 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1982692 - Posted: 1 Mar 2019, 0:10:57 UTC

yeah i'm getting nothing. still uploading, but no downloads.
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 1982692 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 36343
Credit: 261,360,520
RAC: 489
Australia
Message 1982694 - Posted: 1 Mar 2019, 0:16:59 UTC

Well I can't complain about all the AP's that I'm picking up ATM. :-D

Cheers.
ID: 1982694 · Report as offensive
Profile Chris904395093209d Project Donor
Volunteer tester

Send message
Joined: 1 Jan 01
Posts: 112
Credit: 29,923,129
RAC: 6
United States
Message 1982695 - Posted: 1 Mar 2019, 0:18:38 UTC

I was able to download tasks one at a time on 2 Linux machines, 2 other machines still have stuck downloads even after trying to retry downloading 1 at a time.
~Chris

ID: 1982695 · Report as offensive
Profile Cliff Harding
Volunteer tester
Avatar

Send message
Joined: 18 Aug 99
Posts: 1432
Credit: 110,967,840
RAC: 67
United States
Message 1982698 - Posted: 1 Mar 2019, 0:34:04 UTC

It's been quite some time that I've used the host.txt file, where do I stick it?


I don't buy computers, I build them!!
ID: 1982698 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 36343
Credit: 261,360,520
RAC: 489
Australia
Message 1982699 - Posted: 1 Mar 2019, 0:37:53 UTC - in response to Message 1982698.  

It's been quite some time that I've used the host.txt file, where do I stick it?
How to modify your hosts file.

Cheers.
ID: 1982699 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1982703 - Posted: 1 Mar 2019, 0:52:17 UTC

But you shouldn't be modifying your hosts file in the first place as Richard says. We are not having a DNS issue as can be seen with http_debug. We simply have a single, weaker download server running that resolves via a perfectly working DNS just fine. Adding the vader server to the hosts file accomplishes nothing.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1982703 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1982705 - Posted: 1 Mar 2019, 1:06:22 UTC - in response to Message 1982703.  

But you shouldn't be modifying your hosts file in the first place as Richard says. We are not having a DNS issue as can be seen with http_debug. We simply have a single, weaker download server running that resolves via a perfectly working DNS just fine. Adding the vader server to the hosts file accomplishes nothing.


. . So it seems :(

Stephen

:(
ID: 1982705 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 36343
Credit: 261,360,520
RAC: 489
Australia
Message 1982706 - Posted: 1 Mar 2019, 1:10:07 UTC - in response to Message 1982703.  

But you shouldn't be modifying your hosts file in the first place as Richard says. We are not having a DNS issue as can be seen with http_debug. We simply have a single, weaker download server running that resolves via a perfectly working DNS just fine. Adding the vader server to the hosts file accomplishes nothing.
But if you are running a host file and Vader was being ignored then it needs changing. ;-)

Cheers.
ID: 1982706 · Report as offensive
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 . . . 31 · Next

Message boards : Number crunching : Panic Mode On (115) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.