Posts by Oz


log in
1) Message boards : Number crunching : Panic Mode On (103) Server Problems? (Message 1811956)
Posted 24 Aug 2016 by Profile Oz
Thank you.
2) Message boards : Number crunching : Panic Mode On (103) Server Problems? (Message 1811804)
Posted 23 Aug 2016 by Profile Oz
I am back for the Wow! event. Unfortunately, I have no budget for the electricity and new hardware (sigh). Zalster and Mr. Kevvy among others will pass me in a month or two.
I am going to try to find the SoG and privy apps (no luck so far...) and test them so I can maybe do next year's Wow! as well. Good luck all.
Oz
3) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1731461)
Posted 3 Oct 2015 by Profile Oz
03-10-15 4:11:45 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 4:23:04 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:05:23 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:06:53 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:08:16 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:09:40 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:11:04 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:12:26 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:14:39 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:19:33 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:21:52 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:42:15 AM SETI@home Scheduler request failed: Couldn't connect to server

Heading into Day 5...
4) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1731456)
Posted 3 Oct 2015 by Profile Oz
03-10-15 6:42:15 AM Project communication failed: attempting access to reference site
03-10-15 6:42:15 AM SETI@home Scheduler request failed: Couldn't connect to server
03-10-15 6:42:17 AM Internet access OK - project servers may be temporarily down.
5) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1731434)
Posted 3 Oct 2015 by Profile Oz
Ok, I am sorry to have suggested what I thought might have been a helpful solution. Since there is no mention of any issues involving ssl.berkeley.edu on the berkeley ist status (trouble) page, they are either unaware of the problem or have assigned it a priority below low.
6) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1731348)
Posted 3 Oct 2015 by Profile Oz
Along with posting here, might I suggest that people send a "problem with service" email to:

itcsshelp@berkeley.edu

A few dozen (or hundred or thousand) emails might help Berkeley IST grasp the scope of the problem.

(Be polite!)
7) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1730970)
Posted 2 Oct 2015 by Profile Oz
It seems like the folks who can't connect at all are mostly those who have built up large numbers of tasks waiting to report. I suspect that means that the scheduler requests are large and perhaps end up getting fragmented on the way to Berkeley, increasing the likelihood of failure. Most of my requests have been to report fewer than 5 tasks at a time and, although about half of those fail, the next attempt usually succeeds.

Looking at some old threads regarding connection problems, I noticed that there's an option available in cc_config.xml for <max_tasks_reported>xx</max_tasks_reported> which essentially cuts the scheduler requests into smaller chunks. Perhaps that's something that would help here. Or perhaps not (but I think it might be worth a try). ;^)


It may help some folks, but I am sitting on a laptop with ONE task to report - it has not managed to connect since 30/9/15 at 14:05UTC...
I don't think Berkeley IT is aware of the problem as there is no mention of it on their Service Status page (http://systemstatus.berkeley.edu/) which begins with:

The page will be updated whenever there is a change in system status that will affect users for more than 30 minutes. If you need assistance with a system or network problem, call Campus Shared Services IT at 510-664-9000 Option 1, 1, 1 - All Other Technology Requests.
8) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1730966)
Posted 2 Oct 2015 by Profile Oz
If you think it will help.

itcsshelp@berkeley.edu
510-664-9000, ext. 1
9) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1730914)
Posted 1 Oct 2015 by Profile Oz
yeah same traceroute as everyone else

Sorry Zombu2, I should have made it simpler:

Here is a tracert from one of my boxes that has NOT been connecting...

Trace complete.

You're tracing to the website in both cases, which has no bearing on whether you can "connect" a cruncher to a server. You would need to trace to one of the other data servers (upload, download, scheduling - wherever the problem is) to get any insight into that.


Yes, I noticed that in the preceding discussions and was about to try it, but just wanted to clarify that what I had previously posted was not "the same"
10) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1730902)
Posted 1 Oct 2015 by Profile Oz
yeah same traceroute as everyone else


Sorry Zombu2, I should have made it simpler:

Here is a tracert from one of my boxes that has NOT been connecting...

>ipconfig /flushdns

Windows IP Configuration

Successfully flushed the DNS Resolver Cache.


>tracert setiathome.berkeley.edu

Tracing route to setiathome.berkeley.edu [208.68.240.110]
over a maximum of 30 hops:

1 1 ms 1 ms 1 ms xxx
2 * * * Request timed out.
3 22 ms 31 ms 31 ms xxx
4 12 ms 12 ms 10 ms xxx
5 14 ms 13 ms 15 ms xxx

6 28 ms 29 ms 29 ms bu-ether15.chcgildt87w-bcr00.tbone.rr.com [66.10
9.6.72]
7 33 ms 26 ms 28 ms 0.ae1.pr1.chi10.tbone.rr.com [107.14.17.194]
8 36 ms 27 ms 28 ms xe-4-2-0.0.chic0.tr-cps.internet2.edu [64.57.20.
53]
9 26 ms 26 ms 27 ms ae-5.80.rtr.chic.net.internet2.edu [64.57.20.150
]
10 39 ms 37 ms 36 ms ae-0.80.rtr.kans.net.internet2.edu [64.57.20.148
]
11 76 ms 93 ms 78 ms ae-0.80.rtr.salt.net.internet2.edu [64.57.20.146
]
12 72 ms 78 ms 73 ms ae-2.80.rtr.losa.net.internet2.edu [64.57.20.144
]
13 79 ms 78 ms 77 ms xe-0-0-0.80.rtr.paix.net.internet2.edu [64.57.20
.125]
14 78 ms 78 ms 78 ms 64.57.21.7
15 79 ms 81 ms 81 ms dc-oak-agg4--svl-agg4-100ge.cenic.net [137.164.4
6.144]
16 80 ms 82 ms 82 ms ucb--oak-agg4-10g.cenic.net [137.164.50.31]
17 81 ms 83 ms 81 ms t2-3.inr-202-reccev.Berkeley.EDU [128.32.0.39]
18 82 ms 82 ms 87 ms e3-47.inr-310-ewdc.Berkeley.EDU [128.32.0.99]
19 * * * Request timed out.
20 * * * Request timed out.
21 * * * Request timed out.
22 * * * Request timed out.
23 * * * Request timed out.
24 * * * Request timed out.
25 * * * Request timed out.
26 * * * Request timed out.
27 * * * Request timed out.
28 * * * Request timed out.
29 * * * Request timed out.
30 * * * Request timed out.

Trace complete.


And here is a tracert from one of my boxes that has been connecting...

Microsoft Windows [Version 6.1.7601]
Copyright (c) 2009 Microsoft Corporation. All rights reserved.

>tracert -h 20 -w 15001 setiathome.berkeley.edu

Tracing route to setiathome.berkeley.edu [208.68.240.110]
over a maximum of 20 hops:

1 <1 ms <1 ms <1 ms xxx
2 3 ms 4 ms 4 ms xxx
3 7 ms 7 ms 7 ms xxx
4 * * * Request timed out.
5 * * * Request timed out.
6 16 ms 16 ms 15 ms 0.ae2.BR3.NYC4.ALTER.NET [140.222.229.99]
7 * * * Request timed out.
8 75 ms 74 ms 74 ms ae-3-80.ear1.SanJose1.Level3.net [4.69.152.150]
9 * * * Request timed out.
10 73 ms 73 ms 73 ms CENIC.ear1.SanJose1.Level3.net [4.15.122.46]
11 74 ms 75 ms 74 ms dc-oak-agg4--svl-agg4-100ge.cenic.net [137.164.46.144]
12 75 ms 81 ms 76 ms ucb--oak-agg4-10g.cenic.net [137.164.50.31]
13 75 ms 74 ms 74 ms t2-3.inr-201-sut.Berkeley.EDU [128.32.0.37]
14 75 ms 75 ms 74 ms et3-48.inr-311-ewdc.Berkeley.EDU [128.32.0.101]
15 * * * Request timed out.
16 * * * Request timed out.
17 * * * Request timed out.
18 * * * Request timed out.
19 * * * Request timed out.
20 * * * Request timed out.

Trace complete.

Note that they have not one single node in common.
11) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1730763)
Posted 1 Oct 2015 by Profile Oz
Here is a tracert from one of my boxes that has been connecting...

Microsoft Windows [Version 6.1.7601]
Copyright (c) 2009 Microsoft Corporation. All rights reserved.

>tracert -h 20 -w 15001 setiathome.berkeley.edu

Tracing route to setiathome.berkeley.edu [208.68.240.110]
over a maximum of 20 hops:

1 <1 ms <1 ms <1 ms xxx
2 3 ms 4 ms 4 ms xxx
3 7 ms 7 ms 7 ms xxx
4 * * * Request timed out.
5 * * * Request timed out.
6 16 ms 16 ms 15 ms 0.ae2.BR3.NYC4.ALTER.NET [140.222.229.99]
7 * * * Request timed out.
8 75 ms 74 ms 74 ms ae-3-80.ear1.SanJose1.Level3.net [4.69.152.150]
9 * * * Request timed out.
10 73 ms 73 ms 73 ms CENIC.ear1.SanJose1.Level3.net [4.15.122.46]
11 74 ms 75 ms 74 ms dc-oak-agg4--svl-agg4-100ge.cenic.net [137.164.46.144]
12 75 ms 81 ms 76 ms ucb--oak-agg4-10g.cenic.net [137.164.50.31]
13 75 ms 74 ms 74 ms t2-3.inr-201-sut.Berkeley.EDU [128.32.0.37]
14 75 ms 75 ms 74 ms et3-48.inr-311-ewdc.Berkeley.EDU [128.32.0.101]
15 * * * Request timed out.
16 * * * Request timed out.
17 * * * Request timed out.
18 * * * Request timed out.
19 * * * Request timed out.
20 * * * Request timed out.

Trace complete.

So... WTF!?!
12) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1730707)
Posted 1 Oct 2015 by Profile Oz
I find it interesting that the received WUs dropped about 23k when all this started. No spikes or dips, just lower.

It makes me think that there may be a limit on the number of connections being enforced.


I noticed the same thing. It begs a more interesting question, why are ~70% of seti clients apparently still able to connect?
13) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1730627)
Posted 1 Oct 2015 by Profile Oz
It's the Pope, Daesh, or Obamas fault. No doubt about that, whatsoever.

Not blaming the Russians or the Chinese, then? :P


No, I think Grumpy is right - it's Pope Obama in cahoots with the North Koreans and Isis. Now that McDonald's is in Russia and China, they are all fat and stupid like we Americans.
14) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1730514)
Posted 1 Oct 2015 by Profile Oz
Nope the problem is at Berkeley...



>ipconfig /flushdns

Windows IP Configuration

Successfully flushed the DNS Resolver Cache.


>tracert setiathome.berkeley.edu

Tracing route to setiathome.berkeley.edu [208.68.240.110]
over a maximum of 30 hops:

1 1 ms 1 ms 1 ms xxx
2 * * * Request timed out.
3 22 ms 31 ms 31 ms xxx
4 12 ms 12 ms 10 ms xxx
5 14 ms 13 ms 15 ms xxx

6 28 ms 29 ms 29 ms bu-ether15.chcgildt87w-bcr00.tbone.rr.com [66.10
9.6.72]
7 33 ms 26 ms 28 ms 0.ae1.pr1.chi10.tbone.rr.com [107.14.17.194]
8 36 ms 27 ms 28 ms xe-4-2-0.0.chic0.tr-cps.internet2.edu [64.57.20.
53]
9 26 ms 26 ms 27 ms ae-5.80.rtr.chic.net.internet2.edu [64.57.20.150
]
10 39 ms 37 ms 36 ms ae-0.80.rtr.kans.net.internet2.edu [64.57.20.148
]
11 76 ms 93 ms 78 ms ae-0.80.rtr.salt.net.internet2.edu [64.57.20.146
]
12 72 ms 78 ms 73 ms ae-2.80.rtr.losa.net.internet2.edu [64.57.20.144
]
13 79 ms 78 ms 77 ms xe-0-0-0.80.rtr.paix.net.internet2.edu [64.57.20
.125]
14 78 ms 78 ms 78 ms 64.57.21.7
15 79 ms 81 ms 81 ms dc-oak-agg4--svl-agg4-100ge.cenic.net [137.164.4
6.144]
16 80 ms 82 ms 82 ms ucb--oak-agg4-10g.cenic.net [137.164.50.31]
17 81 ms 83 ms 81 ms t2-3.inr-202-reccev.Berkeley.EDU [128.32.0.39]
18 82 ms 82 ms 87 ms e3-47.inr-310-ewdc.Berkeley.EDU [128.32.0.99]
19 * * * Request timed out.
20 * * * Request timed out.
21 * * * Request timed out.
22 * * * Request timed out.
23 * * * Request timed out.
24 * * * Request timed out.
25 * * * Request timed out.
26 * * * Request timed out.
27 * * * Request timed out.
28 * * * Request timed out.
29 * * * Request timed out.
30 * * * Request timed out.

Trace complete.
15) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1730507)
Posted 1 Oct 2015 by Profile Oz
My boxes connecting through ver###n, both DSL and fibre have no issues, The boxes connecting through Ti##-War### have the uploads go but can't report, connect or download situation.
Also I can ping berkeley.edu but not setiathome.berkeley.edu through T-W.
It's possible it may not be on the Berkeley end...


If you do an nslookup of setiathome.berkeley.edu on a box connected to ver###n and do the same for a box conntected to Ti##-War###, are the reported IP addresses the same or different?


I will check tomorrow, I am several miles from either location connecting through V. However, I did note that a ping of berkeley.edu on TW yields an IPV6 address, while pinging setiathome.berkeley.edu shows an IPV4 address and total packet loss.

Microsoft Windows [Version 6.1.7601]
Copyright (c) 2009 Microsoft Corporation. All rights reserved.

>ping berkeley.edu

Pinging berkeley.edu [2607:f140:0:81::f] with 32 bytes of data:
Reply from 2607:f140:0:81::f: time=92ms
Reply from 2607:f140:0:81::f: time=108ms
Reply from 2607:f140:0:81::f: time=94ms
Reply from 2607:f140:0:81::f: time=93ms

Ping statistics for 2607:f140:0:81::f:
Packets: Sent = 4, Received = 4, Lost = 0 (0% loss),
Approximate round trip times in milli-seconds:
Minimum = 92ms, Maximum = 108ms, Average = 96ms

ping setiathome.berkeley.edu

Pinging setiathome.berkeley.edu [208.68.240.110] with 32 bytes of data:
Request timed out.
Request timed out.
Request timed out.
Request timed out.

Ping statistics for 208.68.240.110:
Packets: Sent = 4, Received = 0, Lost = 4 (100% loss),
16) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1730460)
Posted 30 Sep 2015 by Profile Oz
My boxes connecting through ver###n, both DSL and fibre have no issues, The boxes connecting through Ti##-War### have the uploads go but can't report, connect or download situation.
Also I can ping berkeley.edu but not setiathome.berkeley.edu through T-W.
It's possible it may not be on the Berkeley end...
17) Message boards : Number crunching : Not getting WU's on Intel GPU? (Message 1729605)
Posted 27 Sep 2015 by Profile Oz
intel hd graphics does not show on my 4790. enabled in bios and dled and installed open cl 1.2. what did I miss?
18) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1719161)
Posted 27 Aug 2015 by Profile Oz
I got one! I got an AP! I haven't seen one in a long time.

Edit: I seem to have passed 2 Million credits recently. Yay!



Congratulations on your milestone! Keep on crunching.
19) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1719159)
Posted 27 Aug 2015 by Profile Oz
I agree that that seems to be true. What I have noticed is that high MB production and having the replica running don't seem to coexist, one can either have one or the other but not both. At least it would appear that way to me, of late.
20) Message boards : Number crunching : Panic Mode On (100) Server Problems? (Message 1716930)
Posted 22 Aug 2015 by Profile Oz
Looking at the Munin graphs for MBs reminds me of driving a four cylinder pickup in West Virginia, one starts up a hill, down-shifts to fourth, down-shifts to third - foot pressed to the firewall... and it just... won't... go... any... faster...


Next 20

Copyright © 2016 University of California