CLOSED - Can someone tell me what's going on...?

Message boards : Number crunching : CLOSED - Can someone tell me what's going on...?
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile PT

Send message
Joined: 19 May 99
Posts: 231
Credit: 902,910
RAC: 0
United Kingdom
Message 87937 - Posted: 19 Mar 2005, 14:16:45 UTC
Last modified: 19 Mar 2005, 14:47:20 UTC

Can someone tell me what's going on...?
When I look at the server status pages all servers seems to be up and running, no backlogs, and there are obviously available WU’s . However, I am not able to upload my WU’s and I am not able to download new ones. Am I wasting my time here, or what?

I do not belong to the whining guys normally but now I have to complain a bit, just a little bit! What is going on here…?

Crunching 100% at cp.net now!
ID: 87937 · Report as offensive
Jan Inge

Send message
Joined: 24 Sep 02
Posts: 21
Credit: 1,655,076
RAC: 0
Norway
Message 87938 - Posted: 19 Mar 2005, 14:19:21 UTC - in response to Message 87937.  

> Can someone tell me what's going on...?
> When I look at the server status pages all servers seems to be up and running,
> no backlogs, and there are obviously available WU’s . However, I am not able
> to upload my WU’s and I am not able to download new ones. Am I wasting my time
> here, or what?
>
> I do not belong to the whining guys normally but now I have to complain a bit,
> just a little bit! What is going on here…?
>
> Crunching 100% at cp.net now!
>
You can find out more about the problem at
http://setiweb.ssl.berkeley.edu/forum_thread.php?id=12589
In short, the problem is that a 100mbit connection is down.
ID: 87938 · Report as offensive
Matt

Send message
Joined: 23 May 99
Posts: 19
Credit: 11,063,595
RAC: 77
United Kingdom
Message 87939 - Posted: 19 Mar 2005, 14:20:51 UTC

Hi,
The uploads failing thread explains this pretty well, but basically

SETI classic has some problems a couple of years ago using too much of berkeleys bandwith so bought a new network link from cogent.

The web servers still run on UCB's link, so everything is reporting fine, but the data servers run on the cogent link, and this is currently down for unknown reasons. No data will be transmitted until the cogent link is restored, hopefully in the near fututre.


Matt
ID: 87939 · Report as offensive
Profile PT

Send message
Joined: 19 May 99
Posts: 231
Credit: 902,910
RAC: 0
United Kingdom
Message 87945 - Posted: 19 Mar 2005, 14:25:44 UTC - in response to Message 87939.  

Thank you Matt and Jan Inge,
Appreciate your fast response guys! It is however a bit too much failures in this project at the moment! I am starting to get very frustrated….

Once again, thanks:-D

Happy crunching
ID: 87945 · Report as offensive
Matt

Send message
Joined: 23 May 99
Posts: 19
Credit: 11,063,595
RAC: 77
United Kingdom
Message 87946 - Posted: 19 Mar 2005, 14:27:27 UTC

It's not really their fault, especially this time. This weekend, it seems every boinc project is having problems!!


Matt
ID: 87946 · Report as offensive
Profile Siran d'Vel'nahr
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 7379
Credit: 44,181,323
RAC: 238
United States
Message 87948 - Posted: 19 Mar 2005, 14:29:45 UTC
Last modified: 19 Mar 2005, 14:39:23 UTC

This link will explain what's going on. inr-668 Cogent (SSL) is down.

L8R....

T'Khasi Time: Saturday, 19 March 2005 - 06:29 AM --800 (Pacific Standard Time)

CAPT Siran d'Vel'nahr - L L & P _\\//
Winders 11 OS? "What a piece of junk!" - L. Skywalker
"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 87948 · Report as offensive
Profile PT

Send message
Joined: 19 May 99
Posts: 231
Credit: 902,910
RAC: 0
United Kingdom
Message 87958 - Posted: 19 Mar 2005, 14:48:52 UTC - in response to Message 87948.  


Thank you for the information.


This link can now be considered closed!

Happy crunching
ID: 87958 · Report as offensive
Profile Skip Da Shu
Volunteer tester
Avatar

Send message
Joined: 28 Jun 04
Posts: 233
Credit: 431,047
RAC: 0
Message 87977 - Posted: 19 Mar 2005, 15:31:58 UTC - in response to Message 87958.  
Last modified: 19 Mar 2005, 15:46:45 UTC

>
> Thank you for the information.

Yo, Dog/Gou,

http://setiweb.ssl.berkeley.edu/user_profile/images/36628_avatar.jpg

- da shu @ HeliOS,
"A child's exposure to technology should never be predicated on an ability to afford it."
ID: 87977 · Report as offensive
Profile Raymon Purgason

Send message
Joined: 24 Oct 03
Posts: 27
Credit: 14,251,492
RAC: 0
United States
Message 87988 - Posted: 19 Mar 2005, 15:48:00 UTC

Has anyone noticed that when you select "goto web" from inside Bonic that it directs you to the Classic homepage. Perhaps this is why we cant upload.
ID: 87988 · Report as offensive
Profile Skip Da Shu
Volunteer tester
Avatar

Send message
Joined: 28 Jun 04
Posts: 233
Credit: 431,047
RAC: 0
Message 87993 - Posted: 19 Mar 2005, 15:55:34 UTC - in response to Message 87988.  

> Has anyone noticed that when you select "goto web" from inside Bonic that
> it directs you to the Classic homepage. Perhaps this is why we cant upload.

No, this has always been the case. The URL for boinc goes to the classic site.
ID: 87993 · Report as offensive
Profile Siran d'Vel'nahr
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 7379
Credit: 44,181,323
RAC: 238
United States
Message 88005 - Posted: 19 Mar 2005, 16:12:17 UTC
Last modified: 19 Mar 2005, 16:15:26 UTC

The fact that the Classic site is linked in the BOINC manager is not the problem. Read through this thread to find out what the real problem is. Here's the link to the Communications and Network Services page that is referenced. Hope this helps. >:-/

Oh, BTW, the original author has closed this thread. Just an observation.

L8R....

T'Khasi Time: Saturday, 19 March 2005 - 08:12 AM --800 (Pacific Standard Time)

CAPT Siran d'Vel'nahr - L L & P _\\//
Winders 11 OS? "What a piece of junk!" - L. Skywalker
"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 88005 · Report as offensive
Profile Prognatus

Send message
Joined: 6 Jul 99
Posts: 1600
Credit: 391,546
RAC: 0
Norway
Message 88183 - Posted: 20 Mar 2005, 1:42:38 UTC
Last modified: 20 Mar 2005, 1:43:51 UTC

Yes, and this shows when they cut the power; at 16.00 hours.

ID: 88183 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 88216 - Posted: 20 Mar 2005, 2:58:02 UTC - in response to Message 88183.  

> Yes, and <a> href="http://fragment1.berkeley.edu/~cricket/inr-668-interfaces.html">
> this[/url] shows when they cut the power; at 16.00 hours.

If this was a power problem, the servers would be down.

The servers are up, it's the circuit that's down.
ID: 88216 · Report as offensive
Profile Prognatus

Send message
Joined: 6 Jul 99
Posts: 1600
Credit: 391,546
RAC: 0
Norway
Message 88250 - Posted: 20 Mar 2005, 4:23:46 UTC
Last modified: 20 Mar 2005, 4:26:07 UTC

They're updating their software "Cricket" on the routers, so they've pulled the plug / cut the power to the ROUTERS - not the servers. Hence, this is not a power problem, Ludd. The servers cannot ulpoad/download until the routers get the coonection to Internet online again.

ID: 88250 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 88266 - Posted: 20 Mar 2005, 4:53:36 UTC - in response to Message 88250.  

Cricket doesn't run on the routers.

You can read about Cricket here: http://cricket.sourceforge.net.

It uses SNMP (Simple Network Management Protocol, RFC-1157).

The routers keep count of bytes and packets going in and out of the various interfaces.

Another machine runs Cricket, and it queries the various routers (via SNMP) periodically (typically every five minutes).

Compare the current counts to the previous count, and you know what happened during the last five minutes.

The routers just answer an SNMP query every five minutes.

> They're updating their software "Cricket" on the routers, so they've pulled
> the plug / cut the power to the ROUTERS - not the servers. Hence, this is not
> a power problem, Ludd. The servers cannot ulpoad/download until the routers
> get the coonection to Internet online again.

ID: 88266 · Report as offensive
Profile Prognatus

Send message
Joined: 6 Jul 99
Posts: 1600
Credit: 391,546
RAC: 0
Norway
Message 88268 - Posted: 20 Mar 2005, 4:57:19 UTC
Last modified: 20 Mar 2005, 5:03:14 UTC

As Siran said, this list of ucb routers page states that they're updating Cricket. But I'll take your word for it that Cricket doesn't have anything to do with running the routers. However, it does seem that the routers are down, and that THAT's the reason we're experiencing upload/download problems.


ID: 88268 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 88289 - Posted: 20 Mar 2005, 6:03:03 UTC - in response to Message 88268.  
Last modified: 20 Mar 2005, 6:15:11 UTC

> As Siran said, this list of
> ucb routers
page states that they're updating Cricket. But I'll take
> your word for it that Cricket doesn't have anything to do with running the
> routers. However, it does seem that the routers are down, and that THAT's the
> reason we're experiencing upload/download problems.

You don't have to take my word for it, you can read the Cricket documentation on Sourceforge. :-)

However, the graphs appear to be current -- it's 22:00 now in California, so it looks like they're working, and they'd probably be stale if Cricket was down.

It could be a router failure, but the only way it'd take this long is if they were waiting for parts. I'd expect CNS at Berkeley to be a 24/7 operation, if only to prevent riots in the Residence halls when the 'net goes down.

That's why I think the home page is accurate: that it's a broken wire between Cogent's SFO pop and the campus.
ID: 88289 · Report as offensive
Profile Siran d'Vel'nahr
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 7379
Credit: 44,181,323
RAC: 238
United States
Message 88297 - Posted: 20 Mar 2005, 6:20:12 UTC

Dammit, I waited too long to edit my previous post. I should have added this link for further explaination. It shows that the inr-668 Cogent (SSL) went down about 16:00 today. Sorry....

As an aside, does the "renovation" of Cricket have anything to do with the lack of Internet access for the SETI data servers? Don't know. Ned may just be correct in that they may have a broken wire. He "sounds" like he would know better than I. I just wanted to point out that with a little research, the problem could be found not to be of BOINC or the SETI servers. Biggest clue: status page still works and shows that WUs are piling up to be sent out.

L8R....

T'Khasi Time: Saturday, 19 March 2005 - 10:15 PM --800 (Pacific Standard Time)

CAPT Siran d'Vel'nahr - L L & P _\\//
Winders 11 OS? "What a piece of junk!" - L. Skywalker
"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 88297 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 88312 - Posted: 20 Mar 2005, 6:46:28 UTC - in response to Message 88297.  


> As an aside, does the "renovation" of Cricket have anything to do with the
> lack of Internet access for the SETI data servers? Don't know. Ned may just
> be correct in that they may have a broken wire. He "sounds" like he would
> know better than I. I just wanted to point out that with a little research,
> the problem could be found not to be of BOINC or the SETI servers. Biggest
> clue: status page still works and shows that WUs are piling up to be sent
> out.

I'm doing a lot of reading between the lines.

I don't use Cricket, I use MRTG, but the techniques are the same. MRTG is slow when you have LOTS of graphs, Cricket (and RRDtool) try to get around that.

The routers run great without a tool like this, but having something like MRTG sure makes it easy to see problems.

I don't know how Berkeley labels routers and interfaces, so I can't look at other interfaces on the same router and say for sure that the router is up.

When I traceroute to setiboincdata.ssl.berkeley.edu, the last router I can reach is at Cogent.

So, it might be the router, or the CSU/DSU, or the wire.

I know that it'd take me less than an hour to swap out a bad router, or a bad interface, or a bad CSU/DSU. My operation is smaller than theirs so I'd expect them to have more spares than I do.

I know that it'd take hours for Verizon to dispatch if the wire breaks somewhere between here and their Central Office.

... and the time to fix a major fiber break depends on where the splicing trailer is located at any given time -- and how long it takes to get the splicers and the trailer to the break, if they can find it quickly.
ID: 88312 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 88315 - Posted: 20 Mar 2005, 6:50:26 UTC - in response to Message 88297.  

> Dammit, I waited too long to edit my previous post. I should have added <a> href="http://fragment1.berkeley.edu/~cricket/inr-668-interfaces.html">this
> link[/url] for further explaination. It shows that the inr-668 Cogent (SSL)
> went down about 16:00 today. Sorry....

One flaw in MRTG and RRDtool is that the "daily" graph is really about 40 hours long -- so it was 1600 yesterday.

... and I just looked at the graph from Siran's link and it looks like the link is back up -- 10 megabits/sec. inbound and 20 megabits/sec. outbound.
ID: 88315 · Report as offensive
1 · 2 · Next

Message boards : Number crunching : CLOSED - Can someone tell me what's going on...?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.