Panic Mode On (20) Server problems

Message boards : Number crunching : Panic Mode On (20) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 15 · Next

AuthorMessage
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 917927 - Posted: 15 Jul 2009, 10:41:27 UTC - in response to Message 917862.  

Get them more coffee!! Can't hurt, might help.


hey..... when did they get the money for coffee? :D

SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 917927 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 917931 - Posted: 15 Jul 2009, 10:52:54 UTC - in response to Message 917927.  

Get them more coffee!! Can't hurt, might help.

hey..... when did they get the money for coffee? :D

They didn't - that's the problem.

We supply the coffee, they supply the WUs - fair swap?
ID: 917931 · Report as offensive
Profile Vipin Palazhi
Avatar

Send message
Joined: 29 Feb 08
Posts: 286
Credit: 167,386,578
RAC: 0
India
Message 917935 - Posted: 15 Jul 2009, 10:58:47 UTC - in response to Message 917931.  

They didn't - that's the problem.

We supply the coffee, they supply the WUs - fair swap?

It seems that coffee alone might not solve the problem. We might have to toss in a few doughnuts as well. Maybe a tuna sandwich or two.
______________

ID: 917935 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 917936 - Posted: 15 Jul 2009, 11:01:53 UTC - in response to Message 917935.  

They didn't - that's the problem.

We supply the coffee, they supply the WUs - fair swap?

It seems that coffee alone might not solve the problem. We might have to toss in a few doughnuts as well. Maybe a tuna sandwich or two.

There is a rumour that beer works best in Matt's case. Still have to research his taste in sandwiches.
ID: 917936 · Report as offensive
Profile Oz
Avatar

Send message
Joined: 6 Jun 99
Posts: 233
Credit: 200,655,462
RAC: 212
United States
Message 917959 - Posted: 15 Jul 2009, 11:50:02 UTC

I have been looking at this:

http://setiathome.berkeley.edu/sah_status.html

and this:

http://bluenorthernsoftware.com/scarecrow/sahstats/graphs.php?t=48

and this:

http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Finr-250%2Fgigabitethernet2_3;view=Octets;ranges=d%3Aw%3Am%3Ay

If the bandwidth was being used by CUDA, the turnaround time would not be rising. If the bandwidth was being used by so-called older, slower machines would the "bytes out" not be higher? If it's database queries the "queries/sec" would be much higher.

So, is it possible that "ACK" (hey, other computer, are you there and can we talk) is accounting for 30-40Mbits/sec of bandwidth?!?! If so, it certainly would explain the congestion and dataflow problems of late.

BTW - I am a 10 year member and have gone through a lot of "bumpy" periods. No WU's gets me going, too. But how would you like to be Matt? Everyone screaming about the equivalent of not being able to live in a mansion on a paupers wages. And he is still nice enough to do the tech updates so we all can have a idea of what's going on at the other end (even if sometimes it's after the fact).

Oz

Member of the 20 Year Club



ID: 917959 · Report as offensive
Profile Steven Meyer
Avatar

Send message
Joined: 24 Mar 08
Posts: 2333
Credit: 3,428,296
RAC: 0
United States
Message 917972 - Posted: 15 Jul 2009, 12:27:28 UTC

Is there any prognosis on poor Bruno's condition?
Any estimate on when he will recover?

181 tasks in "Uploading - Retry in xx:xx:xx" at this time.
ID: 917972 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 917974 - Posted: 15 Jul 2009, 12:31:54 UTC
Last modified: 15 Jul 2009, 12:36:12 UTC


Please be patient..

I'm patient also.. I'm a SETI@home member and learned this property.. ;-)

Also I ran out or work more times.


AFAIK, ONLY the UL server is offline.
O.K., also some splitters but they are current not soo important.
http://setiathome.berkeley.edu/sah_status.html

I don't know why and don't know how long it will last..
But the Berkeley crew know why. They have a well reason for this.

O.K., it's sadly that Matt didn't found time to update the TNews [http://setiathome.berkeley.edu/forum_forum.php?id=21].. but for sure he will post there soon.

Yes.. I have now 1 1/2 day results for UL and they can't go home.
BOINC is now nearly unusable with so many results in the UL overview.. so I guess it should be ~ 1,200 WUs or more..
The GPU cruncher make ~ 800 WUs / day.

I have a ~ 4 day cache, so I hope I can UL the next one, two days.
> 'CPUs x 2' ULs in BOINC and no work request.


O.K., SETI@home member, please don't condemn the project or don't be angry..
The Berkeley crew make the best for to be online again.


BTW. AFAIK. SETI@home have ~ 50 % or more of all BOINC members, right?
So SETI@home have the most server traffic of all BOINC projects.
Sometimes other projects have also server probs, which have less members.

Please be patient and soon we have all again new WUs.. :-)



BTW.
SETI@home is a nonprofit educational and research organization that relies significantly on donations to continue operations.
http://setiathome.berkeley.edu/sah_donate.php

ID: 917974 · Report as offensive
Profile [B^S] madmac
Volunteer tester
Avatar

Send message
Joined: 9 Feb 04
Posts: 1175
Credit: 4,754,897
RAC: 0
United Kingdom
Message 917988 - Posted: 15 Jul 2009, 13:03:11 UTC

I am down to my last one I must have had a slight problem with my internet as Boinc said check internet or proxy and a few seconds later it tried to upload an old wu. has anyone idea what is causing this slight problem could there have been a problem with my ISP that corrected itself after a few seconds. This is twice I have had this simce updating to 6.6.20
ID: 917988 · Report as offensive
IFRS
Volunteer tester
Avatar

Send message
Joined: 21 May 99
Posts: 1736
Credit: 259,180,282
RAC: 0
Brazil
Message 917990 - Posted: 15 Jul 2009, 13:04:51 UTC

I don´t mind if I can´t upload my results for over a week or more. But having idle machines that can´t download more work because the out pile is full, while work to distribute and process increase, that don´t make any sense to me.
There is any work around this "feature"?


ID: 917990 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 917991 - Posted: 15 Jul 2009, 13:06:32 UTC - in response to Message 917827.  

hello.
Since more than 24 hours impossible to transfer the completed work.
I do not include/understand because I continue to receive work.
20 WU finished and in a state of transfer.
The newspaper of BOINC n' do not stop growing bigger.
Always the same message: Internet access OK - project servers may Be temporarily down Thank you very much to inform of this message the server, in order to solve this problem....


Uploads have been disabled temporarily. The SETI Admins are working towards a solution as best they can.
ID: 917991 · Report as offensive
Zebra3
Avatar

Send message
Joined: 22 Oct 01
Posts: 186
Credit: 13,658,148
RAC: 0
Canada
Message 918002 - Posted: 15 Jul 2009, 13:18:28 UTC - in response to Message 917990.  

I don´t mind if I can´t upload my results for over a week or more. But having idle machines that can´t download more work because the out pile is full, while work to distribute and process increase, that don´t make any sense to me.
There is any work around this "feature"?



There are lots of other projects out there that would love your main crunchers do work for them if even temporarily. I put as much work as I can with S@H but when it runs out like it has on my 2 CUDA enabled machines I load up a WU from CPUGRID and my card is happy for hours. Yes I'm not crunching for that day or 2 on Seti but I am still contributing to a BOINC project just the same and it all adds up in the combined BOINC totals.

If you are unwilling to attach to other projects then I guess the big red power button is the way to go. The guys in the lab will get things up and going when they can.
http://www.novascotia.com
ID: 918002 · Report as offensive
Chris Weaver

Send message
Joined: 3 Apr 99
Posts: 6
Credit: 151,343
RAC: 0
United States
Message 918037 - Posted: 15 Jul 2009, 14:41:45 UTC

I see that the status of the upload server has been disabled now since at least yesterday.

Anybody know why, and when it will be back up?
ID: 918037 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 918038 - Posted: 15 Jul 2009, 14:47:18 UTC - in response to Message 918037.  

Yes, it's disabled.
No, we don't know why.
No, we don't know when it will be back.

Plenty of threads about that around here. :-)
ID: 918038 · Report as offensive
Profile Boomer
Volunteer tester

Send message
Joined: 1 Jun 09
Posts: 43
Credit: 21,892,319
RAC: 10
United States
Message 918039 - Posted: 15 Jul 2009, 14:50:32 UTC - in response to Message 918037.  

This is due to bandwidth/server issues and very high traffic from clients trying to upload. This seems to be a chronic recurring problem for SETI@HOME. The project is not managed very well and there appear to be more volunteers and number crunching power than they can handle. More details are in the following threads:

http://setiathome.berkeley.edu/tech_news.php
http://setiathome.berkeley.edu/forum_thread.php?id=54631
ID: 918039 · Report as offensive
Profile craggyislander
Volunteer tester
Avatar

Send message
Joined: 28 Oct 01
Posts: 100
Credit: 206,709
RAC: 0
United Kingdom
Message 918045 - Posted: 15 Jul 2009, 15:19:49 UTC

<<<<<<< Hope Bruno is better soon :-)
"The longest journey begins with a single step" Confucius

ID: 918045 · Report as offensive
Profile Samdani
Avatar

Send message
Joined: 21 Oct 00
Posts: 85
Credit: 13,480,553
RAC: 0
Pakistan
Message 918051 - Posted: 15 Jul 2009, 15:36:15 UTC

Yes the upload server is disabled since Monday. My computer has a few hundred results waiting to be uploaded and it is running out of work fast. But I don’t understand the logic of linking downloads with uploads. What is the harm in allowing downloads if the work is available.
ID: 918051 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65745
Credit: 55,293,173
RAC: 49
United States
Message 918055 - Posted: 15 Jul 2009, 15:41:37 UTC - in response to Message 918051.  

Yes the upload server is disabled since Monday. My computer has a few hundred results waiting to be uploaded and it is running out of work fast. But I don’t understand the logic of linking downloads with uploads. What is the harm in allowing downloads if the work is available.

Their linked for good reason, To keep a potential flood of reported WU's from getting worse, S@H has a very small budget and only 3 people to run It, The Servers work, But bandwidth is at the moment in need of an upgrade.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 918055 · Report as offensive
Profile James Sotherden
Avatar

Send message
Joined: 16 May 99
Posts: 10436
Credit: 110,373,059
RAC: 54
United States
Message 918056 - Posted: 15 Jul 2009, 15:42:40 UTC

No panic here. The old P4 ran out of work and wont get any im sure, with 7 trying to upload, so i think ill turn that off and let that old CPU cool down and take a well deserved rest. The mac has only a .25 day cache, but i still have at least 2 or 3 days crunching, yep ton of uploads but they will get going when they can.
So in the mean time the mac will have free run of doing Milkyway. I really want to crack that 100.000 mark.
[/quote]

Old James
ID: 918056 · Report as offensive
Profile Bill Walker
Avatar

Send message
Joined: 4 Sep 99
Posts: 3868
Credit: 2,697,267
RAC: 0
Canada
Message 918057 - Posted: 15 Jul 2009, 15:45:47 UTC - in response to Message 918051.  

As I understand it, the "linking" is done to cover lots of other possibilities, like problems in the user's computer or in the connections between the user and the project. You don't want the project sending units to a computer that won't recieve them or can't process them. This would waste the project's resources, and could result in work units timing out at the users end.

Right now, the uploads have probably been disabled while the SETI guys try to set up some new configuration on their end to work around our bandwith problems. Or, it may be a hardware problem at SETI. In either case, it will eventually be resolved. In the mean time, try another BOINC project and (as I am about to do) have a cup of tea.

ID: 918057 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 918065 - Posted: 15 Jul 2009, 16:01:06 UTC - in response to Message 917988.  

I am down to my last one I must have had a slight problem with my internet as Boinc said check internet or proxy and a few seconds later it tried to upload an old wu. has anyone idea what is causing this slight problem could there have been a problem with my ISP that corrected itself after a few seconds. This is twice I have had this simce updating to 6.6.20

I'm using 6.6.36 on one of my pc's and it's been happening to that one too. The rest of the pc's that use other versions don't seem to have that problem, but on the one that does, it seems to be happening more lately. I had thought I'd read someone else having the same problem, but I was leaving at the time, and didn't read all of it and can't find it now..Nothing wrong with the connection, and it only seems to happen when there are server problems.
ID: 918065 · Report as offensive
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 15 · Next

Message boards : Number crunching : Panic Mode On (20) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.