Panic Mode On (81) Server Problems?

Message boards : Number crunching : Panic Mode On (81) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next

AuthorMessage
Profile BMaytum
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 104
Credit: 4,382,041
RAC: 2
United States
Message 1338911 - Posted: 16 Feb 2013, 17:31:20 UTC - in response to Message 1335768.  
Last modified: 16 Feb 2013, 17:48:04 UTC

Bandwidth ... than useless server chatter with no results.

Yes, we need more bandwidth to fully sort comms difficulties, but Matt has proven that proper server configuration can also go a long ways toward fully utilizing what we now have.


Whatever tweaks or adjustments Matt applied recently have really IMPROVED the DL /UL /Report throughput (and now few stalls) for me, just over the hill from SSL. Prior to those tweaks, I seriously contemplated pulling my plug on SAH from so much communications frustration. Now greatly improved, I'll stick around to atleast reach my 14th SAH Anniversary in April.

Kudos MATT!
Sabertooth Z77, i7-3770K@4.2GHz, GTX680, W8.1Pro x64
P5N32-E SLI, C2D E8400@3Ghz, GTX580, Win7SP1Pro x64 & PCLinuxOS2015 x64
ID: 1338911 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1338921 - Posted: 16 Feb 2013, 18:21:35 UTC

My comms are doing fairly well, too. Every now and then I need to use a proxy to push the AP downloads along, but usually just 3-5 re-tries without a proxy and they'll go all the way through.

My single-core machine oddly doesn't have that issue at all. In BOINC Land, nothing ever makes sense, and nothing is predictable.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1338921 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1338976 - Posted: 16 Feb 2013, 23:01:28 UTC - in response to Message 1338921.  

I thought I had a way to fix the bandwidth problem this morning -- National Lottery sent me a message saying, "We've good news about your ticket in last night's Euromillions draw!" Alas, it was only £5, not £500,000...
ID: 1338976 · Report as offensive
Profile KWSN Ekky Ekky Ekky
Avatar

Send message
Joined: 25 May 99
Posts: 944
Credit: 52,956,491
RAC: 67
United Kingdom
Message 1339044 - Posted: 17 Feb 2013, 1:27:10 UTC
Last modified: 17 Feb 2013, 1:30:27 UTC

The curious Cricket blue line hiccups continue:
http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Finr-250%2Fgigabitethernet2_3;view=Octets;ranges=d%3Aw
I wonder if this a symptom of a problem or whatever was done to cure the greater difficulties?
Looks like another downward cycle is starting at present.

ID: 1339044 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1339079 - Posted: 17 Feb 2013, 5:40:18 UTC - in response to Message 1339044.  

The curious Cricket blue line hiccups continue:
http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Finr-250%2Fgigabitethernet2_3;view=Octets;ranges=d%3Aw
I wonder if this a symptom of a problem or whatever was done to cure the greater difficulties?
Looks like another downward cycle is starting at present.

Might be a bit wonky, but so far the rigs seem to be able to keep the work flowing.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1339079 · Report as offensive
ExchangeMan
Volunteer tester

Send message
Joined: 9 Jan 00
Posts: 115
Credit: 157,719,104
RAC: 0
United States
Message 1339083 - Posted: 17 Feb 2013, 5:58:22 UTC - in response to Message 1339079.  

The curious Cricket blue line hiccups continue:
http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Finr-250%2Fgigabitethernet2_3;view=Octets;ranges=d%3Aw
I wonder if this a symptom of a problem or whatever was done to cure the greater difficulties?
Looks like another downward cycle is starting at present.

Might be a bit wonky, but so far the rigs seem to be able to keep the work flowing.

Acting a little wierd here too. Been able to keep going with some manual intervention.

ID: 1339083 · Report as offensive
mramakers

Send message
Joined: 20 Jul 04
Posts: 42
Credit: 3,694,335
RAC: 0
Netherlands
Message 1339092 - Posted: 17 Feb 2013, 8:23:53 UTC - in response to Message 1338844.  

And it's weekend again. Scheduler request failing time after time.

I guess I was a bit quick to start whining yersterday, because it seems it was only a glitch. Everything is working perfect ever since.
-Keeping fingers crossed-
ID: 1339092 · Report as offensive
Profile TRuEQ & TuVaLu
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 505
Credit: 69,523,653
RAC: 10
Sweden
Message 1339095 - Posted: 17 Feb 2013, 8:34:37 UTC

Well, my problem seems to be solved.

I have uppgraded to BM beta 7.0.4+ and now getting a funny answer in messages when requesting new work.

2013-02-17 09:29:52 | Moo! Wrapper | Not requesting tasks: don't need
The same goes for SETI tasks and if tasks are in transfer it doesn't ask for more either.... and with alot of stalled transfers...


And work cache is set to 3+1 days and i only have a cue of 2-3 hours....

I hope they are working on this.
I made a thread in Boinc Forum about it

http://boinc.berkeley.edu/dev/forum_thread.php?id=8196&sort=6

I hope they can sort it out.
ID: 1339095 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1339386 - Posted: 19 Feb 2013, 16:08:52 UTC

Well, here we are back after a most unusual outage. Even though the web server went down, the project servers kept going and work continued to flow. Sometimes, it's quite the opposite, LOL.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1339386 · Report as offensive
Profile Fred E.
Volunteer tester

Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,140,697
RAC: 0
United States
Message 1339387 - Posted: 19 Feb 2013, 16:17:57 UTC

Yes, the project ran well without our gripes and panic posts. Wonder if.... Guess it is time for the usual maintenance window, and I got so many astropulse tasks during the blackout I can make it thru maintenance.

Hope to hear more about the cause of the web server downtime.
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.
ID: 1339387 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1339390 - Posted: 19 Feb 2013, 16:20:57 UTC - in response to Message 1339387.  

Yes, the project ran well without our gripes and panic posts. Wonder if.... Guess it is time for the usual maintenance window, and I got so many astropulse tasks during the blackout I can make it thru maintenance.

Hope to hear more about the cause of the web server downtime.

Don't know the exact reason it happened, but thinman, the web server, crashed and could not be rebooted by remote. Eric tried. And since yesterday was a campus holiday due to Presidents' Day, nobody was in the lab to restart it.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1339390 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1339502 - Posted: 20 Feb 2013, 1:17:30 UTC
Last modified: 20 Feb 2013, 1:21:33 UTC

Maybe if you didn't saw it already on the 1st page: http://setiathome.berkeley.edu.


Lab-wide Power Outage This Weekend (Feb 22-24)
Once again we are going offline due to electrical repairs in our entire building. We plan to come down Friday afternoon (15:00 local time) and be back up by Sunday afternoon (12:00 local time). All SETI servers will be unreachable during that span. 19 Feb 2013, 23:58:59 UTC


IIRC, the SAH lab is UTC -08:00.
This mean currently Germany is 9 hours before ..
This would mean Saturday 00:00 to Sunday 21:00 (MET).


* Best regards! :-) * Philip J. Fry formerly Sutaru Tsureku, team seti.international founder. * Optimize your PC for higher RAC. * SETI@home needs your help. *
ID: 1339502 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1339822 - Posted: 20 Feb 2013, 21:03:52 UTC

That's cool. My first time ever getting this..

2013-02-20 13:20:06 SETI@home Message from server: This computer has reached a limit on tasks in progress


I was stock-piling for this weekend's outage and I found a proxy that works pretty well for AP DLs.. so I changed from 50% of CPU cores to 100% for an hour or so and then switched back when I got to the limit. I'm good for about 17 days now.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1339822 · Report as offensive
Rolf

Send message
Joined: 16 Jun 09
Posts: 114
Credit: 7,817,146
RAC: 0
Switzerland
Message 1339858 - Posted: 20 Feb 2013, 22:18:05 UTC - in response to Message 1339502.  

Maybe if you didn't saw it already on the 1st page: http://setiathome.berkeley.edu.


Lab-wide Power Outage This Weekend (Feb 22-24)
Once again we are going offline due to electrical repairs in our entire building. We plan to come down Friday afternoon (15:00 local time) and be back up by Sunday afternoon (12:00 local time). All SETI servers will be unreachable during that span. 19 Feb 2013, 23:58:59 UTC


IIRC, the SAH lab is UTC -08:00.
This mean currently Germany is 9 hours before ..
This would mean Saturday 00:00 to Sunday 21:00 (MET).


Why not increase the limits for this special occasion? Double?
ID: 1339858 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1339865 - Posted: 20 Feb 2013, 22:55:26 UTC - in response to Message 1339858.  

"Why not increase the limits for this special occasion?"

For the same reason they were installed in the first place.
ID: 1339865 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1339907 - Posted: 21 Feb 2013, 17:28:49 UTC

Well, it looks like we're back from whatever partial outage we had last night.

Anybody get any details of what happened from one of the guys?

David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1339907 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1339911 - Posted: 21 Feb 2013, 17:40:05 UTC - in response to Message 1339907.  

Well, it looks like we're back from whatever partial outage we had last night.

Anybody get any details of what happened from one of the guys?


Korpela wrote:
Yep, it looks like thinman is down and bruno is hung. Unfortunately
this happened about half an hour after I left work.


ID: 1339911 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1339917 - Posted: 21 Feb 2013, 17:59:29 UTC
Last modified: 21 Feb 2013, 18:02:52 UTC

So far, the servers have come back live and hot. All 9 rigs have reported their uploads and are receiving fresh work!
The kitties are happy campers again, and I can shut the furnace back off.

I do wish the limits could be raised to allow some increased caching for the coming weekend's scheduled outage, but the kitties are not gonna hold their breath on that one.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1339917 · Report as offensive
Profile Link
Avatar

Send message
Joined: 18 Sep 03
Posts: 834
Credit: 1,807,369
RAC: 0
Germany
Message 1339921 - Posted: 21 Feb 2013, 18:10:44 UTC - in response to Message 1339865.  

"Why not increase the limits for this special occasion?"

For the same reason they were installed in the first place.

Also it would not make sense to increase it, I'm pretty sure the bandwidth will be maxed out until the planned outage, i.e. they will send out as many WUs as they can either way (and also probaly assign a lot of WUs, which can't be downloaded before they shut down).
ID: 1339921 · Report as offensive
Profile Michael W.F. Miles
Avatar

Send message
Joined: 24 Mar 07
Posts: 268
Credit: 34,410,870
RAC: 0
Canada
Message 1339941 - Posted: 21 Feb 2013, 18:54:05 UTC

It would be nice to get some work before they shut down. With the limit of 200 that will give my meager system work for one 12 hour period.
Last night everything was down and now that there is a planned outage nobody gets work. All servers are now down.
Insanity is doing the same thing over and over but expecting different results each time.


Oh, please, oh, please allow us to crunch for you seti

Michael Miles
ID: 1339941 · Report as offensive
Previous · 1 . . . 15 · 16 · 17 · 18 · 19 · 20 · 21 · Next

Message boards : Number crunching : Panic Mode On (81) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.