Panic Mode On (8) Server problems

Message boards : Number crunching : Panic Mode On (8) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 15 · Next

AuthorMessage
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 796697 - Posted: 12 Aug 2008, 7:14:10 UTC - in response to Message 796694.  

Well, the problem is in some ways similar to the problem encountered over the weekend. It seems the upload server is constipated. That doesn't stop access to the database pages (including the message boards and user stats), but it does leave SETI about as functional as the UN Security Council.


Well I got this after I uploaded a WU;
12/08/2008 06:17:01|SETI@home|Scheduler request failed: Couldn't connect to server
Yet I am sending this message, weird or what?


ID: 796697 · Report as offensive
Profile [B^S] madmac
Volunteer tester
Avatar

Send message
Joined: 9 Feb 04
Posts: 1175
Credit: 4,754,897
RAC: 0
United Kingdom
Message 796699 - Posted: 12 Aug 2008, 7:16:34 UTC

Thnak you is that why I am now getting
12/08/2008 08:13:20|SETI@home|Scheduler request failed: Failed sending data to the peer
after I got internet access ok, project may be down
ID: 796699 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13746
Credit: 208,696,464
RAC: 304
Australia
Message 796710 - Posted: 12 Aug 2008, 7:34:37 UTC - in response to Message 796694.  

Well I got this after I uploaded a WU;
12/08/2008 06:17:01|SETI@home|Scheduler request failed: Couldn't connect to server
Yet I am sending this message, weird or what?

The server being used for the forums/web pages, and the server that handles uploads/downloads/scheduler requests etc are not one & the same.
From memory they also use different network connections.
Grant
Darwin NT
ID: 796710 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13746
Credit: 208,696,464
RAC: 304
Australia
Message 796733 - Posted: 12 Aug 2008, 9:02:08 UTC


Ah, children...
It would appear the log jam may have cleared.
Outbound network traffic 91Mb/s
Inbound network traffic 10Mb/s
Result creation rate 32.5/s
Grant
Darwin NT
ID: 796733 · Report as offensive
eluk

Send message
Joined: 10 May 08
Posts: 42
Credit: 634,661
RAC: 0
United Kingdom
Message 796756 - Posted: 12 Aug 2008, 11:03:09 UTC - in response to Message 796733.  


Ah, children...
It would appear the log jam may have cleared.
Outbound network traffic 91Mb/s
Inbound network traffic 10Mb/s
Result creation rate 32.5/s

That looks like a blip.
Ready to send is at 0. As of 40 mins.
Result creation rate 1.2/sec.

CPDN & Rosetta are benefiting from this.
ID: 796756 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13746
Credit: 208,696,464
RAC: 304
Australia
Message 796763 - Posted: 12 Aug 2008, 11:39:42 UTC - in response to Message 796733.  

It would appear the log jam may have cleared.

Bugger!
Looks like it was just a minor brach, the dam is still intact.
:-/

Grant
Darwin NT
ID: 796763 · Report as offensive
Rudy
Volunteer tester

Send message
Joined: 23 Jun 99
Posts: 189
Credit: 794,998
RAC: 0
Canada
Message 797167 - Posted: 13 Aug 2008, 10:05:20 UTC

Not looking good. Something seems to have crashed (again).

Server status page numbers have flatlined for about 5 hours and cricket bits out are almost non existent.
ID: 797167 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 797259 - Posted: 13 Aug 2008, 14:53:32 UTC - in response to Message 797167.  

Not looking good. Something seems to have crashed (again).

Server status page numbers have flatlined for about 5 hours and cricket bits out are almost non existent.

Server status page hasn't updated for over 10 hours now, hence the flatlining. Combined with Cricket showing almost no data, it looks like the staff will need to kick a few things when they get into the lab this morning.
                                                                Joe
ID: 797259 · Report as offensive
PhonAcq

Send message
Joined: 14 Apr 01
Posts: 1656
Credit: 30,658,217
RAC: 1
United States
Message 797264 - Posted: 13 Aug 2008, 15:04:11 UTC - in response to Message 796697.  

Seti shines compared to the UNSC, considering what the ruskies are doing right now in georgia. But that is off topic.

I say turn off AP wu production now and clean up the ghost wu's. When normal seti production (non-ap) is stabilized again, then turn on only one ap splitter to gauge the effect on the overall system. That's my soap box and I'm going to stay on it.

Well, the problem is in some ways similar to the problem encountered over the weekend. It seems the upload server is constipated. That doesn't stop access to the database pages (including the message boards and user stats), but it does leave SETI about as functional as the UN Security Council.


Well I got this after I uploaded a WU;
12/08/2008 06:17:01|SETI@home|Scheduler request failed: Couldn't connect to server
Yet I am sending this message, weird or what?


ID: 797264 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 797287 - Posted: 13 Aug 2008, 15:54:42 UTC - in response to Message 788622.  


Uhh.. I think it's time again for this one..



All people around, press this button to feel better..! ;-D
I pressed him few times.. many times.. hope he's not broken now.. ;-D


My 24/7 rig is nearly running dry.. I start to be nervous.. hope the server will send soon new work..

(BOINC update button ;-)

ID: 797287 · Report as offensive
PhonAcq

Send message
Joined: 14 Apr 01
Posts: 1656
Credit: 30,658,217
RAC: 1
United States
Message 797298 - Posted: 13 Aug 2008, 16:27:26 UTC

Hey, I just noticed the Cricket is chirping again!
ID: 797298 · Report as offensive
QSilver

Send message
Joined: 26 May 99
Posts: 232
Credit: 6,452,764
RAC: 0
United States
Message 797307 - Posted: 13 Aug 2008, 16:43:13 UTC - in response to Message 797298.  

Hey, I just noticed the Cricket is chirping again!


I got a few shortie WUs right before I noticed that chirping Cricket!

QS
ID: 797307 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 797310 - Posted: 13 Aug 2008, 16:49:17 UTC


. . . All's Well - that ends Well ;)

plenty of crunchin' here on mi box



. . . runs like hell to escape the baCkfirEs ;))

BOINC Wiki . . .

Science Status Page . . .
ID: 797310 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 797315 - Posted: 13 Aug 2008, 17:16:58 UTC - in response to Message 797307.  

Hey, I just noticed the Cricket is chirping again!


I got a few shortie WUs right before I noticed that chirping Cricket!

QS

When the server stats stopped updating, 6 of the 9 mb_splitter processes were working on data within the 2 June through 20 June recorded time range. The Arecibo schedule shows that only the GALFACTS project was scheduled to use ALFA during that period, so a very high proportion of shorty work is likely from those recordings. ALFA of course might be running during some maintenance times, and the set up and tear down for GALFACTS observations may also generate some work which isn't Very High Angle Range shorties.

I'm still convinced that those many shorties are a major contributor to the server difficulties. If a normal mix requires 60 MBits/sec, a mix with many shorties can easily need 120 MBits/sec which simply isn't available. And those WUs are the same size as lower angle ranges so shorties greatly increase workunit storage requirements. The addition of AstroPulse may have been "the straw which broke the camel's back", but I think the servers would have been badly stressed even without that.
                                                                  Joe
ID: 797315 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14653
Credit: 200,643,578
RAC: 874
United Kingdom
Message 797333 - Posted: 13 Aug 2008, 18:05:33 UTC

Woo Hoo! Current result creation rate 32.82/sec [As of 13 Aug 2008 18:00:14 UTC]

We have lift off.
ID: 797333 · Report as offensive
QSilver

Send message
Joined: 26 May 99
Posts: 232
Credit: 6,452,764
RAC: 0
United States
Message 797339 - Posted: 13 Aug 2008, 18:27:17 UTC

...and Cricket's green graph is showing 90+MB for a while now!

QS
ID: 797339 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 797352 - Posted: 13 Aug 2008, 18:46:31 UTC

Of course the stats are not generating again so BoincStats is showing nothing for the last couple of days.

ID: 797352 · Report as offensive
Steve Bergman
Volunteer tester

Send message
Joined: 20 May 07
Posts: 48
Credit: 292,679
RAC: 0
United States
Message 797357 - Posted: 13 Aug 2008, 18:58:51 UTC - in response to Message 795028.  
Last modified: 13 Aug 2008, 19:11:37 UTC

I would just like to see things stabilize and see the optimizers be able to get the code to the point that credits would eqaulize between MB an AP.......
'Course, my MB credits are not what they used to be........

AP is the first seti@home app that does more than just waste time and electricity; It can actually do real science while it's looking for spooks. I say to heck with credit considerations and full speed ahead with AstroPulse!

A lot has been said lately about seti@home's shoestring budget. Well, AstroPulse is exactly the sort of respectable app to just possibly rectify that. Fact is, it's hard to get grants for chasing ghosts. Especially when you've been at it for years and have never even seen one, let alone caught one. It lacks credibility. The smart money doesn't go for it. But after we discover a few pulsars, and maybe even something really unexpected (fingers crossed)... I would imagine that both seti@home and Arecibo would get more respect.
ID: 797357 · Report as offensive
Profile Blurf
Volunteer tester

Send message
Joined: 2 Sep 06
Posts: 8962
Credit: 12,678,685
RAC: 0
United States
Message 797404 - Posted: 13 Aug 2008, 20:35:15 UTC - in response to Message 797352.  

Of course the stats are not generating again so BoincStats is showing nothing for the last couple of days.


Arkayn--please be patient. The stats will catch up


ID: 797404 · Report as offensive
PhonAcq

Send message
Joined: 14 Apr 01
Posts: 1656
Credit: 30,658,217
RAC: 1
United States
Message 797512 - Posted: 13 Aug 2008, 23:46:35 UTC - in response to Message 797357.  



A lot has been said lately about seti@home's shoestring budget. Well, AstroPulse is exactly the sort of respectable app to just possibly rectify that. Fact is, it's hard to get grants for chasing ghosts. Especially when you've been at it for years and have never even seen one, let alone caught one. It lacks credibility. The smart money doesn't go for it. But after we discover a few pulsars, and maybe even something really unexpected (fingers crossed)... I would imagine that both seti@home and Arecibo would get more respect.


I agree most fully. So this is the reason the servers and the procedures at centralcommand should be as professionally operated as possible. Nobody is going to drop $1M for new servers. But a rock solid operational history that funding agents can 'trust' would go along way to justifying a post-doc and tech sized grant. That would be a start. The operations for the last week or so since turning on AP is disappointing, however.
ID: 797512 · Report as offensive
Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 15 · Next

Message boards : Number crunching : Panic Mode On (8) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.