Panic Mode On (81) Server Problems?

Message boards : Number crunching : Panic Mode On (81) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 21 · Next

AuthorMessage
Tom*

Send message
Joined: 12 Aug 11
Posts: 127
Credit: 20,769,223
RAC: 9
United States
Message 1338054 - Posted: 14 Feb 2013, 10:11:53 UTC - in response to Message 1335768.  
Last modified: 14 Feb 2013, 10:12:36 UTC

Yes, we need more bandwidth to fully sort comms difficulties, but Matt has proven that proper server configuration can also go a long ways toward fully utilizing what we now have.


I totally agree whatever tweaks Matt made has made a huge difference in schedular comms, I still need a Proxy to download AP's but can usually do without for Sched and upload/download MB's No need to switch back and forth either as Schedular now answers fairly reliably with the shorter timeouts of my
Proxy.

Can someone please give Matt a pad of post-it notes (any color but canary yellow)
so he can leave tips in case he goes on tour :-)

Thank you again Matt
ID: 1338054 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1338084 - Posted: 14 Feb 2013, 14:30:37 UTC

Yesterday, the UPS that runs one of my crunchers, the Uverse modem/router, and the radios for my radioreference.com feed, suddenly decided to take a nap. The UPS log shows nothing. It had some event 3-4 weeks ago, then it shows program start yesterday when I turned everything on. I have no idea what happened.

Anyway, the other cruncher, not on that UPS, built up quite a pile of uploads in six hours. Once it was able to reach the internet again, (and here's the point of this post) it sent them all in pretty fast. Then it reported them and asked for more, was resent 14 ghosts, and downloaded those almost as fast as the uploads. So everything was good around 1730 CST. (However, a couple hours later, I looked again and it had a couple of new downloads that were stuck.)

David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1338084 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1338109 - Posted: 14 Feb 2013, 16:04:55 UTC
Last modified: 14 Feb 2013, 16:05:52 UTC

Looks like Seti is not the only one that has panic modes.
Seems like Boincstats was down sometime yesterday or last night.
Was still down early this morning, but they're back now.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1338109 · Report as offensive
Gone

Send message
Joined: 31 May 99
Posts: 150
Credit: 125,779,206
RAC: 0
United Kingdom
Message 1338656 - Posted: 15 Feb 2013, 23:15:36 UTC

ID: 1338656 · Report as offensive
Profile KWSN Ekky Ekky Ekky
Avatar

Send message
Joined: 25 May 99
Posts: 944
Credit: 52,956,491
RAC: 67
United Kingdom
Message 1338660 - Posted: 15 Feb 2013, 23:27:10 UTC - in response to Message 1338656.  

[quote]Is it me or is the little blue line having it's traditional weekend dive ?

Yep - don't let the team head for them thar hills until it comes back up again!




ID: 1338660 · Report as offensive
Gone

Send message
Joined: 31 May 99
Posts: 150
Credit: 125,779,206
RAC: 0
United Kingdom
Message 1338665 - Posted: 15 Feb 2013, 23:39:40 UTC - in response to Message 1338660.  

It seems to have done a "Dead Cat Bounce" and is on the rise again......
ID: 1338665 · Report as offensive
Iona
Avatar

Send message
Joined: 12 Jul 07
Posts: 790
Credit: 22,438,118
RAC: 0
United Kingdom
Message 1338745 - Posted: 16 Feb 2013, 4:25:12 UTC

Uh-oh.....35 minutes trying to report, so far.



Don't take life too seriously, as you'll never come out of it alive!
ID: 1338745 · Report as offensive
ExchangeMan
Volunteer tester

Send message
Joined: 9 Jan 00
Posts: 115
Credit: 157,719,104
RAC: 0
United States
Message 1338751 - Posted: 16 Feb 2013, 4:32:05 UTC - in response to Message 1338745.  

Uh-oh.....35 minutes trying to report, so far.



I've noticed some problems too...

ID: 1338751 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1338767 - Posted: 16 Feb 2013, 5:16:35 UTC
Last modified: 16 Feb 2013, 5:17:10 UTC

It dipped, but came back up.

I still have almost 1800 WUs stored......all I can get. And that's with 9 crunchers online.

S U X
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1338767 · Report as offensive
Profile KWSN Ekky Ekky Ekky
Avatar

Send message
Joined: 25 May 99
Posts: 944
Credit: 52,956,491
RAC: 67
United Kingdom
Message 1338819 - Posted: 16 Feb 2013, 8:42:35 UTC - in response to Message 1338767.  
Last modified: 16 Feb 2013, 8:48:56 UTC

It dipped, but came back up.

Interesting, these dips.
In fact the blue line seems to be suffering from hiccups, judging by the last few hours.
http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Finr-250%2Fgigabitethernet2_3;view=Octets;ranges=d%3Aw[/quote]
[Edit]Yep, there it goes down again. Question is, will it come back up as well?[End Edit]

ID: 1338819 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22189
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1338825 - Posted: 16 Feb 2013, 9:03:41 UTC

Yellow fluff alert!!!!

One of those dips is active....
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1338825 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1338827 - Posted: 16 Feb 2013, 9:10:06 UTC
Last modified: 16 Feb 2013, 9:30:52 UTC

The the blue line of the cricket is falling again...

Prepare for the new "normal weekend msg": project servers may be temporarily down.
ID: 1338827 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1338833 - Posted: 16 Feb 2013, 9:25:54 UTC - in response to Message 1338827.  

The teh blue line of the cricket is falling again...

Prepare for the new "normal weekend msg": project servers may be temporarily down.

Agreed. One of my hosts got through about ten minutes ago, with a few to report, and got 15 new ones in return. So far, so normal.

But the 'sent' time in the database looks like this:

9:08:15
9:08:13
9:08:11
9:08:10
9:08:07
9:08:05
9:08:04
9:08:02
9:07:59
9:07:57
9:07:55
9:07:55
9:07:54
9:07:52
9:07:49

Normally, all the tasks in a batch have the same timestamp, or maybe there's a difference of one second. Now, it seems to be taking two seconds per task. That's one sick database.

All tasks for computer 6910484
ID: 1338833 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1338836 - Posted: 16 Feb 2013, 9:34:13 UTC
Last modified: 16 Feb 2013, 9:37:35 UTC

Looks like Blue line rise again, Maybe a servers hipcup? or the watchdog awake?

It´s working now. But if you see the graph with atention, on the last 12 hours, each some time the line drops for few minutes and rise again, that´s is not normal.
ID: 1338836 · Report as offensive
mramakers

Send message
Joined: 20 Jul 04
Posts: 42
Credit: 3,694,335
RAC: 0
Netherlands
Message 1338844 - Posted: 16 Feb 2013, 11:13:45 UTC
Last modified: 16 Feb 2013, 11:19:33 UTC

And it's weekend again. Scheduler request failing time after time.

-Offtopic- How is it possible that some people have a 200 WU cache, while mine is 100 max? I,ve tried different numbers for work and additional work cache, but it doesn't seem to make a difference. I,ve searched the forum, but i could not find any topic about the subject. Is it because I do GPU work only?
ID: 1338844 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34255
Credit: 79,922,639
RAC: 80
Germany
Message 1338846 - Posted: 16 Feb 2013, 11:24:38 UTC - in response to Message 1338844.  

And it's weekend again. Scheduler request failing time after time.

-Offtopic- How is it possible that some people have a 200 WU cache, while mine is 100 max? I,ve tried different numbers for work and additional work cache, but it doesn't seem to make a difference. I,ve searched the forum, but i could not find any topic about the subject. Is it because I do GPU work only?


Yes.

You can get 100 for CPU plus 100 for GPU.



With each crime and every kindness we birth our future.
ID: 1338846 · Report as offensive
mramakers

Send message
Joined: 20 Jul 04
Posts: 42
Credit: 3,694,335
RAC: 0
Netherlands
Message 1338847 - Posted: 16 Feb 2013, 11:31:41 UTC - in response to Message 1338846.  
Last modified: 16 Feb 2013, 11:31:54 UTC

Thank you.
ID: 1338847 · Report as offensive
Profile TRuEQ & TuVaLu
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 505
Credit: 69,523,653
RAC: 10
Sweden
Message 1338858 - Posted: 16 Feb 2013, 12:38:18 UTC
Last modified: 16 Feb 2013, 12:43:29 UTC

I've got me a few ap wu's yesterday and today.
A bit of better speed now...about 6-8Kb when it downloads.
Still stalled transfers and server backoffs but not as frequent as before.

And woops we have 16Kb a sec again.

and raising to 24KB a sec

cricket please.....don't move
ID: 1338858 · Report as offensive
Profile TRuEQ & TuVaLu
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 505
Credit: 69,523,653
RAC: 10
Sweden
Message 1338881 - Posted: 16 Feb 2013, 15:18:29 UTC
Last modified: 16 Feb 2013, 15:18:53 UTC

Now I got me some intresting respons.

2013-02-16 16:16:25 | SETI@home | update requested by user
2013-02-16 16:16:25 | SETI@home | Temporarily failed download of ap_08dc12af_B6_P1_00376_20130216_14751.wu: transient HTTP error
2013-02-16 16:16:25 | SETI@home | Backing off 29 min 29 sec on download of ap_08dc12af_B6_P1_00376_20130216_14751.wu
2013-02-16 16:16:25 | SETI@home | Started download of ap_03ja13ad_B6_P0_00340_20130210_27913.wu
2013-02-16 16:16:27 | SETI@home | Sending scheduler request: Requested by user.
2013-02-16 16:16:27 | SETI@home | Not requesting tasks: some download is stalled
2013-02-16 16:16:36 | SETI@home | Scheduler request completed
2013-02-16 16:16:39 | | Project communication failed: attempting access to reference site
2013-02-16 16:16:41 | | Internet access OK - project servers may be temporarily down.


I am not even try to figure this out....
ID: 1338881 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1338884 - Posted: 16 Feb 2013, 15:28:35 UTC
Last modified: 16 Feb 2013, 15:31:22 UTC

No problemos here.
I got all 1800 tasks I can get. The kitties have been sniffing 'em out and crunching 'em down.

The chinks in the upload bandwidth on the Cricket are a little worrisome though.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1338884 · Report as offensive
Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 21 · Next

Message boards : Number crunching : Panic Mode On (81) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.