Panic Mode On (7) Server Problems! Closed for Renovation

Message boards : Number crunching : Panic Mode On (7) Server Problems! Closed for Renovation
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 14 · Next

AuthorMessage
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51492
Credit: 1,018,363,574
RAC: 1,004
United States
Message 738866 - Posted: 14 Apr 2008, 9:39:56 UTC
Last modified: 14 Apr 2008, 9:42:26 UTC

Yup.....Cricket Graph shows the bandwidth has gone flatter than road kill.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 738866 · Report as offensive
Profile John Clark
Volunteer tester
Avatar

Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 738867 - Posted: 14 Apr 2008, 9:46:52 UTC

Was OK at 7.15 BST (UTC+1) when I looked at the Cricket Graphs. So, the server in question must have fallen over about 06.30 UTC.

At least crunched WUs results are being returned and the handshaking acknowledging the reported figures is working OK.

Living on your caches, then?
It's good to be back amongst friends and colleagues



ID: 738867 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51492
Credit: 1,018,363,574
RAC: 1,004
United States
Message 738872 - Posted: 14 Apr 2008, 10:01:02 UTC - in response to Message 738867.  

Was OK at 7.15 BST (UTC+1) when I looked at the Cricket Graphs. So, the server in question must have fallen over about 06.30 UTC.

At least crunched WUs results are being returned and the handshaking acknowledging the reported figures is working OK.

Living on your caches, then?

Yup.....except for the 'kitty sniffer' rig, which has about another hour and a half of work left, then it falls back to Einstein and Rosetta 'till the smoke clears.......

"Time is simply the mechanism that keeps everything from happening all at once."

ID: 738872 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 738879 - Posted: 14 Apr 2008, 10:21:38 UTC - in response to Message 738867.  

Was OK at 7.15 BST (UTC+1) when I looked at the Cricket Graphs. So, the server in question must have fallen over about 06.30 UTC.

At least crunched WUs results are being returned and the handshaking acknowledging the reported figures is working OK.

Living on your caches, then?


I've set my CACHE to 5 day's.
Here are the last messages from S@H server's :

14-4-2008 3:45:30|SETI@home|Sending scheduler request: To fetch work. Requesting 145 seconds of work, reporting 0 completed tasks
14-4-2008 3:45:31|SETI@home|Started upload of 14fe08ab.23100.3344.6.8.156_0_0
14-4-2008 3:45:35|SETI@home|Scheduler request succeeded: got 1 new tasks
14-4-2008 3:45:36|SETI@home|Finished upload of 14fe08ab.23100.3344.6.8.156_0_0
14-4-2008 3:45:37|SETI@home|Started download of 27mr08ao.17264.18886.12.8.129
14-4-2008 3:45:52|SETI@home|Finished download of 27mr08ao.17264.18886.12.8.129
14-4-2008 3:56:53|SETI@home|Sending scheduler request: To fetch work. Requesting 54 seconds of work, reporting 1 completed tasks
14-4-2008 3:56:58|SETI@home|Scheduler request succeeded: got 1 new tasks
14-4-2008 3:57:00|SETI@home|Started download of 28mr08ab.30454.24885.13.8.23
14-4-2008 3:57:02|SETI@home|Temporarily failed download of 28mr08ab.30454.24885.13.8.23: http error
14-4-2008 3:57:02|SETI@home|Backing off 1 min 0 sec on download of 28mr08ab.30454.24885.13.8.23

Then communication work's till:

14-4-2008 10:46:48|SETI@home|Started upload of 27mr08ak.15220.4571.7.8.224_0_0
14-4-2008 10:46:52|SETI@home|Finished upload of 27mr08ak.15220.4571.7.8.224_0_0
14-4-2008 11:13:02|SETI@home|Sending scheduler request: To fetch work. Requesting 11 seconds of work, reporting 14 completed tasks
14-4-2008 11:13:08|SETI@home|Scheduler request succeeded: got 1 new tasks
14-4-2008 11:13:10|SETI@home|Started download of 28mr08ac.12359.29955.10.8.139
14-4-2008 11:13:32||Project communication failed: attempting access to reference site
14-4-2008 11:13:32|SETI@home|Temporarily failed download of 28mr08ac.12359.29955.10.8.139: http error
14-4-2008 11:13:32|SETI@home|Backing off 1 min 0 sec on download of 28mr08ac.12359.29955.10.8.139

and after a while :

14-4-2008 11:49:17|SETI@home|Backing off 2 hr 14 min 17 sec on download of 28mr08ab.21349.12342.15.8.85
14-4-2008 12:10:01|SETI@home|Computation for task 27mr07ag.7880.15614.14.7.164_2 finished
14-4-2008 12:10:01|SETI@home|Starting 27mr08am.15190.16023.5.8.109_0
14-4-2008 12:10:01|SETI@home|Starting task 27mr08am.15190.16023.5.8.109_0 using setiathome_enhanced version 527
14-4-2008 12:10:03|SETI@home|Started upload of 27mr07ag.7880.15614.14.7.164_2_0
14-4-2008 12:10:08|SETI@home|Finished upload of 27mr07ag.7880.15614.14.7.164_2_0

As off this moment it appears to be OK .
Looks like the DOWNLOAD server gets overloaded from time to time.


ID: 738879 · Report as offensive
Profile Dennis

Send message
Joined: 26 Jun 07
Posts: 153
Credit: 15,826,319
RAC: 0
United States
Message 738891 - Posted: 14 Apr 2008, 11:08:03 UTC

I see we are unable to download atm, n/p this happens, but, all weekend I seem to have an ever increasing pending from about 30k Saturday to almost 48k today and a slowly declining RAC. I see some WU's are being validated, but wondering if there is a slowdown here or if my wingmen are goofing off lol. Maybe just Mark, Satan and a few others are slacking a bit. Anyone else have a pending list increasing more than what seems normal the last few days?
ID: 738891 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51492
Credit: 1,018,363,574
RAC: 1,004
United States
Message 738894 - Posted: 14 Apr 2008, 11:10:53 UTC - in response to Message 738891.  

I see we are unable to download atm, n/p this happens, but, all weekend I seem to have an ever increasing pending from about 30k Saturday to almost 48k today and a slowly declining RAC. I see some WU's are being validated, but wondering if there is a slowdown here or if my wingmen are goofing off lol. Maybe just Mark, Satan and a few others are slacking a bit. Anyone else have a pending list increasing more than what seems normal the last few days?

I think pendings always tend to go up over the weekend because of users crunching at work, that may crunch all weekend, but not connect again to report until Monday morning.....
But that is just my theory......

"Time is simply the mechanism that keeps everything from happening all at once."

ID: 738894 · Report as offensive
Profile David
Volunteer tester
Avatar

Send message
Joined: 19 May 99
Posts: 411
Credit: 1,426,457
RAC: 0
Australia
Message 738906 - Posted: 14 Apr 2008, 11:52:29 UTC - in response to Message 738867.  

Living on your caches, then?


Yeah 4 day cache here now, and that should be more than enough to ride out all but the most extreme outages
ID: 738906 · Report as offensive
LeCelte44
Avatar

Send message
Joined: 29 Sep 99
Posts: 3
Credit: 294,368
RAC: 0
France
Message 738935 - Posted: 14 Apr 2008, 14:00:33 UTC - in response to Message 738906.  

Hello all
i installed Seti on a second PC
and it can't download certains parts???
Is there à problem with serveur??

ID: 738935 · Report as offensive
KB7RZF
Volunteer tester
Avatar

Send message
Joined: 15 Aug 99
Posts: 9549
Credit: 3,308,926
RAC: 2
United States
Message 738937 - Posted: 14 Apr 2008, 14:01:27 UTC - in response to Message 738935.  

Hello all
i installed Seti on a second PC
and it can't download certains parts???
Is there à problem with serveur??

Short answer, Yes. :-)
ID: 738937 · Report as offensive
LeCelte44
Avatar

Send message
Joined: 29 Sep 99
Posts: 3
Credit: 294,368
RAC: 0
France
Message 738940 - Posted: 14 Apr 2008, 14:04:40 UTC - in response to Message 738937.  


Short answer, Yes. :-)[/quote]
thank's a lot

pierre
ID: 738940 · Report as offensive
Profile SATAN
Avatar

Send message
Joined: 27 Aug 06
Posts: 835
Credit: 2,129,006
RAC: 0
United Kingdom
Message 738951 - Posted: 14 Apr 2008, 14:36:51 UTC

I must confess I am slacking, only reporting every 6 hours or so at the minute.

I do apologise from the heart of my botton to anyone who is waiting for me to report a work unit.

I guess having a large cache is a good idea at the minute. Feel sorry for the guys, they work there a**'s off yet it seems 1 step forward 2 steps back at the minute. They must get so frustrated in that office.
ID: 738951 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66445
Credit: 55,293,173
RAC: 49
United States
Message 738970 - Posted: 14 Apr 2008, 15:44:51 UTC - in response to Message 738951.  
Last modified: 14 Apr 2008, 15:45:46 UTC

I must confess I am slacking, only reporting every 6 hours or so at the minute.

I do apologize from the heart of my bottom to anyone who is waiting for me to report a work unit.

I guess having a large cache is a good idea at the minute. Feel sorry for the guys, they work there a**'s off yet it seems 1 step forward 2 steps back at the minute. They must get so frustrated in that office.

Nah, don't sweat It Satan :), Afterall You're the Demon w/the Pitchfork. ;)

Nice dog there too!
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 738970 · Report as offensive
Profile SATAN
Avatar

Send message
Joined: 27 Aug 06
Posts: 835
Credit: 2,129,006
RAC: 0
United Kingdom
Message 738986 - Posted: 14 Apr 2008, 16:02:50 UTC

Cheers joker, that's mother dog Tessa.

I've swapped the pitchfork for a Jack Knife, a bit more modern.
ID: 738986 · Report as offensive
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 739009 - Posted: 14 Apr 2008, 17:09:31 UTC

Front Page News

April 14, 2008
We continue to have workunit storage server problems but hope to have the machine replaced today (but may not have it on line until tomorrow). Until this new system is in place workunit downloads will be disabled.


Good thing I have plenty of WU's on my other projects, as I am down to my last WU on SETI and SETI Beta.

I'm still hoping to reach 50k for SETI before the end of the month!
Sir Arthur C Clarke 1917-2008
ID: 739009 · Report as offensive
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3373
Credit: 715,342
RAC: 4
United States
Message 739062 - Posted: 14 Apr 2008, 19:32:21 UTC

also see Matt's post titled Moribund Monday (Apr 14 2008) in the Technical News forum.
Continuing problems with the workunit storage server... There were more resets over the weekend, ultimately resulting in one that caused the server to think enough drives have failed to call the entire RAID dead. We are confident we can trick the server into thinking otherwise - we actually have some helpful techs logged in doing that as I type. We still want to replace the whole box, which we'll hopefully do today, and then the drives will have to resync again. Chances are we'll be down until tomorrow (Tuesday).

So while we are down we'll try to catch up on several things. Moving servers around the closet, incorporating the new drive enclosure that arrived today, getting more stuff on the new KVM, etc.

- Matt

ID: 739062 · Report as offensive
Profile hiamps
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 4292
Credit: 72,971,319
RAC: 0
United States
Message 739070 - Posted: 14 Apr 2008, 19:56:33 UTC

I seem to have 1 upload and 1 download that don't want to work right now...Hope it doesn't become long term...
Official Abuser of Boinc Buttons...
And no good credit hound!
ID: 739070 · Report as offensive
Profile John Clark
Volunteer tester
Avatar

Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 739093 - Posted: 14 Apr 2008, 20:57:27 UTC - in response to Message 739070.  

I seem to have 1 upload and 1 download that don't want to work right now...Hope it doesn't become long term...



It will continue until at least Tuesday (See here, and the quote lifted from the post Matt L made

Continuing problems with the workunit storage server... There were more resets over the weekend, ultimately resulting in one that caused the server to think enough drives have failed to call the entire RAID dead. We are confident we can trick the server into thinking otherwise - we actually have some helpful techs logged in doing that as I type. We still want to replace the whole box, which we'll hopefully do today, and then the drives will have to resync again. Chances are we'll be down until tomorrow (Tuesday).

So while we are down we'll try to catch up on several things. Moving servers around the closet, incorporating the new drive enclosure that arrived today, getting more stuff on the new KVM, etc.

- Matt

It's good to be back amongst friends and colleagues



ID: 739093 · Report as offensive
Profile littlegreenmanfrommars
Volunteer tester
Avatar

Send message
Joined: 28 Jan 06
Posts: 1410
Credit: 934,158
RAC: 0
Australia
Message 739098 - Posted: 14 Apr 2008, 21:17:12 UTC

OH,
So it's a server problem?

*panic off*
ID: 739098 · Report as offensive
Profile hiamps
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 4292
Credit: 72,971,319
RAC: 0
United States
Message 739100 - Posted: 14 Apr 2008, 21:25:07 UTC - in response to Message 739093.  

I seem to have 1 upload and 1 download that don't want to work right now...Hope it doesn't become long term...



It will continue until at least Tuesday (See here, and the quote lifted from the post Matt L made

Continuing problems with the workunit storage server... There were more resets over the weekend, ultimately resulting in one that caused the server to think enough drives have failed to call the entire RAID dead. We are confident we can trick the server into thinking otherwise - we actually have some helpful techs logged in doing that as I type. We still want to replace the whole box, which we'll hopefully do today, and then the drives will have to resync again. Chances are we'll be down until tomorrow (Tuesday).

So while we are down we'll try to catch up on several things. Moving servers around the closet, incorporating the new drive enclosure that arrived today, getting more stuff on the new KVM, etc.

- Matt


Sorry folks but it is my fault. Got one of my problem machines fixed and was going to get it online today...This seems to happen everytime I build a new machine...or fix one....
Official Abuser of Boinc Buttons...
And no good credit hound!
ID: 739100 · Report as offensive
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 739101 - Posted: 14 Apr 2008, 21:27:17 UTC - in response to Message 739093.  

I seem to have 1 upload and 1 download that don't want to work right now...Hope it doesn't become long term...



It will continue until at least Tuesday (See here, and the quote lifted from the post Matt L made

Continuing problems with the workunit storage server... There were more resets over the weekend, ultimately resulting in one that caused the server to think enough drives have failed to call the entire RAID dead. We are confident we can trick the server into thinking otherwise - we actually have some helpful techs logged in doing that as I type. We still want to replace the whole box, which we'll hopefully do today, and then the drives will have to resync again. Chances are we'll be down until tomorrow (Tuesday).

So while we are down we'll try to catch up on several things. Moving servers around the closet, incorporating the new drive enclosure that arrived today, getting more stuff on the new KVM, etc.

- Matt


Hello Matt, thanx for the INput, hope you get server OUTput without to much hassle. ;)
We will 'survive' the weekly outage. Looks like a lot off work.


ID: 739101 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 14 · Next

Message boards : Number crunching : Panic Mode On (7) Server Problems! Closed for Renovation


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.