Panic Mode On (28) Server problems

Message boards : Number crunching : Panic Mode On (28) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 8 · 9 · 10 · 11 · 12 · 13 · 14 . . . 16 · Next

AuthorMessage
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 970992 - Posted: 17 Feb 2010, 22:59:34 UTC - in response to Message 970990.  

Matt posted an explaination http://setiathome.berkeley.edu/forum_thread.php?id=58816


PROUD MEMBER OF Team Starfire World BOINC
ID: 970992 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 970995 - Posted: 17 Feb 2010, 23:07:53 UTC

The curious thing is that, although the servers are all back on the cricket graphs have not gone through the roof as one would expect. And uploads are still being bounced immediately although there is no sign of traffic...

F.
ID: 970995 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13832
Credit: 208,696,464
RAC: 304
Australia
Message 970997 - Posted: 17 Feb 2010, 23:13:31 UTC - in response to Message 970995.  

The curious thing is that, although the servers are all back on the cricket graphs have not gone through the roof as one would expect. And uploads are still being bounced immediately although there is no sign of traffic...

In the Tech News it appears a few things got scrambled in the power down; RAID & Database resync/rebuilding going on. Once that clears up, then the rest of it should pick up with a bit of luck.

Grant
Darwin NT
ID: 970997 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66194
Credit: 55,293,173
RAC: 49
United States
Message 970999 - Posted: 17 Feb 2010, 23:16:33 UTC - in response to Message 970995.  

The curious thing is that, although the servers are all back on the cricket graphs have not gone through the roof as one would expect. And uploads are still being bounced immediately although there is no sign of traffic...

F.

Yeah and that happened before the incident yesterday and before the outage, I would try and upload before the incident and before the outage and I'd get halfway and then nothing. So I wait as I've read what Matt said already. Still It's strange that the A/C just needed to be reset.
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 970999 · Report as offensive
Rasputin
Volunteer tester

Send message
Joined: 13 Jun 02
Posts: 1764
Credit: 6,132,221
RAC: 0
Russia
Message 971004 - Posted: 17 Feb 2010, 23:39:16 UTC

I haven't been able to upload or download since Monday. Finished all CUDA tasks several hours ago and have about one days worth of 6.03 left. Can't reschedule because there all VLAR's.

Guess I'll use all four cores of the CPU to help offset my idle GPU's. I only use two cores normally.
ID: 971004 · Report as offensive
Rick
Avatar

Send message
Joined: 3 Dec 99
Posts: 79
Credit: 11,486,227
RAC: 0
United States
Message 971006 - Posted: 17 Feb 2010, 23:43:37 UTC

I gave up on SETI when I ran out of tasks and turned everything over to Collatz. I normally only run Collatz in the GPU on my iMac since SETI won't provide GPU tasks. I just shut the HP down until I see transfers working again.
ID: 971006 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66194
Credit: 55,293,173
RAC: 49
United States
Message 971009 - Posted: 17 Feb 2010, 23:52:01 UTC - in response to Message 971004.  

I haven't been able to upload or download since Monday. Finished all CUDA tasks several hours ago and have about one days worth of 6.03 left. Can't reschedule because there all VLAR's.

Guess I'll use all four cores of the CPU to help offset my idle GPU's. I only use two cores normally.

Me neither, I just get HTTP errors when I try to upload to Seti, I'm waiting until sometime on Thursday before trying to upload again, As somehow I think Seti has not been receiving anything from US like usual lately.
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 971009 · Report as offensive
John G

Send message
Joined: 29 Dec 01
Posts: 68
Credit: 10,932,850
RAC: 0
Canada
Message 971019 - Posted: 18 Feb 2010, 0:42:38 UTC

I really love this ---- yher astropulse servers are going full tilt with new wu's the seti one are down for the count ----- lol
ID: 971019 · Report as offensive
Roundel

Send message
Joined: 1 Feb 06
Posts: 21
Credit: 6,850,211
RAC: 0
United States
Message 971026 - Posted: 18 Feb 2010, 1:04:13 UTC

I was actually just able to upload 1 (just one) WU
I've been getting Http errors for everything else on every other system.
ID: 971026 · Report as offensive
nemesis
Avatar

Send message
Joined: 12 Oct 99
Posts: 1408
Credit: 35,074,350
RAC: 0
Message 971029 - Posted: 18 Feb 2010, 1:13:21 UTC

it'll probably be friday or early saturday before all the dust settles...

ID: 971029 · Report as offensive
Profile John Fluth

Send message
Joined: 6 Oct 99
Posts: 22
Credit: 164,030,648
RAC: 153
United States
Message 971042 - Posted: 18 Feb 2010, 1:57:47 UTC

Able to submit some tasks about five minutes ago. Did not receive any tasks.
ID: 971042 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66194
Credit: 55,293,173
RAC: 49
United States
Message 971054 - Posted: 18 Feb 2010, 2:34:33 UTC - in response to Message 971042.  

Able to submit some tasks about five minutes ago. Did not receive any tasks.

I'm glad You could, I couldn't.

2/17/2010 6:32:06 PM		Resuming network activity
2/17/2010 6:32:06 PM	SETI@home	Started upload of 28mr07ac.2932.45510.12.10.91_0_0
2/17/2010 6:32:06 PM	SETI@home	Started upload of 28mr07ac.2932.45510.12.10.139_0_0
2/17/2010 6:32:07 PM	SETI@home	Temporarily failed upload of 28mr07ac.2932.45510.12.10.91_0_0: HTTP error
2/17/2010 6:32:07 PM	SETI@home	Backing off 1 hr 33 min 41 sec on upload of 28mr07ac.2932.45510.12.10.91_0_0
2/17/2010 6:32:11 PM	SETI@home	Temporarily failed upload of 28mr07ac.2932.45510.12.10.139_0_0: HTTP error
2/17/2010 6:32:11 PM	SETI@home	Backing off 3 hr 50 min 31 sec on upload of 28mr07ac.2932.45510.12.10.139_0_0
2/17/2010 6:32:13 PM		Suspending network activity - user request

Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 971054 · Report as offensive
Profile Pappa
Volunteer tester
Avatar

Send message
Joined: 9 Jan 00
Posts: 2562
Credit: 12,301,681
RAC: 0
United States
Message 971068 - Posted: 18 Feb 2010, 3:26:50 UTC

This recovery will take a while! Go read the Tech News!

All things considered it is doing okay.

Regards


Please consider a Donation to the Seti Project.

ID: 971068 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66194
Credit: 55,293,173
RAC: 49
United States
Message 971071 - Posted: 18 Feb 2010, 3:36:01 UTC - in response to Message 971068.  

This recovery will take a while! Go read the Tech News!

All things considered it is doing okay.

Regards


Yes Oh fearless leader, Er Pappa. I was just posting, Besides I already read It.
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 971071 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 971084 - Posted: 18 Feb 2010, 4:26:40 UTC - in response to Message 971006.  

I gave up on SETI when I ran out of tasks and turned everything over to Collatz. I normally only run Collatz in the GPU on my iMac since SETI won't provide GPU tasks. I just shut the HP down until I see transfers working again.

There are lots of things we can complain about, like a lack of funding, servers running at high loads, and while I don't think that's justified (you may be upset about a lack of uploads, but BOINC doesn't care), I can understand it.

... but if you have to blame someone, the A/C is maintained by campus facilities, not SETI@Home.

Besides, if you add a project (instead of "giving up") and just let BOINC manage it through resource shares, life will be good.
ID: 971084 · Report as offensive
Profile Pappa
Volunteer tester
Avatar

Send message
Joined: 9 Jan 00
Posts: 2562
Credit: 12,301,681
RAC: 0
United States
Message 971092 - Posted: 18 Feb 2010, 4:51:51 UTC
Last modified: 18 Feb 2010, 4:52:50 UTC

As near as I can "GUESS" there are roughly 1.4+ million results to upload (28+ hours of outage for MB and AP). Or try stuffing about 200 pounds of cooked spagetti in your mouth and swallowing it in a couple of hours (that could cause a PANIC).

Looking at my Logs, the scheduler is still having issues. So while it appears there is work. "Little" is getting out. I did have one machine that "did" manage to get a request through and it got work. That was while I was out in the yard doing things. Go figure, I did not have to push buttons. Boinc worked just as it supposed to.

I suspect that before someone goes to bed to night something will turn on (be restarted) and in the morning things will look brighter.

So yes, "offically" my one day cache failed me. Although I do have other projects that are keeping things warm. This messes up the numbers I have been collecting for a months now for Pendings and RAC. Over the next week or so, they will show the dip and increase...

Patience, as things are sorted.

Regards

I forgot, somewhere in all this I am not Panic'd (I know I am sipposed to be). I am conecerned...

Regards
Please consider a Donation to the Seti Project.

ID: 971092 · Report as offensive
Rasputin
Volunteer tester

Send message
Joined: 13 Jun 02
Posts: 1764
Credit: 6,132,221
RAC: 0
Russia
Message 971101 - Posted: 18 Feb 2010, 5:32:28 UTC

I've read the technical news and it still doesn't explain why some of us haven't been able to connect to the servers since early monday.

Ok, that's not totally correct. I managed to upload half of two different wu's today. LOL Only several hundred more to go.

I think (based on what some people have said here) that there is a problem with Berkeley's internet service.




ID: 971101 · Report as offensive
Rick
Avatar

Send message
Joined: 3 Dec 99
Posts: 79
Credit: 11,486,227
RAC: 0
United States
Message 971111 - Posted: 18 Feb 2010, 6:51:50 UTC

Seem to be getting connected to the scheduler now but just getting "Project has no jobs available"
ID: 971111 · Report as offensive
Profile [B^S] madmac
Volunteer tester
Avatar

Send message
Joined: 9 Feb 04
Posts: 1175
Credit: 4,754,897
RAC: 0
United Kingdom
Message 971118 - Posted: 18 Feb 2010, 7:41:13 UTC

I am getting http errors which means a server going wrong I think this morning. So I will just sit and wait and see if my uploads go up on both my machines. Just got a new one a couple of months ago, so that has a few to upload this one I am typing on has 6 waiting to upload
ID: 971118 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51477
Credit: 1,018,363,574
RAC: 1,004
United States
Message 971119 - Posted: 18 Feb 2010, 7:41:22 UTC
Last modified: 18 Feb 2010, 7:43:43 UTC

I am getting almost nothing but HTTP errors on upload attempts...a few do manage to make it, but very few.
And 'scheduler request failed, couldn't connect to server' when attempting to report those uploads that did squeak through.

Something is obviously still very wrong here....and this problem is happening on all of my rigs.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 971119 · Report as offensive
Previous · 1 . . . 8 · 9 · 10 · 11 · 12 · 13 · 14 . . . 16 · Next

Message boards : Number crunching : Panic Mode On (28) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.