Panic Mode On (10) Server problems

Message boards : Number crunching : Panic Mode On (10) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 13 · Next

AuthorMessage
Profile Allie in Vancouver
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 3949
Credit: 1,604,668
RAC: 0
Canada
Message 818963 - Posted: 15 Oct 2008, 22:53:58 UTC - in response to Message 818957.  

Remember that there are more than 800,000 participants, and less than 4,300 who have posted at least one message this year.


so everyone here has to do the panicing for 186 silent crunchers...

:)


Pure mathematics is, in its way, the poetry of logical ideas.

Albert Einstein
ID: 818963 · Report as offensive
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 818992 - Posted: 15 Oct 2008, 23:33:19 UTC - in response to Message 818928.  

Indeed we've gone thru worse and no doubt will have worse problems in the future -- that is part of the concept behind multiple project processing. There was a time when SETI was my only project --- I've joined 10 over the years, with two of them being defunct (BBC Climate which shut down with good warning and explanation, and Predictor which has been a very bad citizen in the BOINC world). These days I still do processing with the early set (SETI, then Einstein, then Climate, then World Grid), but when I started encountering multiple project outages of one form or another, I added Rosetta, Spinhenge, Malaria and most recently MilkyWay).

So when SETI demonstrates a bit of ongoing crankiness, it is a relatively simple task to adjust the cycles.

ID: 818992 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 818994 - Posted: 15 Oct 2008, 23:39:59 UTC - in response to Message 818992.  

Indeed we've gone thru worse and no doubt will have worse problems in the future -- that is part of the concept behind multiple project processing.

True, but that isn't the only tool in the BOINC arsenal that is designed to deal with outages.

If you're worried about an outage, just carry a little bit of extra cache, 3 or 4 days max. will cover all but the worst outages.

I quit changing settings to deal with outages years ago. I sit back and watch as everyone gets all excited, suspends network activity, shuts down, and even talks about finding projects that "have their act together" and sure enough, when the servers come back, BOINC does its' job, reports work, and continues on as if nothing unusual happened.

... and you know something: what happened isn't unusual. BOINC is designed so that a project can have less than 99.999% reliability and still crunch along perfectly.

I crunch SETI and SETI Beta -- and that's all.
ID: 818994 · Report as offensive
Profile Vipin Palazhi
Avatar

Send message
Joined: 29 Feb 08
Posts: 286
Credit: 167,386,578
RAC: 0
India
Message 819154 - Posted: 16 Oct 2008, 5:12:24 UTC
Last modified: 16 Oct 2008, 5:18:29 UTC

Since early this morning, I have been experiencing trouble accessing the seti@home webpages. Whenever I tried to open my account page, it came up with a page with just the heading "Your Account" and the logo. The rest of the page would be blank. The home page still returns HTTP 500 error.

I finally accessed the account page through google and I now notice that the link for the account page is Your_Account (with an underscore). The workunit uploads and downloads are normal.

Since I am not able to access most of the other pages, I have no clue whether this is an isolated incident or if others are facing this too. Does anyone have any information?

Edit: Finally managed to get into the webpage and this is what I get. You will notice that all the links are different now.

Vipin
ID: 819154 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51477
Credit: 1,018,363,574
RAC: 1,004
United States
Message 819157 - Posted: 16 Oct 2008, 5:21:50 UTC - in response to Message 819154.  

Since early this morning, I have been experiencing trouble accessing the seti@home webpages. Whenever I tried to open my account page, it came up with a page with just the heading "Your Account" and the logo. The rest of the page would be blank. The home page still returns HTTP 500 error.

I finally accessed the account page through google and I now notice that the link for the account page is Your_Account (with an underscore). The workunit uploads and downloads are normal.

Since I am not able to access most of the other pages, I have no clue whether this is an isolated incident or if others are facing this too. Does anyone have any information?

Edit: Finally managed to get into the webpage and this is what I get. You will notice that all the links are different now.

Vipin

Not just you.......the webpage code is being worked on............

"Time is simply the mechanism that keeps everything from happening all at once."

ID: 819157 · Report as offensive
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 819160 - Posted: 16 Oct 2008, 5:26:18 UTC - in response to Message 818994.  

I agree that there are multiple approaches here -- the thing with running longer caches -- some projects don't like that, others do. If you are only running SETI, than the longer caches and the don't bother with changes is a reasonable approach.

I run a mix of projects, which originally was to cope with outages that often enough extended thru weekends or longer -- with multiple projects I had active coverage. The downside there is that doing that requires a bit more attending to things. I do have a bunch of offsite workstations -- for them, I have them set up for 'low maintenance' projects (Climate, Spinhenge for example). SETI is not a low maintenance project for me, so I've got it loaded on workstations I can readily get a hold of.




True, but that isn't the only tool in the BOINC arsenal that is designed to deal with outages.

If you're worried about an outage, just carry a little bit of extra cache, 3 or 4 days max. will cover all but the worst outages.

... and you know something: what happened isn't unusual. BOINC is designed so that a project can have less than 99.999% reliability and still crunch along perfectly.

I crunch SETI and SETI Beta -- and that's all.


ID: 819160 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 819162 - Posted: 16 Oct 2008, 5:28:00 UTC


. . . Nice work on THAT fix Berkeley - Keep up the great work guys -

iT is Sincerely Appreciated . . .


BOINC Wiki . . .

Science Status Page . . .
ID: 819162 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66202
Credit: 55,293,173
RAC: 49
United States
Message 819188 - Posted: 16 Oct 2008, 6:02:34 UTC - in response to Message 819157.  

Since early this morning, I have been experiencing trouble accessing the seti@home webpages. Whenever I tried to open my account page, it came up with a page with just the heading "Your Account" and the logo. The rest of the page would be blank. The home page still returns HTTP 500 error.

I finally accessed the account page through Google and I now notice that the link for the account page is Your_Account (with an underscore). The workunit uploads and downloads are normal.

Since I am not able to access most of the other pages, I have no clue whether this is an isolated incident or if others are facing this too. Does anyone have any information?

Edit: Finally managed to get into the webpage and this is what I get. You will notice that all the links are different now.

Vipin

Not just you.......the webpage code is being worked on............

Oh so that's what It was. I was afraid that Seti@Home had gone beyond the event horizon. ;)
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 819188 · Report as offensive
Profile [B^S] madmac
Volunteer tester
Avatar

Send message
Joined: 9 Feb 04
Posts: 1175
Credit: 4,754,897
RAC: 0
United Kingdom
Message 819191 - Posted: 16 Oct 2008, 6:32:54 UTC

I wondered what was going on thank you for the info, I am still having trouble uploading two old Wu's. It is now two days and they still have not uploaded yet the ones after them have, still getting either connect failed or http errors. What is the longest time any one has to wait for WU's to upload?
ID: 819191 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 819192 - Posted: 16 Oct 2008, 6:35:27 UTC - in response to Message 819191.  

I wondered what was going on thank you for the info, I am still having trouble uploading two old Wu's. It is now two days and they still have not uploaded yet the ones after them have, still getting either connect failed or http errors. What is the longest time any one has to wait for WU's to upload?



Try to push them thru manually. I had to do that a couple of times.
Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 819192 · Report as offensive
Profile [B^S] madmac
Volunteer tester
Avatar

Send message
Joined: 9 Feb 04
Posts: 1175
Credit: 4,754,897
RAC: 0
United Kingdom
Message 819202 - Posted: 16 Oct 2008, 7:21:24 UTC

Just left them alone and they uploaded by themselves after a couple of days.
ID: 819202 · Report as offensive
Profile RandyC
Avatar

Send message
Joined: 20 Oct 99
Posts: 714
Credit: 1,704,345
RAC: 0
United States
Message 819271 - Posted: 16 Oct 2008, 14:52:52 UTC

Anyone noticed that the links at the top of the page are now numbers instead of names (at least as of this posting)?

That's something to really PANIC over!!!!

:^)
ID: 819271 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51477
Credit: 1,018,363,574
RAC: 1,004
United States
Message 819275 - Posted: 16 Oct 2008, 15:00:15 UTC - in response to Message 819271.  

Anyone noticed that the links at the top of the page are now numbers instead of names (at least as of this posting)?

That's something to really PANIC over!!!!

:^)

LOL...not really a panic, just a little pain in the backside......

The Boinc webpage code is being worked on......hopefully it will be fixed during the course of the day........
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 819275 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 66202
Credit: 55,293,173
RAC: 49
United States
Message 819311 - Posted: 16 Oct 2008, 16:21:59 UTC - in response to Message 819275.  
Last modified: 16 Oct 2008, 16:22:29 UTC

Anyone noticed that the links at the top of the page are now numbers instead of names (at least as of this posting)?

That's something to really PANIC over!!!!

:^)

LOL...not really a panic, just a little pain in the backside......

The Boinc webpage code is being worked on......hopefully it will be fixed during the course of the day........

I've heard of Kitties being "fixed", I didn't know webpages needed to be fixed so they won't breed out of control.;)

Pretty soon It'll be "Please fix Your webpage to control the webpage population". ;)
Savoir-Faire is everywhere!
The T1 Trust, T1 Class 4-4-4-4 #5550, America's First HST

ID: 819311 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51477
Credit: 1,018,363,574
RAC: 1,004
United States
Message 819314 - Posted: 16 Oct 2008, 16:24:16 UTC - in response to Message 819311.  

Anyone noticed that the links at the top of the page are now numbers instead of names (at least as of this posting)?

That's something to really PANIC over!!!!

:^)

LOL...not really a panic, just a little pain in the backside......

The Boinc webpage code is being worked on......hopefully it will be fixed during the course of the day........

I've heard of Kitties being "fixed", I didn't know webpages needed to be fixed so they won't breed out of control.;)

Pretty soon It'll be "Please fix Your webpage to control the webpage population". ;)

Webpages only need to be fixed when they become 'bad kitties'.....
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 819314 · Report as offensive
BarryAZ

Send message
Joined: 1 Apr 01
Posts: 2580
Credit: 16,982,517
RAC: 0
United States
Message 819452 - Posted: 16 Oct 2008, 22:22:06 UTC - in response to Message 819191.  

You are definitely not alone -- SETI has been having this problem on and off for a couple of days (and has had this problem on and off in the past as well). I still wonder if the issue is really a scaling problem -- SETI is by far the largest of the BOINC projects and has some very able tech folks and reasonably solid hardware -- so perhaps, as I suggested before, it is simply a problem of too much activity. For me, SETI is now in my second tier set of projects and might even drop back from there as I find it a project that (if you have multiple projects) tends to want additional attention on a regular basis. So what I do is move it off to higher maintenance workstations and focus on other projects which are more robust.


I wondered what was going on thank you for the info, I am still having trouble uploading two old Wu's. It is now two days and they still have not uploaded yet the ones after them have, still getting either connect failed or http errors. What is the longest time any one has to wait for WU's to upload?


ID: 819452 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 819623 - Posted: 17 Oct 2008, 10:06:54 UTC

Looks like bruno has been disabled again.
ID: 819623 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14674
Credit: 200,643,578
RAC: 874
United Kingdom
Message 819624 - Posted: 17 Oct 2008, 10:21:55 UTC - in response to Message 819623.  

Looks like bruno has been disabled again.

Hmmmm. I'm beginning to distrust that flag on the Server Status page. It's changed to "disabled" in the last hour, so between 2 a.m. and 3 a.m. Berkeley time. I know the crew are dedicated, but that seems out of tolerance, even for them.

We saw the same flag last weekend, and Matt's comment was:

Hours later the upload server needed a kick as well. Eric discovered that in the morning and got it working again.

I think Bruno has decided to take the weekend off, and set its own 'Gone fishing' flag.
ID: 819624 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 819627 - Posted: 17 Oct 2008, 10:57:47 UTC - in response to Message 819624.  

Looks like bruno has been disabled again.

Hmmmm. I'm beginning to distrust that flag on the Server Status page. It's changed to "disabled" in the last hour, so between 2 a.m. and 3 a.m. Berkeley time. I know the crew are dedicated, but that seems out of tolerance, even for them.[/quote]

I think during those hours the gremlins are on watch, you know how they can be.
ID: 819627 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51477
Credit: 1,018,363,574
RAC: 1,004
United States
Message 819639 - Posted: 17 Oct 2008, 11:41:33 UTC - in response to Message 819627.  
Last modified: 17 Oct 2008, 11:44:08 UTC


I think during those hours the gremlins are on watch, you know how they can be.

Well........
That's because the first thing that seems to go down is the server status pages, so if they are at a standstill, further crap is bound to happen........

They are the canary in the coal mine........so to speak....or squeek.......
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 819639 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 13 · Next

Message boards : Number crunching : Panic Mode On (10) Server problems


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.