Panic Mode On (102) Server Problems?

Message boards : Number crunching : Panic Mode On (102) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 25 · Next

AuthorMessage
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1775123 - Posted: 30 Mar 2016, 22:57:14 UTC - in response to Message 1775119.  
Last modified: 30 Mar 2016, 23:15:18 UTC

I am a little confused here. In the past when the tasks page for my computer was not indicating what I actually had for tasks, I thought the reason was that the replica database was “X” number of seconds behind the master data base and the information for the status page came from the replica. Now my computer’s task page seems current and the replica database on Carolyn appears to be down. Am I mistaken on the source of data for the status page or has there been a change of the source from Carolyn to Oscar?

They've probably re-jigged it that way as a temporary measure, because I read a suggestion that Carolyn had suffered a hardware failure and they may be giving themselves more time to ship in spare parts. The servers can work through a backlog by themselves overnight: I don't think they've learned how to fix their own hardware yet.

Edit - or maybe they had a spare in stock. Carolyn is back up and running, though the replica database is still offline.
ID: 1775123 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1775128 - Posted: 31 Mar 2016, 0:02:45 UTC - in response to Message 1775123.  

LoL, has been hard to tell what isn't broken, with my ISP being really sketchy over the last week. Fingers crossed the fires at Berkeley are all out, before I get mine put out here...
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1775128 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1853
Credit: 268,616,081
RAC: 1,349
United States
Message 1775130 - Posted: 31 Mar 2016, 0:13:23 UTC - in response to Message 1775123.  

- or maybe they had a spare in stock. Carolyn is back up and running, though the replica database is still offline.

Or maybe they took the replica and made it prime until Carolyn can be repaired?
ID: 1775130 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1775173 - Posted: 31 Mar 2016, 3:33:07 UTC - in response to Message 1775130.  

- or maybe they had a spare in stock. Carolyn is back up and running, though the replica database is still offline.

Or maybe they took the replica and made it prime until Carolyn can be repaired?

They may have before they left for the day over there as I'm now getting updated task pages (that usually come off Carolyn), but that will usually over stress Oscar (though Oscar only running at about 200 queries/sec above normal ATM).

Cheers.
ID: 1775173 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1775216 - Posted: 31 Mar 2016, 6:46:07 UTC

I wonder if:

centurion: Intel Server (2 x hexa-core 3.4GHz Xeon, 512 GB RAM)

might not be a replacement master database in the making to get NTPKR underway?

Cheers.
ID: 1775216 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1775304 - Posted: 31 Mar 2016, 14:57:46 UTC - in response to Message 1775216.  
Last modified: 31 Mar 2016, 14:58:52 UTC

I wonder if:

centurion: Intel Server (2 x hexa-core 3.4GHz Xeon, 512 GB RAM)

might not be a replacement master database in the making to get NTPKR underway?

Cheers.

http://setiathome.berkeley.edu/forum_thread.php?id=78899&postid=1774702#1774702

They previously indicated, in one of the youtube videos, that they are working on doing NTPKR via cloud computing using amazons services.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1775304 · Report as offensive
OTS
Volunteer tester

Send message
Joined: 6 Jan 08
Posts: 369
Credit: 20,533,537
RAC: 0
United States
Message 1775389 - Posted: 31 Mar 2016, 19:37:10 UTC

Replica DB is back on line, but 45+ hours behind the master at the moment.
ID: 1775389 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1775570 - Posted: 1 Apr 2016, 15:36:48 UTC - in response to Message 1768816.  

Monthly update.

Feels like a quiet month. We saw 19 new tapes from 2015, and seemingly finished off the December shipment from Arecibo. But we knocked off 102 reprocessing tapes from 2010: here's the state of play.

Recorded	TOTAL		Processed with		Processed with 
				SaH v7/8 (since		Sah v6 only
				launch June 2013)	(derived)

2007		 350		   4			 346
2008		 916		 874			  42
2009		 548		 456			  92
2010		 762		 261			 501
2011		1148		1082			  66
2012		 846		 819			  27
2013		 590		 585			   5
2014		 260		 260			 n/a
2015		 311		 311			 n/a

Grand total	5731		4652			1079
ID: 1775570 · Report as offensive
Filipe

Send message
Joined: 12 Aug 00
Posts: 218
Credit: 21,281,677
RAC: 20
Portugal
Message 1775572 - Posted: 1 Apr 2016, 15:42:03 UTC

Thank you for the update Richard ;)
ID: 1775572 · Report as offensive
Profile morpheus
Avatar

Send message
Joined: 5 Jun 99
Posts: 71
Credit: 52,480,762
RAC: 33
Germany
Message 1776763 - Posted: 7 Apr 2016, 1:47:57 UTC
Last modified: 7 Apr 2016, 1:55:17 UTC

Oh oh... Server status page doesn't look so well.
At least for me.

PS: Hmm... Okay, looks like it's better now.
.:morpheus:.
ID: 1776763 · Report as offensive
Profile KWSN THE Holy Hand Grenade!
Volunteer tester
Avatar

Send message
Joined: 20 Dec 05
Posts: 3187
Credit: 57,163,290
RAC: 0
United States
Message 1777201 - Posted: 8 Apr 2016, 17:05:21 UTC

Not sure if this is related, but the stats export has been down for 4 days...
.

Hello, from Albany, CA!...
ID: 1777201 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1777204 - Posted: 8 Apr 2016, 17:10:27 UTC - in response to Message 1777201.  

Not sure if this is related, but the stats export has been down for 4 days...

No, it has not....
Stats dump log...
And Boincstats has been reporting my daily progress with no problems.
Must be the site you are looking at, not Seti's lack of doing the stats dump.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1777204 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1777381 - Posted: 9 Apr 2016, 1:04:20 UTC

There are currently 7 channels in progress. Am I correct in saying that As of 9 Apr 2016, 1:00:05 UTC There are 4 splitters working on 28no10aa (9)?The reason why I come to this conclusion is because there is only 4 channels been split all together.
03au10aa(1)
28no10aa(9) tape in question
30au10ab(5)
30jn10ad(3)
ID: 1777381 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13732
Credit: 208,696,464
RAC: 304
Australia
Message 1777392 - Posted: 9 Apr 2016, 1:39:12 UTC - in response to Message 1777381.  

There are currently 7 channels in progress. Am I correct in saying that As of 9 Apr 2016, 1:00:05 UTC There are 4 splitters working on 28no10aa (9)?The reason why I come to this conclusion is because there is only 4 channels been split all together.
03au10aa(1)
28no10aa(9) tape in question
30au10ab(5)
30jn10ad(3)

It's been that way, on and off, for several months now.
For some reason several splitters will end up on one channel instead of one splitter per channel.
Grant
Darwin NT
ID: 1777392 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22190
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1777468 - Posted: 9 Apr 2016, 7:21:59 UTC

...one group of splitters get lonely when working on their own, so they gather together on one tape for a bit of networking time ;-)
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1777468 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1777481 - Posted: 9 Apr 2016, 8:44:08 UTC - in response to Message 1777392.  

There are currently 7 channels in progress. Am I correct in saying that As of 9 Apr 2016, 1:00:05 UTC There are 4 splitters working on 28no10aa (9)?The reason why I come to this conclusion is because there is only 4 channels been split all together.
03au10aa(1)
28no10aa(9) tape in question
30au10ab(5)
30jn10ad(3)

It's been that way, on and off, for several months now.
For some reason several splitters will end up on one channel instead of one splitter per channel.

Thanks Grant for explaining. Looks like things are getting a bit better. I wonder if the tape actually get split quicker with 2 splitters on it
ID: 1777481 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1777482 - Posted: 9 Apr 2016, 8:44:56 UTC - in response to Message 1777468.  

...one group of splitters get lonely when working on their own, so they gather together on one tape for a bit of networking time ;-)

I like the way you think Rob :)
ID: 1777482 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1777573 - Posted: 9 Apr 2016, 19:33:17 UTC - in response to Message 1777468.  

...one group of splitters get lonely when working on their own, so they gather together on one tape for a bit of networking time ;-)

That does have some merit (even if in a joking manner). I suppose I should actually read the book for I, Robot one of these days.. but at least in the movie, there was that brief mention that there are ghosts in the coding, whereby if you leave a group of bots alone in isolation, you'll find that some of them will cluster together in a corner, rather than being uniformly-spaced apart.

So splitters getting lonely and deciding to work together one one tape instead of their own tape has at least some merit...
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1777573 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13732
Credit: 208,696,464
RAC: 304
Australia
Message 1777605 - Posted: 9 Apr 2016, 21:27:44 UTC - in response to Message 1777573.  

So splitters getting lonely and deciding to work together one one tape instead of their own tape has at least some merit...

Only problems with that being
1 multiple spitters on the 1 tape tended to bring down the splitting rate. With the reduced load of v8, it's no longer an issue, but give it time with new classes of GPU on the way...
2 with several splitters on the one file, you tend to get more of the same type of work, ie shorties or VLARs. With 1 splitter to a file you tend to get less of one type of WU and a better spread of work so VLAR or shortie storms are much less likely. And when they do occur, not as severe.
Grant
Darwin NT
ID: 1777605 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1778185 - Posted: 11 Apr 2016, 20:59:03 UTC

Monday must be the new Tuesday.

Weekly Outage and Initial Catch Up
Every Monday morning (Pacific time) we begin a four hour data distribution outage for database and systems maintenance. The upload/download servers will be offline during this time. Afterwards you may experience connectivity issues for several more hours as the servers catch up with demand. 11 Apr 2016, 15:34:47 UTC

ID: 1778185 · Report as offensive
Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 25 · Next

Message boards : Number crunching : Panic Mode On (102) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.