Panic Mode On (85) Server Problems?

Message boards : Number crunching : Panic Mode On (85) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 23 · Next

AuthorMessage
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14654
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1406812 - Posted: 23 Aug 2013, 9:40:52 UTC - in response to Message 1406809.  

Hi folks,

Usually the servers perform a resend for an inconclusive result within minutes of it being detected. This one was marked as inconclusive but has not been sent out again after 4 hours:

http://setiathome.berkeley.edu/workunit.php?wuid=1304318565

Are we running into problems here ?

Tom

No, I don't think so. When a new task is created - which happens pretty much instantly when a validation fails like that - it goes to the back of the queue for sending out.

Your replacement 3127660152 was created at 4:59:31 UTC, 14 seconds after the quorum was complete.

My most recently allocated task was 3127602041, created 3:59:39 UTC, so the queue is currently about 5.5 hours long. So there's about an hour, and 58,111 task ID numbers, between your task and mine. We should reach it soon.

I think the queue is just moving slowly today because there are very few shorties in the mix.
ID: 1406812 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 1407125 - Posted: 24 Aug 2013, 0:16:06 UTC

About an hour and a half ago, one of my computers spewed 142 WUs into the abandonment. Sorry wingman. Those puppies are still on my crunch list. What should I do?
Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 1407125 · Report as offensive
Andre Howard
Volunteer tester
Avatar

Send message
Joined: 16 May 99
Posts: 124
Credit: 217,463,217
RAC: 0
United States
Message 1407130 - Posted: 24 Aug 2013, 0:42:09 UTC - in response to Message 1407125.  

Reset the project, no credit will be granted if you crunch them.

ID: 1407130 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 1407163 - Posted: 24 Aug 2013, 3:51:29 UTC

Thanks Andre, that is just what I did. I was rewarded with twosies and threesis, so it turned out well. So far so good.
Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 1407163 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1407212 - Posted: 24 Aug 2013, 8:29:52 UTC - in response to Message 1406809.  

Hi folks,

Usually the servers perform a resend for an inconclusive result within minutes of it being detected. This one was marked as inconclusive but has not been sent out again after 4 hours:

http://setiathome.berkeley.edu/workunit.php?wuid=1304318565

Are we running into problems here ?

Tom

Task re-issued at 16 Oct 2013, 7:59:54 UTC
ID: 1407212 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14654
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1407229 - Posted: 24 Aug 2013, 9:31:47 UTC - in response to Message 1407212.  

Hi folks,

Usually the servers perform a resend for an inconclusive result within minutes of it being detected. This one was marked as inconclusive but has not been sent out again after 4 hours:

http://setiathome.berkeley.edu/workunit.php?wuid=1304318565

Are we running into problems here ?

Tom

Task re-issued at 16 Oct 2013, 7:59:54 UTC

October ?? !! I must have overslept...
ID: 1407229 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1407253 - Posted: 24 Aug 2013, 10:58:17 UTC - in response to Message 1407229.  

Hi folks,

Usually the servers perform a resend for an inconclusive result within minutes of it being detected. This one was marked as inconclusive but has not been sent out again after 4 hours:

http://setiathome.berkeley.edu/workunit.php?wuid=1304318565

Are we running into problems here ?

Tom

Task re-issued at 16 Oct 2013, 7:59:54 UTC

October ?? !! I must have overslept...

Yes just the little bit Richard :) that was the report date and time the reissue date and time is 23 Aug 2013, 10:33:10 UTC
ID: 1407253 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1407972 - Posted: 26 Aug 2013, 16:16:53 UTC

The SSP server list shows the Boinc replica database on carolyn to be running, but to the right it shows the replica offline. Is this a problem worth worrying about?

David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1407972 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1408163 - Posted: 27 Aug 2013, 0:27:33 UTC - in response to Message 1407972.  

The SSP server list shows the Boinc replica database on carolyn to be running, but to the right it shows the replica offline. Is this a problem worth worrying about?

The database has been turned back on when I looked at was 157,000 seconds behind. In answer to your question I don't believe this of anything to worry about
ID: 1408163 · Report as offensive
Profile Fred E.
Volunteer tester

Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,140,697
RAC: 0
United States
Message 1408424 - Posted: 27 Aug 2013, 19:18:00 UTC
Last modified: 27 Aug 2013, 19:26:28 UTC

Brings back some unpleasant menories:

SETI@home 8/27/2013 1:52:38 PM Scheduler request failed: HTTP internal server error
and
SETI@home 8/27/2013 2:04:42 PM Scheduler request failed: Timeout was reached

(times are UTC-5)

Looks like my completions were reported and I have some ghosts. Have to dig out my old playbook for scheduler problems. But it will probably clear up after the rush.

Edit: connected 20 minutes later and got the first batch of ghosts, so it was a temporary problem.
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.
ID: 1408424 · Report as offensive
.clair.

Send message
Joined: 4 Nov 04
Posts: 1300
Credit: 55,390,408
RAC: 69
United Kingdom
Message 1408996 - Posted: 28 Aug 2013, 20:31:21 UTC

The ntpckr`s are taking over the SSP
there`s 24 of them !!
Hope they dont turn into zombies or the colo staff have had it :)
ID: 1408996 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22235
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1409003 - Posted: 28 Aug 2013, 20:39:39 UTC

And to think that it wasn't too long ago that folks were observing that the ntpckrs weren't doing anything!
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1409003 · Report as offensive
tbret
Volunteer tester
Avatar

Send message
Joined: 28 May 99
Posts: 3380
Credit: 296,162,071
RAC: 40
United States
Message 1409021 - Posted: 28 Aug 2013, 21:32:29 UTC - in response to Message 1409003.  
Last modified: 28 Aug 2013, 21:33:54 UTC

And to think that it wasn't too long ago that folks were observing that the ntpckrs weren't doing anything!


Yes, and I am loving this. Absolutely loving this.

<EDIT: This is better news than anything I can think-of INCLUDING the great reliability of the colocation's working and fatter data pipe. THIS is what we've been crunching-for.>
ID: 1409021 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1409027 - Posted: 28 Aug 2013, 22:02:39 UTC - in response to Message 1409021.  

THIS is what we've been crunching-for.

Contact with the Mu Araens, you mean? Referencing the server names. :-)
ID: 1409027 · Report as offensive
Profile Uli
Volunteer tester
Avatar

Send message
Joined: 6 Feb 00
Posts: 10923
Credit: 5,996,015
RAC: 1
Germany
Message 1409118 - Posted: 29 Aug 2013, 1:58:56 UTC

LOL the 15th anni of Seti is coming up. So if we want a chance at the Nobel price we better get chugging before the crew reaches retirement age. As that is for most over 15 yrs in the future. Hey some of us are on limited time.
Just glad to see progress. Now if someone could enlighten us, would be a +1.
Pluto will always be a planet to me.

Seti Ambassador
Not to late to order an Anni Shirt
ID: 1409118 · Report as offensive
tbret
Volunteer tester
Avatar

Send message
Joined: 28 May 99
Posts: 3380
Credit: 296,162,071
RAC: 40
United States
Message 1409131 - Posted: 29 Aug 2013, 2:30:38 UTC - in response to Message 1409027.  

THIS is what we've been crunching-for.

Contact with the Mu Araens, you mean? Referencing the server names. :-)


Clever. I like it. Do you think the name will stick?
ID: 1409131 · Report as offensive
Thomas
Volunteer tester

Send message
Joined: 9 Dec 11
Posts: 1499
Credit: 1,345,576
RAC: 0
France
Message 1409183 - Posted: 29 Aug 2013, 5:43:02 UTC - in response to Message 1409118.  

Yes uli.
Maybe Matt we will post information in Technical News.
Fingers crossed ;)
ID: 1409183 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34980
Credit: 261,360,520
RAC: 489
Australia
Message 1409223 - Posted: 29 Aug 2013, 7:46:56 UTC - in response to Message 1409183.  

Yes uli.
Maybe Matt we will post information in Technical News.
Fingers crossed ;)


Matt posted about the "muarae" servers back here,

Server-news-wise, we did acquire another donated box - a 3U monster that actually contains four motherboards, each with 2 hexa-core Xeon CPUs and 72GB of memory, and 3 SATA drives. Despite being in one box, they are four distinct machines: muarae1, muarae2, muarae3, and muarae4. You may have noticed (or not) that muarae1 has already been employed to replace thinman as the main SETI@home web site server. We hope to retire thinman soon, if only because it is physically too large by today's standards (3U, 4 cpus, 28GB) and thus costing us too much money (as the colocation facility charges us by the rack space unit). It is also too deep for its current rack by a couple inches and hindering air flow. The plans for the remaining muaraes are still being debated. Eric is already using another as a GALFA compute server. By the way, as I write this thinman is still around and getting web hits from the few people/robots out there that have IP addresses hard wired or really stubborn DNS caches.


Obviously we can see what decision was made with muarae2 and muarae3 to do now. ;-)

Cheers.
ID: 1409223 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1409608 - Posted: 30 Aug 2013, 1:23:40 UTC

Lost power in my neighborhood for a whole 12 seconds this morning. All of my UPSes are without batteries and has been that way for 8 months. *sigh* Main rig came back up fine, but the splash screen for win7 was up for a really long time (almost four minutes compared to about 20 seconds for a normal boot). And of course along with that, it forgot where all of my icons were located on the desktop. Good thing I randomly thought to take a screenshot of my desktop 3 months ago.. I was able to put everything back where it belongs.

Went into the other room and hit the power button on the single core machine and went back to bed. Couple hours later I heard the CPU fan was still at max speed and turned the monitor on.. "no signal." Press NumLock on the keyboard.. nothing. Flipped the rocker switch on the PSU and tried again.. nothing. Doesn't POST.

Hm. I'll poke at it more tomorrow and figure out what died. HDD light comes on and then goes off ~30 seconds later, but no lights/response from the keyboard, and no video output, and no beep. Hope it's not the board, but it could possibly be. At least it doesn't have hundreds of MBs being tied up.. just 14 of them. I know it doesn't much matter, but if I can't get anything to respond, I'll figure out how to release them back into the wild.

That machine doesn't really do much at all. Just kind of crunches away slowly in the background and is a WSUS server. That's about it.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1409608 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13755
Credit: 208,696,464
RAC: 304
Australia
Message 1409656 - Posted: 30 Aug 2013, 5:13:21 UTC - in response to Message 1409608.  

All of my UPSes are without batteries and has been that way for 8 months. *sigh*

I you've got the funds, just do what i did & replace them with some cheap car batteries. Much longer up time.

Grant
Darwin NT
ID: 1409656 · Report as offensive
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 23 · Next

Message boards : Number crunching : Panic Mode On (85) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.