Panic Mode On (86) Server Problems?

Message boards : Number crunching : Panic Mode On (86) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 16 · 17 · 18 · 19 · 20 · 21 · 22 . . . 24 · Next

AuthorMessage
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 1480581 - Posted: 22 Feb 2014, 9:51:04 UTC - in response to Message 1480574.  

Here's a new message-

22/02/2014 18:49:44 | SETI@home | Not requesting tasks: some download is stalled

That's a very old message - it's been standard in BOINC for years. We just haven't seen it for a while, since the transfer to the co-lo.

Even prior to that when we had all the download issues, I'd never seen it. When the network traffic finally cleared all the downloads that had accumulated with each scheduler request for work would finally come through.
This is the first time I've ever seen a stalled download block requests for work.
Grant
Darwin NT
ID: 1480581 · Report as offensive
Profile Belthazor
Volunteer tester
Avatar

Send message
Joined: 6 Apr 00
Posts: 219
Credit: 10,373,795
RAC: 13
Russia
Message 1480596 - Posted: 22 Feb 2014, 11:24:50 UTC - in response to Message 1480581.  

yeah, it looks like JBOD is crashed, no data to download or even splitting...
ID: 1480596 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1480599 - Posted: 22 Feb 2014, 11:43:19 UTC - in response to Message 1480596.  

yeah, it looks like JBOD is crashed, no data to download or even splitting...

I find it interesting that it appears to have crashed at very close to midnight, local time.

Looking across all my machines, the work allocations show

7:58:54 UTC - downloaded successfully
8:02:03 UTC - still stuck downloading

That sounds to me like a software problem - a cron job gone wrong - rather than a random hardware failure.

There are no reported failures (yet) at IST.
ID: 1480599 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1480607 - Posted: 22 Feb 2014, 12:16:01 UTC - in response to Message 1480581.  

Here's a new message-

22/02/2014 18:49:44 | SETI@home | Not requesting tasks: some download is stalled

That's a very old message - it's been standard in BOINC for years. We just haven't seen it for a while, since the transfer to the co-lo.

Even prior to that when we had all the download issues, I'd never seen it. When the network traffic finally cleared all the downloads that had accumulated with each scheduler request for work would finally come through.
This is the first time I've ever seen a stalled download block requests for work.

OK, I owe you a fuller explanation than that.

commit 89578050f7a523588031656526e537fc99dab08d
Author: David Anderson <davea@ssl.berkeley.edu> Mon Jul 2 04:43:05 2012
Committer: David Anderson <davea@ssl.berkeley.edu> Mon Jul 2 04:43:05 2012

- When the client makes a scheduler RPC without requesting work,
and there's a simple reason
(e.g. the project is suspended, no-new-tasks, downloads stalled, etc.)
show it in the event lot.
If the reason is more complex, don't try to explain.

(I think he means the event log)

July 2012 means somewhere round about v7.0.31 or .32 - so you won't have seen the message until you upgraded BOINC, probably to your current v7.0.64 - but the behaviour has been in BOINC for years.

It's one of the few - but very welcome - cases where BOINC now gives better information about what's going on than it used to.
ID: 1480607 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1480641 - Posted: 22 Feb 2014, 15:09:45 UTC

Some hosts are now running on SETI GPU WU dryland (i hate the @#!!! 100GPUWU limit), backup project thanks, hope they fix that soon.
ID: 1480641 · Report as offensive
Miklos M.

Send message
Joined: 5 May 99
Posts: 955
Credit: 136,115,648
RAC: 73
Hungary
Message 1480645 - Posted: 22 Feb 2014, 15:47:03 UTC

Thanks to everyone's help, especially Richard, I am baaaaaaaack crunching.
ID: 1480645 · Report as offensive
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1222
Credit: 451,243,443
RAC: 1,127
Denmark
Message 1480656 - Posted: 22 Feb 2014, 16:22:16 UTC

Is there even anybody in to do anything on a Saturday?
ID: 1480656 · Report as offensive
Profile KWSN THE Holy Hand Grenade!
Volunteer tester
Avatar

Send message
Joined: 20 Dec 05
Posts: 3187
Credit: 57,163,290
RAC: 0
United States
Message 1480662 - Posted: 22 Feb 2014, 16:27:00 UTC

SETI Beta is completely down...
.

Hello, from Albany, CA!...
ID: 1480662 · Report as offensive
Profile Fred E.
Volunteer tester

Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,140,697
RAC: 0
United States
Message 1480663 - Posted: 22 Feb 2014, 16:31:36 UTC

Is there even anybody in to do anything on a Saturday?
Not sure they can do anything at the lab they can't do from home, and they've been checking in on weekends. But this one may require a trip to the colo site if the disk array is messed up.

Thanks to everyone's help, especially Richard, I am baaaaaaaack crunching.
- Miklos M.

Welcome back!
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.
ID: 1480663 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1480670 - Posted: 22 Feb 2014, 16:47:57 UTC
Last modified: 22 Feb 2014, 16:48:48 UTC

Seems like Murphys pass on the lab at friday´s midnight and do it´s work on the disk array. Hope the problem is something easy to fix or we will be out of work for a long time.
ID: 1480670 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1480675 - Posted: 22 Feb 2014, 17:08:16 UTC

I got excited when one of my machines had the message:
2/22/2014 10:40:48 AM (UTC -5) SETI@home Scheduler request completed: got 1 new tasks
Then I realized it was a _3 & stuck in download limbo.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1480675 · Report as offensive
Profile Fred E.
Volunteer tester

Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,140,697
RAC: 0
United States
Message 1480679 - Posted: 22 Feb 2014, 17:14:29 UTC

Work may be underway - noted some different behavior - retry #17 did not go to retry immediately like the first 16. It tried for a while before giving up.
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.
ID: 1480679 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22202
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1480681 - Posted: 22 Feb 2014, 17:17:14 UTC

Certainly looks as if the tyre kicker has arrived on site....
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1480681 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1480685 - Posted: 22 Feb 2014, 17:25:45 UTC - in response to Message 1480681.  

Certainly looks as if the tyre kicker has arrived on site....

And the Server Status page clocked him in at 16:50 UTC... :P
ID: 1480685 · Report as offensive
Ulrich Metzner
Volunteer tester
Avatar

Send message
Joined: 3 Jul 02
Posts: 1256
Credit: 13,565,513
RAC: 13
Germany
Message 1480686 - Posted: 22 Feb 2014, 17:32:01 UTC

It's dead, Jim...
Aloha, Uli

ID: 1480686 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1480688 - Posted: 22 Feb 2014, 17:38:14 UTC - in response to Message 1480662.  

SETI Beta is completely down...

Except the website is up:

http://setiweb.ssl.berkeley.edu/beta/

Except the Forums are up:

http://setiweb.ssl.berkeley.edu/beta/forum_index.php

Except scheduler works as your hosts (and my hosts) can contact it:

http://setiweb.ssl.berkeley.edu/beta/hosts_user.php?userid=6133

I'd say that's slightly less than completely down!!!

Claggy
ID: 1480688 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1480689 - Posted: 22 Feb 2014, 17:40:54 UTC
Last modified: 22 Feb 2014, 17:41:51 UTC

Agree... if you could post/read anything on the forums it´s not totaly dead... just in the ICU. We still have some hope.
ID: 1480689 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1480699 - Posted: 22 Feb 2014, 18:13:40 UTC
Last modified: 22 Feb 2014, 18:17:33 UTC

Downloads are fixed, here and at Seti Beta, and the Status page now shows tapes.

Claggy
ID: 1480699 · Report as offensive
Profile RottenMutt
Avatar

Send message
Joined: 15 Mar 01
Posts: 1011
Credit: 230,314,058
RAC: 0
United States
Message 1480702 - Posted: 22 Feb 2014, 18:21:06 UTC

life may be returning!
ID: 1480702 · Report as offensive
Previous · 1 . . . 16 · 17 · 18 · 19 · 20 · 21 · 22 . . . 24 · Next

Message boards : Number crunching : Panic Mode On (86) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.