Panic Mode On (74) Server problems?


log in

Advanced search

Message boards : Number crunching : Panic Mode On (74) Server problems?

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 11 · Next
Author Message
msattlerProject donor
Volunteer tester
Avatar
Send message
Joined: 9 Jul 00
Posts: 38861
Credit: 577,163,086
RAC: 522,606
United States
Message 1221732 - Posted: 22 Apr 2012, 16:27:53 UTC - in response to Message 1221730.

Hmmmm...
Da Cricket thingy gone kaput.

Numbers seem to be updating -- maybe the plotter pen has run out of ink? (A perennial problem back in the days...)

That little yellow fellow will have to find another place to play.


Oh no, he mentioned the yellow fellow....
Now we're in the danger zone.


I think we're safe....I didn't use the 'd' word.
____________
*********************************************
Embrace your inner kitty...ya know ya wanna!

I have met a few friends in my life.
Most were cats.

Sten-Arne
Volunteer tester
Send message
Joined: 1 Nov 08
Posts: 3403
Credit: 19,564,253
RAC: 18,850
Sweden
Message 1221734 - Posted: 22 Apr 2012, 16:42:00 UTC - in response to Message 1221732.

Hmmmm...
Da Cricket thingy gone kaput.

Numbers seem to be updating -- maybe the plotter pen has run out of ink? (A perennial problem back in the days...)

That little yellow fellow will have to find another place to play.


Oh no, he mentioned the yellow fellow....
Now we're in the danger zone.


I think we're safe....I didn't use the 'd' word.


Well, let's hope so. If you're wrong, we're going down like never before...

LOL
____________

Cosmic_Ocean
Avatar
Send message
Joined: 23 Dec 00
Posts: 2245
Credit: 8,582,702
RAC: 4,235
United States
Message 1221744 - Posted: 22 Apr 2012, 17:51:38 UTC - in response to Message 1221730.

Oh no, he mentioned the yellow fellow....
Now we're in the danger zone.

Why did you have to say 'danger zone'? The first thing that instantly popped into my mind was http://www.youtube.com/watch?v=s2MeX45Kk6Q.
____________

Linux laptop uptime: 1484d 22h 42m
Ended due to UPS failure, found 14 hours after the fact

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5789
Credit: 57,880,951
RAC: 48,134
Australia
Message 1221756 - Posted: 22 Apr 2012, 18:29:22 UTC - in response to Message 1221744.


Cricket's borked & downloads are taking several attempts to download- first time for a while.
Looks like a lot of shorties coming through.
____________
Grant
Darwin NT.

Josef W. SegurProject donor
Volunteer developer
Volunteer tester
Send message
Joined: 30 Oct 99
Posts: 4227
Credit: 1,042,262
RAC: 350
United States
Message 1221797 - Posted: 22 Apr 2012, 20:18:51 UTC - in response to Message 1221727.

Hmmmm...
Da Cricket thingy gone kaput.

Numbers seem to be updating -- maybe the plotter pen has run out of ink? (A perennial problem back in the days...)

I think "Cur: nan bits/sec" argues that there is no data coming in from the inr-250 router, and the script which presents those numbers is smart enough not to include those "Not A Number" values in the averages. There's an overall page for the inr-250 router at http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Fsum-tier2%2Fsum-tier2-inr-250;ranges=d;view=Octets which is also blank for the same period, and there the averages are shown as zero.
Joe

Profile arkaynProject donor
Volunteer tester
Avatar
Send message
Joined: 14 May 99
Posts: 3619
Credit: 48,525,313
RAC: 38,224
United States
Message 1221802 - Posted: 22 Apr 2012, 20:33:11 UTC - in response to Message 1221797.

Hmmmm...
Da Cricket thingy gone kaput.

Numbers seem to be updating -- maybe the plotter pen has run out of ink? (A perennial problem back in the days...)

I think "Cur: nan bits/sec" argues that there is no data coming in from the inr-250 router, and the script which presents those numbers is smart enough not to include those "Not A Number" values in the averages. There's an overall page for the inr-250 router at http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=%2Frouter-interfaces%2Fsum-tier2%2Fsum-tier2-inr-250;ranges=d;view=Octets which is also blank for the same period, and there the averages are shown as zero.
Joe


Looks like they fixed it as I see ink once again.
____________

Profile Mark WyzenbeekProject donor
Avatar
Send message
Joined: 28 Jun 99
Posts: 89
Credit: 1,599,605
RAC: 1,114
United States
Message 1222387 - Posted: 23 Apr 2012, 21:47:14 UTC

I'm getting :
4/23/2012 2:45:07 PM | SETI@home | Scheduler request failed: Couldn't resolve host name

____________
The Universe is not only stranger than you imagine, it's stranger than you can imagine.

SETI@home classic workunits 1,405 CPU time 57,318 hours

Profile Mark WyzenbeekProject donor
Avatar
Send message
Joined: 28 Jun 99
Posts: 89
Credit: 1,599,605
RAC: 1,114
United States
Message 1222400 - Posted: 23 Apr 2012, 22:23:24 UTC

And half an hour later it's working again. Never mind.
____________
The Universe is not only stranger than you imagine, it's stranger than you can imagine.

SETI@home classic workunits 1,405 CPU time 57,318 hours

Profile cliff
Avatar
Send message
Joined: 16 Dec 07
Posts: 322
Credit: 2,509,590
RAC: 0
United Kingdom
Message 1222828 - Posted: 24 Apr 2012, 19:23:38 UTC
Last modified: 24 Apr 2012, 19:27:34 UTC

And here we go again. status page says everything is back up after the weekly outage, but..

24/04/2012 20:17:19 | SETI@home | Scheduler request failed: Failure when receiving data from the peer
24/04/2012 20:17:22 | | Project communication failed: attempting access to reference site
24/04/2012 20:17:24 | | Internet access OK - project servers may be temporarily down.
24/04/2012 20:17:31 | SETI@home | update requested by user
24/04/2012 20:17:34 | SETI@home | Sending scheduler request: Requested by user.
24/04/2012 20:17:34 | SETI@home | Reporting 13 completed tasks, requesting new tasks for CPU and NVIDIA
24/04/2012 20:17:56 | SETI@home | Scheduler request failed: Couldn't connect to server
24/04/2012 20:17:59 | SETI@home | update requested by user
24/04/2012 20:17:59 | | Project communication failed: attempting access to reference site
24/04/2012 20:18:00 | | Internet access OK - project servers may be temporarily down.
24/04/2012 20:18:01 | SETI@home | Sending scheduler request: Requested by user.
24/04/2012 20:18:01 | SETI@home | Reporting 13 completed tasks, requesting new tasks for CPU and NVIDIA
24/04/2012 20:18:23 | SETI@home | Scheduler request failed: Couldn't connect to server
24/04/2012 20:18:28 | | Project communication failed: attempting access to reference site
24/04/2012 20:18:29 | | Internet access OK - project servers may be temporarily down.

Wheeee.. much fun is had..
1st time I've seen a failure receiving data from the peer though..
And the hopping bugs are in severe distress once again... lotsa white and very little green.
Also no access to the tasks page:-/ Wanted to catch up on my last AP6..
Cheers,
____________
Cliff,
Been there, Done that, Still no damm T shirt!

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46078
Credit: 36,577,375
RAC: 5,303
Message 1222834 - Posted: 24 Apr 2012, 19:48:42 UTC

I've got the same problem as Cliff does, can upload, but not report, as to downloading I have no idea yet.
____________
My Facebook, War Commander, 2015

Profile cliff
Avatar
Send message
Joined: 16 Dec 07
Posts: 322
Credit: 2,509,590
RAC: 0
United Kingdom
Message 1222836 - Posted: 24 Apr 2012, 19:50:12 UTC - in response to Message 1222834.

I've got the same problem as Cliff does, can upload, but not report, as to downloading I have no idea yet.


No real worries, its nothing a large hammer and some chewing gum wont cure:-)

Cheers,
____________
Cliff,
Been there, Done that, Still no damm T shirt!

Sten-Arne
Volunteer tester
Send message
Joined: 1 Nov 08
Posts: 3403
Credit: 19,564,253
RAC: 18,850
Sweden
Message 1222845 - Posted: 24 Apr 2012, 20:08:32 UTC

Everything works perfectly here. Upload, reporting, getting new work, and downloading. All swift and with a download speed seldom seen.



No wait, that was yesterday. Today after the outage, it's SNAFU as usual. All will be back to normal, one day before the next outage though.

Until then, whine or wine...your choice....
____________

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5789
Credit: 57,880,951
RAC: 48,134
Australia
Message 1223052 - Posted: 25 Apr 2012, 4:34:20 UTC

I wonder what happened here?
http://setiathome.berkeley.edu/workunit.php?wuid=974196689
____________
Grant
Darwin NT.

Profile Area 51
Avatar
Send message
Joined: 31 Jan 04
Posts: 965
Credit: 42,193,520
RAC: 0
United Kingdom
Message 1223120 - Posted: 25 Apr 2012, 10:42:56 UTC - in response to Message 1223052.

I wonder what happened here?
http://setiathome.berkeley.edu/workunit.php?wuid=974196689



....whilst avoiding knocking the power plug out of the wall outlet and thus avoiding an unplanned outage, the cleaner accidentally sucked up a few wu's that were lying around. It'll be in the dust bag ... ;-)


More seriously, I can't recall seeing something like this before! Is (was) it one of yours?
____________

Josef W. SegurProject donor
Volunteer developer
Volunteer tester
Send message
Joined: 30 Oct 99
Posts: 4227
Credit: 1,042,262
RAC: 350
United States
Message 1223156 - Posted: 25 Apr 2012, 14:48:52 UTC - in response to Message 1223052.

I wonder what happened here?
http://setiathome.berkeley.edu/workunit.php?wuid=974196689

The WU was finished, the record was purged.

From the number I judge it was created April 18 or 19. With an average turnaround around 3 days, many WUs will be finished and assimilated in one or two days. The records are purged a day after that.

If your host is still doing a task for that WU or something, none of that should have happened yet. What prompted the question?
Joe

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5789
Credit: 57,880,951
RAC: 48,134
Australia
Message 1223234 - Posted: 25 Apr 2012, 18:18:44 UTC - in response to Message 1223156.

If your host is still doing a task for that WU or something, none of that should have happened yet. What prompted the question?

My GPU processed the WU (only a few hundred seconds of run time), all the other GPUs errored out (similar runtimes).
Just curious as to why one was able to run it, all the others errored out.

____________
Grant
Darwin NT.

Josef W. SegurProject donor
Volunteer developer
Volunteer tester
Send message
Joined: 30 Oct 99
Posts: 4227
Credit: 1,042,262
RAC: 350
United States
Message 1223245 - Posted: 25 Apr 2012, 19:06:51 UTC - in response to Message 1223234.

If your host is still doing a task for that WU or something, none of that should have happened yet. What prompted the question?

My GPU processed the WU (only a few hundred seconds of run time), all the other GPUs errored out (similar runtimes).
Just curious as to why one was able to run it, all the others errored out.

If the others were running stock and got -12 errors, it's probably because the optimized GPU app you're running can handle two triplets in the same array and stock can't. Whatever, I assume the WU has been discarded for too many errors and probably was afflicted with RFI so that's not unreasonable.
Joe

Dave
Avatar
Send message
Joined: 29 Mar 02
Posts: 774
Credit: 23,193,139
RAC: 0
United Kingdom
Message 1223582 - Posted: 26 Apr 2012, 15:21:22 UTC

Are we not validating much atm? My RAC's not rising much.

Richard HaselgroveProject donor
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8460
Credit: 48,780,839
RAC: 83,726
United Kingdom
Message 1223585 - Posted: 26 Apr 2012, 15:44:59 UTC - in response to Message 1223582.

Are we not validating much atm? My RAC's not rising much.

We had a second round of re-estimation this week, which tends to result in over-fetching and tasks being crunched out of order/returned slowly. That leads to increased pending tasks.

But RAC is always a blunt, crude, slow measure for assessing performance. Just keep an eye on the counts of the various categories at the top of your task lists, and the ratio between them?

N9JFE David SProject donor
Volunteer tester
Avatar
Send message
Joined: 4 Oct 99
Posts: 11144
Credit: 13,921,412
RAC: 13,073
United States
Message 1224069 - Posted: 27 Apr 2012, 20:31:41 UTC

Holy bovine, Batman! It's been over 28-3/4 hours since the last post in the Panic Mode thread. Can things possibly be that right with the world???

____________
David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.


Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (74) Server problems?

Copyright © 2014 University of California