Panic Mode On (65) Server problems?

Message boards : Number crunching : Panic Mode On (65) Server problems?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 9 · Next

AuthorMessage
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1187283 - Posted: 21 Jan 2012, 22:34:54 UTC - in response to Message 1187279.  

Boring is good. Just want the Gods of reliability now.

LOl....and higher limits would be a joy too.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1187283 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6652
Credit: 121,090,076
RAC: 0
United States
Message 1187293 - Posted: 21 Jan 2012, 22:57:30 UTC - in response to Message 1187283.  
Last modified: 21 Jan 2012, 22:58:24 UTC

Boring is good. Just want the Gods of reliability now.

LOl....and higher limits would be a joy too.

+10^100

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1187293 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13732
Credit: 208,696,464
RAC: 304
Australia
Message 1187304 - Posted: 21 Jan 2012, 23:26:18 UTC - in response to Message 1187283.  


And sort out the DCF problem.
Grant
Darwin NT
ID: 1187304 · Report as offensive
Profile john3760
Avatar

Send message
Joined: 9 Feb 11
Posts: 334
Credit: 3,400,979
RAC: 0
United Kingdom
Message 1187322 - Posted: 22 Jan 2012, 0:25:02 UTC
Last modified: 22 Jan 2012, 1:01:04 UTC

Crickets maxed out again!!
Not sure if it's due to AP's,or
Marks return to the message boards.;)

Welcome back Mark.

john 3760
ID: 1187322 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1187369 - Posted: 22 Jan 2012, 6:37:46 UTC - in response to Message 1187322.  

Crickets maxed out again!!
Not sure if it's due to AP's,or
Marks return to the message boards.;)

Welcome back Mark.

john 3760

LOL....I don't think even I post quite enough to boost the ol' Crickets.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1187369 · Report as offensive
B-Man
Volunteer tester

Send message
Joined: 11 Feb 01
Posts: 253
Credit: 147,366
RAC: 0
United States
Message 1187899 - Posted: 24 Jan 2012, 3:28:20 UTC
Last modified: 24 Jan 2012, 3:29:53 UTC

Looks like the stuck tape is still blocking the splitters after I reported it in the last problem thread before this one. Looks like it has been blocking 2-3 MB splitters for over one week. It needs to be fixed.
ID: 1187899 · Report as offensive
Profile Khangollo
Avatar

Send message
Joined: 1 Aug 00
Posts: 245
Credit: 36,410,524
RAC: 0
Slovenia
Message 1188005 - Posted: 24 Jan 2012, 13:07:25 UTC

Scheduler fail.

Just when there were hundreds of APs accumulating :@
ID: 1188005 · Report as offensive
Terror Australis
Volunteer tester

Send message
Joined: 14 Feb 04
Posts: 1817
Credit: 262,693,308
RAC: 44
Australia
Message 1188006 - Posted: 24 Jan 2012, 13:08:57 UTC

Scheduler requests from all 5 boxes are failing with http errors.

Uploads are still going through Ok.

The blue line on the Crickets has taken a nose dive too.

T.A.
ID: 1188006 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1188028 - Posted: 24 Jan 2012, 14:17:50 UTC - in response to Message 1188006.  

Scheduler requests from all 5 boxes are failing with http errors.

Uploads are still going through Ok.

The blue line on the Crickets has taken a nose dive too.

T.A.

Up again now...
ID: 1188028 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1188102 - Posted: 24 Jan 2012, 22:00:02 UTC

For those who haven't noticed.. you should have a non-zero "number of completed tasks" on your application details for each host now. I asked a question through the back channels and it ended up being a simple PHP fix. Many thanks to those involved on that one.

I was somewhat surprised at how many tasks my single-core machine has done. Then I got a crazy idea.. let's go look at mark's computers.. pick the one with the highest RAC.. wow that's a lot of completed tasks.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1188102 · Report as offensive
Profile john3760
Avatar

Send message
Joined: 9 Feb 11
Posts: 334
Credit: 3,400,979
RAC: 0
United Kingdom
Message 1188111 - Posted: 24 Jan 2012, 22:49:21 UTC

The replica databsse server is disabled,yet it is 0 seconds

behind the master . How can that be ?


Something is up

john3760


ID: 1188111 · Report as offensive
Profile Lint trap

Send message
Joined: 30 May 03
Posts: 871
Credit: 28,092,319
RAC: 0
United States
Message 1188183 - Posted: 25 Jan 2012, 4:12:34 UTC - in response to Message 1188111.  

The replica databsse server is disabled,yet it is 0 seconds

behind the master . How can that be ?


Something is up




Last status was '0' when it was disabled (by staff)??

Did you happen to notice the time since last update (not the status page time, the "As of" 'm' or 'h' displayed to the right of the '0') ??



But, what a diff 5 hrs makes...

I see the replica 50k secs behind now and losing ground steadily.

Maybe Jocelyn has been assigned some extra duties??

Lt


ID: 1188183 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1188355 - Posted: 25 Jan 2012, 20:45:31 UTC - in response to Message 1187899.  

Looks like the stuck tape is still blocking the splitters after I reported it in the last problem thread before this one. Looks like it has been blocking 2-3 MB splitters for over one week. It needs to be fixed.

Has anyone messaged DA or Jeff to notify them that tape 01oc11aa is stuck?
ID: 1188355 · Report as offensive
Profile Belthazor
Volunteer tester
Avatar

Send message
Joined: 6 Apr 00
Posts: 219
Credit: 10,373,795
RAC: 13
Russia
Message 1188456 - Posted: 26 Jan 2012, 5:45:22 UTC - in response to Message 1188355.  


Has anyone messaged DA or Jeff to notify them that tape 01oc11aa is stuck?


You think they dosen't watching status page? :) I'm sure they well known of the problem, but no wonder of it because of no serious happened and three of MB-splitters are enough, even for shortie storm...

ID: 1188456 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1188468 - Posted: 26 Jan 2012, 7:25:16 UTC

Well, I don't care HOW many splitters are running....I can tell you this.
My top rig is losing cache steadily. And if I am losing cache, a lot of other rigs are as well. Even though server status shows over 200k WUs ready to send, the scheduler/feeder combo is not getting the job done. Otherwise, bandwidth would be saturated right now.

Something is not happy in serverland right now.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1188468 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1188479 - Posted: 26 Jan 2012, 9:28:46 UTC - in response to Message 1188468.  

I'm still bouncing off the limits here. :(

Cheers.
ID: 1188479 · Report as offensive
Kevin Olley

Send message
Joined: 3 Aug 99
Posts: 906
Credit: 261,085,289
RAC: 572
United Kingdom
Message 1188557 - Posted: 26 Jan 2012, 16:03:09 UTC - in response to Message 1188468.  

Well, I don't care HOW many splitters are running....I can tell you this.
My top rig is losing cache steadily. And if I am losing cache, a lot of other rigs are as well. Even though server status shows over 200k WUs ready to send, the scheduler/feeder combo is not getting the job done. Otherwise, bandwidth would be saturated right now.

Something is not happy in serverland right now.



Mine is a bit slower than yours, I am just about holding cache when running on regular WU's but when I give it a nudge and it starts spitting out any shorties it has accumilated in the last 24 hrs cache levels will drop and only very slowly raise when its finnished shorty bashing.


Kevin


ID: 1188557 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1188561 - Posted: 26 Jan 2012, 16:19:23 UTC - in response to Message 1188481.  

Even though server status shows over 200k WUs ready to send, the scheduler/feeder combo is not getting the job done. Otherwise, bandwidth would be saturated right now.



if the server status page is saying :

Data Distribution State SETI@home #

Results ready to send : 200,711


and my BOINC Manager says:

2012-01-26 05:08:10 | SETI@home | Sending scheduler request: To fetch work.
2012-01-26 05:08:10 | SETI@home | Reporting 1 completed tasks, requesting new tasks for CPU
2012-01-26 05:08:12 | SETI@home | Scheduler request completed: got 0 new tasks
2012-01-26 05:08:12 | SETI@home | Project has no tasks available


then certainly one of these 2 or a server between both has a very long nose ^^


That's just that the feeder (up to 100 tasks) was empty when the request got in. A few seconds later it would have restocked with fresh 100 tasks and you'd have gotten some.
If we are still sending out a lot of VHAR, each request needs more tasks to get the same amount of preocessing time as if it were mid AR, so the feeder is empty most of the time, making it harder to get tasks assigned. and because everybody is getting a lot of tasks (when you get them) they are horribly hard to get down as well.
Shorties storms don't make for happy servers.
ID: 1188561 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1188566 - Posted: 26 Jan 2012, 16:26:04 UTC - in response to Message 1188561.  

Even though server status shows over 200k WUs ready to send, the scheduler/feeder combo is not getting the job done. Otherwise, bandwidth would be saturated right now.



if the server status page is saying :

Data Distribution State SETI@home #

Results ready to send : 200,711


and my BOINC Manager says:

2012-01-26 05:08:10 | SETI@home | Sending scheduler request: To fetch work.
2012-01-26 05:08:10 | SETI@home | Reporting 1 completed tasks, requesting new tasks for CPU
2012-01-26 05:08:12 | SETI@home | Scheduler request completed: got 0 new tasks
2012-01-26 05:08:12 | SETI@home | Project has no tasks available


then certainly one of these 2 or a server between both has a very long nose ^^


That's just that the feeder (up to 100 tasks) was empty when the request got in. A few seconds later it would have restocked with fresh 100 tasks and you'd have gotten some.
If we are still sending out a lot of VHAR, each request needs more tasks to get the same amount of preocessing time as if it were mid AR, so the feeder is empty most of the time, making it harder to get tasks assigned. and because everybody is getting a lot of tasks (when you get them) they are horribly hard to get down as well.
Shorties storms don't make for happy servers.

You are missing my original point.....(now obscured since AP is filling the pipe again).
If the feeder and scheduler are working together properly, they are fully capable of saturating the bandwidth with MB tasks only, and that was nowhere near happening. And current MB work being issued does not appear to be a shorty storm anyway.

"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1188566 · Report as offensive
LadyL
Volunteer tester
Avatar

Send message
Joined: 14 Sep 11
Posts: 1679
Credit: 5,230,097
RAC: 0
Message 1188574 - Posted: 26 Jan 2012, 16:35:43 UTC - in response to Message 1188566.  

You are missing my original point.....(now obscured since AP is filling the pipe again).
If the feeder and scheduler are working together properly, they are fully capable of saturating the bandwidth with MB tasks only, and that was nowhere near happening. And current MB work being issued does not appear to be a shorty storm anyway.


Oh, I'm sorry, I just wanted to put across that 'no tasks available' with ready to send high is empty feeder, I didn't check SSP or cricket.

Why the feeder is empty is a different issue and yes, if people have trouble getting tasks when the bandwidth is not maxxed that points at more severe problems than just being unable to keep up with demand.
ID: 1188574 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 . . . 9 · Next

Message boards : Number crunching : Panic Mode On (65) Server problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.