Panic Mode On (83) Server Problems?


log in

Advanced search

Message boards : Number crunching : Panic Mode On (83) Server Problems?

Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 22 · Next
Author Message
Profile Fred E.Project donor
Volunteer tester
Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,139,004
RAC: 1
United States
Message 1362872 - Posted: 30 Apr 2013, 11:51:03 UTC - in response to Message 1362867.

I was running low until I hit the right moment, and got 69 new tasks. Apart from one solitary mid-AR resend, every single one was a shorty. They just get snapped up too darn quickly during a shorty storm - we need mid-AR or AP to damp things down a bit.

Had a similar result with a lucky fetch. I'm out of AP and have mostly shorties, so my 83 gpu tasks sum to 2 hours of crunch time per BoincTasks. Have set up my zero resource share project for the maintenance window.
____________
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.

ExchangeMan
Volunteer tester
Send message
Joined: 9 Jan 00
Posts: 113
Credit: 143,323,473
RAC: 201,909
United States
Message 1362877 - Posted: 30 Apr 2013, 12:09:37 UTC - in response to Message 1362867.

I was running low until I hit the right moment, and got 69 new tasks. Apart from one solitary mid-AR resend, every single one was a shorty. They just get snapped up too darn quickly during a shorty storm - we need mid-AR or AP to damp things down a bit.

It seemed that for a few days there was sort of a nice mix of shorties, mid-ARs and AP work units. I don't know if anyone planned it that way or it was just random chance. With a decent mix, the system can almost maintain itself except until someone needs to change tapes - then it's pot luck.

____________

Profile HAL9000
Volunteer tester
Avatar
Send message
Joined: 11 Sep 99
Posts: 4443
Credit: 119,121,640
RAC: 140,441
United States
Message 1362900 - Posted: 30 Apr 2013, 14:11:04 UTC

It is a game of hungry hungry hippos right now it would seem. With my 24 core box reporting a 3 hour queue it looks like it will be doing some PG work today once maintenance starts.
____________
SETI@home classic workunits: 93,865 CPU time: 863,447 hours

Join the BP6/VP6 User Group today!

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5868
Credit: 60,593,214
RAC: 47,519
Australia
Message 1362909 - Posted: 30 Apr 2013, 18:34:28 UTC - in response to Message 1362860.

+ 1, the splitters our next bottleneck?

It would appear so.
Just coming back from the outage, there is only 1 splitter running, and it is only producing 6 Wus/sec. With shorties we need at least 55/s. So we really need 10 PFB splitters, and there are only 6, and it's been a while since they've all been running. No wonder we keep running out of work, even while it is being split.
____________
Grant
Darwin NT.

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5868
Credit: 60,593,214
RAC: 47,519
Australia
Message 1362925 - Posted: 30 Apr 2013, 19:38:09 UTC - in response to Message 1362909.

+ 1, the splitters our next bottleneck?

It would appear so.
Just coming back from the outage, there is only 1 splitter running, and it is only producing 6 Wus/sec. With shorties we need at least 55/s. So we really need 10 PFB splitters, and there are only 6, and it's been a while since they've all been running. No wonder we keep running out of work, even while it is being split.


All the splitters are now running, and they're producing less than half the rate of what is needed.
____________
Grant
Darwin NT.

andybutt
Volunteer tester
Avatar
Send message
Joined: 18 Mar 03
Posts: 252
Credit: 118,114,179
RAC: 65,395
United Kingdom
Message 1362958 - Posted: 30 Apr 2013, 21:14:15 UTC - in response to Message 1362925.

Hungry GPU's sitting here doing nothing! Ho Hum!
____________

Profile Fred E.Project donor
Volunteer tester
Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,139,004
RAC: 1
United States
Message 1362966 - Posted: 30 Apr 2013, 21:32:34 UTC

Just got a "Project down for maintenance" and a one hour backoff, so something's underway.

____________
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.

Sirius B
Volunteer tester
Avatar
Send message
Joined: 26 Dec 00
Posts: 11547
Credit: 1,727,794
RAC: 1,663
Israel
Message 1362967 - Posted: 30 Apr 2013, 21:33:17 UTC

Hmmn, I get this....

30/04/2013 22:30:55 | SETI@home | Requesting new tasks for CPU
30/04/2013 22:30:57 | SETI@home | Scheduler request completed: got 0 new tasks
30/04/2013 22:30:57 | SETI@home | Project is temporarily shut down for maintenance

____________

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46519
Credit: 36,861,048
RAC: 4,982
United States
Message 1362977 - Posted: 30 Apr 2013, 21:42:17 UTC - in response to Message 1362967.

Yeah, same here.
____________
My Facebook, War Commander, 2015

Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 26 May 99
Posts: 7081
Credit: 27,532,089
RAC: 36,105
United Kingdom
Message 1362986 - Posted: 30 Apr 2013, 22:35:48 UTC

My main cruncher just downloaded 98 GPU tasks, but my second one still can't get any.
____________


Today is life, the only life we're sure of. Make the most of today.

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46519
Credit: 36,861,048
RAC: 4,982
United States
Message 1362987 - Posted: 30 Apr 2013, 22:38:07 UTC

I got 12 for the gpu, yay!
____________
My Facebook, War Commander, 2015

Sakletare
Avatar
Send message
Joined: 18 May 99
Posts: 131
Credit: 20,952,242
RAC: 1,018
Sweden
Message 1362994 - Posted: 30 Apr 2013, 22:42:51 UTC

And what is this 90 Mbits/sec upload that's running 24/7, even when the project is down for maintenance?

http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=/router-interfaces/inr-211/gigabitethernet6_17&ranges=d%3Aw&view=Octets

Profile ivan
Volunteer tester
Avatar
Send message
Joined: 5 Mar 01
Posts: 624
Credit: 143,488,407
RAC: 149,122
United Kingdom
Message 1362998 - Posted: 30 Apr 2013, 22:45:54 UTC - in response to Message 1362994.

And what is this 90 Mbits/sec upload that's running 24/7, even when the project is down for maintenance?

http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=/router-interfaces/inr-211/gigabitethernet6_17&ranges=d%3Aw&view=Octets

I don't think anyone ever guaranteed that that link was exclusively for seti@home.
____________

Profile Wiggo
Avatar
Send message
Joined: 24 Jan 00
Posts: 7357
Credit: 96,896,295
RAC: 66,608
Australia
Message 1363013 - Posted: 30 Apr 2013, 23:08:57 UTC - in response to Message 1362998.

The MB splitters are certainly having a hard time getting up to their usual speed.

Cheers.

Josef W. SegurProject donor
Volunteer developer
Volunteer tester
Send message
Joined: 30 Oct 99
Posts: 4305
Credit: 1,073,917
RAC: 1,233
United States
Message 1363019 - Posted: 30 Apr 2013, 23:38:30 UTC - in response to Message 1362998.

And what is this 90 Mbits/sec upload that's running 24/7, even when the project is down for maintenance?

http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=/router-interfaces/inr-211/gigabitethernet6_17&ranges=d%3Aw&view=Octets

I don't think anyone ever guaranteed that that link was exclusively for seti@home.

By the nature of a gigabit router interface, that link must be for SETI@home exclusively. My guess is the backup operation which is the reason for the Tuesday outage is sensibly being done to someplace outside the colocation facility.
Joe

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46519
Credit: 36,861,048
RAC: 4,982
United States
Message 1363032 - Posted: 1 May 2013, 0:11:28 UTC

Still no work, but then I thought the outrage was over?
____________
My Facebook, War Commander, 2015

zoom314Project donor
Avatar
Send message
Joined: 30 Nov 03
Posts: 46519
Credit: 36,861,048
RAC: 4,982
United States
Message 1363059 - Posted: 1 May 2013, 1:46:33 UTC - in response to Message 1362998.

And what is this 90 Mbits/sec upload that's running 24/7, even when the project is down for maintenance?

http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=/router-interfaces/inr-211/gigabitethernet6_17&ranges=d%3Aw&view=Octets

I don't think anyone ever guaranteed that that link was exclusively for seti@home.

No, still I'm seeing no work, the server says no work is available...
____________
My Facebook, War Commander, 2015

Tom*
Send message
Joined: 12 Aug 11
Posts: 114
Credit: 4,815,274
RAC: 248
United States
Message 1363060 - Posted: 1 May 2013, 1:46:36 UTC
Last modified: 1 May 2013, 1:56:32 UTC

Never thought I'd say this -

Please Split some AP's to slow down the feeder, Link and Client systems.

Profile RottenMutt
Avatar
Send message
Joined: 15 Mar 01
Posts: 992
Credit: 207,654,737
RAC: 0
United States
Message 1363061 - Posted: 1 May 2013, 2:53:07 UTC - in response to Message 1362998.

And what is this 90 Mbits/sec upload that's running 24/7, even when the project is down for maintenance?

http://fragment1.berkeley.edu/newcricket/grapher.cgi?target=/router-interfaces/inr-211/gigabitethernet6_17&ranges=d%3Aw&view=Octets

I don't think anyone ever guaranteed that that link was exclusively for seti@home.


then lets program the upload and download servers to report data rates and output on status page.
____________

Keith White
Avatar
Send message
Joined: 29 May 99
Posts: 370
Credit: 2,896,563
RAC: 2,442
United States
Message 1363062 - Posted: 1 May 2013, 3:05:59 UTC
Last modified: 1 May 2013, 3:12:09 UTC

What screams at me looking at the Munin graphs is that only when we ran out of AstroPulse units did the MB units start to plummet from 300K to 0. This suggests to me that a lot of GPU AstroPulse is being done and when that ran out the MB reserve was quickly devoured by hungry hungry GPUs.

It's interesting that 30 units/s creation rate isn't enough, at least during a shorty storm.
____________
"Life is just nature's way of keeping meat fresh." - The Doctor

Previous · 1 . . . 7 · 8 · 9 · 10 · 11 · 12 · 13 . . . 22 · Next

Message boards : Number crunching : Panic Mode On (83) Server Problems?

Copyright © 2014 University of California