Panic Mode On (103) Server Problems?

Message boards : Number crunching : Panic Mode On (103) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 34 · Next

AuthorMessage
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 1812436 - Posted: 25 Aug 2016, 10:23:03 UTC

I still have 44 AP`s left and some Guppies.
Lets wait and see.


With each crime and every kindness we birth our future.
ID: 1812436 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1812454 - Posted: 25 Aug 2016, 11:53:23 UTC

Work is again being split and sent out, but my backup projects are armed in case things go pear shaped during the night here.

Cheers.
ID: 1812454 · Report as offensive
I3APR

Send message
Joined: 23 Apr 16
Posts: 99
Credit: 70,717,488
RAC: 0
Italy
Message 1812458 - Posted: 25 Aug 2016, 12:15:28 UTC
Last modified: 25 Aug 2016, 12:16:44 UTC

Gentlemen, help me understand.

I read on Server status page :

"mb_splitter/ap_splitter: Reads tapes (or tape images on disk) containing raw telescope data and creates SETI@home (multi-beam) or Astropulse workunits for the BOINC/SETI@home clients. At least one needs to be running to produce work, and that's usually enough."

Now I see :

ALL ap's splitters down, so I believe we should not get any AstroPulse WU
4 pfb splitter on 8 disabled, so 4 left running
1 gbt splitter on 7 offline so 6 left running

How come I have now half of my machine with no WU to crunch ( Project has no task available )?
What is an acceptable ( and usual ) creation rate ?

Thanks
A.
ID: 1812458 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22200
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1812461 - Posted: 25 Aug 2016, 12:30:17 UTC

Again, on the Server Status Page, if you look at the second table down on the right, under "progress" you will see a column titled "AstoPulse". This column shows the AstroPulse splitting status for each "tape" loaded, there are three states:
Blank = ready to be split;
Green bar = being split
(done) = splitting has been completed

As I type, all the "Arecibo" tapes loaded are showing "(done)", so the AstroPulse splitters will be sleeping ready for the next batch of tapes to be loaded.

As for your question about why no work arriving on some of your computers - after a splitter crash it takes time for the servers to catch up with demand. There is only a very small "ready to send" pool, and that obviously gets hit very hard during the recovery period, which is typically a few hours.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1812461 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1812465 - Posted: 25 Aug 2016, 12:43:04 UTC - in response to Message 1812458.  

At least one needs to be running to produce work, and that's usually enough.

I'm sure one was enough when that phrase was written - at least eleven years ago, according to the Wayback Machine

People have started putting greater demands on the system since then - one of the other 149,999 active users probably got there first.
ID: 1812465 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1812486 - Posted: 25 Aug 2016, 14:23:18 UTC - in response to Message 1812461.  

Again, on the Server Status Page, if you look at the second table down on the right, under "progress" you will see a column titled "AstoPulse". This column shows the AstroPulse splitting status for each "tape" loaded, there are three states:
Blank = ready to be split;
Green bar = being split
(done) = splitting has been completed

As I type, all the "Arecibo" tapes loaded are showing "(done)", so the AstroPulse splitters will be sleeping ready for the next batch of tapes to be loaded.

As for your question about why no work arriving on some of your computers - after a splitter crash it takes time for the servers to catch up with demand. There is only a very small "ready to send" pool, and that obviously gets hit very hard during the recovery period, which is typically a few hours.



. . I am going to have to seek out this "Server page". I am in the dark here :)

.
ID: 1812486 · Report as offensive
Profile John Neale
Volunteer tester
Avatar

Send message
Joined: 16 Mar 00
Posts: 634
Credit: 7,246,513
RAC: 9
South Africa
Message 1812488 - Posted: 25 Aug 2016, 14:29:36 UTC - in response to Message 1812486.  

. . I am going to have to seek out this "Server page". I am in the dark here :)

Server status page. There's a link to it on the home page of SETI@home, under "ABOUT".
ID: 1812488 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6652
Credit: 121,090,076
RAC: 0
United States
Message 1812765 - Posted: 26 Aug 2016, 13:36:48 UTC - in response to Message 1810390.  

I was lucky to be able to reach Jeff so quickly, and that it was an easy problem to fix.

Not so lucky in getting my little green star to reignite....

Meow!

My star hasn't reignited either, and I made a donation the day before you did.

Steve

I guess the green star is really broke. Still nothing is even showing up on my account page.

Steve
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 1812765 · Report as offensive
I3APR

Send message
Joined: 23 Apr 16
Posts: 99
Credit: 70,717,488
RAC: 0
Italy
Message 1812772 - Posted: 26 Aug 2016, 14:27:42 UTC - in response to Message 1812765.  


I guess the green star is really broke. Still nothing is even showing up on my account page.


Mine did shine only for 24h after the donation then it went off and never went back on again.. :-(

A.
ID: 1812772 · Report as offensive
WezH
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 576
Credit: 67,033,957
RAC: 95
Finland
Message 1813846 - Posted: 30 Aug 2016, 16:53:50 UTC

Hmmm, should it be maintenance now?

Anyway, Project has no tasks available -messages in event log...
ID: 1813846 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1813993 - Posted: 31 Aug 2016, 1:25:42 UTC

Since we did not have the pleasure of the Tuesday outrage it would be interesting to know what happened.
ID: 1813993 · Report as offensive
Profile Qui-Gon
Volunteer tester
Avatar

Send message
Joined: 15 May 99
Posts: 2940
Credit: 19,199,902
RAC: 11
United States
Message 1813996 - Posted: 31 Aug 2016, 1:41:47 UTC - in response to Message 1813993.  

I don't know if the outage is automatic or manual but if it's the latter, I hope the guy who does it got to sleep in, at least.
ID: 1813996 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1814033 - Posted: 31 Aug 2016, 3:18:54 UTC

I'm guessing the maintenance was skipped because we're in a high-priority to check the DB for signals coming from near HD 164595. Eric seems confident there won't be anything in the DB, but you won't know until you check, and that means doing a bit of manual nitpicker on a specific target. I suspect that there probably won't be maintenance this week because of that.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1814033 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1814036 - Posted: 31 Aug 2016, 3:35:24 UTC - in response to Message 1814033.  

Good guess.
ID: 1814036 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1814088 - Posted: 31 Aug 2016, 7:12:01 UTC

Maintenance outage for Wednesday 31 August has been announced on the front page.
ID: 1814088 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1814091 - Posted: 31 Aug 2016, 7:20:40 UTC - in response to Message 1814088.  

Maintenance outage for Wednesday 31 August has been announced on the front page.

Indeed it has. Was guessing we might skip it this week, but I guess it got delayed by a day.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1814091 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22200
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1814113 - Posted: 31 Aug 2016, 9:42:26 UTC

...or is Wednesday the new Tuesday??
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1814113 · Report as offensive
Kieron Walsh

Send message
Joined: 2 Mar 00
Posts: 74
Credit: 43,502,325
RAC: 112
United Kingdom
Message 1814140 - Posted: 31 Aug 2016, 13:15:45 UTC

Shouldn't ithe outage have started by now?
I'm hoping to snag some more productive APs than I got last week!



PS
We're All Wednesday Aren't We
ID: 1814140 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1814142 - Posted: 31 Aug 2016, 13:25:45 UTC - in response to Message 1814140.  

Not usually this early - the normal start time is between 07:30 and 09:30 Berkeley time (14:30-16:30 UTC, 15:30-17:30 UK) - depending which staff member is on duty this week.
ID: 1814142 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1814246 - Posted: 1 Sep 2016, 4:32:13 UTC
Last modified: 1 Sep 2016, 4:40:19 UTC

Well. My friend and I were just talking about how promising Zen looks, and that it is purported to hit the market in October. That's enough time for about a month for reviews before Black Friday comes around, so that's going to work out pretty well for me (unless Zen also flops on its face like Bulldozer did).

We were discussing that earlier in the evening, and then I came home, and I'm sitting here, playing a game, and I heard what sounded like a pile of burnt CDs falling off the top of the desk. I looked around like 'wtf?' and grabbed a flashlight to investigate..and then the rig shut off like I pulled the plug.

Got the under-desk lamp turned on (ancient fluorescent bulb and a geriatric start capacitor) and saw the problem.



Well that's not good. Took the panel off, took the heatsink out, looked at the mounting point, and the tab had busted off.



I grabbed my JB PlasticWeld kit and mixed up a small glob of it and epoxied the bracket back together and crimped it using vise-grips and let it sit for two hours. Installed it back on the mobo, went to lock the tension strap for the heatsink down and got about halfway through and POP. The epoxy let go.

Hmm. Needs more engineering. Got an idea, it didn't work as well as I envisioned, but you know what? It's working. Drilled a hole through the bracket, put a short screw through it so that the tension strap would have something to hook onto, and it seems to be holding.



I went ahead and used a strand of cat-5 to put a check-strap on the heatsink onto the PSU support brace so that at least if it lets go again, the 800g heatsink isn't going to plummet down onto the GPU again.

I've got ideas for how to improve that fix.... but I ran out of motivation. Hopefully, it holds until I can upgrade to Zen (or maybe even Intel, if it comes to that), at which point I'll take that plastic bracket off, take some very good measurements of it, and whip up a replacement out of steel using a mill that I have access to. I think the PCB would break before a steel bracket would.

[edit: I did worry that the CPU fried, because the heatsink fell off a good 20 seconds before the rig shut off. I've got all the throttling features disabled in UEFI, but I do have it set to shut down on CPU fan fail, or if it reaches 65*C. One or both of those happened. It's still good though, as I'm posting this from the machine in question.]
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1814246 · Report as offensive
Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · 19 · 20 . . . 34 · Next

Message boards : Number crunching : Panic Mode On (103) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.