Panic Mode On (91) Server Problems?

Message boards : Number crunching : Panic Mode On (91) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 18 · 19 · 20 · 21

AuthorMessage
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1599929 - Posted: 11 Nov 2014, 17:53:51 UTC - in response to Message 1599917.  

I'm puzzled though. I haven't been able to get any wu's for quite a while, in common with most people. But the SSP reports there are 667 units ready to send, but I'm getting "project has no tasks available" ??

P.

The SSP figures are a snapshot, by the time you see them they can only be a rough indicator. MB splitting is very bursty, the splitter loads up enough data for 256 WUs, processes it, then writes all 256 very quickly. Soon thereafter, a Transitioner comes along and adds 512 to the ready to send. Within a few seconds there will be multiple work requests which reduce ready to send...

At the moment, there seem to be six splitters actively producing work (one seems to be having indigestion over 07oc11af). Presumably each splitter dumps its periodic load of 256 WUs / 512 tasks in its own time.

Three of the six seem to be working on 'shorties' at the moment - 03se14ae, 06se14ab, and 07se14ab. Those batches will be swept up very quickly, because each successful request (for a certain amount of computing time) will be allocated more tasks than for slower-running tasks.

Another tape might be producing exclusively VLARs (in principle - I have no evidence that is happening at the moment). Those batches might hang around for a while, and clog the feeder - again resulting in no tasks being available for allocation to GPUs.
ID: 1599929 · Report as offensive
Sirius B Project Donor
Volunteer tester
Avatar

Send message
Joined: 26 Dec 00
Posts: 24879
Credit: 3,081,182
RAC: 7
Ireland
Message 1599950 - Posted: 11 Nov 2014, 23:47:40 UTC

That wa a nice short outage & uploads/downloads ok, minus AP's of course :-(
ID: 1599950 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1599982 - Posted: 12 Nov 2014, 0:51:25 UTC

Well, Beta's back. However, I think the database may have been recovered from a backup. My account page shows my phone's last contact was last Friday, and I know I had it do an update yesterday.
David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1599982 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1600032 - Posted: 12 Nov 2014, 2:35:14 UTC

total AP channels to do: 775

That's the most I can recall seeing.
ID: 1600032 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 1600140 - Posted: 12 Nov 2014, 6:53:12 UTC - in response to Message 1600032.  

Still appear to be issues with the MB splitters. They appear to be (mostly) keeping up with demand, but are unable to refill the ready-to-send buffer, which remains empty.
Grant
Darwin NT
ID: 1600140 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1600209 - Posted: 12 Nov 2014, 13:20:13 UTC - in response to Message 1600032.  

total AP channels to do: 775

That's the most I can recall seeing.

I wonder if it will go even higher or if they calculated that enough data was added to keep MB going until they could get all of the AP services working again.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1600209 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1600237 - Posted: 12 Nov 2014, 14:58:37 UTC
Last modified: 12 Nov 2014, 15:00:38 UTC

Apparently the 07oc11af tape is stuck, i don´t understand why the splitter program did not recognice that and automaticaly skips this problem.

Did any one reports that to the staff?
ID: 1600237 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1600254 - Posted: 12 Nov 2014, 15:44:30 UTC - in response to Message 1599566.  

I find an interesting situation, presumably due to all the AP processes being down.

I was the _1 on http://setiathome.berkeley.edu/workunit.php?wuid=1602258824 and came up inconclusive. Okay, fine, it went out to a _2. That host timed out and it went to a _3. That host has now reported in, and shows as waiting for validation. The scheduler is not sending it out again because it's still waiting to hear whether the third completion is valid, which can't happen with the validators down.

I wonder how many other examples of this there are.

This one has not changed.

I also note that I have a whole bunch of other AP6s that have validated, some more than a month ago now, but still not been purged. I do hope they get the whole AP mess fixed soon.
David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1600254 · Report as offensive
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9954
Credit: 103,452,613
RAC: 328
United Kingdom
Message 1600278 - Posted: 12 Nov 2014, 16:13:02 UTC

ID: 1600278 · Report as offensive
Previous · 1 . . . 18 · 19 · 20 · 21

Message boards : Number crunching : Panic Mode On (91) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.