Panic Mode On (87) Server Problems?

Message boards : Number crunching : Panic Mode On (87) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 16 · 17 · 18 · 19 · 20 · 21 · 22 . . . 24 · Next

AuthorMessage
Batter Up
Avatar

Send message
Joined: 5 May 99
Posts: 1946
Credit: 24,860,347
RAC: 0
United States
Message 1490754 - Posted: 18 Mar 2014, 18:01:30 UTC

What happen to the Tuesday shutdown?
ID: 1490754 · Report as offensive
Profile Fred E.
Volunteer tester

Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,140,697
RAC: 0
United States
Message 1490760 - Posted: 18 Mar 2014, 18:09:09 UTC

What happen to the Tuesday shutdown?

Rain delay.
Joe had an explanation for the 'result creation rate' a few days ago. The splitters make WUs, and put them in the database: but transitioners are needed to turn WUs into results (tasks). Making a WU is a slow, steady process that involves mathematics and data shuffling from disk to disk: making a result from a WU is just a bit of database record-keeping, and takes barely any time. Did you see the database queries/second go over 4,500?

Thanks for the explanation - I must have missed that one. Yes on the database queries / second. Guess they have more horsepower/capacity than I thought. V7 creation rate has dropped back to near-idle now, and pending validation is starting to drop.
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.
ID: 1490760 · Report as offensive
Profile Bernie Vine
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 26 May 99
Posts: 9954
Credit: 103,452,613
RAC: 328
United Kingdom
Message 1490764 - Posted: 18 Mar 2014, 18:12:14 UTC

Comment from Eric on Beta

Matt and Jeff are out of town


Might have something to do with it.
ID: 1490764 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1490795 - Posted: 18 Mar 2014, 18:56:08 UTC - in response to Message 1490715.  

Here's the big bulge in ready to send that Richard predicted earlier in this thread:

[As of 18 Mar 2014, 17:00:05 UTC]
Data Distribution State SETI@home # Astropulse # As of*
Results ready to send 602,189 26,288 9m

The highest I happened to see RTS was 698K. Now it's down to 648K. Result creation rate is nearly nothing and what little there is is probably accounted for by resends being added to the queue.
David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1490795 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22189
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1490802 - Posted: 18 Mar 2014, 19:04:26 UTC

With well over 300k tasks sitting in the ready to send buffer the production rate should be very low.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1490802 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1490892 - Posted: 18 Mar 2014, 21:42:17 UTC - in response to Message 1490644.  


...
Actually the stuck SSP shows more than 300k to send - we do know that a stuck SSP can lead to runaway splitters, when stuck below the highwater mark. I guess if it got stuck _above_ the highwater mark now, splitters are probably not gettting the signal to fire up or at least not working flat out.

Either way, I doubt it will get sorted before maintenance.

edit: small correction, my beta UPload seems to be stuck too.

Well, there's a clue - they were splitting at 15/sec when the page locked. That means they were not at high water mark - so they probably went on, and on, and on...

15 * 3600 * 20 (hours) is over a million tasks split since then. We've been drawing them down, of course, but not so many shorties as in recent weeks. I reckon we'll be bloated.

(and of course they had lots of raw material to work on)

Dr. Anderson's Changeset 2e4d561 is related to this issue. It changed BOINC's sample work generator so it wouldn't go on creating work if the Transitioner got behind or stuck.

Unfortunately, if applied to the splitters here it would mean workunit production would stop if any one of the 6 transitioners got in trouble. That would be a case of the cure being worse than the ailment, I guess.
                                                                  Joe
ID: 1490892 · Report as offensive
Profile Julie
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 28 Oct 09
Posts: 34053
Credit: 18,883,157
RAC: 18
Belgium
Message 1490897 - Posted: 18 Mar 2014, 21:44:02 UTC

Half of the tasks on this computer are ap. Luv it!
rOZZ
Music
Pictures
ID: 1490897 · Report as offensive
Ulrich Metzner
Volunteer tester
Avatar

Send message
Joined: 3 Jul 02
Posts: 1256
Credit: 13,565,513
RAC: 13
Germany
Message 1490924 - Posted: 18 Mar 2014, 21:55:54 UTC

No outage today - work flows - everything's fine! :)
Aloha, Uli

ID: 1490924 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1491130 - Posted: 19 Mar 2014, 9:00:30 UTC

CreditNew Strikes Again:

3444282044 1455105331 18 Mar 2014, 6:32:51 UTC 19 Mar 2014, 1:47:14 UTC Completed and validated 2.12 0.50 609.32 AstroPulse v6
Anonymous platform (NVIDIA GPU)

Not bad, for 2 seconds of clock time...EAT YOUR HEART OUT!!!!
ID: 1491130 · Report as offensive
Arivald Ha'gel

Send message
Joined: 9 May 03
Posts: 14
Credit: 16,623,619
RAC: 2
Poland
Message 1491136 - Posted: 19 Mar 2014, 9:14:52 UTC

There seems to be a different problem visible:

Replica seconds behind master 0 31h
Results received in last hour 32h
Result turnaround time (last hour average) hours hours 32h


Replica out of sync?
ID: 1491136 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1491142 - Posted: 19 Mar 2014, 9:24:18 UTC - in response to Message 1491130.  
Last modified: 19 Mar 2014, 9:25:10 UTC

CreditNew Strikes Again:

3444282044 1455105331 18 Mar 2014, 6:32:51 UTC 19 Mar 2014, 1:47:14 UTC Completed and validated 2.12 0.50 609.32 AstroPulse v6
Anonymous platform (NVIDIA GPU)

Not bad, for 2 seconds of clock time...EAT YOUR HEART OUT!!!!

You win the Random Number Generator Credit Lottery!!!

And still there are someones (very few at this time i belive) who continue to say: nothing is wrong with creditscrew...
ID: 1491142 · Report as offensive
Jesse Viviano

Send message
Joined: 27 Feb 00
Posts: 100
Credit: 3,949,583
RAC: 0
United States
Message 1491455 - Posted: 19 Mar 2014, 21:29:02 UTC

One of the Astropulse assimilators has failed according to the server stats page, and therefore a queue of validated Astropulse work units waiting for assimilation has grown.
ID: 1491455 · Report as offensive
Profile Oz
Avatar

Send message
Joined: 6 Jun 99
Posts: 233
Credit: 200,655,462
RAC: 212
United States
Message 1491681 - Posted: 20 Mar 2014, 5:09:19 UTC

I like credit_new - today, at least...

3442864585 1454394478 17 Mar 2014, 10:18:44 UTC 19 Mar 2014, 15:15:16 UTC Completed and validated 2.14 0.75 621.99 AstroPulse v6 Anonymous platform (CPU)
Member of the 20 Year Club



ID: 1491681 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1492357 - Posted: 20 Mar 2014, 23:49:30 UTC

Well AP splitting is almost finished so get ready for a steady flow of MB work soon.

Cheers.
ID: 1492357 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1492700 - Posted: 21 Mar 2014, 14:01:49 UTC - in response to Message 1492357.  

Well AP splitting is almost finished so get ready for a steady flow of MB work soon.

Cheers.

Wow. Over 10 hours later and no one is wailing over the APs running out.
David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1492700 · Report as offensive
Batter Up
Avatar

Send message
Joined: 5 May 99
Posts: 1946
Credit: 24,860,347
RAC: 0
United States
Message 1492726 - Posted: 21 Mar 2014, 15:07:00 UTC - in response to Message 1492700.  


Wow. Over 10 hours later and no one is wailing over the APs running out.

Many are on double secret probation. Let's all join hands and sing kumbaya.
ID: 1492726 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1853
Credit: 268,616,081
RAC: 1,349
United States
Message 1492879 - Posted: 21 Mar 2014, 21:09:22 UTC - in response to Message 1492700.  

Well AP splitting is almost finished so get ready for a steady flow of MB work soon.

Cheers.

Wow. Over 10 hours later and no one is wailing over the APs running out.


Ah, but the caches are still full. When the bellies get empty, the grumbling will commence :)
ID: 1492879 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1492886 - Posted: 21 Mar 2014, 21:36:17 UTC - in response to Message 1492700.  

Well AP splitting is almost finished so get ready for a steady flow of MB work soon.

Cheers.

Wow. Over 10 hours later and no one is wailing over the APs running out.

Well if we could burn thought the MB faster then more AP would get loaded quicker, yes?
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1492886 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1492891 - Posted: 21 Mar 2014, 21:45:45 UTC - in response to Message 1492886.  

Well AP splitting is almost finished so get ready for a steady flow of MB work soon.

Cheers.

Wow. Over 10 hours later and no one is wailing over the APs running out.

Well if we could burn thought the MB faster then more AP would get loaded quicker, yes?

Yes and I'm trying to do my part.

Cheers.
ID: 1492891 · Report as offensive
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1222
Credit: 451,243,443
RAC: 1,127
Denmark
Message 1492903 - Posted: 21 Mar 2014, 22:19:43 UTC - in response to Message 1492886.  

Well AP splitting is almost finished so get ready for a steady flow of MB work soon.

Cheers.

Wow. Over 10 hours later and no one is wailing over the APs running out.

Well if we could burn thought the MB faster then more AP would get loaded quicker, yes?

As I see it they load way too much data at once, but the real culprit is really the credit flaw.
ID: 1492903 · Report as offensive
Previous · 1 . . . 16 · 17 · 18 · 19 · 20 · 21 · 22 . . . 24 · Next

Message boards : Number crunching : Panic Mode On (87) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.