Panic Mode On (97) Server Problems?

Message boards : Number crunching : Panic Mode On (97) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 33 · Next

AuthorMessage
Profile JBird Project Donor
Avatar

Send message
Joined: 3 Sep 02
Posts: 297
Credit: 325,260,309
RAC: 549
United States
Message 1667897 - Posted: 21 Apr 2015, 3:10:35 UTC - in response to Message 1667877.  

4/20/2015 9:57:48 PM | SETI@home | Finished upload of 02se12ab.23690.11523.438086664197.12.66_0_0
=
Whole bunch of em with that extra _0

But my point is, Boinc is jumping on these with June 6+ deadlines before it runs the remaining older May 9+ stuff already in my queue.

160+ tasks piling up in Pending just last 2 days - on top of the 160+ that were already in there before workfetch lit up again.

Just a puzzle is all.

ID: 1667897 · Report as offensive
OTS
Volunteer tester

Send message
Joined: 6 Jan 08
Posts: 369
Credit: 20,533,537
RAC: 0
United States
Message 1667906 - Posted: 21 Apr 2015, 3:27:47 UTC - in response to Message 1667897.  

4/20/2015 9:57:48 PM | SETI@home | Finished upload of 02se12ab.23690.11523.438086664197.12.66_0_0
=
Whole bunch of em with that extra _0

But my point is, Boinc is jumping on these with June 6+ deadlines before it runs the remaining older May 9+ stuff already in my queue.

160+ tasks piling up in Pending just last 2 days - on top of the 160+ that were already in there before workfetch lit up again.

Just a puzzle is all.


Okay, I was confused. I see now you were talking about work "In Progress" and I have often wondered the same thing and also why a bunch of tasks that have that same "Sent" time have different deadlines. Perhaps someone more knowledgeable will stop by.
ID: 1667906 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1667912 - Posted: 21 Apr 2015, 3:41:20 UTC

Don't bother looking at what the recorded date was, they load files that they want to process, it might be from 2011 or 2015, and they don't come in order.
ID: 1667912 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1668014 - Posted: 21 Apr 2015, 11:39:29 UTC - in response to Message 1667906.  

4/20/2015 9:57:48 PM | SETI@home | Finished upload of 02se12ab.23690.11523.438086664197.12.66_0_0
=
Whole bunch of em with that extra _0

But my point is, Boinc is jumping on these with June 6+ deadlines before it runs the remaining older May 9+ stuff already in my queue.

160+ tasks piling up in Pending just last 2 days - on top of the 160+ that were already in there before workfetch lit up again.

Just a puzzle is all.


Okay, I was confused. I see now you were talking about work "In Progress" and I have often wondered the same thing and also why a bunch of tasks that have that same "Sent" time have different deadlines. Perhaps someone more knowledgeable will stop by.

The deadline is based on the workeunit angle range. A "standard" workunit, with an angle range centered around 0.40, had a deadline of about 6 weeks or so. Workunits with angle range below the standard angle range take a bit longer and have a deadline of about 8 weeks. You will see them flagged with .vlar in their name, for Very Low Angle Range. On the other side are workunits above the standard range that take less time to process & therefor have a shorter deadline of about 3 weeks. Also called VHARs or "shorties".

BOINC will process tasks in FIFO mode unless it thinks there will be a deadline issue. Then it will process tasks in order by deadline. You will also see the Status as Running, high priority. This often happen when we get a load of shorties from the data sets.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1668014 · Report as offensive
OTS
Volunteer tester

Send message
Joined: 6 Jan 08
Posts: 369
Credit: 20,533,537
RAC: 0
United States
Message 1668037 - Posted: 21 Apr 2015, 12:52:40 UTC - in response to Message 1668014.  

4/20/2015 9:57:48 PM | SETI@home | Finished upload of 02se12ab.23690.11523.438086664197.12.66_0_0
=
Whole bunch of em with that extra _0

But my point is, Boinc is jumping on these with June 6+ deadlines before it runs the remaining older May 9+ stuff already in my queue.

160+ tasks piling up in Pending just last 2 days - on top of the 160+ that were already in there before workfetch lit up again.

Just a puzzle is all.


Okay, I was confused. I see now you were talking about work "In Progress" and I have often wondered the same thing and also why a bunch of tasks that have that same "Sent" time have different deadlines. Perhaps someone more knowledgeable will stop by.

The deadline is based on the workeunit angle range. A "standard" workunit, with an angle range centered around 0.40, had a deadline of about 6 weeks or so. Workunits with angle range below the standard angle range take a bit longer and have a deadline of about 8 weeks. You will see them flagged with .vlar in their name, for Very Low Angle Range. On the other side are workunits above the standard range that take less time to process & therefor have a shorter deadline of about 3 weeks. Also called VHARs or "shorties".

BOINC will process tasks in FIFO mode unless it thinks there will be a deadline issue. Then it will process tasks in order by deadline. You will also see the Status as Running, high priority. This often happen when we get a load of shorties from the data sets.



That was very informative and answered a lot of questions. Thanks.
ID: 1668037 · Report as offensive
Profile JBird Project Donor
Avatar

Send message
Joined: 3 Sep 02
Posts: 297
Credit: 325,260,309
RAC: 549
United States
Message 1668084 - Posted: 21 Apr 2015, 15:01:39 UTC - in response to Message 1668014.  
Last modified: 21 Apr 2015, 15:47:48 UTC

Thanks HAL. That was unexpected. Had no idea that much went into it; nor that even such parameters were in play.
Just Nouveau Riche here celebrating my first Million marks on my single Host.
Still figuring out how things work and what if anything I can do about it (tweaking, tuning, troubleshooting)
All of you have been just Grand.
I appreciate it.
=
I'm *guessing then that the Pending units buildup is more related to the wingman background shuffle; and the possibility that I may have been an early recipient of this batch (after Outage reload) - somebody's gotta be first - wingmen will follow soon enough.

ID: 1668084 · Report as offensive
WezH
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 576
Credit: 67,033,957
RAC: 95
Finland
Message 1668118 - Posted: 21 Apr 2015, 16:15:36 UTC

What, no Tuesday Outrage yet?

And Panic Mode On after that yet :)
"Please keep Your signature under four lines so Internet traffic doesn't go up too much"

- In 1992 when I had my first e-mail address -
ID: 1668118 · Report as offensive
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5517
Credit: 528,817,460
RAC: 242
United States
Message 1668119 - Posted: 21 Apr 2015, 16:16:42 UTC - in response to Message 1668118.  

Still early over there. Only 9:16 in the morning

Give them time to get some coffee before they start
ID: 1668119 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1668122 - Posted: 21 Apr 2015, 16:19:08 UTC

Hope it goes well today.
We are doing shorties, and those GPU caches aren't going to last long.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1668122 · Report as offensive
WezH
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 576
Credit: 67,033,957
RAC: 95
Finland
Message 1668127 - Posted: 21 Apr 2015, 16:26:40 UTC

Time to update BIOS.

I Hope when I come back it is TO (Tuesday Outrage).

If not, then I succeeded to update it fast.

If not, then I failed to update it.
"Please keep Your signature under four lines so Internet traffic doesn't go up too much"

- In 1992 when I had my first e-mail address -
ID: 1668127 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1668128 - Posted: 21 Apr 2015, 16:29:00 UTC - in response to Message 1668127.  

Time to update BIOS.

I Hope when I come back it is TO (Tuesday Outrage).

If not, then I succeeded to update it fast.

If not, then I failed to update it.

Best of luck then.
My stuff is so old they don't do bios updates anymore...LOL.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1668128 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1668133 - Posted: 21 Apr 2015, 16:36:43 UTC - in response to Message 1668084.  

I'm *guessing then that the Pending units buildup is more related to the wingman background shuffle; and the possibility that I may have been an early recipient of this batch (after Outage reload) - somebody's gotta be first - wingmen will follow soon enough.

Yep, that's pretty much how it works. A lot of your Tasks, you are paired with a slower wingmate. They will get done eventually, and you will get credit when the Task validates.
Donald
Infernal Optimist / Submariner, retired
ID: 1668133 · Report as offensive
Profile JBird Project Donor
Avatar

Send message
Joined: 3 Sep 02
Posts: 297
Credit: 325,260,309
RAC: 549
United States
Message 1668201 - Posted: 21 Apr 2015, 23:11:10 UTC - in response to Message 1668133.  

Thanks Donald. Just a quick look as we reload and update post Outage here, 110 out of 120 (the first 6 pages of the Pendings) are my CUDA MBs received and ran over the last 3 days.
I usually only *investigate wingmen on much older lagging tasks and of course have seen what I've seen with those - abandons, timeouts, errors while and whatnot.
Only been doing this right at 4 months.
I imagine I haven't seen *everything yet!
Plenty to learn about these rhythms and patterns.
Trying to work *with it.

ID: 1668201 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1668230 - Posted: 22 Apr 2015, 1:41:39 UTC

Way to go on this weeks maintenance!

RTS never hit 0 and graphs indicate every thing is returning to normal.

Job well done Matt and the team!
ID: 1668230 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11360
Credit: 29,581,041
RAC: 66
United States
Message 1668256 - Posted: 22 Apr 2015, 3:01:25 UTC - in response to Message 1668230.  

Way to go on this weeks maintenance!

RTS never hit 0 and graphs indicate every thing is returning to normal.

Job well done Matt and the team!

And yet no APs, sigh.
ID: 1668256 · Report as offensive
Wild6-NJ
Volunteer tester

Send message
Joined: 4 Aug 99
Posts: 43
Credit: 100,336,791
RAC: 140
Message 1668445 - Posted: 22 Apr 2015, 14:08:53 UTC

SAH v7 assimilators are offline and the assimilation queue is building.
Hope we don't have database issues again.
ID: 1668445 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1668451 - Posted: 22 Apr 2015, 14:16:10 UTC - in response to Message 1668445.  

SAH v7 assimilators are offline and the assimilation queue is building.
Hope we don't have database issues again.



Yea its a bit strange to see that they recovered nicely after maintenance then shut down.
ID: 1668451 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1668453 - Posted: 22 Apr 2015, 14:18:17 UTC - in response to Message 1668451.  

SAH v7 assimilators are offline and the assimilation queue is building.
Hope we don't have database issues again.



Yea its a bit strange to see that they recovered nicely after maintenance then shut down.

Never a dull moment around here....LOL.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1668453 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1668457 - Posted: 22 Apr 2015, 14:33:54 UTC

So there are 12 sah assimilator (v7) processes now? I don't recall there being so many previously. I wonder if that is related.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1668457 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1668458 - Posted: 22 Apr 2015, 14:35:45 UTC - in response to Message 1668457.  
Last modified: 22 Apr 2015, 14:40:40 UTC

So there are 12 sah assimilator (v7) processes now? I don't recall there being so many previously. I wonder if that is related.

Maybe Matt fired some extra ones up when working on that huge backlog we had....

Meanwhile, the fact that they are all shut down could point to another DB problem, as they are on both vader and georgem, so it's not just one server locking up.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1668458 · Report as offensive
Previous · 1 . . . 5 · 6 · 7 · 8 · 9 · 10 · 11 . . . 33 · Next

Message boards : Number crunching : Panic Mode On (97) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.