Panic Mode On (85) Server Problems?

Message boards : Number crunching : Panic Mode On (85) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 23 · Next

AuthorMessage
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1394714 - Posted: 25 Jul 2013, 15:22:19 UTC - in response to Message 1394703.  

There's a certain type of tasks that I haven't seen for a couple of days now. I refuse to whine and bitch though.

Are you sure?


Almost....:-)

LOL, Sten-Arne.

I have about 240 of your precious still working on my CPUs.

That's between 9 rigs.

They are coming along.....

Meow meow.

Well it can't be AP tasks he hasn't seen. Since 2 of his machines have been getting them everyday for the past week or so.
I think I may have to wait until I finish processing all 267 of the ones I have before I return any of them. :)

What kills me is what Boinc does with AP.

My daily driver has about 10 started. One of which has about 18 hours invested and 98% completed. And yet it goes off on a lark and starts another. Due 8/8.
I don't even want to look at my underachievers....LOL.

Mine were doing that, but I changed my queue vales back to how they were for BOINC 6. So I have Maintain 0 additional 10. With the task limits this is fine since some of my machines are to slow to get 100 task for a 10 day cache.
However they don't have the issue of thinking they will run into a deadline and start other tasks. As the queue "Maintain" value is still used in the BOINC client scheduler as it was when the value was labeled "connect every xx days".

Last I recall, my Boincs were set to grab everything, at any time.

I have not looked at it in ages, since the server side limits have made most cache settings moot. A hundred tasks on a multi GPU rig do not make a 10 day cache. LOL, or not. Maybe a few hours.

"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1394714 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1394757 - Posted: 25 Jul 2013, 16:16:50 UTC - in response to Message 1394714.  

A hundred tasks on a multi GPU rig do not make a 10 day cache. LOL, or not. Maybe a few hours.

A 100 MB WU cache not even least for 6 hrs in a 2x690 host... even less if they are shorties...

ID: 1394757 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1394775 - Posted: 25 Jul 2013, 17:08:29 UTC - in response to Message 1394757.  

A hundred tasks on a multi GPU rig do not make a 10 day cache. LOL, or not. Maybe a few hours.

A 100 MB WU cache not even least for 6 hrs in a 2x690 host... even less if they are shorties...

Yah.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1394775 · Report as offensive
.clair.

Send message
Joined: 4 Nov 04
Posts: 1300
Credit: 55,390,408
RAC: 69
United Kingdom
Message 1394937 - Posted: 25 Jul 2013, 23:14:25 UTC - in response to Message 1394638.  

There's a certain type of tasks that I haven't seen for a couple of days now. I refuse to whine and bitch though.

In that case i will have a whine and bitch about not geting any vlar for my 7970 rig for months,
Oh, better not, you will think i have gone mad,
did i forget to take my tablets today ?
one, two, three, ten !! whatever . . . .
ID: 1394937 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1395003 - Posted: 26 Jul 2013, 1:27:02 UTC

Much more important than bitching about Credit New, which we can do nothing about, is where are the APs? It has been days and we can do nothing about that either.
ID: 1395003 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1395004 - Posted: 26 Jul 2013, 1:55:23 UTC - in response to Message 1395003.  

Doesn't the backup on the MBs have
to be done first?
ID: 1395004 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1395095 - Posted: 26 Jul 2013, 7:54:32 UTC - in response to Message 1395004.  

Doesn't the backup on the MBs have
to be done first?

Indeed. It is unusual to have that many data sets (known as "tapes") loaded at one time, and since AP blasts through much faster than MB.. we will probably have to wait until there are only 10-15 tapes left before more get loaded.

It might be a week or two.

A lot of it also depends on how much storage space is available on the servers, too. I mean, 50gb per tape times...42 = 2150 GB. Sure, that may not sound like too terribly much, but then you have the WUs themselves that get split out of it, and those add up, as well. For every WU that has not validated yet, there is a file on disk for it until it validates and moves on to deletion.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1395095 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1395505 - Posted: 27 Jul 2013, 5:53:50 UTC - in response to Message 1395095.  
Last modified: 27 Jul 2013, 5:58:55 UTC

Doesn't the backup on the MBs have
to be done first?

Indeed. It is unusual to have that many data sets (known as "tapes") loaded at one time, and since AP blasts through much faster than MB.. we will probably have to wait until there are only 10-15 tapes left before more get loaded.

It might be a week or two.

A lot of it also depends on how much storage space is available on the servers, too. I mean, 50gb per tape times...42 = 2150 GB. Sure, that may not sound like too terribly much, but then you have the WUs themselves that get split out of it, and those add up, as well. For every WU that has not validated yet, there is a file on disk for it until it validates and moves on to deletion.

I have now about 161 APs in play.

They are dropping down, but not too rapidly, as most are assinged to CPUs.

And on CPUs, they take anywhere up to 20 hours to complete.

I do them all, kids. Whatever it takes. The kitties do them all.
No favorites here.
The servers send it, we do it.
That is what the project asks, that is what we do.
I have not abandoned or otherwise rejected a task in more years than I can tell you. It's just something I do not do.
Even during the vlar attack after the v7 changeover.
I did not take out even one.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1395505 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1395582 - Posted: 27 Jul 2013, 14:27:53 UTC - in response to Message 1394937.  

There's a certain type of tasks that I haven't seen for a couple of days now. I refuse to whine and bitch though.

In that case i will have a whine and bitch about not geting any vlar for my 7970 rig for months,
Oh, better not, you will think i have gone mad,
did i forget to take my tablets today ?
one, two, three, ten !! whatever . . . .

Breakfast! :-)
ID: 1395582 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1396683 - Posted: 30 Jul 2013, 22:06:26 UTC

As near as I can tell it has been a week with no APs being split. I hunger for an AP.
ID: 1396683 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1396686 - Posted: 30 Jul 2013, 22:08:47 UTC - in response to Message 1396683.  

Probably Friday at the earliest.
ID: 1396686 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1396698 - Posted: 30 Jul 2013, 22:22:40 UTC - in response to Message 1396694.  

OK, it's worse than I thought.
ID: 1396698 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1396745 - Posted: 30 Jul 2013, 23:52:47 UTC

My cache is definitely slowly dwindling, but I'm hanging on with occasional re-sends. In about 24 hours, I will have run through the last of my _0 and _1 APs and start getting into the cache of re-sends. From there, if I get no more, 5 days or so. So realistically.. I'm empty and idle in about 7 days from now, but I'm sure I'll get a couple more re-sends before then, extending it out another day or two, depending on how many I get.

No panic yet, and no complaining ever. It is my choice to go AP-only, and you have to take the good with the bad.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1396745 · Report as offensive
Thomas
Volunteer tester

Send message
Joined: 9 Dec 11
Posts: 1499
Credit: 1,345,576
RAC: 0
France
Message 1396875 - Posted: 31 Jul 2013, 5:57:51 UTC

Can't you crunching MB's instead of AP's during these lean times ? It's always better to crunching all WU's.... There's WU's for everyone and we would never fall down... Maybe ;)
ID: 1396875 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1396913 - Posted: 31 Jul 2013, 8:17:25 UTC

I'm not having a go at anyone. I thought the idea of this thread was to talk about server related problems? For example if download/upload speeds are slow. How come at some point in the thread does the topic of APs come up? This topic was also reasonably prominent in panic mode thread 84. Or am I missing something somewhere is this related to a server problem?
If you would like to talk about that that is fine but can I suggest you start a thread dedicated to it?
Thank you for taking the time to read this [/quote]
ID: 1396913 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1396998 - Posted: 31 Jul 2013, 15:23:23 UTC - in response to Message 1396913.  

I'm not having a go at anyone. I thought the idea of this thread was to talk about server related problems? For example if download/upload speeds are slow. How come at some point in the thread does the topic of APs come up? This topic was also reasonably prominent in panic mode thread 84. Or am I missing something somewhere is this related to a server problem?
If you would like to talk about that that is fine but can I suggest you start a thread dedicated to it?
Thank you for taking the time to read this
[/quote]

Not really, it is more of a central Bitching & Moaning thread and can be used for other things as well.

As such there is not all that much off topic in this thread except for politics.

ID: 1396998 · Report as offensive
Profile Dimly Lit Lightbulb 😀
Volunteer tester
Avatar

Send message
Joined: 30 Aug 08
Posts: 15399
Credit: 7,423,413
RAC: 1
United Kingdom
Message 1397060 - Posted: 31 Jul 2013, 17:16:12 UTC - in response to Message 1396875.  

Can't you crunching MB's instead of AP's during these lean times ? It's always better to crunching all WU's.... There's WU's for everyone and we would never fall down... Maybe ;)

Personally I crunch astropulse only due to the length of them, one task and that's 14-16 hrs of crunching, excluding the B3_P1 & B6_P0 "shorties". Currently my cache has dwindled to a solitary task :(

Member of the People Encouraging Niceness In Society club.

ID: 1397060 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1397254 - Posted: 31 Jul 2013, 21:05:34 UTC - in response to Message 1396998.  
Last modified: 31 Jul 2013, 21:07:29 UTC

I'm not having a go at anyone. I thought the idea of this thread was to talk about server related problems? For example if download/upload speeds are slow. How come at some point in the thread does the topic of APs come up? This topic was also reasonably prominent in panic mode thread 84. Or am I missing something somewhere is this related to a server problem?
If you would like to talk about that that is fine but can I suggest you start a thread dedicated to it?
Thank you for taking the time to read this


Thanks Atkayn for clarifying what this thread can be used for.
ID: 1397254 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1397459 - Posted: 1 Aug 2013, 7:11:48 UTC - in response to Message 1397060.  

Can't you crunching MB's instead of AP's during these lean times ? It's always better to crunching all WU's.... There's WU's for everyone and we would never fall down... Maybe ;)

Personally I crunch astropulse only due to the length of them, one task and that's 14-16 hrs of crunching, excluding the B3_P1 & B6_P0 "shorties". Currently my cache has dwindled to a solitary task :(

Same here. It's easier to keep up with burning through 6 tasks per 24 hours, and keeps the client_state file relatively small. Of course with the bigger pipe to contact the servers, that's not much of an issue anymore as it was in the past. The mega crunchers (before the limits) would have client_state's that were upwards of 10MiB! That has to get sent to the server on every scheduler contact (was my understanding of it back then..maybe it's different now).

So even though the data files for APs are much larger and take up more disk space and use a lot more bandwidth to transfer, it keeps the client_state smaller, and also helps to keep your tasks page on the website (and more importantly, for the database) shorter, and the tasks list in BOINC Manager shorter, too.

The database performance issues are the reason for the limits anyway, and the way I see it, having fewer tasks is me doing my part to help the database just that little tiny bit... and also because of CPU-only and 6/day, I can hold a 10-day cache and not be affected by the limits, so that's a plus.

Even though there is no downtime, I'm still able to keep my CPUs busy during this dark period of no new APs being split, all thanks to a 10-day cache that has turned into 15-18 thanks to the re-sends I've been able to scrounge up.

That's how I see it.

There.. see? Did my B&M-ing as well as threw legitimate server-related facts into it. :p
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1397459 · Report as offensive
Profile Link
Avatar

Send message
Joined: 18 Sep 03
Posts: 834
Credit: 1,807,369
RAC: 0
Germany
Message 1397587 - Posted: 1 Aug 2013, 16:14:32 UTC - in response to Message 1397459.  

The mega crunchers (before the limits) would have client_state's that were upwards of 10MiB! That has to get sent to the server on every scheduler contact (was my understanding of it back then..maybe it's different now).

The client_state.xml was never sent to the servers, it has always been the sched_request_[projectURL].xml, which contains much less information than the client_state and therefore is also much smaller.
ID: 1397587 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 . . . 23 · Next

Message boards : Number crunching : Panic Mode On (85) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.