Panic Mode On (84) Server Problems?

Message boards : Number crunching : Panic Mode On (84) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 21 · Next

AuthorMessage
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1222
Credit: 451,243,443
RAC: 1,127
Denmark
Message 1386431 - Posted: 1 Jul 2013, 13:39:57 UTC - in response to Message 1386421.  

No new APs being split for a long time now. I'm beginning to panic, getting nervous.

Please, please.....


Same near panic here... If I read the status page right, there seems to be a lot of MBs being split before we get APs again.
ID: 1386431 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22540
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1386547 - Posted: 1 Jul 2013, 18:43:04 UTC

Quite simple - it takes a lot less time to split the APs off a tape than to split the MBs off the same tape.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1386547 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1386573 - Posted: 1 Jul 2013, 19:45:46 UTC - in response to Message 1386547.  
Last modified: 1 Jul 2013, 19:46:26 UTC

Quite simple - it takes a lot less time to split the APs off a tape than to split the MBs off the same tape.

There are also a lot more folks doing AP on GPU since the v7 and Lunatics installer rollouts. So the APs get crunched and returned faster than they used to be, hence they do not last as long when they are available.

The kitties were not doing them on GPU until the Lunatics release.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1386573 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 36875
Credit: 261,360,520
RAC: 489
Australia
Message 1386600 - Posted: 1 Jul 2013, 21:16:30 UTC - in response to Message 1386573.  

More like more people are doing AP's so that they can get a better RAC happening.

Don't worry Sten, I'll keep trying my best to chew through those MB's so more files can be loaded.

Cheers.
ID: 1386600 · Report as offensive
Lionel

Send message
Joined: 25 Mar 00
Posts: 680
Credit: 563,640,304
RAC: 597
Australia
Message 1386611 - Posted: 1 Jul 2013, 21:35:33 UTC - in response to Message 1386600.  

Wiggo, you are dead right there.

There is another side effect as well. For example, assuming I only do AP, if I only get AP for half the time, then I can maintain a RAC equivalent to crunching MB only (on a monthly basis), but with circa half the power usage due to idle GPUs and CPUs.




ID: 1386611 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1386617 - Posted: 1 Jul 2013, 21:57:51 UTC - in response to Message 1386600.  

More like more people are doing AP's so that they can get a better RAC happening.

Don't worry Sten, I'll keep trying my best to chew through those MB's so more files can be loaded.

Cheers.

Don't worry indeed.

The kitties are of course muching their way through a ton of MB work as well.
I do not have any of my rigs set up AP-only.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1386617 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1386619 - Posted: 1 Jul 2013, 22:01:54 UTC

I've been getting lots of re-sends for APs to clean up that ATI fiasco. So far, re-sends are enough to satisfy my cache's demands. In fact, I just got a _7 task earlier.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1386619 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13855
Credit: 208,696,464
RAC: 304
Australia
Message 1386767 - Posted: 2 Jul 2013, 8:05:03 UTC - in response to Message 1386619.  


Been scratching my head lately over the download network traffic. There have been some large surges, and as now some sustained traffic levels, even though there is no AP work going out, no increase in the amount of MB work in progess, nor a decline in the average turn around time.
Grant
Darwin NT
ID: 1386767 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1386788 - Posted: 2 Jul 2013, 10:25:10 UTC - in response to Message 1386767.  


Been scratching my head lately over the download network traffic. There have been some large surges, and as now some sustained traffic levels, even though there is no AP work going out, no increase in the amount of MB work in progess, nor a decline in the average turn around time.

I have noticed that myself, and was just as curious as you are. With the way v7 runs, I don't know if we shall really see another true 'shorty storm'. And even if we did have a minor one, the effects on the bandwidth would not have such a marked start and stop.

Only guess would be retrieving some information from the servers back to the lab for archiving purposes? Dunno.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1386788 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1386820 - Posted: 2 Jul 2013, 14:19:12 UTC - in response to Message 1386788.  


Been scratching my head lately over the download network traffic. There have been some large surges, and as now some sustained traffic levels, even though there is no AP work going out, no increase in the amount of MB work in progess, nor a decline in the average turn around time.

I have noticed that myself, and was just as curious as you are. With the way v7 runs, I don't know if we shall really see another true 'shorty storm'. And even if we did have a minor one, the effects on the bandwidth would not have such a marked start and stop.

Only guess would be retrieving some information from the servers back to the lab for archiving purposes? Dunno.

Looking at my results shorties are taking about 45% as long to run as normal AR tasks. I expect if we get a bunch of 'tapes' that make nothing but shorties we will be able to tell on the b/w graph.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1386820 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1386845 - Posted: 2 Jul 2013, 18:43:24 UTC - in response to Message 1386840.  

I wonder if the creation rate could be related to file 20jn12ac as it has been in its current state for way longer than is usual.

Cheers.


And 12 days later, 20jn12ac is still in the same sorry state. Someone is asleep at the helm.

As far as I know, that stuck dataset holds up 1 MB splitter. And since MB splitting has been keeping up with demand, maybe that's why it's not been dealt with. Although the rational for leaving it in it's stuck state evades me...LOL.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1386845 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22540
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1386854 - Posted: 2 Jul 2013, 19:17:24 UTC

I think Mark's right - as its not causing any delays in the MB production its never reached the top of someone's tuit list.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1386854 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1386893 - Posted: 2 Jul 2013, 21:33:53 UTC - in response to Message 1386845.  

I wonder if the creation rate could be related to file 20jn12ac as it has been in its current state for way longer than is usual.

Cheers.


And 12 days later, 20jn12ac is still in the same sorry state. Someone is asleep at the helm.

As far as I know, that stuck dataset holds up 1 MB splitter. And since MB splitting has been keeping up with demand, maybe that's why it's not been dealt with. Although the rational for leaving it in it's stuck state evades me...LOL.

No it isn't. No splitter has run on that file since sometime last week. They blocked it somehow, and just have not dismounted it yet.
Donald
Infernal Optimist / Submariner, retired
ID: 1386893 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1386895 - Posted: 2 Jul 2013, 21:41:25 UTC - in response to Message 1386893.  

I wonder if the creation rate could be related to file 20jn12ac as it has been in its current state for way longer than is usual.

Cheers.


And 12 days later, 20jn12ac is still in the same sorry state. Someone is asleep at the helm.

As far as I know, that stuck dataset holds up 1 MB splitter. And since MB splitting has been keeping up with demand, maybe that's why it's not been dealt with. Although the rational for leaving it in it's stuck state evades me...LOL.

No it isn't. No splitter has run on that file since sometime last week. They blocked it somehow, and just have not dismounted it yet.

How do you know that if it's stuck, Donald?
I was told by Eric that if a dataset gets stuck, it ties up the splitter working on it. I was later told that Matt had restarted it. But it would appear that it got stuck again. Just curious why you think the splitters are all dancing around it and that it's not still stuck tying up a splitter.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1386895 · Report as offensive
Jean Labrecque

Send message
Joined: 4 Jun 03
Posts: 77
Credit: 14,518,927
RAC: 0
Canada
Message 1386936 - Posted: 2 Jul 2013, 23:33:23 UTC

look at the color chart channel in progress so it is not being split at this moment
ID: 1386936 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22540
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1387024 - Posted: 3 Jul 2013, 4:46:33 UTC

While not being split it is tying up a splitter doing nothing.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1387024 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1387029 - Posted: 3 Jul 2013, 4:53:23 UTC - in response to Message 1387024.  

While not being split it is tying up a splitter doing nothing.

Or IS it at the moment?
Not really sure.
JEAN makes a good point...
We have 6 active splitters, and the SSP shows 6 datasets color coded as 'in progress'. The stuck dataset does not show that.

Although I know everything on the SSP is not always as it appears.
And don't have any idea if one splitter only works on one dataset at a time by itself.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1387029 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1387054 - Posted: 3 Jul 2013, 6:02:01 UTC - in response to Message 1386936.  
Last modified: 3 Jul 2013, 6:04:29 UTC

look at the color chart channel in progress so it is not being split at this moment

Exactly. There are no dark green "in progress" bands on that file, and have not been since about Wednesday last week. There was one when it was stuck earlier. That's why I think Eric or Matt "blocked" that file from being split.

And don't have any idea if one splitter only works on one dataset at a time by itself.

I have seen as many as 9 channels "in progress" on MB, so at least some of the splitters can do 2 channels at once.
Donald
Infernal Optimist / Submariner, retired
ID: 1387054 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51478
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1387055 - Posted: 3 Jul 2013, 6:03:32 UTC - in response to Message 1387054.  

look at the color chart channel in progress so it is not being split at this moment

Exactly. There are no dark green "in progress" bands on that file, and have not been since about Wednesday last week. There was one when it was stuck earlier. That's why I think Eric or Matt "blocked" that file from being split.

And ya just could be correct, my friend.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 1387055 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13855
Credit: 208,696,464
RAC: 304
Australia
Message 1387298 - Posted: 3 Jul 2013, 18:04:00 UTC - in response to Message 1387283.  


I'm thinking with the network bandwidth we have now, and no end in sight for the serverside limits, they could probably tweak the AP/MB ratio of work being split & pump out more AP these days.
Grant
Darwin NT
ID: 1387298 · Report as offensive
Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 21 · Next

Message boards : Number crunching : Panic Mode On (84) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.