Panic Mode On (16) Server problems

Message boards : Number crunching : Panic Mode On (16) Server problems
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 10 · Next

AuthorMessage
FiveHamlet
Avatar

Send message
Joined: 5 Oct 99
Posts: 783
Credit: 32,638,578
RAC: 0
United Kingdom
Message 902099 - Posted: 31 May 2009, 17:24:33 UTC
Last modified: 31 May 2009, 17:53:11 UTC

Sorry to my wingmen.
I lost a lot of Cuda tasks when windows updated.
Normally have auto update off don't know what happened.
Once again sorry.

EDIT: Checked some of the WU's and they were VLAR's 200 were auto killed.
They should be sent as 6.03
ID: 902099 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51522
Credit: 1,018,363,574
RAC: 1,004
United States
Message 902101 - Posted: 31 May 2009, 17:28:25 UTC - in response to Message 902095.  

When all else fails...........and the kitties cannot get what they want for their kibble bowl............you will have lost one cruncher............

Mark,

I know the kitties expect the bowl to be kept completely full at all time, but as long as the bowl is not empty, they'll be able to eat.

It's only a problem when the kitties start to starve.

Also, please realize that a big part of this discussion is how to best keep food in the bowl -- and in every other bowl -- as much as possible.

-- Ned

Ned......don't try to coddle me..........
I only look after my kitties' best interests.........not yours or anybody else's/

Got it???

Sorry Ned.......that was rather rude.
But to the point.......I did not mean to be so brash.

The project should feed all the kitties, not just the ones who yeowl at the bowl. They can't cater to some "fat cats" and let the rest starve.

Got it?
The kitties that yeowl at the bowl speak for those who can or will not.......

I am the voice of kitties far and wide.....and speak for those who do not stand up for themselves. I am not always right, for sure. But at least I stand up and speak my peace.

Got it?

At the beginning of this thread you said:
When all else fails...........and the kitties cannot get what they want for their kibble bowl............you will have lost one cruncher............

In other words, if you don't get yours, you will quit.

In this post, you said:
I am the voice of kitties far and wide.....and speak for those who do not stand up for themselves. I am not always right, for sure. But at least I stand up and speak my peace.

Either you are demanding that your cache stay full and to heck with everyone else, or you are in favor of equitable distribution of work. Can't be both.

When we started a technical discussion of how to best keep everyone working, that was when you threatened to quit. You said you only care about your kitties, no one elses.

Maybe I missed something, but it sounds like you don't care how many kitties go hungry as long as your "fat cats" have bowls completely full at all time.

Then you say "I speak for those who can't speak for themselves" -- which seems to say that you're for equitable distribution -- kibble for everyone.

Please pick one position.

In the meantime, the technical discussion is about the best way to allocate work: how to efficiently and quickly keep at least some work in every queue without anyone going without.

OK......I'll back down....
I am a bit adamant that the kitties get what they want, but I guess this should be about what is best for the project.
I am very proud of what I have accomplished here, and would like it to remain so..
But I guess it's not all about me, now is it?

I have to stand down and realize that if the kitties are having trouble getting their kibble bowls full, it surely is affecting every other participant in the project as well.

Sorry I got so verbose.....
The kitties will share.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 902101 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 902110 - Posted: 31 May 2009, 18:13:04 UTC - in response to Message 902101.  


In the meantime, the technical discussion is about the best way to allocate work: how to efficiently and quickly keep at least some work in every queue without anyone going without.

OK......I'll back down....
I am a bit adamant that the kitties get what they want, but I guess this should be about what is best for the project.
I am very proud of what I have accomplished here, and would like it to remain so..
But I guess it's not all about me, now is it?

I have to stand down and realize that if the kitties are having trouble getting their kibble bowls full, it surely is affecting every other participant in the project as well.

Sorry I got so verbose.....
The kitties will share.

My thinking is that, whenever possible, no kitty should ever have an empty bowl.

Now, they may want the bowl full, and that is probably possible most of the time.

If we work from a "no queue is ever left empty" instead of "top up a queue at every opportunity" that should work better when things are slow, and end up with full queues most of the time.

At least it seems like that'd work.
ID: 902110 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51522
Credit: 1,018,363,574
RAC: 1,004
United States
Message 902114 - Posted: 31 May 2009, 18:22:54 UTC - in response to Message 902110.  


In the meantime, the technical discussion is about the best way to allocate work: how to efficiently and quickly keep at least some work in every queue without anyone going without.

OK......I'll back down....
I am a bit adamant that the kitties get what they want, but I guess this should be about what is best for the project.
I am very proud of what I have accomplished here, and would like it to remain so..
But I guess it's not all about me, now is it?

I have to stand down and realize that if the kitties are having trouble getting their kibble bowls full, it surely is affecting every other participant in the project as well.

Sorry I got so verbose.....
The kitties will share.

My thinking is that, whenever possible, no kitty should ever have an empty bowl.

Now, they may want the bowl full, and that is probably possible most of the time.

If we work from a "no queue is ever left empty" instead of "top up a queue at every opportunity" that should work better when things are slow, and end up with full queues most of the time.

At least it seems like that'd work.

Well...although the AP splitters have run out of data to split again, the kitties managed to get enough kibble in their bowls over the last couple of days to keep them happy for a bit. Maybe they can snag a few resends here and there to help a little.

If it really comes down to it, I'll just have to adjust their diet to include a portion of MB work rather than switch to another project...

And, as the situation is not likely to change as far as I can tell, I would suggest that new users default to MB only unless they manually choose to add AP work to their preferences. Leave the AP work for those who want it and know that their rigs are fast enough to process it effectively.

"Time is simply the mechanism that keeps everything from happening all at once."

ID: 902114 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 902121 - Posted: 31 May 2009, 18:36:09 UTC - in response to Message 902114.  


My thinking is that, whenever possible, no kitty should ever have an empty bowl.

Now, they may want the bowl full, and that is probably possible most of the time.

If we work from a "no queue is ever left empty" instead of "top up a queue at every opportunity" that should work better when things are slow, and end up with full queues most of the time.

At least it seems like that'd work.

Well...although the AP splitters have run out of data to split again, the kitties managed to get enough kibble in their bowls over the last couple of days to keep them happy for a bit. Maybe they can snag a few resends here and there to help a little.

If it really comes down to it, I'll just have to adjust their diet to include a portion of MB work rather than switch to another project...

And, as the situation is not likely to change as far as I can tell, I would suggest that new users default to MB only unless they manually choose to add AP work to their preferences. Leave the AP work for those who want it and know that their rigs are fast enough to process it effectively.

The Lunatics unified installer is pretty cool. I'm using it to set up my crunchers here, and they're set up for AP and Multibeam.

My preferences are for Astropulse -- and to take whatever is available.

So my main cruncher has nearly all Multibeam right now, but hey, all work needs to be done, and the project never promised us there would always be work.
ID: 902121 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51522
Credit: 1,018,363,574
RAC: 1,004
United States
Message 902124 - Posted: 31 May 2009, 18:43:35 UTC - in response to Message 902121.  


My thinking is that, whenever possible, no kitty should ever have an empty bowl.

Now, they may want the bowl full, and that is probably possible most of the time.

If we work from a "no queue is ever left empty" instead of "top up a queue at every opportunity" that should work better when things are slow, and end up with full queues most of the time.

At least it seems like that'd work.

Well...although the AP splitters have run out of data to split again, the kitties managed to get enough kibble in their bowls over the last couple of days to keep them happy for a bit. Maybe they can snag a few resends here and there to help a little.

If it really comes down to it, I'll just have to adjust their diet to include a portion of MB work rather than switch to another project...

And, as the situation is not likely to change as far as I can tell, I would suggest that new users default to MB only unless they manually choose to add AP work to their preferences. Leave the AP work for those who want it and know that their rigs are fast enough to process it effectively.

The Lunatics unified installer is pretty cool. I'm using it to set up my crunchers here, and they're set up for AP and Multibeam.

My preferences are for Astropulse -- and to take whatever is available.

So my main cruncher has nearly all Multibeam right now, but hey, all work needs to be done, and the project never promised us there would always be work.

My rigs still have the opti MB app installed, so if I have to do some MB work, all I have to do is edit my preferences to allow Boinc to fetch it.....
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 902124 · Report as offensive
Profile Blurf
Volunteer tester

Send message
Joined: 2 Sep 06
Posts: 8964
Credit: 12,678,685
RAC: 0
United States
Message 902135 - Posted: 31 May 2009, 19:30:36 UTC

FYI Eric has been questioned on raising MB credit values and last I remember said no, so it is a moot point.


ID: 902135 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51522
Credit: 1,018,363,574
RAC: 1,004
United States
Message 902136 - Posted: 31 May 2009, 19:35:10 UTC - in response to Message 902135.  

FYI Eric has been questioned on raising MB credit values and last I remember said no, so it is a moot point.

Not surprising that he would not want to rock the boat....
It was just a thought.
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 902136 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 902140 - Posted: 31 May 2009, 19:43:04 UTC - in response to Message 902135.  

FYI Eric has been questioned on raising MB credit values and last I remember said no, so it is a moot point.

I think we're forgetting Eric's server-side credit multiplier.

The problem is that credit is defined in terms of benchmark * time, but calculated based on FLOPs.

His script samples recently returned work, finds a median work unit, calculates the FLOPs credit and the Benchmark * Time credit and adjusts a multiplier based on a 30 day average.

That should tie credit back to the definition of a cobblestone, and it should be about right -- eventually.

It will never be perfect because some processors do some instructions faster than others -- and it's different with each new chip.
ID: 902140 · Report as offensive
-Bert-

Send message
Joined: 23 Mar 02
Posts: 152
Credit: 412,754
RAC: 0
Netherlands
Message 903104 - Posted: 3 Jun 2009, 4:12:34 UTC - in response to Message 902342.  

Not possible to upload at the moment.
ID: 903104 · Report as offensive
-Bert-

Send message
Joined: 23 Mar 02
Posts: 152
Credit: 412,754
RAC: 0
Netherlands
Message 903117 - Posted: 3 Jun 2009, 5:14:32 UTC - in response to Message 903112.  

I was just 'panicing' :)
ID: 903117 · Report as offensive
Profile [KWSN]John Galt 007
Volunteer tester
Avatar

Send message
Joined: 9 Nov 99
Posts: 2444
Credit: 25,086,197
RAC: 0
United States
Message 903800 - Posted: 5 Jun 2009, 0:59:43 UTC

ID: 903800 · Report as offensive
Profile Byron S Goodgame
Volunteer tester
Avatar

Send message
Joined: 16 Jan 06
Posts: 1145
Credit: 3,936,993
RAC: 0
United States
Message 903810 - Posted: 5 Jun 2009, 1:36:54 UTC
Last modified: 5 Jun 2009, 1:44:52 UTC

Two AP splitters up and running. Yea

Data Distribution State ......SETI@home # Astropulse # As of*
Results ready to send.......... 86,113 .............5 .........0m

Edit" totals
Edit 2: oops, one dropped off, now only one.
ID: 903810 · Report as offensive
Profile cliff west

Send message
Joined: 7 May 01
Posts: 211
Credit: 16,180,728
RAC: 15
United States
Message 903849 - Posted: 5 Jun 2009, 3:31:46 UTC

well i just update software (.31), new hardware (gtx 260) for cuda, and up my cashe to 4 days. so far

6/4/2009 8:29:08 PM SETI@home Reporting 1 completed tasks, requesting new tasks
6/4/2009 8:29:13 PM SETI@home Scheduler request completed: got 0 new tasks
6/4/2009 8:29:13 PM SETI@home Message from server: No work sent
6/4/2009 8:29:28 PM SETI@home Sending scheduler request: To fetch work.
6/4/2009 8:29:28 PM SETI@home Requesting new tasks
6/4/2009 8:29:33 PM SETI@home Scheduler request completed: got 0 new tasks
6/4/2009 8:29:33 PM SETI@home Message from server: (Project has no jobs available)
6/4/2009 8:30:48 PM SETI@home Sending scheduler request: To fetch work.
6/4/2009 8:30:48 PM SETI@home Requesting new tasks
6/4/2009 8:30:53 PM SETI@home Scheduler request completed: got 0 new tasks
6/4/2009 8:30:53 PM SETI@home Message from server: (Project has no jobs available)

ID: 903849 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 903857 - Posted: 5 Jun 2009, 4:01:41 UTC - in response to Message 903849.  


Network traffic has slumped again, and getting plenty of
5/06/2009 13:07:42 SETI@home Message from server: (Project has no jobs available)
messages and no work.

Grant
Darwin NT
ID: 903857 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13904
Credit: 208,696,464
RAC: 304
Australia
Message 903860 - Posted: 5 Jun 2009, 4:28:52 UTC - in response to Message 903857.  
Last modified: 5 Jun 2009, 4:33:43 UTC


Network traffic has slumped again, and getting plenty of
5/06/2009 13:07:42 SETI@home Message from server: (Project has no jobs available)
messages and no work.


Finally got some work after all the "no jobs available messages"
Now getting
5/06/2009 13:54:28 SETI@home Scheduler request failed: Couldn't connect to server


EDIT- now getting work again & no Scheduler connection failure messages. Just checked the network taffic again. Downloads were still dead but inbound traffic had surged (hence the problems contacting the scheduler).
Outbound traffic has now picked up again, inbound reducing. Looks like things have settled down again.
Grant
Darwin NT
ID: 903860 · Report as offensive
Chelski
Avatar

Send message
Joined: 3 Jan 00
Posts: 121
Credit: 8,979,050
RAC: 0
Malaysia
Message 903924 - Posted: 5 Jun 2009, 11:26:08 UTC

It is very clear the last couple of weeks, when AP work becomes available, the bandwidth becomes almost maxed out. And when no more AP work left, it drops almost immediately.

The end effect is that the AP clients are getting starved. When the work are availabe, most of the time my crunchers can't even upload finished work, or didn't get through to ask for work. And when connection is restored, the AP WUs are gone.

Seems like there are too many "AP only" crunchers who are not accepting MBs? If the project thinks that we need to temporarily convert all clients to MB to clear the backlog, please tell us so.
ID: 903924 · Report as offensive
MonorailPilot

Send message
Joined: 12 May 99
Posts: 8
Credit: 4,422,543
RAC: 0
United States
Message 903952 - Posted: 5 Jun 2009, 13:50:13 UTC - in response to Message 901411.  


I will agree with AP being an opt-in sort of deal, but after looking at one of the causes of all of my pending credits/tasks.. AP should not be allowed on hosts that have less than a 250 RAC. That requirement makes sure they're going to actually try to do some crunching. 90% of the wingmen I get have less than 5 tasks, are brand new rigs, or have RACs of less than 10..because of one/both of the previous two reasons.


Which will eliminate at least 1 of my rigs crunching AP permanently.

Now, if you want to tell me that someone needs to have returned at least 10 MB's or something, I don't have an issue with that. But there are plenty of people out there with rigs in the 100-200 rac that are MORE then capable of crunching AP.
ID: 903952 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 903957 - Posted: 5 Jun 2009, 14:09:16 UTC - in response to Message 903952.  

Now, if you want to tell me that someone needs to have returned at least 10 MB's or something, I don't have an issue with that. But there are plenty of people out there with rigs in the 100-200 rac that are MORE then capable of crunching AP.



Even with a 10 Multibeam limit there is going to be a problem with new people getting hit with APs for the first time. With the estimated time to completion inflated so badly at first the new cruncher takes one look at that and figures he will never be able to complete them so he just deletes BOINC and is never heard from again leaving whatever work was on their machine to time out.

It seems to me about the best way to handle this is to make Astropulse work strictly opt-in so that you actually have to come into the forums and find out what it is all about before you get any. This way, any of the set and forget crowd will be happily crunching away on MBs and wont even know what they are missing. Those that actually look at their results will eventually start to wonder why some of their wingmen are getting so much more credit and come into the forums to find out.


PROUD MEMBER OF Team Starfire World BOINC
ID: 903957 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 904016 - Posted: 5 Jun 2009, 17:21:34 UTC - in response to Message 903952.  

...there are plenty of people out there with rigs in the 100-200 rac that are MORE then capable of crunching AP.

And there are hosts out there with S@H RAC of 3 or less which are more than capable of crunching AP, low RAC may simply mean actively working for many projects. If such a host gets an AP WU it finishes within deadline, but will build LTD and not ask for work here for quite some time.
                                                                 Joe
ID: 904016 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 10 · Next

Message boards : Number crunching : Panic Mode On (16) Server problems


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.