Panic Mode On (93) Server Problems?

Message boards : Number crunching : Panic Mode On (93) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · 15 . . . 24 · Next

AuthorMessage
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1616780 - Posted: 20 Dec 2014, 23:26:57 UTC - in response to Message 1616775.  
Last modified: 20 Dec 2014, 23:45:11 UTC

That's nice. My main machine is completely out of GPU tasks even though the server has seen fit to recently send me 23 CPU APs that I can't use. It continues to send CPU tasks even though the GPUs are idle;

Sat Dec 20 18:17:48 2014 | SETI@home | [sched_op] Starting scheduler request
Sat Dec 20 18:17:48 2014 | SETI@home | Sending scheduler request: To fetch work.
Sat Dec 20 18:17:48 2014 | SETI@home | Requesting new tasks for CPU and ATI
Sat Dec 20 18:17:48 2014 | SETI@home | [sched_op] CPU work request: 417213.09 seconds; 0.00 devices
Sat Dec 20 18:17:48 2014 | SETI@home | [sched_op] ATI work request: 2604960.00 seconds; 3.00 devices
Sat Dec 20 18:17:50 2014 | SETI@home | Scheduler request completed: got 1 new tasks
Sat Dec 20 18:17:50 2014 | SETI@home | [sched_op] estimated total CPU task duration: 39576 seconds
Sat Dec 20 18:17:50 2014 | SETI@home | [sched_op] estimated total ATI task duration: 0 seconds


I suppose the only alternative is to have the server send them back as GPU tasks and watch it keep sending CPU tasks that aren't needed. Seems if it can't send CPU tasks it will send Nothing.

BTW, the oldest AP file finished and those 3 splitters immediately jumped onto 03se14ad. Now there are 4 splitters working 03se14ad with (7) completed and "Current result creation rate: 0.5886/sec" showing...
ID: 1616780 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1616790 - Posted: 21 Dec 2014, 0:02:12 UTC - in response to Message 1616786.  

There's still something not right with the AP part of SETI.

Lets blame it on Bruno's replacement.
ID: 1616790 · Report as offensive
Cruncher-American Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor

Send message
Joined: 25 Mar 02
Posts: 1513
Credit: 370,893,186
RAC: 340
United States
Message 1616794 - Posted: 21 Dec 2014, 0:04:49 UTC

Maybe we again need that old program we all used to use - I can't remember the name - that would change WUs from CPU to GPU (and vice versa).
ID: 1616794 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13727
Credit: 208,696,464
RAC: 304
Australia
Message 1616797 - Posted: 21 Dec 2014, 0:15:03 UTC - in response to Message 1616786.  

There's still something not right with the AP part of SETI.

Take a look at the weekly MB Ready-to-send & Result creation rate graphs and you'll see that the MB splitters have been hampered in their output for the last couple of days.
It used to be the splitters would run at their maximum rate, pump out the work till the ready-to-send buffer had reached it's high water mark, then shut down. Lately the maximum rate has been less than it was & the splitters rarely shut down at all now.


Multiple splitters on the one file was occurring before the AP database issues, but previously it was only a couple of splitters on the one file.
With MB, that still occurs but now we have up to 4 splitters at a time on the AP files. I suspect that having more than one splitter trying to work on one file is resulting in Input/Output contention & hence bogging down the output of the splitters. The more splitters on the one file, the greater the contention & slowdown.
Grant
Darwin NT
ID: 1616797 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1616800 - Posted: 21 Dec 2014, 0:30:46 UTC

All i know is i still have not got even 1 Freaking GPU unit since they came back online and i am now getting realy peed off with the b/s excuses .

It's a vlar storm
It's to many splitters on one tape
It's this it's that b/s b/s b/s

There is a major problem so put the dam project down fix it before coming back online

Oh it's now to close to christmas will be the next freaking excuse

1 freaking month and still no joy

And what do is see on the server page the Astropulse science database server is disabled
but the splitter are are not WTF
ID: 1616800 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1616806 - Posted: 21 Dec 2014, 0:51:51 UTC - in response to Message 1616804.  
Last modified: 21 Dec 2014, 0:52:03 UTC

Sten you have to understand Glenn and the fact he is upside down.
ID: 1616806 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1616808 - Posted: 21 Dec 2014, 1:00:13 UTC - in response to Message 1616804.  
Last modified: 21 Dec 2014, 1:01:58 UTC

stern i have not got any AP GPU not one since 31 of the 10 not one AP on the GPU i am not talking about MB twit

try looking at the right stats http://setiathome.berkeley.edu/results.php?userid=8889212&offset=40&show_names=0&state=4&appid=20
ID: 1616808 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1616818 - Posted: 21 Dec 2014, 1:12:22 UTC
Last modified: 21 Dec 2014, 1:13:58 UTC

I'm not the one whom comes here only a few hours after the project is up and whinge about not getting work . Yes i have blown a gasket but it has taken a few day for me to blow it .

I've been patient but now i'm just freaking peed right off . So i have now aborted all the CPU AP's resfet setting to only get GPU AP's so until i start getting them i won't do any other and just start shutting the machine down at the end of the day and start saving money by using less power F it

Untill i start getting GPU AP's i will consider the project still stuffed
ID: 1616818 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1616823 - Posted: 21 Dec 2014, 1:25:16 UTC - in response to Message 1616808.  

stern i have not got any AP GPU not one since 31 of the 10 not one AP on the GPU i am not talking about MB twit

try looking at the right stats http://setiathome.berkeley.edu/results.php?userid=8889212&offset=40&show_names=0&state=4&appid=20

Dude, you've been sent 28 APs and then aborted them. Don't do that! Since you have so few tasks on that machine, just set your preferences for GPU APs and hit the reset button. They will be sent back as GPU APs. Once you abort them they are Gone. Just make sure to hit the Update button after changing the preferences so the server will see you want GPU APs.

As I have said before, the Server will try to fill your AP CPU Cache First, then it will worry about sending the GPU tasks. You can try to lower your cache setting so there are fewer CPU tasks to fill, but, from my experience the server knows that trick and still wants to send you 100 CPU tasks before it worries about GPU tasks.

Good luck.
;-)
ID: 1616823 · Report as offensive
Profile cliff
Avatar

Send message
Joined: 16 Dec 07
Posts: 625
Credit: 3,590,440
RAC: 0
United Kingdom
Message 1616827 - Posted: 21 Dec 2014, 1:33:17 UTC - in response to Message 1616818.  

Hi Glen,
Its that season of the year again.. When the world and everything in it goes crackers:-/

Have a shot of your favourite tipple, look at the cat sideways and hiss a few times:-) Then the Cat can be confused about the state of things like the rest of us..

Eventually things will get sorted.. Just not sure what year that will be, of if I will still be around to see it:-)

Regards,
Cliff,
Been there, Done that, Still no damm T shirt!
ID: 1616827 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1616829 - Posted: 21 Dec 2014, 1:38:10 UTC - in response to Message 1616823.  

Tbar mate i have done that the setting are set to only receive GPU AP's

I have aborted those units as one last chance to see what will happen . That way there should be no limits as i have now only got 4 units going on the CPU NO gpu units so the GPU's are just sitting there doing sweet f all

Once those units finish i will reset the project and if i still do not receive any units then F it i will consider the project is still stuffed

Sorry Tbar not peed off with you i know your only trying to help . But the other twit whom whinges to much to then say something to me because after 5 days i whinge F you
ID: 1616829 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1616835 - Posted: 21 Dec 2014, 1:45:31 UTC - in response to Message 1616823.  
Last modified: 21 Dec 2014, 1:47:14 UTC

As I have said before, the Server will try to fill your AP CPU Cache First, then it will worry about sending the GPU tasks. You can try to lower your cache setting so there are fewer CPU tasks to fill, but, from my experience the server knows that trick and still wants to send you 100 CPU tasks before it worries about GPU tasks.

TBar, Here's a couple of work arounds for you to try:

Prior to upping your cache level, set 'Use CPU' to 'No', update Boinc, then up your cache level, Boinc now can't ask for CPU work, so AP can't be sent to the CPU, only GPUs,
you're got four locations/venues available, you can always keep one venue available for doing this, and just move the host to that venue until you're filled up with GPU work.

Or remove the CPU Astropulse app from your app_info, Restart Boinc, Now the scheduler will first predomitly sent VLARs to the CPU, then send any Astropulse and non-VLARs to the GPUs,
you can always put it back in once your GPU cache is full of Astropulse work.

Try Boinc 7.4.36 at least on one of your hosts (at least with the 2nd work around), it has this changeset:

http://boinc.berkeley.edu/gitweb/?p=boinc-v2.git;a=commit;h=789637f637753c4e06f7ca58ce2de285d1491cc8
client: request work from backed-off resources if doing RPC anyway

It will mean if a task completes (say CPU), the uploading of the CPU result will reset the CPU back-off, and because Boinc is asking for work it'll also ask for work from the other (backed off) resources,
so you'll get more chances of grabbing work.

Claggy
ID: 1616835 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1616836 - Posted: 21 Dec 2014, 1:46:40 UTC - in response to Message 1616833.  
Last modified: 21 Dec 2014, 1:47:17 UTC

I can't stand on my head and i haven't had a smoke now for over 10 days maybe that's the problem na i'm genuinely getting peed off
ID: 1616836 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1616839 - Posted: 21 Dec 2014, 1:58:12 UTC - in response to Message 1616829.  
Last modified: 21 Dec 2014, 2:07:30 UTC

Hmmm, I'll try it again.

Hitting the reset button without any tasks will accomplish Nothing. The point of hitting the reset is to change 28 CPU tasks to 28 GPU tasks, too late for that now.

The AP server is Fixated on CPU tasks. It Wants to fill your CPU cache. It will not worry about any GPU tasks Until it fills your CPU cache. Your best bet is to let it fill the CPU cache. Right now that is a problem due to low creation rates. So, change your cache setting to about 1 or 2 days and ask for both CPU & GPU APs. It will send the CPU tasks first...Let it. Do not abort them. After it sends the asked for cpu work it will send the GPU work...when it gets around to it. If it balks on the GPU work, Then you can change your preferences to GPU only and hit the reset button. It will then resend the CPU work as GPU work.
Then repeat the above.
ID: 1616839 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1616843 - Posted: 21 Dec 2014, 2:19:59 UTC - in response to Message 1616839.  

I've had the buffer set to 0.5 days now for the last 2 days and i've been letting the buffer run down but i'm getting to peed off so if no joy then i'll just stop for a few weeks maybe then things mite settle .
I had something like 55 units on the cpu when the buffer was set to max on wednesday and i'm only able to do 4 units at a time (overheats) and i just aborted the rest 28 units which would have taken another 3 and a bit days so the buffer should have been at max 2 days ago but still didn't get any thing and i was not allowing V7 MB so why didn't i start getting any .....

After the GPU's just sat there for day doing nothing i did some MB with the buffer set to 0.5 days . BUt now i'm just getting grrrrr
ID: 1616843 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1616845 - Posted: 21 Dec 2014, 2:25:12 UTC - in response to Message 1616835.  

As I have said before, the Server will try to fill your AP CPU Cache First, then it will worry about sending the GPU tasks. You can try to lower your cache setting so there are fewer CPU tasks to fill, but, from my experience the server knows that trick and still wants to send you 100 CPU tasks before it worries about GPU tasks.

TBar, Here's a couple of work arounds for you to try:

Prior to upping your cache level, set 'Use CPU' to 'No', update Boinc, then up your cache level, Boinc now can't ask for CPU work, so AP can't be sent to the CPU, only GPUs...

Claggy

I've tried that. The results are I receive very few GPU tasks, not enough to run the machine. I've found the best route is to just let the CPU cache fill, then you receive GPU tasks without any problem. Unfortunately AP tasks are hard to come by at this time, so filling the cache is a problem. In fact, it looks as though all APs have dried up at present. Hopefully we'll be able to fill the caches soon.
ID: 1616845 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1616848 - Posted: 21 Dec 2014, 2:38:10 UTC - in response to Message 1616845.  

I noticed this morning the Astropulse science database server was disabled maybe it's crashed again if no body else is getting them .

GGGGGGGGGGrrrrrrrrrrrrrrrr

oh well guess it's time to just leave the project till after new years and get off my bum and do some of the things i've been wishing to try with this new system like trying to get the Mac Vm working like i had with the Intel Quaddy or hit the torrent sites and catch up on some of the shows i've missed .

Not going to let it get me down so just let it finish what i got reset project then set tings to no new tasks till after New year some time .

But thanks for trying Tbar have a good X-mas everyone if i don't see ya before new years .
ID: 1616848 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1616851 - Posted: 21 Dec 2014, 3:06:53 UTC - in response to Message 1616833.  

stop smoking that crap you're smoking, it's not good for you.

Not true
ID: 1616851 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1616852 - Posted: 21 Dec 2014, 3:10:03 UTC - in response to Message 1616836.  

i haven't had a smoke now for over 10 days

I'm sorry.
ID: 1616852 · Report as offensive
Profile JaundicedEye
Avatar

Send message
Joined: 14 Mar 12
Posts: 5375
Credit: 30,870,693
RAC: 1
United States
Message 1616853 - Posted: 21 Dec 2014, 3:27:48 UTC - in response to Message 1616851.  

stop smoking that crap you're smoking, it's not good for you.

Not true


++1

"Sour Grapes make a bitter Whine." <(0)>
ID: 1616853 · Report as offensive
Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · 15 . . . 24 · Next

Message boards : Number crunching : Panic Mode On (93) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.