problem with PC shutdown


log in

Advanced search

Message boards : Number crunching : problem with PC shutdown

Author Message
MikeN
Send message
Joined: 24 Jan 11
Posts: 301
Credit: 31,940,620
RAC: 39,547
United Kingdom
Message 1362508 - Posted: 29 Apr 2013, 13:54:59 UTC

One of my PCs (5774790 running windows 7) has developed an annoying habit. Everytime I logout, restart or shut down the PC with Boinc running, all current tasks and about 6 new ones crash with computational errors. Of my 4 PCs this is the only one behaving this way. If I suspend BOINC first then no tasks crash, but I do not always remember to do this, not am I always here to do it. Can anyone suggest a solution?
____________

Sirius B
Volunteer tester
Avatar
Send message
Joined: 26 Dec 00
Posts: 11276
Credit: 1,686,661
RAC: 3,687
Israel
Message 1362509 - Posted: 29 Apr 2013, 13:56:19 UTC - in response to Message 1362508.

Can't answer for your memory, but how about using remote access if possible that is.
____________

MikeN
Send message
Joined: 24 Jan 11
Posts: 301
Credit: 31,940,620
RAC: 39,547
United Kingdom
Message 1362511 - Posted: 29 Apr 2013, 14:02:40 UTC - in response to Message 1362509.

Can't answer for your memory, but how about using remote access if possible that is.


Yes I already use logmein to access it remotely, but I really want to know if there is something I can do to stop it trashing WUs on shutdown as none of my other PCs have this problem, so it seems there is something wrong.
____________

Glenn savill
Send message
Joined: 20 Aug 99
Posts: 2364
Credit: 3,334,275
RAC: 33,464
Australia
Message 1362517 - Posted: 29 Apr 2013, 14:18:19 UTC

witch puter the 1 with the gtx 460 ?

notice driver Nvdia is 310 I would update to at least 314 from nvida
____________

MikeN
Send message
Joined: 24 Jan 11
Posts: 301
Credit: 31,940,620
RAC: 39,547
United Kingdom
Message 1362519 - Posted: 29 Apr 2013, 14:21:24 UTC - in response to Message 1362517.

witch puter the 1 with the gtx 460 ?

notice driver Nvdia is 310 I would update to at least 314 from nvida


No the GenuineIntel Intel(R) Core(TM)2 Duo CPU E7500 @ 2.93GHz [Family 6 Model 23 Stepping 10] (2 processors) which is a CPU only machine.

I have had problems before with GPU driver updates, including the most recent microsoft Nvidea update, so I now take the view if it ain't bust don't fix it.

____________

Glenn savill
Send message
Joined: 20 Aug 99
Posts: 2364
Credit: 3,334,275
RAC: 33,464
Australia
Message 1362523 - Posted: 29 Apr 2013, 14:29:50 UTC - in response to Message 1362519.

The one from Microsoft NO never let Microsoft do it it all ways causes trouble

the 314 from Nvida is stable I use it on a gigabyte 9600 gpu not probs and my GTX220 and GTX650

personally I supend before I shut down even remote wont help if you don't supend

somebody else mite have had your prob
____________

Profile WilliamProject donor
Volunteer tester
Avatar
Send message
Joined: 14 Feb 13
Posts: 1602
Credit: 9,469,424
RAC: 265
Message 1362539 - Posted: 29 Apr 2013, 15:38:37 UTC

First, upgrade BOINC to 7.0.64.
As it's not a GPU machine, the caution on emptying cache don't applies.
See if that cures it - 7.0.25 is a bad version to be on anyway.

boinc should get the shutdown signal, amd gracefully shutdown the apps - the error suggests to me that boinc isn't quite getting there and tasks are being killed by the OS before boinc can get to them.

Anything that might slow the system down? You've excluded boinc dirs from AV?
____________
A person who won't read has no advantage over one who can't read. (Mark Twain)

Richard HaselgroveProject donor
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8551
Credit: 50,429,269
RAC: 51,194
United Kingdom
Message 1362542 - Posted: 29 Apr 2013, 15:45:12 UTC

See task 2956740262

boinc_exit(): requesting safe worker shutdown ->
boinc_exit(): worker didn't respond to exit request within 2 seconds, exiting anyway.

Why would it take longer than 2 seconds for the application to write its final checkpoint/status and quit cleanly? Hard disk in power-down state?

Glenn savill
Send message
Joined: 20 Aug 99
Posts: 2364
Credit: 3,334,275
RAC: 33,464
Australia
Message 1362546 - Posted: 29 Apr 2013, 15:52:16 UTC

Unless he has a prob with bionic maybe you never shut it down properly at some stage ?

Try restart in SAFE MODE WITH NETWORKING let it start and when it finishes shut it down don't restart shut down ?????

start up and then try it
____________

MikeN
Send message
Joined: 24 Jan 11
Posts: 301
Credit: 31,940,620
RAC: 39,547
United Kingdom
Message 1362566 - Posted: 29 Apr 2013, 16:23:02 UTC - in response to Message 1362542.

See task 2956740262

boinc_exit(): requesting safe worker shutdown ->
boinc_exit(): worker didn't respond to exit request within 2 seconds, exiting anyway.

Why would it take longer than 2 seconds for the application to write its final checkpoint/status and quit cleanly? Hard disk in power-down state?


Don't think so, today I was actually using the PC at the time and needed to do a manual shut down / restart to set up some new user parameters. The PC was working normally at the time.
____________

MikeN
Send message
Joined: 24 Jan 11
Posts: 301
Credit: 31,940,620
RAC: 39,547
United Kingdom
Message 1362567 - Posted: 29 Apr 2013, 16:25:30 UTC - in response to Message 1362543.

See task 2956740262

boinc_exit(): requesting safe worker shutdown ->
boinc_exit(): worker didn't respond to exit request within 2 seconds, exiting anyway.

Why would it take longer than 2 seconds for the application to write its final checkpoint/status and quit cleanly? Hard disk in power-down state?

CPU only rig....maybe check task manager CPU usage and see if there is a CPU hogging task running?


Just checked task manager and no applications running with more than 3% of the CPU, most are 0-1%. Typically the two SETI tasks are each getting 47-50% of the CPU and the next highest CPU usage is by task manager!
____________

Profile BilBg
Volunteer tester
Avatar
Send message
Joined: 27 May 07
Posts: 2713
Credit: 6,148,437
RAC: 5,720
Bulgaria
Message 1362622 - Posted: 29 Apr 2013, 17:45:54 UTC - in response to Message 1362508.

One of my PCs (5774790 running windows 7) has developed an annoying habit. Everytime I logout, restart or shut down the PC with Boinc running, all current tasks and about 6 new ones crash with computational errors.
Can anyone suggest a solution?


Try if this (WaitToKillAppTimeout, WaitToKillServiceTimeout) will help:
http://setiathome.berkeley.edu/forum_thread.php?id=68120&postid=1326359#1326359


____________



- ALF - "Find out what you don't do well ..... then don't do it!" :)

MikeN
Send message
Joined: 24 Jan 11
Posts: 301
Credit: 31,940,620
RAC: 39,547
United Kingdom
Message 1364501 - Posted: 4 May 2013, 18:50:32 UTC

This weekend is a bank holiday weekend in the UK. I Checked my office PCs this morning using logmein and found both were offline. This usually means there has been a power cut at work, so I decided to drive in to reboot the PCs to avoid them being offline until Tuesday. 20 minutes later I got to my office only to find that I had left me office key at home! Drove back home, got key, drove back to work and got into office. Turned out no power cut, but the rooter that handles the wired Internet access was not working. I have no access to the rooter, so nothing I could do except drive home again.

When I got home, I realised that I had a wireless dongle attached to my home PC and that PC could manage without Internet access for 3 days. I took it and the driver disk back into work only to find that the CD drive on my main cruncher (the one with the GTX460) would not read the driver mini disk, However, it did work on my other office PC (the one which is the topic of this thread) and after two reboots I got it back online using the university wireless system. This did require two reboots and fortunately I had followed the advice from Glenn and William and last week had both cycled the PC through safe mode and updated BOINC. As a result, no WUs were trashed in this process and the problem appears to have been solved.

I eventually got my main cruncher back online by physically moving it to my research lab which is on a different wired router. Unfortunately, whilst it was uploading the 100 finished seti WUs, it decided to download 9 hours of Einstein WUs from my backup project.

Overall that took 6 hours of travelling and problem solving to get the two PCs back seti crunching for the whole bank holiday weekend.
____________

Sirius B
Volunteer tester
Avatar
Send message
Joined: 26 Dec 00
Posts: 11276
Credit: 1,686,661
RAC: 3,687
Israel
Message 1364513 - Posted: 4 May 2013, 19:13:35 UTC - in response to Message 1364501.

Wow, now that should be nominated for the cruncher of the year 2013 award. :)

Nice job well done btw.
____________

MikeN
Send message
Joined: 24 Jan 11
Posts: 301
Credit: 31,940,620
RAC: 39,547
United Kingdom
Message 1364544 - Posted: 4 May 2013, 20:14:38 UTC

The most annoying thing is that today could have been avoided if the limits were raised. 100 shorties on a GTX460 only last about 3 hours. With a 10 day stash I could just have left the PCs crunching offline until Tuesday when the university staff would fix or replace the router. This would have saved at least half the time it took to fix the problems today.
____________

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5831
Credit: 59,465,240
RAC: 47,876
Australia
Message 1364548 - Posted: 4 May 2013, 20:20:23 UTC - in response to Message 1364544.

The most annoying thing is that today could have been avoided if the limits were raised. 100 shorties on a GTX460 only last about 3 hours. With a 10 day stash I could just have left the PCs crunching offline until Tuesday when the university staff would fix or replace the router. This would have saved at least half the time it took to fix the problems today.

Given that the system doesn't fall over as often as it used to, i'd settle for a 1 day cache instead of my previous 4. It would be nice to get through the weekly outage without running out of work.
____________
Grant
Darwin NT.

MikeN
Send message
Joined: 24 Jan 11
Posts: 301
Credit: 31,940,620
RAC: 39,547
United Kingdom
Message 1364552 - Posted: 4 May 2013, 20:22:49 UTC - in response to Message 1364548.

The most annoying thing is that today could have been avoided if the limits were raised. 100 shorties on a GTX460 only last about 3 hours. With a 10 day stash I could just have left the PCs crunching offline until Tuesday when the university staff would fix or replace the router. This would have saved at least half the time it took to fix the problems today.

Given that the system doesn't fall over as often as it used to, i'd settle for a 1 day cache instead of my previous 4. It would be nice to get through the weekly outage without running out of work.

I would still want at least 4 days to get through problems which occur during long weekends. Between the UK and USA there are a lot of those.
____________

Glenn savill
Send message
Joined: 20 Aug 99
Posts: 2364
Credit: 3,334,275
RAC: 33,464
Australia
Message 1364594 - Posted: 4 May 2013, 21:29:30 UTC - in response to Message 1364552.

Mike well done I agree with Sirus you deserve the cruncher of the year for your effert .
____________

Profile ivan
Volunteer tester
Avatar
Send message
Joined: 5 Mar 01
Posts: 612
Credit: 140,133,344
RAC: 151,940
United Kingdom
Message 1364633 - Posted: 4 May 2013, 23:42:45 UTC - in response to Message 1364501.

Overall that took 6 hours of travelling and problem solving to get the two PCs back seti crunching for the whole bank holiday weekend.

Known in some circles as a "no more sheepdogs" day...
____________

Message boards : Number crunching : problem with PC shutdown

Copyright © 2014 University of California