problem with PC shutdown

Message boards : Number crunching : problem with PC shutdown
Message board moderation

To post messages, you must log in.

AuthorMessage
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1362508 - Posted: 29 Apr 2013, 13:54:59 UTC

One of my PCs (5774790 running windows 7) has developed an annoying habit. Everytime I logout, restart or shut down the PC with Boinc running, all current tasks and about 6 new ones crash with computational errors. Of my 4 PCs this is the only one behaving this way. If I suspend BOINC first then no tasks crash, but I do not always remember to do this, not am I always here to do it. Can anyone suggest a solution?
ID: 1362508 · Report as offensive
Sirius B Project Donor
Volunteer tester
Avatar

Send message
Joined: 26 Dec 00
Posts: 24877
Credit: 3,081,182
RAC: 7
Ireland
Message 1362509 - Posted: 29 Apr 2013, 13:56:19 UTC - in response to Message 1362508.  

Can't answer for your memory, but how about using remote access if possible that is.
ID: 1362509 · Report as offensive
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1362511 - Posted: 29 Apr 2013, 14:02:40 UTC - in response to Message 1362509.  

Can't answer for your memory, but how about using remote access if possible that is.


Yes I already use logmein to access it remotely, but I really want to know if there is something I can do to stop it trashing WUs on shutdown as none of my other PCs have this problem, so it seems there is something wrong.
ID: 1362511 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1362517 - Posted: 29 Apr 2013, 14:18:19 UTC

witch puter the 1 with the gtx 460 ?

notice driver Nvdia is 310 I would update to at least 314 from nvida
ID: 1362517 · Report as offensive
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1362519 - Posted: 29 Apr 2013, 14:21:24 UTC - in response to Message 1362517.  

witch puter the 1 with the gtx 460 ?

notice driver Nvdia is 310 I would update to at least 314 from nvida


No the GenuineIntel Intel(R) Core(TM)2 Duo CPU E7500 @ 2.93GHz [Family 6 Model 23 Stepping 10] (2 processors) which is a CPU only machine.

I have had problems before with GPU driver updates, including the most recent microsoft Nvidea update, so I now take the view if it ain't bust don't fix it.

ID: 1362519 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1362523 - Posted: 29 Apr 2013, 14:29:50 UTC - in response to Message 1362519.  

The one from Microsoft NO never let Microsoft do it it all ways causes trouble

the 314 from Nvida is stable I use it on a gigabyte 9600 gpu not probs and my GTX220 and GTX650

personally I supend before I shut down even remote wont help if you don't supend

somebody else mite have had your prob
ID: 1362523 · Report as offensive
Profile William
Volunteer tester
Avatar

Send message
Joined: 14 Feb 13
Posts: 2037
Credit: 17,689,662
RAC: 0
Message 1362539 - Posted: 29 Apr 2013, 15:38:37 UTC

First, upgrade BOINC to 7.0.64.
As it's not a GPU machine, the caution on emptying cache don't applies.
See if that cures it - 7.0.25 is a bad version to be on anyway.

boinc should get the shutdown signal, amd gracefully shutdown the apps - the error suggests to me that boinc isn't quite getting there and tasks are being killed by the OS before boinc can get to them.

Anything that might slow the system down? You've excluded boinc dirs from AV?
A person who won't read has no advantage over one who can't read. (Mark Twain)
ID: 1362539 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1362542 - Posted: 29 Apr 2013, 15:45:12 UTC

See task 2956740262

boinc_exit(): requesting safe worker shutdown ->
boinc_exit(): worker didn't respond to exit request within 2 seconds, exiting anyway.

Why would it take longer than 2 seconds for the application to write its final checkpoint/status and quit cleanly? Hard disk in power-down state?
ID: 1362542 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1362543 - Posted: 29 Apr 2013, 15:49:35 UTC - in response to Message 1362542.  

See task 2956740262

boinc_exit(): requesting safe worker shutdown ->
boinc_exit(): worker didn't respond to exit request within 2 seconds, exiting anyway.

Why would it take longer than 2 seconds for the application to write its final checkpoint/status and quit cleanly? Hard disk in power-down state?

CPU only rig....maybe check task manager CPU usage and see if there is a CPU hogging task running?
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1362543 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1362546 - Posted: 29 Apr 2013, 15:52:16 UTC

Unless he has a prob with bionic maybe you never shut it down properly at some stage ?

Try restart in SAFE MODE WITH NETWORKING let it start and when it finishes shut it down don't restart shut down ?????

start up and then try it
ID: 1362546 · Report as offensive
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1362566 - Posted: 29 Apr 2013, 16:23:02 UTC - in response to Message 1362542.  

See task 2956740262

boinc_exit(): requesting safe worker shutdown ->
boinc_exit(): worker didn't respond to exit request within 2 seconds, exiting anyway.

Why would it take longer than 2 seconds for the application to write its final checkpoint/status and quit cleanly? Hard disk in power-down state?


Don't think so, today I was actually using the PC at the time and needed to do a manual shut down / restart to set up some new user parameters. The PC was working normally at the time.
ID: 1362566 · Report as offensive
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1362567 - Posted: 29 Apr 2013, 16:25:30 UTC - in response to Message 1362543.  

See task 2956740262

boinc_exit(): requesting safe worker shutdown ->
boinc_exit(): worker didn't respond to exit request within 2 seconds, exiting anyway.

Why would it take longer than 2 seconds for the application to write its final checkpoint/status and quit cleanly? Hard disk in power-down state?

CPU only rig....maybe check task manager CPU usage and see if there is a CPU hogging task running?


Just checked task manager and no applications running with more than 3% of the CPU, most are 0-1%. Typically the two SETI tasks are each getting 47-50% of the CPU and the next highest CPU usage is by task manager!
ID: 1362567 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1362622 - Posted: 29 Apr 2013, 17:45:54 UTC - in response to Message 1362508.  

One of my PCs (5774790 running windows 7) has developed an annoying habit. Everytime I logout, restart or shut down the PC with Boinc running, all current tasks and about 6 new ones crash with computational errors.
Can anyone suggest a solution?


Try if this (WaitToKillAppTimeout, WaitToKillServiceTimeout) will help:
http://setiathome.berkeley.edu/forum_thread.php?id=68120&postid=1326359#1326359


 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1362622 · Report as offensive
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1364501 - Posted: 4 May 2013, 18:50:32 UTC

This weekend is a bank holiday weekend in the UK. I Checked my office PCs this morning using logmein and found both were offline. This usually means there has been a power cut at work, so I decided to drive in to reboot the PCs to avoid them being offline until Tuesday. 20 minutes later I got to my office only to find that I had left me office key at home! Drove back home, got key, drove back to work and got into office. Turned out no power cut, but the rooter that handles the wired Internet access was not working. I have no access to the rooter, so nothing I could do except drive home again.

When I got home, I realised that I had a wireless dongle attached to my home PC and that PC could manage without Internet access for 3 days. I took it and the driver disk back into work only to find that the CD drive on my main cruncher (the one with the GTX460) would not read the driver mini disk, However, it did work on my other office PC (the one which is the topic of this thread) and after two reboots I got it back online using the university wireless system. This did require two reboots and fortunately I had followed the advice from Glenn and William and last week had both cycled the PC through safe mode and updated BOINC. As a result, no WUs were trashed in this process and the problem appears to have been solved.

I eventually got my main cruncher back online by physically moving it to my research lab which is on a different wired router. Unfortunately, whilst it was uploading the 100 finished seti WUs, it decided to download 9 hours of Einstein WUs from my backup project.

Overall that took 6 hours of travelling and problem solving to get the two PCs back seti crunching for the whole bank holiday weekend.
ID: 1364501 · Report as offensive
Sirius B Project Donor
Volunteer tester
Avatar

Send message
Joined: 26 Dec 00
Posts: 24877
Credit: 3,081,182
RAC: 7
Ireland
Message 1364513 - Posted: 4 May 2013, 19:13:35 UTC - in response to Message 1364501.  

Wow, now that should be nominated for the cruncher of the year 2013 award. :)

Nice job well done btw.
ID: 1364513 · Report as offensive
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1364544 - Posted: 4 May 2013, 20:14:38 UTC

The most annoying thing is that today could have been avoided if the limits were raised. 100 shorties on a GTX460 only last about 3 hours. With a 10 day stash I could just have left the PCs crunching offline until Tuesday when the university staff would fix or replace the router. This would have saved at least half the time it took to fix the problems today.
ID: 1364544 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13722
Credit: 208,696,464
RAC: 304
Australia
Message 1364548 - Posted: 4 May 2013, 20:20:23 UTC - in response to Message 1364544.  

The most annoying thing is that today could have been avoided if the limits were raised. 100 shorties on a GTX460 only last about 3 hours. With a 10 day stash I could just have left the PCs crunching offline until Tuesday when the university staff would fix or replace the router. This would have saved at least half the time it took to fix the problems today.

Given that the system doesn't fall over as often as it used to, i'd settle for a 1 day cache instead of my previous 4. It would be nice to get through the weekly outage without running out of work.
Grant
Darwin NT
ID: 1364548 · Report as offensive
MikeN

Send message
Joined: 24 Jan 11
Posts: 319
Credit: 64,719,409
RAC: 85
United Kingdom
Message 1364552 - Posted: 4 May 2013, 20:22:49 UTC - in response to Message 1364548.  

The most annoying thing is that today could have been avoided if the limits were raised. 100 shorties on a GTX460 only last about 3 hours. With a 10 day stash I could just have left the PCs crunching offline until Tuesday when the university staff would fix or replace the router. This would have saved at least half the time it took to fix the problems today.

Given that the system doesn't fall over as often as it used to, i'd settle for a 1 day cache instead of my previous 4. It would be nice to get through the weekly outage without running out of work.

I would still want at least 4 days to get through problems which occur during long weekends. Between the UK and USA there are a lot of those.
ID: 1364552 · Report as offensive
Darth Beaver Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Avatar

Send message
Joined: 20 Aug 99
Posts: 6728
Credit: 21,443,075
RAC: 3
Australia
Message 1364594 - Posted: 4 May 2013, 21:29:30 UTC - in response to Message 1364552.  

Mike well done I agree with Sirus you deserve the cruncher of the year for your effert .
ID: 1364594 · Report as offensive
Profile ivan
Volunteer tester
Avatar

Send message
Joined: 5 Mar 01
Posts: 783
Credit: 348,560,338
RAC: 223
United Kingdom
Message 1364633 - Posted: 4 May 2013, 23:42:45 UTC - in response to Message 1364501.  

Overall that took 6 hours of travelling and problem solving to get the two PCs back seti crunching for the whole bank holiday weekend.

Known in some circles as a "no more sheepdogs" day...
ID: 1364633 · Report as offensive

Message boards : Number crunching : problem with PC shutdown


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.