Process_____Not Found, Help Please!


log in

Advanced search

Message boards : Number crunching : Process_____Not Found, Help Please!

1 · 2 · Next
Author Message
Profile David
Send message
Joined: 20 May 99
Posts: 10
Credit: 2,556,200
RAC: 115
United States
Message 546222 - Posted: 14 Apr 2007, 21:09:51 UTC

Over the past few weeks I cannot complete a Task. Progress actually goes backwards over a period of several days. In BOINC Manager under Messages, I get the following:

Thu Apr 12 22:09:21 2007||Starting BOINC client version 5.8.15 for powerpc-apple-darwin
Thu Apr 12 22:09:21 2007||log flags: task, file_xfer, sched_ops
Thu Apr 12 22:09:21 2007||Libraries: libcurl/7.15.5 OpenSSL/0.9.7l zlib/1.2.3
Thu Apr 12 22:09:21 2007||Data directory: /Library/Application Support/BOINC Data
Thu Apr 12 22:09:21 2007||Processor: 1 Power Macintosh Power Macintosh [Power Macintosh Model PowerMac3,5] [AltiVec]
Thu Apr 12 22:09:21 2007||Memory: 1.25 GB physical, 283.53 GB virtual
Thu Apr 12 22:09:21 2007||Disk: 372.49 GB total, 283.29 GB free
Thu Apr 12 22:09:21 2007|SETI@home|URL: http://setiathome.berkeley.edu/; Computer ID: 2507670; location: home; project prefs: default
Thu Apr 12 22:09:21 2007||General prefs: from SETI@home (last modified 2007-04-06 04:37:53)
Thu Apr 12 22:09:21 2007||Host location: home
Thu Apr 12 22:09:21 2007||General prefs: no separate prefs for home; using your defaults
Thu Apr 12 22:09:21 2007||Reading preferences override file
Thu Apr 12 22:13:46 2007|SETI@home|Restarting task 05se04aa.21397.2945.1034646.3.32_2 using setiathome_enhanced version 513
Thu Apr 12 22:14:01 2007||Suspending computation - user is active
Thu Apr 12 23:10:42 2007||Resuming computation
Fri Apr 13 00:26:12 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 00:26:13 2007||[error] Process 414 not found
Fri Apr 13 00:54:12 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 00:54:13 2007||[error] Process 1232 not found
Fri Apr 13 01:38:17 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 01:38:18 2007||[error] Process 1401 not found
Fri Apr 13 01:56:33 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 01:56:33 2007|SETI@home|Restarting task 05se04aa.21397.2945.1034646.3.32_2 using setiathome_enhanced version 513
Fri Apr 13 01:56:34 2007||[error] Process 1669 not found
Fri Apr 13 03:29:07 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 03:29:08 2007||[error] Process 1780 not found
Fri Apr 13 03:35:22 2007||Suspending computation - user is active
Fri Apr 13 04:26:38 2007||Resuming computation
Fri Apr 13 04:27:16 2007||Suspending computation - user is active
Fri Apr 13 04:28:51 2007||Resuming computation
Fri Apr 13 06:16:12 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 06:16:13 2007||[error] Process 2340 not found
Fri Apr 13 07:20:15 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 07:20:16 2007||[error] Process 3426 not found
Fri Apr 13 09:27:20 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 09:27:21 2007||[error] Process 3813 not found
Fri Apr 13 10:05:47 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 10:05:48 2007||[error] Process 4664 not found
Fri Apr 13 13:34:31 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 13:34:32 2007||[error] Process 4898 not found
Fri Apr 13 13:53:48 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 13:53:49 2007||[error] Process 6201 not found
Fri Apr 13 14:29:11 2007||Suspending computation - user is active
Fri Apr 13 14:34:47 2007||Resuming computation
Fri Apr 13 14:34:51 2007||Suspending computation - user is active
Fri Apr 13 14:49:31 2007||Resuming computation
Fri Apr 13 15:04:13 2007||Suspending computation - user is active
Fri Apr 13 15:32:37 2007||Resuming computation
Fri Apr 13 17:18:24 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 17:18:25 2007||[error] Process 6319 not found
Fri Apr 13 17:46:04 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 17:46:04 2007|SETI@home|Restarting task 05se04aa.21397.2945.1034646.3.32_2 using setiathome_enhanced version 513
Fri Apr 13 17:46:05 2007||[error] Process 7571 not found
Fri Apr 13 18:01:11 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 18:01:12 2007||[error] Process 7739 not found
Fri Apr 13 19:22:00 2007||Suspending computation - user is active
Fri Apr 13 19:24:24 2007||Resuming computation
Fri Apr 13 21:07:07 2007||Suspending computation - user is active
Fri Apr 13 21:15:12 2007||Resuming computation
Fri Apr 13 21:34:00 2007||Suspending computation - user is active
Fri Apr 13 21:57:14 2007||Resuming computation
Fri Apr 13 21:57:17 2007||Suspending computation - user is active
Fri Apr 13 22:00:16 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 22:00:17 2007|SETI@home|Task 05se04aa.21397.2945.1034646.3.32_2 exited with zero status but no 'finished' file
Fri Apr 13 22:00:17 2007|SETI@home|If this happens repeatedly you may need to reset the project.
Fri Apr 13 22:08:02 2007||Resuming computation
Fri Apr 13 22:14:32 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 22:14:32 2007|SETI@home|Restarting task 05se04aa.21397.2945.1034646.3.32_2 using setiathome_enhanced version 513
Fri Apr 13 22:14:34 2007||[error] Process 9334 not found
Fri Apr 13 22:20:04 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Fri Apr 13 22:20:04 2007|SETI@home|Restarting task 05se04aa.21397.2945.1034646.3.32_2 using setiathome_enhanced version 513
Fri Apr 13 22:20:06 2007||[error] Process 9374 not found
Fri Apr 13 22:22:25 2007||Suspending computation - user is active
Fri Apr 13 22:24:48 2007||Resuming computation
Fri Apr 13 22:25:20 2007||Suspending computation - user is active
Fri Apr 13 22:28:53 2007||Resuming computation
Fri Apr 13 22:29:07 2007||Suspending computation - user is active
Fri Apr 13 22:34:32 2007||Resuming computation
Fri Apr 13 22:58:15 2007||Suspending computation - user is active
Fri Apr 13 23:04:32 2007||Resuming computation
Sat Apr 14 00:26:54 2007||Suspending computation - user is active
Sat Apr 14 00:29:28 2007||Resuming computation
Sat Apr 14 00:40:55 2007||Suspending computation - user is active
Sat Apr 14 00:41:55 2007||Resuming computation
Sat Apr 14 00:59:02 2007||Suspending computation - user is active
Sat Apr 14 01:04:15 2007||Resuming computation
Sat Apr 14 02:29:07 2007||Suspending computation - user is active
Sat Apr 14 02:32:03 2007||Resuming computation
Sat Apr 14 05:05:28 2007||Suspending computation - user is active
Sat Apr 14 05:51:21 2007||Resuming computation
Sat Apr 14 05:51:45 2007||Suspending computation - user is active
Sat Apr 14 05:54:31 2007||Resuming computation
Sat Apr 14 09:44:42 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Sat Apr 14 09:44:42 2007|SETI@home|Restarting task 05se04aa.21397.2945.1034646.3.32_2 using setiathome_enhanced version 513
Sat Apr 14 09:44:43 2007||[error] Process 9408 not found
Sat Apr 14 11:19:00 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Sat Apr 14 11:19:00 2007|SETI@home|Restarting task 05se04aa.21397.2945.1034646.3.32_2 using setiathome_enhanced version 513
Sat Apr 14 11:19:01 2007||[error] Process 13639 not found
Sat Apr 14 12:49:43 2007||Restarting 05se04aa.21397.2945.1034646.3.32_2 - message timeout
Sat Apr 14 12:49:43 2007|SETI@home|Restarting task 05se04aa.21397.2945.1034646.3.32_2 using setiathome_enhanced version 513
Sat Apr 14 12:49:44 2007||[error] Process 14214 not found

____________

Alinator
Volunteer tester
Send message
Joined: 19 Apr 05
Posts: 4178
Credit: 4,647,982
RAC: 0
United States
Message 546225 - Posted: 14 Apr 2007, 21:16:33 UTC

I'm not a Mac'er so don't hold me to this, but I seem to recall this was the way the Mac responded to some of the issues the stock app was having on Win boxes.

A quick and dirty test would be to give Alex Kan's optimized app a try and see if that helps. Generally the optimized app of all flavors have had fewer problems like this lately.

HTH,

Alinator

Profile Keith
Send message
Joined: 19 May 99
Posts: 483
Credit: 938,268
RAC: 0
United Kingdom
Message 546254 - Posted: 14 Apr 2007, 22:20:40 UTC

David
Due to the repeated message "Suspending computation - user is active", is it possible you have set the "Do work while computer is in use?" to "no" under "your preferences/general"?
Keith

Alinator
Volunteer tester
Send message
Joined: 19 Apr 05
Posts: 4178
Credit: 4,647,982
RAC: 0
United States
Message 546330 - Posted: 15 Apr 2007, 0:24:51 UTC

That was it, thanks Keith!

The issue was when the app exits because of a loss of heartbeat, when BOINC tried to restart it, it must be trying to use the old PID. So the Mac comes back and says; "HUH... What are you talking about!"

Alinator

Profile Jakob Creutzfeld
Volunteer tester
Avatar
Send message
Joined: 13 Oct 00
Posts: 611
Credit: 1,558,324
RAC: 283
Germany
Message 546362 - Posted: 15 Apr 2007, 1:31:15 UTC

David,

you may want to try to set "Leave applications in memory while suspended?" to "Yes" in your general preferences (if it isn't already set). Save your changes and don't forget the press Update button in your BOINC manager (advanced view).

Hope this helps
Andy

Profile David
Send message
Joined: 20 May 99
Posts: 10
Credit: 2,556,200
RAC: 115
United States
Message 546412 - Posted: 15 Apr 2007, 2:58:16 UTC - in response to Message 546254.

David
Due to the repeated message "Suspending computation - user is active", is it possible you have set the "Do work while computer is in use?" to "no" under "your preferences/general"?
Keith

That's correct. I don't want my processing power going to BOINC when I'm doing things on my computer.
____________

Profile Philadelphia
Volunteer tester
Avatar
Send message
Joined: 12 Feb 07
Posts: 1590
Credit: 399,688
RAC: 0
United States
Message 546413 - Posted: 15 Apr 2007, 3:00:48 UTC - in response to Message 546330.

That was it, thanks Keith!

The issue was when the app exits because of a loss of heartbeat, when BOINC tried to restart it, it must be trying to use the old PID. So the Mac comes back and says; "HUH... What are you talking about!"

Alinator


Another case solved for "CSI - SETI" :)


____________

Profile David
Send message
Joined: 20 May 99
Posts: 10
Credit: 2,556,200
RAC: 115
United States
Message 546417 - Posted: 15 Apr 2007, 3:04:35 UTC - in response to Message 546330.

That was it, thanks Keith!

The issue was when the app exits because of a loss of heartbeat, when BOINC tried to restart it, it must be trying to use the old PID. So the Mac comes back and says; "HUH... What are you talking about!"

Alinator

I don't understand why that should happen. I've been running BOINC for, what 2-3 years or whatever, and never had a problem before. Been running SETI At Home since it began. If there's a bad configuration for preferences, wouldn't the BOINC team be aware of it and warn us? By the way, is there no way to report a problem to them? I have donated many 10s of thousands of computer hous to the project. How about a little help from the developers!

"Leave applications in memory while suspended" IS set to "Yes".

____________

Alinator
Volunteer tester
Send message
Joined: 19 Apr 05
Posts: 4178
Credit: 4,647,982
RAC: 0
United States
Message 546430 - Posted: 15 Apr 2007, 3:24:46 UTC

I agree, this didn't use to be such a wide ranging problem before.

I can only conclude that it has something to do with the changes the BOINC Dev Team is making to resolve the other problems the transition to using CURL or other third party modules in BOINC has created.

You know, the old story of fix one problem and make three new ones! ;-)

Alinator

OzzFan
Volunteer tester
Avatar
Send message
Joined: 9 Apr 02
Posts: 13625
Credit: 30,995,385
RAC: 20,641
United States
Message 546460 - Posted: 15 Apr 2007, 3:58:54 UTC - in response to Message 546417.

By the way, is there no way to report a problem to them? I have donated many 10s of thousands of computer hous to the project. How about a little help from the developers!


If they answered every request for help from the thousands of users out there, they would never get any of their own work done. There's only about five people in all trying to keep the servers running smoothly, analyze the science, do the paperwork, give speeches at the University, have a personal life, etc.

BOINC is open source, and many users in the community understand the workings of BOINC (or have even contributed to some of it's programming) and are just as good to help people along.

You shouldn't feel like you're getting second rate help just because one of the official developers aren't here responding to your questions. Some of the volunteers are very knowledgeable, and probably more so in some aspects of some things related to computers. That's why these boards were created: to allow users to help each other and to allow the community to grow on it's own, knowing they wouldn't have the manpower to do it all themselves.
____________

Profile David
Send message
Joined: 20 May 99
Posts: 10
Credit: 2,556,200
RAC: 115
United States
Message 546600 - Posted: 15 Apr 2007, 11:30:25 UTC - in response to Message 546460.

By the way, is there no way to report a problem to them? I have donated many 10s of thousands of computer hous to the project. How about a little help from the developers!


If they answered every request for help from the thousands of users out there, they would never get any of their own work done. There's only about five people in all trying to keep the servers running smoothly, analyze the science, do the paperwork, give speeches at the University, have a personal life, etc.

BOINC is open source, and many users in the community understand the workings of BOINC (or have even contributed to some of it's programming) and are just as good to help people along.

You shouldn't feel like you're getting second rate help just because one of the official developers aren't here responding to your questions. Some of the volunteers are very knowledgeable, and probably more so in some aspects of some things related to computers. That's why these boards were created: to allow users to help each other and to allow the community to grow on it's own, knowing they wouldn't have the manpower to do it all themselves.


Well I don't feel that way. I'm just frustrated that after many years of running SETI and racking up a good score that I now have to shut it down--at least until there is a solution to this problem. Yes, after looking at my progress a few minutes ago, the progress of the task above had shrunk to just over 1%. I had turned back on "Do work while computer is in use?" to "yes" under "your preferences/general". This obviously was NOT the problem. I wonder if doing a complete uninstall and then re-installing will help? Anyway I don't see an "uninstall" option on the BOINC installer. Thanks anyway!

____________

Astro
Volunteer tester
Avatar
Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 546601 - Posted: 15 Apr 2007, 11:39:55 UTC
Last modified: 15 Apr 2007, 11:40:58 UTC

David, exit boinc with "file" then "exit". Go to the windows "control panel" pick "add/remove programs", select boinc, then uninstall. There will still be a Boinc directory located at C:\\programfiles\\ that contains all your account and wu information. Then install a different version if you choose. NO work should be lost.

If you want to get rid of old work then a "Reset", or worst case "detachment" is the ticket.

tony

Profile Philadelphia
Volunteer tester
Avatar
Send message
Joined: 12 Feb 07
Posts: 1590
Credit: 399,688
RAC: 0
United States
Message 546604 - Posted: 15 Apr 2007, 11:59:52 UTC - in response to Message 546460.
Last modified: 15 Apr 2007, 12:00:30 UTC

By the way, is there no way to report a problem to them? I have donated many 10s of thousands of computer hous to the project. How about a little help from the developers!


If they answered every request for help from the thousands of users out there, they would never get any of their own work done. There's only about five people in all trying to keep the servers running smoothly, analyze the science, do the paperwork, give speeches at the University, have a personal life, etc.



Amen.

They would need to hire a hoard of people to manage a help desk.

They're doing a hell of a job doing their work and helping us.

5 people vs how many of us? Some crazy ratio for sure.

____________

OzzFan
Volunteer tester
Avatar
Send message
Joined: 9 Apr 02
Posts: 13625
Credit: 30,995,385
RAC: 20,641
United States
Message 546629 - Posted: 15 Apr 2007, 14:07:03 UTC - in response to Message 546600.

Well I don't feel that way. I'm just frustrated that after many years of running SETI and racking up a good score that I now have to shut it down.


Well, I can definitely understand the frustration. Unfortunately, I don't use Macs either, so I can't really help that well. Perhaps someone from our Mac community can help out with this one - or maybe even the uninstall/re-install will help (but I don't think it will - this has to be a configuration problem somewhere).
____________

Alinator
Volunteer tester
Send message
Joined: 19 Apr 05
Posts: 4178
Credit: 4,647,982
RAC: 0
United States
Message 546637 - Posted: 15 Apr 2007, 14:35:42 UTC - in response to Message 546600.
Last modified: 15 Apr 2007, 14:39:22 UTC

Well I don't feel that way. I'm just frustrated that after many years of running SETI and racking up a good score that I now have to shut it down--at least until there is a solution to this problem. Yes, after looking at my progress a few minutes ago, the progress of the task above had shrunk to just over 1%. I had turned back on "Do work while computer is in use?" to "yes" under "your preferences/general". This obviously was NOT the problem. I wonder if doing a complete uninstall and then re-installing will help? Anyway I don't see an "uninstall" option on the BOINC installer. Thanks anyway!


OK, another preference related item that might be coming into play is if you have increased the time between disk writes. If you have left it at the default though, then this is probably not the problem.

FWIW, I have observed similar behaviour on Windows with the stock app when running other very intensive apps (like games and so forth), but the effect you're seeing on net progress is much more severe than anything I've seen.

The only other thing I can think of would be to try to limit the frequent restarts during your activity sessions by setting the idle time timeout higher.

I certainly can understand your frustration over this, and it is annoying especially if you've been running the project without issue previously. The only other comment at this point is you're not alone on this, it has been observed and reported on other platforms and efforts are underway to determine the root cause and correct it. My theory on it is because the comm link between BOINC and the science is being interrupted "abnormally", the checkpoint files don't get written back to disk when the app exits from the no heartbeat condition. If it happpens frequently enough and close together enough it can result in virtually zero progress being made on the task.

Alinator

OzzFan
Volunteer tester
Avatar
Send message
Joined: 9 Apr 02
Posts: 13625
Credit: 30,995,385
RAC: 20,641
United States
Message 546643 - Posted: 15 Apr 2007, 14:56:37 UTC - in response to Message 546637.

My theory on it is because the comm link between BOINC and the science is being interrupted "abnormally", the checkpoint files don't get written back to disk when the app exits from the no heartbeat condition. If it happpens frequently enough and close together enough it can result in virtually zero progress being made on the task.


This actually makes a lot of sense. Any way to test the theory? Any way to fix it if it is correct?
____________

Profile peanutProject donor
Avatar
Send message
Joined: 1 Feb 07
Posts: 369
Credit: 1,651,948
RAC: 1,411
United States
Message 546658 - Posted: 15 Apr 2007, 15:17:15 UTC
Last modified: 15 Apr 2007, 15:31:57 UTC

Let me say first that I am in no way working for Seti and don't claim to be an expert.

I am a fairly new to Seti@Home and have 2 macs and have worked a kink or two out though.

The one thing I see that is not in my computers messages after start up is the second line below.

Thu Apr 12 22:09:21 2007||General prefs: no separate prefs for home; using your defaults
Thu Apr 12 22:09:21 2007||Reading preferences override file

I wonder why he is getting an override of preferences with no custom app, I have custom apps (Alex Kans) and I don't see a preference override message.

I had problems with BOINC 5.8.15 on my G5 power Pc iMac. I frequently lost connection to localhost and had other problems as well. My solution to that whole issue was pretty low tech; I went back to BOINC 5.4.9. BOINC 5.8.15 works well on my Intel processor Mac Mini, but it gave my G5 iMac fits.

Would losing connection to localhost be related to comm link between BOINC and the science ? At the bottom right part of the BOINC manager screen it usually says "Connected to localhost", at least on my macs, that message would change when I had 5.8.15 on my power pc mac.

Josef W. SegurProject donor
Volunteer developer
Volunteer tester
Send message
Joined: 30 Oct 99
Posts: 4299
Credit: 1,068,438
RAC: 1,005
United States
Message 546733 - Posted: 15 Apr 2007, 17:50:55 UTC - in response to Message 546658.

...
Would losing connection to localhost be related to comm link between BOINC and the science ?
...

No, the localhost connection is used for BOINC Manager to talk to the BOINC core client. The science apps communicate with the core client through shared memory.

The OP is running BOINC 5.8.15 but the recommended version for Mac OS X is 5.8.17. The upgrade could help.
Joe

OzzFan
Volunteer tester
Avatar
Send message
Joined: 9 Apr 02
Posts: 13625
Credit: 30,995,385
RAC: 20,641
United States
Message 546737 - Posted: 15 Apr 2007, 17:57:37 UTC - in response to Message 546658.

I wonder why he is getting an override of preferences with no custom app, I have custom apps (Alex Kans) and I don't see a preference override message.


This one is an easy one to answer, as it is the same on Windows PCs too. The message for preferences override appears when one goes through the BSG (BOINC Simple GUI) and presses the Preferences button and sets their preferences locally. The OP is indeed using local preferences, as he stated he doesn't want BOINC to run while he's using the computer.
____________

Alinator
Volunteer tester
Send message
Joined: 19 Apr 05
Posts: 4178
Credit: 4,647,982
RAC: 0
United States
Message 546743 - Posted: 15 Apr 2007, 18:07:45 UTC
Last modified: 15 Apr 2007, 18:09:13 UTC

To cover the other part of your question, the second informational message is just telling you there was a local preferences override file found and was being read in to replace the ones BOINC would normally use. The normal ones are those you set on the project web pages and get updated every time BOINC contacts the project schedulers.

Custom apps are defined by the app_info.xml file.

HTH,

Alinator

LOL, beat me to it there Ozz.

1 · 2 · Next

Message boards : Number crunching : Process_____Not Found, Help Please!

Copyright © 2014 University of California