Stalled tasks.

Questions and Answers : Unix/Linux : Stalled tasks.

To post messages, you must log in.

AuthorMessage
Tangerineboy

Send message
Joined: 2 Jan 09
Posts: 33
Credit: 1,511,482
RAC: 0
United Kingdom
Message 980237 - Posted: 17 Mar 2010, 21:55:19 UTC

I am getting quite a few of these across three of my machines. The task will be sat a percentage in to its computation but will be stuck, closing and restarting boinc has no effect on this "stuck" status. I also get the occasional task that just fails to start. Anyone else come across this ?

The machines are all on FreeBSD 8.0 Release amd64

example http://www.tangerine-army.co.uk/seti.jpg
ID: 980237 · Report as offensive
Profile Ageless
Avatar

Send message
Joined: 9 Jun 99
Posts: 13822
Credit: 3,269,733
RAC: 0
Netherlands
Message 980398 - Posted: 18 Mar 2010, 5:52:58 UTC - in response to Message 980237.  

Someone else got that as well, but on the Mac.

What happens when you exit BOINC fully and restart it?
Jord

Ancient Astronaut Theorists suggest that in many ways, you can be considered an alien conspiracy!
ID: 980398 · Report as offensive
Tangerineboy

Send message
Joined: 2 Jan 09
Posts: 33
Credit: 1,511,482
RAC: 0
United Kingdom
Message 980472 - Posted: 18 Mar 2010, 12:35:07 UTC

Well I did try killing the boinc_client process and then restarting it. The stuck units were still stuck, aborting them gets the computer moving again. I will double check next time I see a stuck wu that all seti stuff is shutdown and restarted and see if that makes a difference.
ID: 980472 · Report as offensive
Profile ignorance is no excuse
Avatar

Send message
Joined: 4 Oct 00
Posts: 9529
Credit: 44,433,321
RAC: 0
Korea, North
Message 980493 - Posted: 18 Mar 2010, 13:40:31 UTC - in response to Message 980472.  

Wondering if this is the AMD hang? Do you run optimized apps? the generic app from seti can cause AMD CPU's to hang. I can't remember if this is a windows problem or if it hits all OSes
In a rich man's house there is no place to spit but his face.
Diogenes Of Sinope

End terrorism by building a school

ID: 980493 · Report as offensive
Tangerineboy

Send message
Joined: 2 Jan 09
Posts: 33
Credit: 1,511,482
RAC: 0
United Kingdom
Message 980551 - Posted: 18 Mar 2010, 16:45:51 UTC

I installed the latest version from the FreeBSD ports collection, I also run seti on Phenom II x4 955 using windows 7 Pro 64bit and I have not seen the problem arise on this system.
ID: 980551 · Report as offensive
Profile Leopoldo
Volunteer tester
Avatar

Send message
Joined: 4 Aug 99
Posts: 102
Credit: 2,953,761
RAC: 202
Russia
Message 981011 - Posted: 19 Mar 2010, 16:54:24 UTC - in response to Message 980551.  
Last modified: 19 Mar 2010, 16:55:57 UTC

IMHO, such stuck looks as a result of using binary, compiled from initially-working source which assumes any compilator defaults - with the other version of compilator (and other defaults).

For example, initial variant of 64-bit openSUSE kernel in the 11.3 milestone 3 works if compiled by GCC 4.4.3 and doesn't with 4.4.4 - this GCC versions used different default aligning of "struct"s, 8 and 32, coordingly.

Such tasks (endlessly looping after writing baseline smoothing into the stderr.txt), are common for the stock 6.03 application of the SETI@home Beta project with 64-bit versions of Ubuntu 9.10 and openSUSE 11.2

Tangerineboy, with such binary I see no other way as abort them.
ID: 981011 · Report as offensive
Tangerineboy

Send message
Joined: 2 Jan 09
Posts: 33
Credit: 1,511,482
RAC: 0
United Kingdom
Message 984175 - Posted: 27 Mar 2010, 0:42:34 UTC
Last modified: 27 Mar 2010, 1:05:51 UTC

I have been looking at optimised apps,

http://lunatics.kwsn.net/index.php?module=Downloads;sa=dlview;id=91

I will give this a whirl and see if it makes a difference
ID: 984175 · Report as offensive

Questions and Answers : Unix/Linux : Stalled tasks.


 
©2016 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.