BOINC keeps restarting result (Solaris 8)

Questions and Answers : Unix/Linux : BOINC keeps restarting result (Solaris 8)
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile ozono28 [Pablo Ibanez]

Send message
Joined: 22 Mar 00
Posts: 4
Credit: 81,529
RAC: 0
Chile
Message 187624 - Posted: 10 Nov 2005, 19:24:34 UTC

Hi:

I have reinstalled software, detached proyect, restarted the proyect but nothing seems to work.

..any ideas ??

2005-11-10 16:17:36 [SETI@home] Restarting result 14ja04aa.23090.2288.390916.116_3 using setiathome version 4.02
2005-11-10 16:17:37 [SETI@home] Result 14ja04aa.23090.2288.390916.116_3 exited with zero status but no 'finished' file
2005-11-10 16:17:37 [SETI@home] If this happens repeatedly you may need to reset the project.
2005-11-10 16:17:37 [---] request_reschedule_cpus: process exited
2005-11-10 16:17:37 [---] schedule_cpus: must schedule
2005-11-10 16:17:37 [SETI@home] Restarting result 14ja04aa.23090.2288.390916.116_3 using setiathome version 4.02
2005-11-10 16:17:38 [SETI@home] Result 14ja04aa.23090.2288.390916.116_3 exited with zero status but no 'finished' file
2005-11-10 16:17:38 [SETI@home] If this happens repeatedly you may need to reset the project.
2005-11-10 16:17:38 [---] request_reschedule_cpus: process exited
2005-11-10 16:17:38 [---] schedule_cpus: must schedule
2005-11-10 16:17:38 [SETI@home] Restarting result 14ja04aa.23090.2288.390916.116_3 using setiathome version 4.02
2005-11-10 16:17:40 [SETI@home] Result 14ja04aa.23090.2288.390916.116_3 exited with zero status but no 'finished' file
2005-11-10 16:17:40 [SETI@home] If this happens repeatedly you may need to reset the project.
2005-11-10 16:17:40 [---] request_reschedule_cpus: process exited
2005-11-10 16:17:40 [---] schedule_cpus: must schedule

ID: 187624 · Report as offensive
Dotsch
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 2422
Credit: 919,393
RAC: 0
Germany
Message 188151 - Posted: 12 Nov 2005, 8:07:51 UTC

Is this restart happens permanetly ?
How long will the app run, if this happen ?
Which boinc version do you use ?

You can try to detach and attach the project again.


ID: 188151 · Report as offensive
Profile ozono28 [Pablo Ibanez]

Send message
Joined: 22 Mar 00
Posts: 4
Credit: 81,529
RAC: 0
Chile
Message 188887 - Posted: 14 Nov 2005, 14:13:13 UTC - in response to Message 188151.  

Dotsch:

I have downloaded Boinc 4.43 (Which is the only one available for solaris).

At this point my concerns are:

1 - I have Solaris 8 but the downloaded file seems to be for solaris 7.
(As far as I understand this package is for solaris 2.x)
2 - My Boinc version is 4.43 but the downloaded file from the server is
seems to be for Boinc 4.02
3 - I have 4 CPUs installed (In the log there is a message about CPUs),
in my WEB Preferences I have also set the cpu usage to "Use One CPUs"
but the problem still persist.

I would appreciate any help on this, I have also detached the project but my workunit is still restarting over and over again.

Thanks!!


-------------
Life Is Like Water
ID: 188887 · Report as offensive
Dotsch
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 2422
Credit: 919,393
RAC: 0
Germany
Message 188917 - Posted: 14 Nov 2005, 17:13:45 UTC - in response to Message 188887.  
Last modified: 14 Nov 2005, 17:29:03 UTC


I have downloaded Boinc 4.43 (Which is the only one available for solaris).

4.43 is a good and stable version.


At this point my concerns are:

1 - I have Solaris 8 but the downloaded file seems to be for solaris 7.
(As far as I understand this package is for solaris 2.x)

It will work on all Solaris 2.x


2 - My Boinc version is 4.43 but the downloaded file from the server is
seems to be for Boinc 4.02
[quote]
You have downloaded the setiathome app version 4.02 from the project server.


[quote]
3 - I have 4 CPUs installed (In the log there is a message about CPUs),
in my WEB Preferences I have also set the cpu usage to "Use One CPUs"
but the problem still persist.

You have got to configure it first.
Look at point 6.1 of this manual : http://boinc.berkeley.edu/hpux.html

But, if you wan't to use more than one CPU on solaris, you must modify your shared memory, described like here : http://boinc.berkeley.edu/solaris.php


I would appreciate any help on this, I have also detached the project but my workunit is still restarting over and over again.

Can you please verify, if your permitions on your boinc directory and all the files and subdirectories are OK. The user under which you run boinc and seti, must have write permissions. For the setiathome app in projects/setiathome.berkeley.edu/setiathome... you need read and execute rights.

Can you please also verify, if your can start the setiathome app in the standalone mode...
Copy a workunit in projects/setiathome/ to work_unit.sah, and start the setiathome application within this directory. Let it run for 1 or 2 minutes, so we can eliminate the cause of a general problem with the download of the app.

The output of the file stderr.txt is also of interest, if there is one there.

If this all will fail, can you please post me the last 100 lines of the output from the command "truss -f ./boinc > out 2>&1 "

ID: 188917 · Report as offensive
Profile Lord Vampyr

Send message
Joined: 27 May 00
Posts: 5
Credit: 888,697
RAC: 0
United States
Message 189315 - Posted: 15 Nov 2005, 20:46:07 UTC - in response to Message 188917.  

I am having the same issue..
here is the last 100 lines of truss

9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED938, 1, 100) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " W * Q M ! ^ J 6 - F 9 \\".., 16384) = 5840
9121: time() = 1132087382
9121: write(7, " U + 3 ? N * Q 6 & ?".., 8192) = 8192
9121: time() = 1132087382
9121: poll(0xFFBED8B8, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED938, 1, 100) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " - = O ) & , [ - 0 9 < Y".., 16384) = 4380
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED8B8, 1, 0) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " . E _ ' H _ ] F U > U +".., 16384) = 1460
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED8B8, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " ^ H - E Y + M 3 L H G ;".., 16384) = 2920
9121: time() = 1132087382
9121: write(7, " Q - < \\ ? F ( ^ N S #".., 8192) = 8192
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED938, 1, 100) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " 5 U # H M K ? < < 4 Y "".., 16384) = 2920
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED8B8, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED938, 1, 100) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " ^ N %\\n Q K & + $ S 8 "".., 16384) = 2920
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED8B8, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED938, 1, 100) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " N 3 / ] H J - \\\\n " ^ J".., 16384) = 2920
9121: time() = 1132087382
9121: write(7, " / R G S 9 M J ] W ! M C".., 8192) = 8192
9121: time() = 1132087382
9121: poll(0xFFBED8B8, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED938, 1, 100) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " - " M ' A ! H 0 @ # 4 >".., 16384) = 2920
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED8B8, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " + \\ ( # 5 U Q ! D Y # C".., 16384) = 2920
9121: time() = 1132087382
9121: write(7, " ) V 4 K , \\ 5 P = > N B".., 8192) = 8192
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " T 9 / N J 6 < % A 6 O )".., 16384) = 4380
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " 8 6 - S D [ 8 " - V 1 A".., 16384) = 2920
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 1
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: read(6, " A / >\\n . I U Z " 9 @".., 16384) = 1460
9121: time() = 1132087382
9121: write(7, " [ ^ > J V : A 8 J M 3 1".., 8192) = 8192
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: poll(0xFFBED928, 1, 0) = 0
9121: time() = 1132087382
9121: stat64("projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7", 0xFFBEF670) = 0
9121: poll(0xFFBEDA20, 1, 0) = 0
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: signotifywait() = 2
9121: lwp_sigredirect(1, SIGINT, 0xFF00FC4C) = 0
9121: Received signal #2, SIGINT, in poll() [caught]
9121: poll(0xFFBED938, 1, 100) Err#4 EINTR
9121: sigprocmask(SIG_SETMASK, 0xFF13EFE8, 0x00000000) = 0
9121: time() = 1132087382
2005-11-15 20:43:02 [---] Received signal 2
9121: write(1, " 2 0 0 5 - 1 1 - 1 5 2".., 44) = 44
9121: sigprocmask(SIG_SETMASK, 0xFF14ADB8, 0x00000000) = 0
9121: setcontext(0xFFBED620)
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
9121: time() = 1132087382
2005-11-15 20:43:02 [---] Exit requested by user
9121: write(1, " 2 0 0 5 - 1 1 - 1 5 2".., 49) = 49
9121: times(0xFFBEF9B0) = 829594131
9121: waitid(P_PID, 9136, 0xFFBEF9C0, WEXITED|WTRAPPED|WNOHANG) = 0
9121: times(0xFFBEF9A0) = 829594131
9121: shmdt(0xFF0E0000) = 0
9121: shmget(123905, 0, 0) = 3
9121: shmctl(3, 12, 0xFFBEF950) = 0
9121: shmctl(3, 10, 0) = 0
9121: time() = 1132087382
2005-11-15 20:43:02 [---] request_reschedule_cpus: exit_tasks
9121: write(1, " 2 0 0 5 - 1 1 - 1 5 2".., 62) = 62
9121: open64("client_state_next.xml", O_WRONLY|O_CREAT|O_TRUNC, 0666) = 8
9121: time() = 1132087382
9121: fstat64(8, 0xFFBEED50) = 0
9121: ioctl(8, TCGETA, 0xFFBEECDC) Err#25 ENOTTY
9121: write(8, " < c l i e n t _ s t a t".., 8192) = 8192
9121: write(8, " _ s t a t u s > 0 < / e".., 1661) = 1661
9121: close(8) = 0
9121: rename("client_state.xml", "client_state_prev.xml") = 0
9121: rename("client_state_next.xml", "client_state.xml") = 0
9121: time() = 1132087382
9121: llseek(0, 0, SEEK_CUR) = 95126
9121: llseek(0, 0, SEEK_CUR) = 95126
9121: write(7, " 6 # @ F I H N > D I 7".., 1017) = 1017
9121: _exit(0)

ID: 189315 · Report as offensive
Dotsch
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 2422
Credit: 919,393
RAC: 0
Germany
Message 189362 - Posted: 15 Nov 2005, 23:16:08 UTC - in response to Message 189315.  
Last modified: 15 Nov 2005, 23:17:35 UTC

I am having the same issue..
here is the last 100 lines of truss

Thanks for the truss. Can you please email me (seti_boinc at dotsch.de) the complete truss from "truss -f ./boinc_client" - please gzip it. The truss should run about 5 minutes.

Have you verified the permisions for the directories and files, like described in my other posting ?

ID: 189362 · Report as offensive
Profile ozono28 [Pablo Ibanez]

Send message
Joined: 22 Mar 00
Posts: 4
Credit: 81,529
RAC: 0
Chile
Message 189530 - Posted: 16 Nov 2005, 14:38:51 UTC - in response to Message 188917.  
Last modified: 16 Nov 2005, 14:41:37 UTC

Dotsch:

Permisions should be fine since I'm running as super user.
I gave all permissions to everything anyway (chmod -R 777 *) but the problem was the same.

After copying the workunit to work_unit.sah the application got stuck in the following message "Insufficient work requesting more".

I did set everything back to normal and I have also sent you by mail the 'truss' output (5 min log).

Cheers,

ID: 189530 · Report as offensive
Dotsch
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 2422
Credit: 919,393
RAC: 0
Germany
Message 189746 - Posted: 17 Nov 2005, 7:37:52 UTC - in response to Message 189530.  

Dotsch:

Permisions should be fine since I'm running as super user.
I gave all permissions to everything anyway (chmod -R 777 *) but the problem was the same.

After copying the workunit to work_unit.sah the application got stuck in the following message "Insufficient work requesting more".

I did set everything back to normal and I have also sent you by mail the 'truss' output (5 min log).

Cheers,

Thank you for the complete truss.

Your output looks, like the libgcc_s.so.1 is not available.
Can you please post me the output from "ldd ./boinc_client" and "ldd projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7".

If ldd reports, that the libgcc_s or libstdc++ installed, please download the package "libgcc" from www.sunfreeware.com.

If you have the libs installed, do a "LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/$PATH_TO_THE_LIBGCC ; export LD_LIBRARY_PATH"

@Mystic_Vampyr : Could you try this, too.

ID: 189746 · Report as offensive
Profile ozono28 [Pablo Ibanez]

Send message
Joined: 22 Mar 00
Posts: 4
Credit: 81,529
RAC: 0
Chile
Message 189884 - Posted: 17 Nov 2005, 20:28:14 UTC - in response to Message 189746.  

Dotsch:

I thought that GCC was fine because:

l> pkginfo |grep -i gcc
utility GNUgcc GNU gcc 2.95.2 SPARC 32bit Solaris 8

But after running your recomendation it was clear that libgcc link and file were not present:

> ldd ./projects/setiathome.berkeley.edu/setiathome_4.02_sparc-sun-solaris2.7
libsocket.so.1 => /usr/lib/libsocket.so.1
libnsl.so.1 => /usr/lib/libnsl.so.1
libelf.so.1 => /usr/lib/libelf.so.1
libdl.so.1 => /usr/lib/libdl.so.1
libaio.so.1 => /usr/lib/libaio.so.1
libm.so.1 => /usr/lib/libm.so.1
libgcc_s.so.1 => (file not found)
libc.so.1 => /usr/lib/libc.so.1
libmp.so.2 => /usr/lib/libmp.so.2
/usr/platform/SUNW,Ultra-4/lib/libc_psr.so.1

I did download libgcc from from inernet but I also had to create a simbolik link to /usr/lib/ where the application was expecting the library to be:

> ln -s /usr/local/lib/libgcc_s.so.1 /usr/lib/libgcc_s.so.1


Now BOIC is working fine.

THANKS A LOT FOR YOUR HELP !!!

Cheers !!
ID: 189884 · Report as offensive
Profile Lord Vampyr

Send message
Joined: 27 May 00
Posts: 5
Credit: 888,697
RAC: 0
United States
Message 190264 - Posted: 18 Nov 2005, 15:56:01 UTC - in response to Message 189884.  

This got mine working too!

Thanks
ID: 190264 · Report as offensive

Questions and Answers : Unix/Linux : BOINC keeps restarting result (Solaris 8)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.