RenderBoxx 10100 and Linux BOINC/SETI computational errors

Message boards : Number crunching : RenderBoxx 10100 and Linux BOINC/SETI computational errors
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile ab1vl-pi
Avatar

Send message
Joined: 11 Jun 18
Posts: 4
Credit: 68,014
RAC: 0
United States
Message 1955253 - Posted: 13 Sep 2018, 21:13:58 UTC

I recently acquired some SuperMicro -based video rendering boxes that feature 2 x Xeon 4 core chips which operate in tandem as one 8 core cpu, 16GB ECC RAM, running at various default clocks between 2.20GHz and 2.6Ghz. These run other projects *fast* and without error. I have 100 concurrent hours of MW@Home and Asteroids@home with no glitches,( Lubuntu 18.04 with latest updates). Note that these have a basic VGA port but no GPU, thus No Usable GPUs is found in the log as expected.

When I add seti as a project the WUs download but immediately terminate with computational errors. One hunch is that the jobs downloaded are based on the assumption I have a GPU, perhaps because the benchmark completes so quickly. Also, I have seen some seti WUs that show up in Status as using 2 cores , (Windows dual core machines) so could it be that the app (or boinc) may be getting WUs that are restricted to 2 cores? That are aborting because they can't cope with my 8 core CPU setup?

Is there a document that explains seti wu naming conventions so I can further investigate?

I have 10 of these 8 core machines and it would be a shame if I could not figure out a way to throw them at seti, since they run WUs pretty fast. Unfortunately they seem to have no usable PCIe slot for a GPU, but I have only looked into one of them so far.

tnx in advance
ID: 1955253 · Report as offensive
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 1955261 - Posted: 13 Sep 2018, 21:49:12 UTC

looking at your system list i'm guessing it's this one: https://setiathome.berkeley.edu/results.php?hostid=8576128

as you can see it is not sending you GPU jobs, but rather normal CPU jobs. I can see they are exiting immediately, but i dont see a clear reason why. maybe someone else more well versed can take a look.

here is a stderr.txt file from a job of yours

<core_client_version>7.9.3</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)</message>
<stderr_txt>
boinc-app-seti is invoked with the following arguments: 
--start
--end.
I: boinc_parse_init_data_file
I: boinc_get_init_data
I: boinc_init
I: worker
I: worker() - common_init
I: worker() - read_wu_state
SIGSEGV: segmentation violation
Stack trace (10 frames):
/usr/lib/x86_64-linux-gnu/libboinc.so.7(boinc_catch_signal+0x1d8)[0x7f3ec4f811ab]
/lib/x86_64-linux-gnu/libc.so.6(+0x3ef20)[0x7f3ec5446f20]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0x23b31)[0x55d7e3b4db31]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0x2bee6)[0x55d7e3b55ee6]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0x5521e)[0x55d7e3b7f21e]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0x2e9bc)[0x55d7e3b589bc]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0x2ec38)[0x55d7e3b58c38]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0xea68)[0x55d7e3b38a68]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xe7)[0x7f3ec5429b97]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0xf4fa)[0x55d7e3b394fa]

Exiting...

</stderr_txt>
]]>


perhaps you dont have all the required dependencies for boinc? how did you install it? are you using a custom app from Lunatics or something?
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 1955261 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1955262 - Posted: 13 Sep 2018, 21:56:39 UTC - in response to Message 1955253.  

I only see 1 of those computers that have processed 3 tasks and they errored with:
<core_client_version>7.9.3</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)</message>
<stderr_txt>
boinc-app-seti is invoked with the following arguments: 
--start
--end.
I: boinc_parse_init_data_file
I: boinc_get_init_data
I: boinc_init
I: worker
I: worker() - common_init
I: worker() - read_wu_state
SIGSEGV: segmentation violation
Stack trace (10 frames):
/usr/lib/x86_64-linux-gnu/libboinc.so.7(boinc_catch_signal+0x1d8)[0x7ff6b01151ab]
/lib/x86_64-linux-gnu/libc.so.6(+0x3ef20)[0x7ff6b05daf20]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0x23b31)[0x56113d3d2b31]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0x2bee6)[0x56113d3daee6]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0x5521e)[0x56113d40421e]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0x2e9bc)[0x56113d3dd9bc]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0x2ec38)[0x56113d3ddc38]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0xea68)[0x56113d3bda68]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xe7)[0x7ff6b05bdb97]
../../projects/setiathome.berkeley.edu/setiathome_v8(+0xf4fa)[0x56113d3be4fa]

Exiting...

</stderr_txt>
]]>

One other computer has downloaded a handful of tasks but not returned anything yet.

I would check for dependencies for the Anonymous app you installed as well as the execute attribute for the app.
And of course if you're trying to run a AVX app on a non-AVX CPU.

If you don't have any GPUs in them, there isn't really any need for the Anonymous platform for non AVX CPUs, just delete your app_info.xml and let the server give you an app.
ID: 1955262 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1955264 - Posted: 13 Sep 2018, 22:09:29 UTC - in response to Message 1955262.  
Last modified: 13 Sep 2018, 22:10:53 UTC

This;
boinc-app-seti
and this;
libboinc.so.7

Those files look like something you would get from the Repository install. I've noticed some repositories are providing their own CPU Apps. From what I've seen, they don't work very well.
I would suggest he download the SSE41 version from Lunatics and try that. Or he could download the BOINC-All-In-One package from C.A. and dig the Intel SSE41 App out of that package, it would be a little more difficult.

The Lunatics CPU App is here, http://lunatics.kwsn.info/index.php?action=downloads;sa=view;down=482
ID: 1955264 · Report as offensive

Message boards : Number crunching : RenderBoxx 10100 and Linux BOINC/SETI computational errors


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.