Wonder if anyone has tried this, if so, how dit it turn out?

Message boards : Number crunching : Wonder if anyone has tried this, if so, how dit it turn out?
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Chiana
Avatar

Send message
Joined: 2 Feb 05
Posts: 19
Credit: 165,472
RAC: 0
Sweden
Message 88336 - Posted: 20 Mar 2005, 8:19:51 UTC

Was thinking of setting up a clustered computer using the "spare power" of an entire web-hotel using Linux and OpenMosix, my guess is that Boinc would think that every attached computer is a CPU and use it as such, but what will happen when one such CPU "dissapears" because it has been rented by a customer or become used otherwise? I don't have the hardware in place yet, that's why asking this question.


ID: 88336 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13959
Credit: 208,696,464
RAC: 304
Australia
Message 88397 - Posted: 20 Mar 2005, 11:06:14 UTC - in response to Message 88336.  

> but what will happen when one such CPU "dissapears" because it has been rented by a customer or become used otherwise?

I expect nothing much- no real difference compared to a multiple CPU unit releasing it for use by an appliaction as required.
Grant
Darwin NT
ID: 88397 · Report as offensive
Profile rattelschneck
Avatar

Send message
Joined: 14 Apr 01
Posts: 435
Credit: 842,179
RAC: 0
Germany
Message 88411 - Posted: 20 Mar 2005, 12:35:40 UTC - in response to Message 88336.  
Last modified: 20 Mar 2005, 12:36:35 UTC

> Was thinking of setting up a clustered computer using the "spare power" of an
> entire web-hotel using Linux and OpenMosix, my guess is that Boinc would think
> that every attached computer is a CPU and use it as such, but what will happen
> when one such CPU "dissapears" because it has been rented by a customer or
> become used otherwise? I don't have the hardware in place yet, that's why
> asking this question.
>
>
I have tested boinc on a 4 node OpenMosix cluster system end of last year. It was very disappointing to me, because the core client processes didn't migrate. I expectet the system's behavior as a 8 cpu system ( 4 x P4 hyper threading). But there was only 2 cc processes, just like a standard P4 with ht. The other 3 boxes didn't nothing at all!

BTW, the classic seti@home client is running just fine on a OpenMosix cluster. All processes are migrating straight away, and it looks like 1 node is doing all the work.


ID: 88411 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13959
Credit: 208,696,464
RAC: 304
Australia
Message 88641 - Posted: 20 Mar 2005, 23:18:47 UTC - in response to Message 88411.  

> I have tested boinc on a 4 node OpenMosix cluster system end of last year. It
> was very disappointing to me, because the core client processes didn't
> migrate. I expectet the system's behavior as a 8 cpu system ( 4 x P4 hyper
> threading). But there was only 2 cc processes, just like a standard P4 with
> ht. The other 3 boxes didn't nothing at all!

Use more than 2 CPUs enabled in your settings?
Grant
Darwin NT
ID: 88641 · Report as offensive
Profile rattelschneck
Avatar

Send message
Joined: 14 Apr 01
Posts: 435
Credit: 842,179
RAC: 0
Germany
Message 88658 - Posted: 20 Mar 2005, 23:38:25 UTC - in response to Message 88641.  


> Use more than 2 CPUs enabled in your settings?

You know what? I'm not pretty sure if i enabled the settings using more than 2 cpus. You caught me on the wrong feet, as we say in germany.
I guess i will have to test again.

rattelschneck
ID: 88658 · Report as offensive
Profile rattelschneck
Avatar

Send message
Joined: 14 Apr 01
Posts: 435
Credit: 842,179
RAC: 0
Germany
Message 90980 - Posted: 25 Mar 2005, 19:41:56 UTC


I did another test run on a really small OpenMosix cluster setup, just 2 hosts. This time i set the preferences to use 8 cpus, to be on the save side ;)

After downloading the official boinc client for linx and starting it, i've got the same result again. The core client process won't migrate. I did a snapshot, so if somebody is interested in this, please take a look here:
Boinc snapshot

A closer look at the processes exposed them as 'monkeys'.

The documentation says:
There are several reasons, why the process may not be eligible for migration in OpenMosix. If the process cannot be migrated, the reason is listed in the file /proc/pid/cantmove. Here is the description of some of the possible reasons:
* monkey: the application is using files as shared memory

For comparison purposes i did a another test run with the seti classic client on the same setup. Two command line clients has been started. As expected the second process migrated straight away. I did another snapshot as you can see here:

Seti classic snapshot

At the end of the day it looks to me that OpenMosix is not the way to go if it comes to Boinc.
If there are some other opinions on this issue, i would very much appreciate to hear about it.


ID: 90980 · Report as offensive
Profile Chiana
Avatar

Send message
Joined: 2 Feb 05
Posts: 19
Credit: 165,472
RAC: 0
Sweden
Message 96269 - Posted: 8 Apr 2005, 11:28:25 UTC

Hmmm... Perhaps the common filesystem built into OpenMosix can be used, haven't experimented with it yet thou...
I imagine it can be somehow used to allow the processes to migrate, but the docus for it is somewhat vague...

ID: 96269 · Report as offensive
Profile Zerjillo

Send message
Joined: 15 Feb 03
Posts: 27
Credit: 461,381
RAC: 0
Spain
Message 96311 - Posted: 8 Apr 2005, 14:57:12 UTC - in response to Message 90980.  

I've been thinking about this same issue during the last days and here are more points about it:

- Does an openMosix cluster gives information about the number of cpus by adding all available cpus in the cluster? I mean BOINC uses some kind of system call to know how many cpus does the system have. Does an openmosix cluster patch this system call to report a different number of cpus? I think the answer will be no, and that's why boinc only launches 2 processes in your cluster.

- Another problem would be what happens with a disconnecting node. Of course it would be beautiful that boinc notices when some new cpus are added / removed, but I really don't think that the program is prepared to do such thing right now.

- Maybe is only the seti client that is using files as shared memory and that's why those 2 processes are not migrating to other cpus? Maybe with another projects it would work? Could this "files as shared memory" be the way that the boinc and seti applications communicate?

I'm not shure about it, but patching the boinc client (it is open source, you can do it! ;-) ) and allow it to launch more tasks than the number of cpus that it sees could be more or less straightforward.

How to deal with nodes that "leave" the cluster would be a more complicated issue. Of course, if the node is gratefully shouted down the crunching process would migrate into another working node, but then that node would had 2 crunching processes in it, which would not be appropiate due to competition for resources. Maybe the boinc core should examine the system every minute or so to know how many available computing cpus we have... Of course, that could be difficult if the system does not provide the exact number of processors with the system call I already mentioned. Maybe some more patches would be neccessary to check the real and actual number of processors on the cluster.

And of course if the communication between boinc and applications is always made by "files as shared memory" the problems would increase as they cannot directly migrate. Then, we would need to also patch the applications for boinc, which is by far more complicated, and even impossible, since not all of them are open source.

More thoughts about it? Would it be interesting to release a "special openMosix edition" for BOINC? Note that for "farmers" could be just a great idea. With cds like ClusterKnoppix make a BOINC farm would be a straightforward issue. And think about mainteneance... just one BOINC application. Just upgrading in the "master" node would be wonderful!

Greetings

Zerjillo

ID: 96311 · Report as offensive
Profile Paul D. Buck
Volunteer tester

Send message
Joined: 19 Jul 00
Posts: 3898
Credit: 1,158,042
RAC: 0
United States
Message 96330 - Posted: 8 Apr 2005, 16:35:27 UTC

BOINC uses shared memory ...
ID: 96330 · Report as offensive
Profile Chiana
Avatar

Send message
Joined: 2 Feb 05
Posts: 19
Credit: 165,472
RAC: 0
Sweden
Message 96657 - Posted: 9 Apr 2005, 21:24:32 UTC - in response to Message 96311.  

> I've been thinking about this same issue during the last days and here are
> more points about it:
>
> - Does an openMosix cluster gives information about the number of cpus by
> adding all available cpus in the cluster? I mean BOINC uses some kind of
> system call to know how many cpus does the system have. Does an openmosix
> cluster patch this system call to report a different number of cpus? I think
> the answer will be no, and that's why boinc only launches 2 processes in your
> cluster.
>
> - Another problem would be what happens with a disconnecting node. Of course
> it would be beautiful that boinc notices when some new cpus are added /
> removed, but I really don't think that the program is prepared to do such
> thing right now.
>
> - Maybe is only the seti client that is using files as shared memory and
> that's why those 2 processes are not migrating to other cpus? Maybe with
> another projects it would work? Could this "files as shared memory" be the way
> that the boinc and seti applications communicate?
>
> I'm not shure about it, but patching the boinc client (it is open source, you
> can do it! ;-) ) and allow it to launch more tasks than the number of cpus
> that it sees could be more or less straightforward.
>
> How to deal with nodes that "leave" the cluster would be a more complicated
> issue. Of course, if the node is gratefully shouted down the crunching process
> would migrate into another working node, but then that node would had 2
> crunching processes in it, which would not be appropiate due to competition
> for resources. Maybe the boinc core should examine the system every minute or
> so to know how many available computing cpus we have... Of course, that could
> be difficult if the system does not provide the exact number of processors
> with the system call I already mentioned. Maybe some more patches would be
> neccessary to check the real and actual number of processors on the cluster.
>
> And of course if the communication between boinc and applications is always
> made by "files as shared memory" the problems would increase as they cannot
> directly migrate. Then, we would need to also patch the applications for
> boinc, which is by far more complicated, and even impossible, since not all of
> them are open source.
>
> More thoughts about it? Would it be interesting to release a "special
> openMosix edition" for BOINC? Note that for "farmers" could be just a great
> idea. With cds like ClusterKnoppix make a BOINC farm would be a
> straightforward issue. And think about mainteneance... just one BOINC
> application. Just upgrading in the "master" node would be wonderful!
>

A lot of things to think of I agree, have to take a good peek into the docus of both boinc and Omosix... Have a 5-system cluster at our computers-club, perhaps I should utilize it for a weekend or so to do some real "field-testing"...

If I do remember correctly the docs for Mos said something like that all those common filesystems were mirrored to all nodes, this means that if the common memory is on such filesystem it should be able to migrate to another node since it should have the same name, content et.c.


ID: 96657 · Report as offensive
Profile Chiana
Avatar

Send message
Joined: 2 Feb 05
Posts: 19
Credit: 165,472
RAC: 0
Sweden
Message 97267 - Posted: 11 Apr 2005, 0:18:29 UTC
Last modified: 11 Apr 2005, 0:19:24 UTC

I wonder if this is something worth looking into in this matter...


ID: 97267 · Report as offensive
Profile Zerjillo

Send message
Joined: 15 Feb 03
Posts: 27
Credit: 461,381
RAC: 0
Spain
Message 97418 - Posted: 11 Apr 2005, 6:55:39 UTC - in response to Message 97267.  

> I wonder if this is something worth
> looking into in this matter...

The last day I posted a message here I found that patch, but I'm not really shure if it is what we are looking for. If I found time I'll try to check it.

Zerjillo

ID: 97418 · Report as offensive
Alex

Send message
Joined: 26 Sep 01
Posts: 260
Credit: 2,327
RAC: 0
Canada
Message 97462 - Posted: 11 Apr 2005, 9:13:35 UTC - in response to Message 96311.  
Last modified: 11 Apr 2005, 9:16:49 UTC

>
> More thoughts about it? Would it be interesting to release a "special
> openMosix edition" for BOINC? Note that for "farmers" could be just a great
> idea. With cds like ClusterKnoppix make a BOINC farm would be a
> straightforward issue. And think about mainteneance... just one BOINC
> application. Just upgrading in the "master" node would be wonderful!
>


Overclockix has boinc and folding on it.
http://overclockix.octeams.com/

I think at one time it ran openmosix. Not sure what it runs now.
Last time I used it, the folding@home client was set up for someone else's account, not sure what they have it set up for now but the config file is editable.


ID: 97462 · Report as offensive
Profile rattelschneck
Avatar

Send message
Joined: 14 Apr 01
Posts: 435
Credit: 842,179
RAC: 0
Germany
Message 97499 - Posted: 11 Apr 2005, 13:48:03 UTC - in response to Message 97462.  


> Overclockix has boinc and folding on it.
> http://overclockix.octeams.com/
>

Better go for this.
This is OpenMosix ready.
Installation of the boinc client isn't a big issue.


ID: 97499 · Report as offensive
Profile rattelschneck
Avatar

Send message
Joined: 14 Apr 01
Posts: 435
Credit: 842,179
RAC: 0
Germany
Message 97502 - Posted: 11 Apr 2005, 13:50:06 UTC - in response to Message 97267.  
Last modified: 11 Apr 2005, 13:54:12 UTC

> I wonder if this is something worth
> looking into in this matter...
>

Hmm, it could be worth looking deeper into it. I will have a try asap.

BTW, is there a way to rename your thread? Giving it a name like "Running BOINC on a linux cluster" could possibly catch more interested users.
What do you think?


ID: 97502 · Report as offensive
Profile rattelschneck
Avatar

Send message
Joined: 14 Apr 01
Posts: 435
Credit: 842,179
RAC: 0
Germany
Message 97503 - Posted: 11 Apr 2005, 14:02:18 UTC - in response to Message 96311.  

> I've been thinking about this same issue during the last days and here are
> more points about it:

Very interesting thoughts, really. I'm in a bit of a hurry right now.
I will come back to it, later on.


ID: 97503 · Report as offensive
Profile Zerjillo

Send message
Joined: 15 Feb 03
Posts: 27
Credit: 461,381
RAC: 0
Spain
Message 97530 - Posted: 11 Apr 2005, 15:02:43 UTC - in response to Message 97499.  
Last modified: 11 Apr 2005, 15:05:19 UTC

> Better go for this.
> This is OpenMosix ready.
> Installation of the boinc client isn't a big issue.

Yep, I downloaded it last friday but I have no chance to test anything. By the way, I'm not sure if clusterknoppix directly implements "Migshm" (if with migshm the processes migrate, the changes to the boinc core would be easy, I think). Anyway, we could think about creating a derived clusterknoppix distro with just the minimum requirements to run boinc (delete some software that would not be useful, add migshm (if possible) & add modified boinc core).

If you have the chance to test cluster knoppix and patch it with migshm I will be very interested in your results.

Greetings

Zerjillo


ID: 97530 · Report as offensive
Profile Chiana
Avatar

Send message
Joined: 2 Feb 05
Posts: 19
Credit: 165,472
RAC: 0
Sweden
Message 98804 - Posted: 14 Apr 2005, 19:35:40 UTC - in response to Message 97502.  

>
> BTW, is there a way to rename your thread? Giving it a name like "Running
> BOINC on a linux cluster" could possibly catch more interested users.
> What do you think?
>

Sounds like a good idea to me, I can't do that myself however, but I'll bet there is an admin or two that can...


ID: 98804 · Report as offensive
BonnyFused

Send message
Joined: 14 Jun 00
Posts: 6
Credit: 325,626
RAC: 0
Italy
Message 157455 - Posted: 26 Aug 2005, 13:11:03 UTC

So, guys... any news about running BOINC on a Linux cluster?
I'd like to give it a try...

Thanks...
ID: 157455 · Report as offensive
Astro
Volunteer tester
Avatar

Send message
Joined: 16 Apr 02
Posts: 8026
Credit: 600,015
RAC: 0
Message 157499 - Posted: 26 Aug 2005, 14:57:02 UTC - in response to Message 98804.  

>
> BTW, is there a way to rename your thread? Giving it a name like "Running
> BOINC on a linux cluster" could possibly catch more interested users.
> What do you think?
>

Sounds like a good idea to me, I can't do that myself however, but I'll bet there is an admin or two that can...

Chiana, Pilot says, "you are the only one (except devs/moderators) that CAN change the title of your thread. You need to make a post, then "Edit" the post, to the left will be the title of the thread in the box. Edit the title and click submit and you're done.

Signed
the crew of Moya,

You trollup ;)
ID: 157499 · Report as offensive
1 · 2 · Next

Message boards : Number crunching : Wonder if anyone has tried this, if so, how dit it turn out?


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.