[Beowulf] Re: [Bioclusters] FW: cluster newbie (fwd from dag at sonsorol.org)

Wed Jan 5 12:12:27 PST 2005

----- Forwarded message from Chris Dagdigian <dag at sonsorol.org> -----

From: Chris Dagdigian <dag at sonsorol.org>
Date: Wed, 05 Jan 2005 14:43:55 -0500
To: "Clustering,  compute farming & distributed computing in life science informatics" <bioclusters at bioinformatics.org>
Subject: Re: [Bioclusters] FW: cluster newbie
Organization: Bioteam Inc. 
User-Agent: Mozilla/5.0 (Macintosh; U; PPC Mac OS X Mach-O; en-US;
	rv:1.7.5) Gecko/20041217
Reply-To: "Clustering,  compute farming & distributed computing in life science informatics" <bioclusters at bioinformatics.org>

Hi Nick,

I've written about this in the past; you can find stuff in the list 
archives.

Some of the articles and presentations I've done on "bioclusters"  are 
linked off this URL: http://bioteam.net/dag/ - something there may be of 
use.

In general I'd go with BioBrew or the ROCKS cluster kit ("roll") that 
comes with Grid Engine as the scheduler if you are starting out.

If you want to roll your own cluster to have maximum flexibility and 
really learn the behind the scenes stuff just pick the "best" (or your 
favorite) componants to match the following requirements:

1. A Linux distribution

2. A Resource manager & scheduler software (Grid Engine, etc.)

3. Software for doing unattended "bare metal" installs and incremental 
updates (SystemImager, etc.)

4. Management & monitoring packages (ganglia, nagios, bigbrother, etc.)

Take what you chose for #1-4 and put all the compute nodes on a private 
gigabit ethernet switch. There should be a common NFS share for user 
home directories and maybe for the scheduler system. Pick one node to be 
the "master" node and connect one of its NICs to the "private" cluster 
network and the 2nd NIC to the company/department network. This way you 
and your users only have to login to and deal with the one single 
"master" node.

-Chris

Nick D'Angelo wrote:

>
>All,
> 
>I am sure this has been asked many times before, but what is the preferred
>method or perhaps 'best' method of clustering a few 3-5 RedHat or other
>Linux flavours to best suit our Bioinfo R and D group?
> 
>I have come across biobrew with their own cd distribution install and also
>this group.
>
>I was going to originally look at Fedora core 2, but that appeared to be
>painful due to the kernel re-compile and to be honest, the documentation
>appeared to be quite poor, at least what I found.
> 
> Any suggestions?
> 
> Thanks,

-- 
Chris Dagdigian, <dag at sonsorol.org>
BioTeam  - Independent life science IT & informatics consulting
Office: 617-665-6088, Mobile: 617-877-5498, Fax: 425-699-0193
PGP KeyID: 83D4310E iChat/AIM: bioteamdag  Web: http://bioteam.net
_______________________________________________
Bioclusters maillist  -  Bioclusters at bioinformatics.org
https://bioinformatics.org/mailman/listinfo/bioclusters

----- End forwarded message -----
-- 
Eugen* Leitl <a href="http://leitl.org">leitl</a>
______________________________________________________________
ICBM: 48.07078, 11.61144            http://www.leitl.org
8B29F6BE: 099D 78BA 2FD3 B014 B08A  7779 75B0 2443 8B29 F6BE
http://moleculardevices.org         http://nanomachines.net
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 198 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20050105/6dd9af58/attachment.sig>