From csamuel at vpac.org  Mon Jan  1 18:06:54 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Tue, 2 Jan 2007 13:06:54 +1100
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <Pine.LNX.4.64.0612291022340.9199@lilith.rgb.private.net>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<4594F3E6.5010803@gmail.com>
	<Pine.LNX.4.64.0612291022340.9199@lilith.rgb.private.net>
Message-ID: <200701021306.57800.csamuel@vpac.org>

On Saturday 30 December 2006 04:24, Robert G. Brown wrote:

> On Fri, 29 Dec 2006, Geoff Jacobs wrote:
>
> > What I'd like to see is an interested party which would implement a
> > good, long term security management program for FC(2n+b) releases. RH
> > obviously won't do this.
>
> I thought there was such a party, but I'm too lazy to google for it.

Fedora Legacy.  It's pretty much dead these days. :-(

http://fedoralegacy.org/

 Important Notice: December 12, 2006 

 The current model for supporting maintenance distributions is being 
re-examined. In the meantime, we are unable to extend support to older Fedora 
Core releases as we had planned. As of now, Fedora Core 4 and earlier 
distributions are no longer being maintained. 

-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070102/5c891c9d/attachment.sig>

From csamuel at vpac.org  Mon Jan  1 18:10:48 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Tue, 2 Jan 2007 13:10:48 +1100
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <4594E874.9060905@gmail.com>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<200612290939.59593.csamuel@vpac.org> <4594E874.9060905@gmail.com>
Message-ID: <200701021310.48609.csamuel@vpac.org>

On Friday 29 December 2006 21:05, Geoff Jacobs wrote:

> Here's a bare bones kickstart method (not Kickstart[tm] per se):
> http://linuxmafia.com/faq/Debian/kickstart.html

Good old Rick, he crops up everywhere & is a mine of information. ;-)

> Regarding kickstart, among choices for pre-scripted installers it is one
> of many. I personally favor the likes of SystemImager, even though it's
> not quite in the same category (FAI is though, IMO). Even dd with netcat
> is pretty powerful for homogeneous nodes.

FAI is the one I've heard of before, but never had the chance to play with it 
yet.   I hear tell that Warewulf is distro neutral and will deploy J.Random 
Distro onto hardware (and maybe even 'doze, shudder).

> Once you've chosen your distro based on experience/need, there are
> usually a few ways to put it on your spindles.

Oh indeed!

cheers,
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070102/2ae2b284/attachment.sig>

From dag at sonsorol.org  Tue Jan  2 13:06:08 2007
From: dag at sonsorol.org (Chris Dagdigian)
Date: Tue, 2 Jan 2007 16:06:08 -0500
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
Message-ID: <DD7E1CEA-0993-430F-954A-9D936293EC64@sonsorol.org>


For what it's worth I'm a biased Grid Engine and Platform LSF user  ...

On Dec 29, 2006, at 11:40 AM, Nathan Moore wrote:

> I've presently set up a cluster of 5 AMD  dual-core linux boxes for  
> my students (at a small college).  I've got  MPICH running, shared  
> NIS/NFS home directories etc.  After reading the MPICH installation  
> guide and manual, I can't say I understand how to deploy MPICH for  
> my students to use.  So far as I can tell, there no load balancing  
> or migration of processes in the library, and so now I'm trying to  
> figure out what piece of software to add to the  cluster to (for  
> example) prevent the starting of an MPI job when there's already  
> another job running.
>
> (1) Is openPBS or gridengine the appropriate tool to use for a  
> multi-user system where mpich is available?  Are there better  
> scheduling options?
>

Both should be fine although if you are considering *PBS you should  
look at both Torque (a fork of OpenPBS I think) and PBSPro  
(commercial but last time I checked they had very good options for  
academic sites).  I can't speak intelligently about the PBS variants  
these days... it's been too long since I've been hands on.

Lots of people use Grid Engine with MPICH using both loose and tight  
integration methods. The mailing list  
(users at gridengine.sunsource.net) has a very helpful community with an  
excellent signal to noise ratio.

Despite being an SGE zealot there are times when I can make both a  
technical and business argument for why Platform LSF is the "best"  
solution for a particular project or problem -- you may want to add  
this to your evaluation plate if you are considering (at all)  
commercial options. If not, don't sweat it.  For a small cluster in  
an academic environment LSF may be hard to justify but if you can get  
good academic pricing it is often worthwhile to crunch the numbers --  
LSF in some cases can 'win' from a features,  lower-administrative- 
burden and support perspective but this a case-by-case thing.


> (1.5) Can mortals install and configure Gridengine?  Thus far it  
> seems too wonderful for me to understand.

Grid Engine is easy to install. I've posted an article here that  
covers the stuff I wish someone had told me beforehand about SGE:

"Things to think about before installing Grid Engine"

http://gridengine.info/articles/2005/09/29/things-to-think-about- 
before-installing

... it boils down to the fact that during installation SGE is  
unusually sensitive to issues regarding hostnames and forward/reverse  
DNS resolution.


>
> (2) Also, if my cluster is made up of a mix of single and dual  
> processor machines, what's the proper way to tell mpd about that  
> topology?

Depends on which MPI implementation and which of the many available  
methods you are using to bootstrap the process.

>
> (3) Its likely that in the future I'll have part-time access to  
> another cluster of dual-boot (XP/linux) machines.  The machines  
> will default to booting to Linux, but will occasionally (5-20 hours  
> a week) be used as windows workstations by a console user (when a  
> user is finished, they'll restart the machine and it will boot back  
> to linux).  If cluster nodes are available in this sort of  
> unpredictable and intermittent way, can they be used as compute  
> nodes in some fashion? Wil gridengine/PBS /??? take care of this  
> sort of process migration?
>

Grid Engine will not transparently preserve and migrate running jobs  
off of machines that get bounced suddenly.  This sort of transparent  
and automatic checkpointing and migration is actually pretty hard to  
do in practice.  If you know in advance which machines are going to  
be shut down and rebooted into windows then there are tools in all  
the common scheduling packages for "draining" a particular machine or  
queue.  You can also "kill and reschedule" jobs that are running on  
any one queue instance or cluster queue. One can even do this on a  
calendar basis when the  "need windows" schedule is predictable (does  
not seem possible in your case).  If the running cluster jobs are  
short lived so that you don't have a big runtime investment then you  
can bounce machines whenever you want - Grid Engine can be told to  
reschedule failed jobs automatically to a different available host --  
the hard case to deal with is the very long running jobs that (a)  
can't be reliably checkpoint or (b) are difficult to suspend/resume/ 
migrate due to the parallel application itself.

The answer may be application specific in your case.

Regards,
Chris


> best regards,
>
> Nathan
>
>
>
> - - - - - - - - - - - - - - - - - - - - - - -
> Nathan Moore
> Physics
> Winona State University
> nmoore at winona.edu
> AIM:nmoorewsu
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit  
> http://www.beowulf.org/mailman/listinfo/beowulf


From rgb at phy.duke.edu  Tue Jan  2 15:32:09 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Tue, 2 Jan 2007 18:32:09 -0500 (EST)
Subject: FW: [Beowulf] Which distro for the cluster?
In-Reply-To: <3D92CA467E530B4E8295214868F840FE0A317F81@emss01m12.us.lmco.com>
References: <3D92CA467E530B4E8295214868F840FE0A317F81@emss01m12.us.lmco.com>
Message-ID: <Pine.LNX.4.64.0701021826050.9199@lilith.rgb.private.net>

On Thu, 28 Dec 2006, Cunningham, Dave wrote:

> I notice that Scyld is notable by it's absence from this discussion.  Is
> that due to cost, or bad/no experience, or other factors?  There is a
> lot of interest in it around my company lately.

Scyld is a fine choice for a cluster, but not usually for a first time
learning cluster for non-professionals.  This is in part because it
costs money, and in part because it is designed to encapsulate a lot of
what one has to do to "make a cluster" to the point where it is nearly
entirely hidden from the user/administrator.  This is desireable from a
corporate point of view (although I personally think that one needs a
certain amount of actual cluster experience to get the most out of even
Scyld) but not so good for poor people seeking to learn.  It also limits
you at least somewhat to the particular parallel computing model that
Scyld itself embraces.

A good friend of mine at Duke uses Scyld for his biochemistry cluster,
and although he's been doing cluster computing for a rather long time
(close to 10 years at a guess, maybe even more) and COULD and HAS IN THE
PAST done it all himself, he really likes Scyld's general cluster
administration and encapsulation features.  Of course the grants that
fund the research are deep-pocketed enough to afford it, as well.  That
isn't always the case in academe, and it really isn't the case at home.

However, Don Becker is on the list and you've given him an open
invitation to present Scyld, who it is really designed and intended for,
and maybe even an overview of how it (currently) works.  Don?

    rgb

>
>  Dave Cunningham
>
> -----Original Message-----
> From: beowulf-bounces at beowulf.org [mailto:beowulf-bounces at beowulf.org]
> On Behalf Of Andrew M.A. Cater
> Sent: Thursday, December 28, 2006 8:40 AM
> To: beowulf at beowulf.org
> Subject: Re: [Beowulf] Which distro for the cluster?
>
> On Wed, Dec 27, 2006 at 06:46:25PM +0100, Chetoo Valux wrote:
>> Dear all,
>>
>> As a Linux user I've worked with several distros as RedHat, SuSE,
> Debian and
>> derivatives, and recently Gentoo.
>>
>> Now I face the challenge of building a HPC for scientific
> calculations, and
>> I wonder which distro would suit me best. As a Gentoo user, I've
> recognised
>> the power of customisation, optimisation and lightweight system, for
>> instance my 4 years old laptop flies like a youngster, and some
> desktops
>> too. So I thought about building the HPC nodes (8+1 master) with
> Gentoo ....
>>
>
> Don't use Gentoo unless you've a full, fast connection to the internet
> _AND_ you're prepared for your cluster to be internet connected while
> you build it. This IMHO.
>
> Scientific calculations: Quantian? Debian. Debian for the number of math
>
> and other packages and the ease of install. Over 8 nodes, it should be
> relatively easy to set up. But it depends what you want to do, what
> other users want to do etc. etc.
>
>> But then it comes the administration and maintenance burden, which for
> me it
>> should be the less, since my main task here is research ... so
> browsing the
>> net I found Rocks Linux with plenty of clustering docs and
> administration
>> tools & guidelines. I feel this should be the choice in my case, even
> if I
>> sacrifice some computation efficiency.
>
> Rocks / Warewulf perhaps. If you just want something you can
> build/update/maintain in your sleep, I'd still suggest Debian - if only
> because a _minimal_ install on the nodes is as small as you want it to
> be - and because it's fairly consistent. Your cluster - your choice but
> you may have to justify it to your co-workers.
>
> Andy
>>
>> Any advice on this will be appreciated.
>>
>> Chetoo.
>
>> _______________________________________________
>> Beowulf mailing list, Beowulf at beowulf.org
>> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
>

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From rgb at phy.duke.edu  Tue Jan  2 15:44:50 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Tue, 2 Jan 2007 18:44:50 -0500 (EST)
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <DD7E1CEA-0993-430F-954A-9D936293EC64@sonsorol.org>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
	<DD7E1CEA-0993-430F-954A-9D936293EC64@sonsorol.org>
Message-ID: <Pine.LNX.4.64.0701021834051.9199@lilith.rgb.private.net>

On Tue, 2 Jan 2007, Chris Dagdigian wrote:

>> (3) Its likely that in the future I'll have part-time access to another 
>> cluster of dual-boot (XP/linux) machines.  The machines will default to 
>> booting to Linux, but will occasionally (5-20 hours a week) be used as 
>> windows workstations by a console user (when a user is finished, they'll 
>> restart the machine and it will boot back to linux).  If cluster nodes are 
>> available in this sort of unpredictable and intermittent way, can they be 
>> used as compute nodes in some fashion? Wil gridengine/PBS /??? take care of 
>> this sort of process migration?
>> 
>
> Grid Engine will not transparently preserve and migrate running jobs off of 
> machines that get bounced suddenly.  This sort of transparent and automatic 
> checkpointing and migration is actually pretty hard to do in practice.  If 
> you know in advance which machines are going to be shut down and rebooted 
> into windows then there are tools in all the common scheduling packages for 
> "draining" a particular machine or queue.  You can also "kill and reschedule"

For what it is worth, the current generation of Condor can, for some
code and linked with its own migration library, permit transparent
checkpointing and code migration, and it also has a very complex
"policy" engine that lets one specify in great deal how to turn jobs on
and off as user/owners use the systems in the pool.  It has recently
become "true open source" although the download website is still a PITA
to navigate and requires a kind of "registration" and its license is
still not a straight GPL.

This is kind of funny because as I read it, the toolset can now be
wrapped up in source RPMs and distributed as a standard component of
e.g. FC in extras or elsewise without violating any aspect of its
license agreement.  Doing this (for Duke, but if it is in one of Duke's
public repos it is pretty public) is on my list of things to do this
week or next.

One of the bitches that I and many others have about all of the
alternatives is that they are too damn complicated.  Many sites -- I
won't say most but many -- have very, very simple needs for a
scheduler/queuing system.  Needs that could be met without requiring the
admin to read a 1000 page manual, join a mailing list, work through a
really complicated build, and try to figure out several distinct
security models and policy models.  What is really needed is a fully
open source "scheduler lite" that pretty much sets up a simple queue for
a simple list of machines with a simple cron-like policy statement,
maybe all defined with an XMLish config file that permitted classes of
machines (like a bunch that belong to user A) to share a policy.

Some people on list (Mark Hahn, e.g.) have IIRC even written their own
lightweight schedulers out of sheer pique with this situation.  However,
I don't know if any of them have been developed to where they are
moderately portable and packagable for general use.

   rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From dsimas at imageworks.com  Tue Jan  2 16:03:51 2007
From: dsimas at imageworks.com (David Simas)
Date: Tue, 2 Jan 2007 16:03:51 -0800
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <Pine.LNX.4.64.0701021834051.9199@lilith.rgb.private.net>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
	<DD7E1CEA-0993-430F-954A-9D936293EC64@sonsorol.org>
	<Pine.LNX.4.64.0701021834051.9199@lilith.rgb.private.net>
Message-ID: <20070103000350.GB19664@kadee.spimageworks.com>

On Tue, Jan 02, 2007 at 06:44:50PM -0500, Robert G. Brown wrote:
> 
> One of the bitches that I and many others have about all of the
> alternatives is that they are too damn complicated.  Many sites -- I
> won't say most but many -- have very, very simple needs for a
> scheduler/queuing system.  Needs that could be met without requiring the
> admin to read a 1000 page manual, join a mailing list, work through a
> really complicated build, and try to figure out several distinct
> security models and policy models.  What is really needed is a fully
> open source "scheduler lite" that pretty much sets up a simple queue for
> a simple list of machines with a simple cron-like policy statement,
> maybe all defined with an XMLish config file that permitted classes of
> machines (like a bunch that belong to user A) to share a policy.

Ruby Queue?

	http://raa.ruby-lang.org/project/rq/
	http://www.artima.com/rubycs/articles/rubyqueue.html

DGS

> 


From csamuel at vpac.org  Tue Jan  2 17:23:43 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Wed, 3 Jan 2007 12:23:43 +1100
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <DD7E1CEA-0993-430F-954A-9D936293EC64@sonsorol.org>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
	<DD7E1CEA-0993-430F-954A-9D936293EC64@sonsorol.org>
Message-ID: <200701031223.46333.csamuel@vpac.org>

On Wednesday 03 January 2007 08:06, Chris Dagdigian wrote:

> Both should be fine although if you are considering *PBS you should ?
> look at both Torque (a fork of OpenPBS I think)

That's correct, it (and ANU-PBS, another fork) seem to be the defacto queuing 
systems in the state and national HPC centers down here.

Torque is just *so* much better than OpenPBS used to be (not that it was 
particularly hard).

cheers,
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070103/27ed5e14/attachment.sig>

From landman at scalableinformatics.com  Tue Jan  2 20:54:03 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Tue, 02 Jan 2007 23:54:03 -0500
Subject: [Beowulf] OT: Announcing MPI-HMMER
Message-ID: <459B36EB.1060509@scalableinformatics.com>

Hi folks:

  Short OT break.  http://code.google.com/p/mpihmmer/  an MPI
implementation of HMMer 2.3.2.

  Back to your regularly scheduled cluster.

Joe


-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From nixon at nsc.liu.se  Wed Jan  3 02:54:03 2007
From: nixon at nsc.liu.se (Leif Nixon)
Date: Wed, 03 Jan 2007 11:54:03 +0100
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net> (Robert
	G. Brown's message of "Fri, 29 Dec 2006 02:48:04 -0500 (EST)")
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
Message-ID: <m3hcv8z8hg.fsf@unna.nsc.liu.se>

"Robert G. Brown" <rgb at phy.duke.edu> writes:

> Also, plenty of folks on this list have done just fine running "frozen"
> linux distros "as is" for years on cluster nodes.  If they aren't broke,
> and live behind a firewall so security fixes aren't terribly important,
> why fix them? 

Because your users will get their passwords stolen.

If your cluster is accessible remotely, that firewall doesn't really
help you very much. The attacker can simply login as a legitimate user
and proceed to walk through your wide-open local security holes.

But you know this already.

-- 
Leif Nixon                       -            Systems expert
------------------------------------------------------------
National Supercomputer Centre    -      Linkoping University
------------------------------------------------------------


From reuti at staff.uni-marburg.de  Wed Jan  3 04:01:26 2007
From: reuti at staff.uni-marburg.de (Reuti)
Date: Wed, 3 Jan 2007 13:01:26 +0100
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <200701031223.46333.csamuel@vpac.org>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
	<DD7E1CEA-0993-430F-954A-9D936293EC64@sonsorol.org>
	<200701031223.46333.csamuel@vpac.org>
Message-ID: <E33E3B97-CCCA-41E8-BF14-981D598D4AAA@staff.uni-marburg.de>

Hi,

Am 03.01.2007 um 02:23 schrieb Chris Samuel:

> On Wednesday 03 January 2007 08:06, Chris Dagdigian wrote:
>
>> Both should be fine although if you are considering *PBS you should
>> look at both Torque (a fork of OpenPBS I think)

although I'm somehow biased to suggest SGE, I also check from time to  
time the Torque mailing list.

> That's correct, it (and ANU-PBS, another fork) seem to be the  
> defacto queuing
> systems in the state and national HPC centers down here.

Whether any queuing system is a standard might not matter. More  
important for chosing one, may be the technical points. To compare  
SGE and Torque e.g.:

- Do you need support for Tight Integrated Linda (I think this will  
most often mean Gaussian) (and PVM) parallel jobs: use SGE

- Do you have some special nodes inside your cluster, and you need to  
specify your resource requests for a parallel job (i.e. combination  
of different types of machines you need for it) in a fine granulated  
manner: use Torque

It's of course impossible to know already in advance a) the needs of  
all the applications, and b) all the features of the queuingsystems,  
if you just start to look into queuing systems.

And I must admit: some years ago I was also shocked by the many pages  
of the manuals of the queuing systems - we only wanted to submit some  
jobs at that point in time. Nowadays I see many possible  
enhancements, which would make the manuals even thicker.

-- Reuti


From glen.beane at jax.org  Wed Jan  3 05:24:31 2007
From: glen.beane at jax.org (Glen Beane)
Date: Wed, 3 Jan 2007 08:24:31 -0500 (EST)
Subject: [Beowulf] picking out a job scheduler
Message-ID: <24120637.1167830671803.JavaMail.ocsadmin@jcs-mid-prod.jax.org>


If you are doing mostly MPI, I would strongly reccoment TORQUE  (a free, open source, OpenPBS fork with *many* enhancements).  I would not reccoment OpenPBS, as Altair no longer updates it and hasn't for quite some time.  TORQUE has great integration with mpich by using mpiexec from Pete at  (http://www.osc.edu/~pw/mpiexec/index.php).  LAM and OpenMPI have native PBS (and TORQUE) support as well. 


Glen L. Beane
The Jackson Laboratory
Software Engineer II
Phone 207-288-6153


From rgb at phy.duke.edu  Wed Jan  3 06:51:44 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Wed, 3 Jan 2007 09:51:44 -0500 (EST)
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <m3hcv8z8hg.fsf@unna.nsc.liu.se>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
	<m3hcv8z8hg.fsf@unna.nsc.liu.se>
Message-ID: <Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>

On Wed, 3 Jan 2007, Leif Nixon wrote:

> "Robert G. Brown" <rgb at phy.duke.edu> writes:
>
>> Also, plenty of folks on this list have done just fine running "frozen"
>> linux distros "as is" for years on cluster nodes.  If they aren't broke,
>> and live behind a firewall so security fixes aren't terribly important,
>> why fix them?
>
> Because your users will get their passwords stolen.
>
> If your cluster is accessible remotely, that firewall doesn't really
> help you very much. The attacker can simply login as a legitimate user
> and proceed to walk through your wide-open local security holes.

So:

   a) Our cluster wasn't remotely accessible.  In fact, it was on a
192.168 network and in order to even touch it, one had to login to an up
to date, carefully defended desktop workstation login server in the
department.

   b) If an attacker has compromised a user account on one of these
workstations, IMO the security battle is already largely lost.  They
have a choice of things to attack or further evil they can try to wreak.
Attacking the cluster is one of them, and as discussed if the cluster is
doing real parallel code it is likely to be quite vulnerable regardless
of whether or not its software is up to date because network security is
more or less orthogonal to fine-grained code network performance.

Still, a cluster is paradoxically one of the best monitored parts of a
network.  Although it would make a gangbusters DoS platform, network
traffic on the cluster, cpu consumption on the cluster, user access to
the cluster are all relatively carefully monitored.  The cluster
installation is likely to be different enough and "odd" enough to make
standard rootkit encapsulations fail for anyone but the legendary
Ubercracker (who can always do whatever they want anyway, right?;-) In
an organization that tightly monitors everything all the time on general
security principles (first line of defense, really, as one can NEVER be
sure all exploitable holes are closed even with a yum-updated, stable,
currently supported distro and human eyes are better at picking up
anomalies in system operation than any automated tool) I think it is
pretty likely that any attempt to take over a cluster and use it for
diabolical ends would be almost instantly detected.

BTW, the cluster's servers were not (and I would not advise that servers
ever be) running the old distro -- we use a castle keep security model
where servers have extremely limited access, are the most tightly
monitored, and are kept aggressively up to date on a fully supported
distro like Centos.  The idea is to give humans more time to detect
intruders that have successfully compromised an account at the
workstation LAN level and squash them like the nasty little dung beetles
that they are.

FWIW, our department is entirely linux at the server level, and almost
entirely linux at the workstation level.  A very few experimental groups
and individuals run either Windows boxes (usually to be able to use some
particular software package) or Macs (because they are, umm, "that kind
of user":-).  I'm guessing that the ratio is something like 4:1 linux to
Win at the workstation level (Macs down there in the noise) and maybe
10:1 linux to win if you include cluster nodes, whatever OS they might
be running.

Since Seth introduced yup on top of RH (maybe 7-8 years ago?  How time
flies...), and then proceeded to write yum to replace yup for RPM
distros in general, we haven't had a single successful promotion to root
in the department.  Nothing done locally can prevent some grad student's
password from being trapped as they login from some compromised
win-based system in their hometown over fall break, but the very few of
these that have occurred have been quickly detected and quickly squashed
without further compromise.

In that same interval, we had a WinXX system compromised and turned into
a pile of festering warez rot something like twice a year.  Pretty
amazing given that they are kept up to date as best as possible and they
make up only 10-20% of our total system count.

> But you know this already.

Oh yeah;-)

And we didn't do this "willingly" and aren't that likely to repeat it
ourselves.  We had some pretty specific reasons to freeze the node
distro -- the cluster nodes in question were the damnable Tyan dual
Athlon systems that were an incredible PITA to stabilize in the first
place (they had multiple firmware bugs and load-based stability issues
under the best of circumstances).  Once we FINALLY got them set up with
a functional kernel and library set so that they wouldn't crash, we were
extremely loathe to mess with it.  So we basically froze it and locked
down the nodes so they weren't easily accessible except from inside the
department, and then monitored them with xmlsysd and wulfstat in
addition to the usual syslog-ng and friends admin tools.

Odd usage patterns (that is, almost any sort of running binary that
wasn't a well-known numerical task associated with one of the groups,
logins by anyone who wasn't a known user) would have been noticed by any
of a half-dozen people, one of whom was me, almost immediately.  The
kernel was "barely stable" as it was and couldn't easily have been
replaced with a hacker kernel (to e.g. erase /proc trace) without a VERY
high probability that the hacker kernel would crash the system and
reveal the hacker on the first try. xmlsysd reads all sorts of stuff
from all over /proc and was custom code that I was working on and
periodically updating, even while Seth was working on yum and updating
THAT.  Somebody would have had to literally custom craft some very
advanced C code to stay hidden on the cluster and even then would have
been revealed by e.g. an update of xmlsysd unless they were a bit beyond
even Ubercracker status.

In general, though, it is very good advice to stay with an updated OS.
My real point was that WITH yum and a bit of prototyping once every
12-24 months, it is really pretty easy to ride the FC wave on MANY
clusters, where the tradeoff is better support for new hardware and more
advanced/newer libraries against any library issues that one may or may
not encounter depending on just what the cluster is doing.  Freezing FC
(or anything else) long past its support boundary is obviously less
desireable.  However, it is also often unnecessary.

On clusters that add new hardware, usually bleeding edge, every four to
six months as research groups hit grant year boundaries and buy their
next bolus of nodes, FC really does make sense as Centos probably won't
"work" on those nodes in some important way and you'll be stuck
backporting kernels or worse on top of your key libraries e.g. the GSL.
Just upgrade FC regularly across the cluster, probably on an "every
other release" schedule like the one we use.

On clusters (or sub-clusters) with a 3 year replacement cycle, Centos or
other stable equivalent is a no-brainer -- as long as it installs on
your nodes in the first place (recall my previous comment about the
"stars needing to be right" to install RHEL/Centos -- the latest release
has to support the hardware you're buying) you're good to go
indefinitely, with the warm fuzzy knowledge that your nodes will update
from a "supported" repo most of their 3+ year lifetime, although for the
bulk of that time the distro will de-facto be frozen except for whatever
YOU choose to backport and maintain.

And really, there isn't much stopping folks from adopting a range of
"mixed" strategies -- running FC-whatever on new nodes for a year or
whatever as needed in order to support their hardware or use new
libraries, then reinstalling them with Centos/RHEL (which is basically
FC-even-current-at-release-time frozen and supported or so it seems
recently anyway) as Centos support catches up with the hardware by
syncing with an FC-current on a new release.

Nowadays, with PXE/Kickstart/Yum (or Debian equivalents, or the OS of
your choice with warewulf, or...) reinstalling OR upgrading a cluster
node is such a non-event in terms of sysadmin time and effort that it
can pretty much be done at will.  Except for pathological cases (like
the Tyans) we're talking at most a few days of sysadmin time to set up a
prototyping node or four, flash over to the new distro via a discrete
node reboot (unattended automated reinstall or a new node diskless
image), and let selected users whack on it for a week or two.  If it
proves invisibly stable and satisfactory -- the rule rather than the
exception -- crank it on up across the cluster.  Even if it "fails" on
some untested pathway after you do this, it costs you at most a reboot
(again to a reinstall/replacement of a node image) to put things back as
they were while you fix things.

The worst thing that such a strategy might require is a rebuild of user
applications for both distros, but with shared libraries to my own
admittedly anecdotal experience this "usually" isn't needed going from
older to newer (that is, an older Centos built binary will "probably"
still work on a current FC node, although this obviously depends on the
precise libraries it uses and how rapidly they are changing).  It's a
bit harder to take binaries from newer to older, especially in packaged
form.  There you almost certainly need an rpmbuild --rebuild and a bit
of luck.

Truthfully, cluster installation and administration has never been
simpler.

    rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From rgb at phy.duke.edu  Wed Jan  3 06:59:47 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Wed, 3 Jan 2007 09:59:47 -0500 (EST)
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <24120637.1167830671803.JavaMail.ocsadmin@jcs-mid-prod.jax.org>
References: <24120637.1167830671803.JavaMail.ocsadmin@jcs-mid-prod.jax.org>
Message-ID: <Pine.LNX.4.64.0701030954170.9199@lilith.rgb.private.net>

On Wed, 3 Jan 2007, Glen Beane wrote:

> If you are doing mostly MPI, I would strongly reccoment TORQUE (a
> free, open source, OpenPBS fork with *many* enhancements).  I would not
> reccoment OpenPBS, as Altair no longer updates it and hasn't for quite
> some time.  TORQUE has great integration with mpich by using mpiexec
> from Pete at (http://www.osc.edu/~pw/mpiexec/index.php).  LAM and
> OpenMPI have native PBS (and TORQUE) support as well.

FWIW (and to me it is worth a lot:-) torque appears to be in FC 6
extras, ready to install and run.  This may or may not mean that FC is
being used as (one of) its primary development/maintenance platform(s)
-- this is often the case.  I'll have to give it a try along with condor
and yes, ruby queue.

We're more interested in using it for simple EP job distribution,
though.  Not so many people here do MPI or PVM computations, it is more
parallel simulations or parametric explorations.

   rgb

>
> Glen L. Beane
> The Jackson Laboratory
> Software Engineer II
> Phone 207-288-6153
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
>

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From hahn at physics.mcmaster.ca  Wed Jan  3 07:53:35 2007
From: hahn at physics.mcmaster.ca (Mark Hahn)
Date: Wed, 3 Jan 2007 10:53:35 -0500 (EST)
Subject: [Beowulf] running MPICH on AMD Opteron Dual Core Processor
	Cluster( 72 Cpu's)
In-Reply-To: <9fe360270612290226yb9e3ccbua77a1febf4123fc6@mail.gmail.com>
References: <9fe360270612290226yb9e3ccbua77a1febf4123fc6@mail.gmail.com>
Message-ID: <Pine.LNX.4.64.0701031049390.12893@coffee.psychology.mcmaster.ca>

> "  p1_8544: p4_error: Timeout in Establishing connection to remote process:
> 0  "
> rm_l_1_8667: (359.417969) net_send: could not write to fd=5, errno=104
>
> We have been trying the same for the past two days and we didnt get any
> solution for the above.

but what have you tried?  I would guess that this is a simple rsh config
problem, nothing to do with mpich.

> Also we downloaded the Latest MPICH 1.2.7p1 and configured the same. now for

but why do you think the problem lies with mpich?

> The same testing with LAM/MPI and OPENMPI are working fine.

lam being mostly just a previous version of lam, and I think inheriting
lam's agent-based process-starting, no?

personally, I'm pretty convinced that MPI implementations should stay
out of the jobstarter business, and go with straight agentless (ssh-based)
job spawning.


From hahn at physics.mcmaster.ca  Wed Jan  3 08:05:00 2007
From: hahn at physics.mcmaster.ca (Mark Hahn)
Date: Wed, 3 Jan 2007 11:05:00 -0500 (EST)
Subject: [Beowulf] SW Giaga, what kind?
In-Reply-To: <1bef2ce30612300153o4d1ae055n1374b976e846d258@mail.gmail.com>
References: <1bef2ce30612272322p4a0d1807m3c6d9ea615f58873@mail.gmail.com>
	<4594E95E.6060102@streamline-computing.com>
	<4594F4B3.4070602@gmail.com>
	<4594FB6D.6070402@streamline-computing.com>
	<45957713.2080001@gmail.com>
	<1bef2ce30612300153o4d1ae055n1374b976e846d258@mail.gmail.com>
Message-ID: <Pine.LNX.4.64.0701031054010.12893@coffee.psychology.mcmaster.ca>

> But, originally, my question was about the quality and reliability of the
> brand of *LevelOne* SW (Unmanaged, Gigabit ports), in comparison to its

I've never heard "level one" used in this context.  the closest would be 
"layer 2", which refers to mac-based switching, and might be what you mean.

> fairly low price, on one hand, and the brand of *3COM* SW (Unmanaged,
> Gigabit ports) on the other hand.

3com is not an exceptional switch vendor, except in the historic sense.
they make OK stuff, but I don't think I'd give them special credit against
any of the well-known brands (dlink, netgear for commodity, hp-procurve,
cisco and some others for higher-end, enterpriseish stuff.)

> The number of nodes in our initial plan is 6 nodes, AMD DualCore, desktop
> type systems.

a dime-store, no-name 8-port gigabit switch would serve just fine.
at gigabit latencies (with a normal stack, etc: ~50 us), internal details
of the switch are basically irrelevant.  small switches like this are 
probably single-chip, giving appliance-like in reliability, insensitivity
to the name on the case, and probably line speed.


From malallen at indiana.edu  Wed Jan  3 08:52:40 2007
From: malallen at indiana.edu (Matt Allen)
Date: Wed, 03 Jan 2007 11:52:40 -0500
Subject: [Beowulf] running MPICH on AMD Opteron Dual Core
	Processor	Cluster( 72 Cpu's)
In-Reply-To: <Pine.LNX.4.64.0701031049390.12893@coffee.psychology.mcmaster.ca>
References: <9fe360270612290226yb9e3ccbua77a1febf4123fc6@mail.gmail.com>
	<Pine.LNX.4.64.0701031049390.12893@coffee.psychology.mcmaster.ca>
Message-ID: <459BDF58.9010205@indiana.edu>

Mark Hahn wrote:
> personally, I'm pretty convinced that MPI implementations should stay
> out of the jobstarter business, and go with straight agentless (ssh-based)
> job spawning.

I'm curious about your reasoning, Mark.  We've had nightmare situations
for years with ssh-based job spawning.  The most common case is where
sshd processes terminate on nodes without the child mpi processes
exiting.  Then we have orphaned mpi processes, owned by init, scattered
throughout the cluster.  If any of these processes are using limited
resources (like Myrinet adapters), subsequent jobs can (more likely,
will) exit immediately upon dispatch to the node.

We've found ways around this with prolog/epilog scripts, and scheduling
policy, but the slickest solutions so far, in my opinion, have been
mpiexec (admittedly not part of an MPI implementation) and lam/openmpi.
 Allowing the resource manager to completely handle job spawning has
provided better post-job cleanup, and more complete job statistics
(cpu-time, mostly) for us.

Do you not have to deal with these sorts of issues?  If not, lay some
wisdom on me; I could use it.

Matt

-- 
Matt Allen            |  Systems Analyst
malallen at indiana.edu  |  Research and Technical Services
812-855-7318          |  Indiana University


From mathog at caltech.edu  Wed Jan  3 09:27:04 2007
From: mathog at caltech.edu (David Mathog)
Date: Wed, 03 Jan 2007 09:27:04 -0800
Subject: [Beowulf] RE: OT: Announcing MPI-HMMER
Message-ID: <E1H29tI-0006PF-DL@mendel.bio.caltech.edu>


>   Short OT break.  http://code.google.com/p/mpihmmer/  an MPI
> implementation of HMMer 2.3.2.


There's also my PVM version of 2.3.2 from 2003/2004, with a few
minor fixes since then rolled up into the lastest distribution:

  ftp://saf.bio.caltech.edu/pub/software/molbio/parallelhmmer.tar.gz

Joe and I aparently exist in parallel software universes ;-).

Regards,

David Mathog
mathog at caltech.edu
Manager, Sequence Analysis Facility, Biology Division, Caltech


From ed at eh3.com  Wed Jan  3 09:30:13 2007
From: ed at eh3.com (Ed Hill)
Date: Wed, 3 Jan 2007 12:30:13 -0500
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <Pine.LNX.4.64.0701030954170.9199@lilith.rgb.private.net>
References: <24120637.1167830671803.JavaMail.ocsadmin@jcs-mid-prod.jax.org>
	<Pine.LNX.4.64.0701030954170.9199@lilith.rgb.private.net>
Message-ID: <20070103123013.50def508@ernie>

On Wed, 3 Jan 2007 09:59:47 -0500 (EST) "Robert G. Brown"
<rgb at phy.duke.edu> wrote:

> On Wed, 3 Jan 2007, Glen Beane wrote:
> 
> > If you are doing mostly MPI, I would strongly reccoment TORQUE (a
> > free, open source, OpenPBS fork with *many* enhancements).  I would
> > not reccoment OpenPBS, as Altair no longer updates it and hasn't
> > for quite some time.  TORQUE has great integration with mpich by
> > using mpiexec from Pete at
> > (http://www.osc.edu/~pw/mpiexec/index.php).  LAM and OpenMPI have
> > native PBS (and TORQUE) support as well.
> 
> FWIW (and to me it is worth a lot:-) torque appears to be in FC 6
> extras, ready to install and run.  This may or may not mean that FC is
> being used as (one of) its primary development/maintenance platform(s)
> -- this is often the case.  I'll have to give it a try along with
> condor and yes, ruby queue.


TORQUE packages have been available in Fedora Extras since April 2006.
Since then, versions have been built and pushed for Fedora Extras 3, 4,
5, and 6.  And if you run into any problems with the Fedora torque
packages then please create a Fedora bugzilla entry!

Ed

-- 
Edward H. Hill III, PhD  |  ed at eh3.com  |  http://eh3.com/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070103/ac8d28c2/attachment.sig>

From landman at scalableinformatics.com  Wed Jan  3 10:29:24 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Wed, 03 Jan 2007 13:29:24 -0500
Subject: [Beowulf] RE: OT: Announcing MPI-HMMER
In-Reply-To: <E1H29tI-0006PF-DL@mendel.bio.caltech.edu>
References: <E1H29tI-0006PF-DL@mendel.bio.caltech.edu>
Message-ID: <459BF604.1090006@scalableinformatics.com>

David Mathog wrote:
>>   Short OT break.  http://code.google.com/p/mpihmmer/  an MPI
>> implementation of HMMer 2.3.2.
> 
> 
> There's also my PVM version of 2.3.2 from 2003/2004, with a few
> minor fixes since then rolled up into the lastest distribution:
> 
>   ftp://saf.bio.caltech.edu/pub/software/molbio/parallelhmmer.tar.gz
> 
> Joe and I aparently exist in parallel software universes ;-).

Heh... :)

Will pull it down and look.  My fault David, I was not aware of this, or 
it never mentally clicked.

-- 

Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From ntmoore at gmail.com  Tue Jan  2 20:55:10 2007
From: ntmoore at gmail.com (Nathan Moore)
Date: Tue, 2 Jan 2007 22:55:10 -0600
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <200701031223.46333.csamuel@vpac.org>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
	<DD7E1CEA-0993-430F-954A-9D936293EC64@sonsorol.org>
	<200701031223.46333.csamuel@vpac.org>
Message-ID: <9988D8B1-94CF-4C93-AD1D-54274993C00C@gmail.com>

Torque was really easy to install, but it seems like my /etc/hosts  
file must be screwed up, as I can't get the cluster nodes to  
respond.  Specifically, within a cluster of 3 machines, each having  
an /etc/hosts file of:

	127.0.0.1       localhost.localdomain   localhost
	199.17.152.17   runner
	199.17.152.135  muscovey
	199.17.152.13   pekin
	(( other workstations follow ))

Now, when I have the pbs_server running on runner, and the pbs_mom  
daemons running on muscovey, pekin, and runner, I et the following  
status message,

	[root at runner torque-2.1.6]# pbsnodes -a
	pekin
	     state = down
	     np = 1
	     ntype = cluster

	muscovey
	     state = down
	     np = 1
	     ntype = cluster

	runner
	     state = down	
	     np = 1
	     ntype = cluster

I realize this is a pretty low-level question, but what the heck is  
wrong with my /etc/hosts file?

regards,

NT


ps,  the trouble shooting message given by torque is,

	[root at runner torque-2.1.6]# momctl -d 3

	Host: runner/runner   Version: 2.1.6
	WARNING:  server not specified (set $pbsserver)
	PID:                    30531
	HomeDirectory:          /var/spool/torque/mom_priv
	MOM active:             2518 seconds
	Server Update Interval: 45 seconds
	LOGLEVEL:               0 (use SIGUSR1/SIGUSR2 to adjust)
	Communication Model:    RPP
	TCP Timeout:            20 seconds
	NOTE:  no prolog configured
	Alarm Time:             0 of 10 seconds
	Trusted Client List:    199.17.152.17,127.0.0.1
	Configured to use /usr/bin/scp -rpB
	NOTE:  no local jobs detected

	diagnostics complete


- - - - - - - - - - - - - - - - - - - - - - -

Nathan Moore
Physics
Winona State University
nmoore at winona.edu
AIM:nmoorewsu

- - - - - - - - - - - - - - - - - - - - - - -


On Jan 2, 2007, at 7:23 PM, Chris Samuel wrote:

On Wednesday 03 January 2007 08:06, Chris Dagdigian wrote:

> Both should be fine although if you are considering *PBS you should
> look at both Torque (a fork of OpenPBS I think)

That's correct, it (and ANU-PBS, another fork) seem to be the defacto  
queuing
systems in the state and national HPC centers down here.

Torque is just *so* much better than OpenPBS used to be (not that it was
particularly hard).

cheers,
Chris
-- 
  Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
  Victorian Partnership for Advanced Computing http://www.vpac.org/
  Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit http:// 
www.beowulf.org/mailman/listinfo/beowulf


From m.janssens at opencfd.co.uk  Wed Jan  3 04:08:31 2007
From: m.janssens at opencfd.co.uk (Mattijs Janssens)
Date: Wed, 3 Jan 2007 12:08:31 +0000
Subject: [Beowulf] cluster trips power switch
In-Reply-To: <9fe360270612290226yb9e3ccbua77a1febf4123fc6@mail.gmail.com>
References: <9fe360270612290226yb9e3ccbua77a1febf4123fc6@mail.gmail.com>
Message-ID: <200701031208.31933.m.janssens@opencfd.co.uk>

When I switch off our small (16 node) cluster it trips the power switch. Guess 
there is a temporary power surge. Are there any devices (line conditioners?) 
that will prevent this? Experiences?

Mattijs

-- 

Mattijs Janssens


From wharman at prism.net  Wed Jan  3 09:59:11 2007
From: wharman at prism.net (wharman at prism.net)
Date: Wed, 3 Jan 2007 12:59:11 -0500 (EST)
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <20070103123013.50def508@ernie>
References: <24120637.1167830671803.JavaMail.ocsadmin@jcs-mid-prod.jax.org>
	<Pine.LNX.4.64.0701030954170.9199@lilith.rgb.private.net>
	<20070103123013.50def508@ernie>
Message-ID: <52586.10.238.10.70.1167847151.webmail@10.238.10.70>


or download TORQUE: http://www.clusterresources.com/pages/products.php

-Bill


-----Original Message-----
From: "Ed Hill" <ed at eh3.com>
Sent: Wednesday, January 3, 2007 12:30 pm
To: "Robert G. Brown" <rgb at phy.duke.edu>
Cc: "beowulf at beowulf.org" <beowulf at beowulf.org>
Subject: Re: [Beowulf] picking out a job scheduler

On Wed, 3 Jan 2007 09:59:47 -0500 (EST) "Robert G. Brown"
<rgb at phy.duke.edu> wrote:

> On Wed, 3 Jan 2007, Glen Beane wrote:
>
> > If you are doing mostly MPI, I would strongly reccoment TORQUE (a
> > free, open source, OpenPBS fork with *many* enhancements).  I would
> > not reccoment OpenPBS, as Altair no longer updates it and hasn't
> > for quite some time.  TORQUE has great integration with mpich by
> > using mpiexec from Pete at
> > (http://www.osc.edu/~pw/mpiexec/index.php).  LAM and OpenMPI have
> > native PBS (and TORQUE) support as well.
>
> FWIW (and to me it is worth a lot:-) torque appears to be in FC 6
> extras, ready to install and run.  This may or may not mean that FC is
> being used as (one of) its primary development/maintenance platform(s)
> -- this is often the case.  I'll have to give it a try along with
> condor and yes, ruby queue.


TORQUE packages have been available in Fedora Extras since April 2006.
Since then, versions have been built and pushed for Fedora Extras 3, 4,
5, and 6.  And if you run into any problems with the Fedora torque
packages then please create a Fedora bugzilla entry!

Ed

-- 
Edward H. Hill III, PhD  |  ed at eh3.com  |  http://eh3.com/
_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf


From csamuel at vpac.org  Wed Jan  3 14:39:36 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Thu, 4 Jan 2007 09:39:36 +1100
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <9988D8B1-94CF-4C93-AD1D-54274993C00C@gmail.com>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
	<200701031223.46333.csamuel@vpac.org>
	<9988D8B1-94CF-4C93-AD1D-54274993C00C@gmail.com>
Message-ID: <200701040939.36395.csamuel@vpac.org>

On Wednesday 03 January 2007 15:55, Nathan Moore wrote:

> ????????WARNING: ?server not specified (set $pbsserver)

This has already been answered on the Torque list, but for the folks on the 
Beowulf list this was the issue.

cheers!
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070104/c63e7880/attachment.sig>

From csamuel at vpac.org  Wed Jan  3 14:50:06 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Thu, 4 Jan 2007 09:50:06 +1100
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <E33E3B97-CCCA-41E8-BF14-981D598D4AAA@staff.uni-marburg.de>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
	<200701031223.46333.csamuel@vpac.org>
	<E33E3B97-CCCA-41E8-BF14-981D598D4AAA@staff.uni-marburg.de>
Message-ID: <200701040950.06887.csamuel@vpac.org>

On Wednesday 03 January 2007 23:01, Reuti wrote:

> - Do you need support for Tight Integrated Linda (I think this will ?
> most often mean Gaussian) (and PVM) parallel jobs: use SGE

Interesting, why so ?   I know a number of sites around Australia (including a 
1900+ CPU cluster) run Gaussian using PBS (I don't know how much pain, if 
any, they went through for that but my understanding is that anything else 
that involves Gaussian involves pain and many dead chickens).

My memory is that the one time I've had to set up a PVM dependant application 
(TGICL) it wasn't particularly hard to get going.  Mind you PVM seems pretty 
dead, TGICL was the only application we've ever had requested that needed it 
and that was a couple of years ago and was only for a couple of weeks work.

cheers!
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070104/8344377a/attachment.sig>

From csamuel at vpac.org  Wed Jan  3 14:59:58 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Thu, 4 Jan 2007 09:59:58 +1100
Subject: [Beowulf] running MPICH on AMD Opteron Dual Core Processor
	Cluster( 72 Cpu's)
In-Reply-To: <Pine.LNX.4.64.0701031049390.12893@coffee.psychology.mcmaster.ca>
References: <9fe360270612290226yb9e3ccbua77a1febf4123fc6@mail.gmail.com>
	<Pine.LNX.4.64.0701031049390.12893@coffee.psychology.mcmaster.ca>
Message-ID: <200701040959.58626.csamuel@vpac.org>

On Thursday 04 January 2007 02:53, Mark Hahn wrote:

> personally, I'm pretty convinced that MPI implementations should stay
> out of the jobstarter business, and go with straight agentless (ssh-based)
> job spawning.

Noooooo...  please not ssh again, make the pain go away!

Seriously though, this is what the PBS TM interface is for (not used SGE, so I 
don't know if it has a similar interface, I'd be surprised if it didn't)..

The TM interface is important as it means Torque can keep a close beady eye on 
the MPI processes spawned and kill off the processes when needed (which all 
too often get left behind otherwise and need hacks like epilogue scripts to 
fix).

It also stops users changing their previous 32 CPU job script to ask for 4 
CPUs in the queue and then forgetting to change the -np parameter for mpirun 
as well.   Nodes don't like that sort of load.

cheers!
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070104/6866f1bf/attachment.sig>

From csamuel at vpac.org  Wed Jan  3 15:23:10 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Thu, 4 Jan 2007 10:23:10 +1100
Subject: [Beowulf] RE: OT: Announcing MPI-HMMER
In-Reply-To: <E1H29tI-0006PF-DL@mendel.bio.caltech.edu>
References: <E1H29tI-0006PF-DL@mendel.bio.caltech.edu>
Message-ID: <200701041023.10843.csamuel@vpac.org>

On Thursday 04 January 2007 04:27, David Mathog wrote:

> Joe and I aparently exist in parallel software universes ;-).

Being MPI means it can take advantage of high speed interconnects (e.g. 
building it with MPICH-GM to use native Myrinet).  Of course whether that 
would benefit HMMER is something I don't know!

cheers,
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070104/593a8923/attachment.sig>

From landman at scalableinformatics.com  Wed Jan  3 17:48:20 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Wed, 03 Jan 2007 20:48:20 -0500
Subject: [Beowulf] RE: OT: Announcing MPI-HMMER
In-Reply-To: <200701041023.10843.csamuel@vpac.org>
References: <E1H29tI-0006PF-DL@mendel.bio.caltech.edu>
	<200701041023.10843.csamuel@vpac.org>
Message-ID: <459C5CE4.1040005@scalableinformatics.com>


Chris Samuel wrote:
> On Thursday 04 January 2007 04:27, David Mathog wrote:
> 
>> Joe and I aparently exist in parallel software universes ;-).
> 
> Being MPI means it can take advantage of high speed interconnects (e.g. 
> building it with MPICH-GM to use native Myrinet).  Of course whether that 
> would benefit HMMER is something I don't know!

It does.  The code does very nicely with low latency (and low OS 
jitter).  XD1's, while discontinued, are sweet boxes.


-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452
cell : +1 734 612 4615


From reuti at staff.uni-marburg.de  Thu Jan  4 03:16:07 2007
From: reuti at staff.uni-marburg.de (Reuti)
Date: Thu, 4 Jan 2007 12:16:07 +0100
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <200701040950.06887.csamuel@vpac.org>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
	<200701031223.46333.csamuel@vpac.org>
	<E33E3B97-CCCA-41E8-BF14-981D598D4AAA@staff.uni-marburg.de>
	<200701040950.06887.csamuel@vpac.org>
Message-ID: <BD5691DE-0939-423B-88DB-15C65931A3D6@staff.uni-marburg.de>

Am 03.01.2007 um 23:50 schrieb Chris Samuel:

> On Wednesday 03 January 2007 23:01, Reuti wrote:
>
>> - Do you need support for Tight Integrated Linda (I think this will
>> most often mean Gaussian) (and PVM) parallel jobs: use SGE
>
> Interesting, why so ?   I know a number of sites around Australia  
> (including a
> 1900+ CPU cluster) run Gaussian using PBS (I don't know how much  
> pain, if
> any, they went through for that but my understanding is that  
> anything else
> that involves Gaussian involves pain and many dead chickens).

Linda and PVM* need some kind of rsh/ssh between the nodes, and I  
didn't get a clue up to now to convince Linda to use the PBS TM of  
Torque. As you mentioned in your other post about keeping control of  
MPI processes, the similar thing to TM is the qrsh command in SGE,  
which will replace rsh/ssh and SGE is controlling this way these  
spawned processes on the nodes. I'm also always looking in a cluster  
setup, without any common rsh/ssh between the nodes at all, where  
users could by accident start processes out of control of the queuing  
system on the nodes.

-- Reuti

* I'm aware, that PVM can be started interactively without any rsh/ 
ssh between the nodes, by supplying some strings to the daemons and  
its response back to the startup process.


>
> My memory is that the one time I've had to set up a PVM dependant  
> application
> (TGICL) it wasn't particularly hard to get going.  Mind you PVM  
> seems pretty
> dead, TGICL was the only application we've ever had requested that  
> needed it
> and that was a couple of years ago and was only for a couple of  
> weeks work.
>
> cheers!
> Chris
> -- 
>  Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
>  Victorian Partnership for Advanced Computing http://www.vpac.org/
>  Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit  
> http://www.beowulf.org/mailman/listinfo/beowulf


From mathog at caltech.edu  Thu Jan  4 08:54:44 2007
From: mathog at caltech.edu (David Mathog)
Date: Thu, 04 Jan 2007 08:54:44 -0800
Subject: [Beowulf] RE: OT: Announcing MPI-HMMER
Message-ID: <E1H2VrY-0007BE-RM@mendel.bio.caltech.edu>


> There's also my PVM version of 2.3.2 from 2003/2004

I ought to clarify that slightly.  Sean Eddy's original code had
a PVM mode.  My variant retained the original but defaults to
a new "database sliced" mode - where each query
runs on every node against a fraction of the database.  (Similar to
what Joe's MPI-BLAST and my PVM parallel BLAST do.)  To support that
hmmfetch was also modified to work over PVM, which wasn't required in
SE's original since the master node always had a complete database,
so there was no need to fetch entries back from the database slices
on the compute nodes.

Now back to your regularly scheduled topics...

Regards,

David Mathog
mathog at caltech.edu
Manager, Sequence Analysis Facility, Biology Division, Caltech


From robl at mcs.anl.gov  Thu Jan  4 09:46:11 2007
From: robl at mcs.anl.gov (Robert Latham)
Date: Thu, 4 Jan 2007 11:46:11 -0600
Subject: [Beowulf] mpiJava + MPICH
In-Reply-To: <45929555.5090108@duke.edu>
References: <45929555.5090108@duke.edu>
Message-ID: <20070104174610.GY24143@mcs.anl.gov>

On Wed, Dec 27, 2006 at 10:46:29AM -0500, Sean Dilda wrote:
> I'm working on setting up mpiJava for a cluster user.   I'm compiling it 
> against Sun's Java 1.5.0 and MPICH 1.2.5, on a cluster running CentOS 4

Is mpiJava dependent on a specific vesion of MPI?  Not only is
MPICH-1.2.5 a quite old version of the MPICH release, the entire
series has been superceeded by MPICH2 (unless you require
heterogenerous support).   does mpijava work with MPICH2-1.0.5 ?

> I've also noticed that when a normal MPI program runs, the process tree 
> shows mpirun, which has a child of your program.  That child of mpirun 
> then has a child that's your program running locally, and a bunch of 
> children that are all the 'rsh' command for launching remote copies. 
> Whenever I run a mpiJava program, the only thing in the process tree is 
> mpirun and a single child of mpirun.
> 
> Has anyone run across this, or have any ideas of what I could do to fix 
> this problem?

The job management in MPICH is a little hairy.  Things have been
cleaned up somewhat in MPICH2.  For example, running MPICH2 progams
under valgrind is a lot easier than doing so with MPICH1-based
programs.  Perhaps the same will hold for mpijava, though I know
nothing about that project.

==rob

-- 
Rob Latham
Mathematics and Computer Science Division    A215 0178 EA2D B059 8CDF
Argonne National Lab, IL USA                 B29D F333 664A 4280 315B


From csamuel at vpac.org  Thu Jan  4 16:09:52 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Fri, 5 Jan 2007 11:09:52 +1100
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <BD5691DE-0939-423B-88DB-15C65931A3D6@staff.uni-marburg.de>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
	<200701040950.06887.csamuel@vpac.org>
	<BD5691DE-0939-423B-88DB-15C65931A3D6@staff.uni-marburg.de>
Message-ID: <200701051109.56867.csamuel@vpac.org>

On Thursday 04 January 2007 22:16, Reuti wrote:

> Linda and PVM* need some kind of rsh/ssh between the nodes, and I ?
> didn't get a clue up to now to convince Linda to use the PBS TM of ?
> Torque.

Torque provides a pbsdsh command that uses the TM interface and acts like the 
various DSH variants.  What it doesn't appear to be able to do (which I've 
just discovered) is to be able to only run once per node in the job. Hmm..

> As you mentioned in your other post about keeping control of ? 
> MPI processes, the similar thing to TM is the qrsh command in SGE, ?
> which will replace rsh/ssh and SGE is controlling this way these ?
> spawned processes on the nodes.

Sounds very similar to pbsdsh in the way it works.

> I'm also always looking in a cluster setup, without any common rsh/ssh
> between the nodes at all, where users could by accident start processes out
> of control of the queuing system on the nodes.

Exactly.  What we do here is a hack in the /etc/profile that checks for the 
existence of $PBS_ENVIRONMENT and kicks them off with a message about only 
being permitted to access the node if you have a job on it.  Ugly, but it 
works.

Newer versions of Torque have a PAM module contributed by Jim Prewett which 
will check the user against the current list of Torque jobs on a node and 
only permit access if they have a job on the node.

We prefer to only allow access via a PBS jobs which is why we still use our 
hack, but the PAM module might be a handy backstop for us.

cheers!
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070105/877e54aa/attachment.sig>

From reuti at staff.uni-marburg.de  Fri Jan  5 04:58:28 2007
From: reuti at staff.uni-marburg.de (Reuti)
Date: Fri, 5 Jan 2007 13:58:28 +0100
Subject: [Beowulf] picking out a job scheduler
In-Reply-To: <200701051109.56867.csamuel@vpac.org>
References: <58106678-33A8-40D7-BA1E-4DF128F1A7FC@gmail.com>
	<200701040950.06887.csamuel@vpac.org>
	<BD5691DE-0939-423B-88DB-15C65931A3D6@staff.uni-marburg.de>
	<200701051109.56867.csamuel@vpac.org>
Message-ID: <440DA641-65DA-4B9D-B975-F440B4851367@staff.uni-marburg.de>

Am 05.01.2007 um 01:09 schrieb Chris Samuel:

> On Thursday 04 January 2007 22:16, Reuti wrote:
>
>> Linda and PVM* need some kind of rsh/ssh between the nodes, and I
>> didn't get a clue up to now to convince Linda to use the PBS TM of
>> Torque.
>
> Torque provides a pbsdsh command that uses the TM interface and  
> acts like the
> various DSH variants.  What it doesn't appear to be able to do  
> (which I've
> just discovered) is to be able to only run once per node in the  
> job. Hmm..

You can run it once per node with the -n option. Trying to simulate  
rsh would simply mean to map the hostname of the requested machine to  
an index in the list of granted machines - no big deal. The bigger  
problem seems to be, that there is no real environment on the nodes  
where the slave tasks are started. I.e. no environment variables set.

-- Reuti


>> As you mentioned in your other post about keeping control of
>> MPI processes, the similar thing to TM is the qrsh command in SGE,
>> which will replace rsh/ssh and SGE is controlling this way these
>> spawned processes on the nodes.
>
> Sounds very similar to pbsdsh in the way it works.
>
>> I'm also always looking in a cluster setup, without any common rsh/ 
>> ssh
>> between the nodes at all, where users could by accident start  
>> processes out
>> of control of the queuing system on the nodes.
>
> Exactly.  What we do here is a hack in the /etc/profile that checks  
> for the
> existence of $PBS_ENVIRONMENT and kicks them off with a message  
> about only
> being permitted to access the node if you have a job on it.  Ugly,  
> but it
> works.
>
> Newer versions of Torque have a PAM module contributed by Jim  
> Prewett which
> will check the user against the current list of Torque jobs on a  
> node and
> only permit access if they have a job on the node.
>
> We prefer to only allow access via a PBS jobs which is why we still  
> use our
> hack, but the PAM module might be a handy backstop for us.
>
> cheers!
> Chris
> -- 
>  Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
>  Victorian Partnership for Advanced Computing http://www.vpac.org/
>  Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit  
> http://www.beowulf.org/mailman/listinfo/beowulf


From amacater at galactic.demon.co.uk  Sun Jan  7 03:22:30 2007
From: amacater at galactic.demon.co.uk (Andrew M.A. Cater)
Date: Sun, 7 Jan 2007 11:22:30 +0000
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
	<m3hcv8z8hg.fsf@unna.nsc.liu.se>
	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
Message-ID: <20070107112230.GA7654@galactic.demon.co.uk>

On Wed, Jan 03, 2007 at 09:51:44AM -0500, Robert G. Brown wrote:
> On Wed, 3 Jan 2007, Leif Nixon wrote:
> 
> >"Robert G. Brown" <rgb at phy.duke.edu> writes:
> >
>   b) If an attacker has compromised a user account on one of these
> workstations, IMO the security battle is already largely lost.  They
> have a choice of things to attack or further evil they can try to wreak.
> Attacking the cluster is one of them, and as discussed if the cluster is
> doing real parallel code it is likely to be quite vulnerable regardless
> of whether or not its software is up to date because network security is
> more or less orthogonal to fine-grained code network performance.
> 

Amen, brother :)

> 
> BTW, the cluster's servers were not (and I would not advise that servers
> ever be) running the old distro -- we use a castle keep security model
> where servers have extremely limited access, are the most tightly
> monitored, and are kept aggressively up to date on a fully supported
> distro like Centos.  The idea is to give humans more time to detect
> intruders that have successfully compromised an account at the
> workstation LAN level and squash them like the nasty little dung beetles
> that they are.
> 

Can I quote you for Security 101 when I need to explain this stuff for 
senior management ?

> 
> And we didn't do this "willingly" and aren't that likely to repeat it
> ourselves.  We had some pretty specific reasons to freeze the node
> distro -- the cluster nodes in question were the damnable Tyan dual
> Athlon systems that were an incredible PITA to stabilize in the first
> place (they had multiple firmware bugs and load-based stability issues
> under the best of circumstances).  Once we FINALLY got them set up with
> a functional kernel and library set so that they wouldn't crash, we were
> extremely loathe to mess with it.  So we basically froze it and locked
> down the nodes so they weren't easily accessible except from inside the
> department, and then monitored them with xmlsysd and wulfstat in
> addition to the usual syslog-ng and friends admin tools.
> 

It is _always_ worth browsing the archives of this list. Somebody, 
somewhere has inevitably already seen it/done it/get the scars and is 
able to explain stuff lucidly. I can't recommend this list highly enough 
both for it's high signal/noise ratio and it's smart people [rgb 1-8 
inclusive, for example]
> 
> In general, though, it is very good advice to stay with an updated OS.
> My real point was that WITH yum and a bit of prototyping once every
> 12-24 months, it is really pretty easy to ride the FC wave on MANY
> clusters, where the tradeoff is better support for new hardware and more
> advanced/newer libraries against any library issues that one may or may
> not encounter depending on just what the cluster is doing.  Freezing FC
> (or anything else) long past its support boundary is obviously less
> desireable.  However, it is also often unnecessary.
> 

Fedora Legacy just closed its doors - if you take a couple of months 
to get your Uebercluster up and running, you're 1/3 of the way through 
your FC cycle :( It doesn't square. Fedora looks set to lose its way 
again for Fedora 7 as they merge Fedora Core and Extras and grow to 
n-000 packages again - the fast upgrade cycle, lack of maintainers and 
lack of structure do not bode well. They're apparently moving to a 13 month 
upgrade cycle - so your Fedora odd releases could well be three years apart. 
The answer is to take a stable distribution, install the minimum and work 
with it OR build your own custom infrastructure as far as I can see. 
Neither Red Hat nor Novell are cluster-aware in any detail - they'll 
support their install and base programs but don't have the depth of 
expertise to go further :(

> On clusters that add new hardware, usually bleeding edge, every four to
> six months as research groups hit grant year boundaries and buy their
> next bolus of nodes, FC really does make sense as Centos probably won't
> "work" on those nodes in some important way and you'll be stuck
> backporting kernels or worse on top of your key libraries e.g. the GSL.
> Just upgrade FC regularly across the cluster, probably on an "every
> other release" schedule like the one we use.
> 

Chances are that anything Red Hat Enterprise based just won't work. New 
hardware is always hard. 

> On clusters (or sub-clusters) with a 3 year replacement cycle, Centos or
> other stable equivalent is a no-brainer -- as long as it installs on
> your nodes in the first place (recall my previous comment about the
> "stars needing to be right" to install RHEL/Centos -- the latest release
> has to support the hardware you're buying) you're good to go
> indefinitely, with the warm fuzzy knowledge that your nodes will update
> from a "supported" repo most of their 3+ year lifetime, although for the
> bulk of that time the distro will de-facto be frozen except for whatever
> YOU choose to backport and maintain.
> 

Absolutely.

> 
> Nowadays, with PXE/Kickstart/Yum (or Debian equivalents, or the OS of
> your choice with warewulf, or...) reinstalling OR upgrading a cluster
> node is such a non-event in terms of sysadmin time and effort that it
> can pretty much be done at will.  

I've had the pleasure/pain of watching cluster admins from a distance
as they worked on a fully commercial cluster from major vendors. For 
most on this list, its a no-brainer. I wish I had seen the same.

> The worst thing that such a strategy might require is a rebuild of user
> applications for both distros, but with shared libraries to my own
> admittedly anecdotal experience this "usually" isn't needed going from
> older to newer (that is, an older Centos built binary will "probably"
> still work on a current FC node, although this obviously depends on the
> precise libraries it uses and how rapidly they are changing).  It's a
> bit harder to take binaries from newer to older, especially in packaged
> form.  There you almost certainly need an rpmbuild --rebuild and a bit
> of luck.
> 

I use Debian - I've never had to learn about more than one repository 
and one distribution for everything I need. What is this "rebuilding" of 
which you speak :)

> Truthfully, cluster installation and administration has never been
> simpler.
> 

I think you underestimate your expertise - and the expertise on this 
list. My mantra is that cluster administration should be simple and 
straightforward: in reality, it's seldom so.

>    rgb

Andy


From ed at eh3.com  Sun Jan  7 05:50:21 2007
From: ed at eh3.com (Ed Hill)
Date: Sun, 7 Jan 2007 08:50:21 -0500
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <20070107112230.GA7654@galactic.demon.co.uk>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
	<m3hcv8z8hg.fsf@unna.nsc.liu.se>
	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
	<20070107112230.GA7654@galactic.demon.co.uk>
Message-ID: <20070107085021.6af6efab@ernie>

On Sun, 7 Jan 2007 11:22:30 +0000 amacater at galactic.demon.co.uk (Andrew
M.A. Cater) wrote:
> 
> Fedora Legacy just closed its doors - if you take a couple of months 
> to get your Uebercluster up and running, you're 1/3 of the way
> through your FC cycle :( It doesn't square. 


Speaking of which, I recently built a tiny (<20 nodes) cluster using:

  Sun X2200 (2X Opteron 2210)
  InfiniBand
  Fedora Core 6 for x86_64

and it was remarkably easy to get MPI working with IB using OpenMPI,
libibverbs, and libmthca (the latter two being available in Fedora
Extras and installed with 'yum install ...').

I can certainly appreciate how long it takes to build medium to large
clusters for larger and more diverse types of users.  But I don't see
why there is a pressing need to upgrade the compute nodes as soon as a
particular Fedora release is no longer current.  If your setup is
working then its working -- its no less valid just because your Fedora
version is one (or perhaps even three) behind the latest.

As others have thoughtfully described, cluster security is typically a
gateway or other "choke point" that mostly divorces it from the actual
compute nodes.  So, once you have things working nicely, you should
have vanishingly few reasons to waste time chasing after the latest 'n
greatest distro version.

It least, that's my view...  :-)

Ed


-- 
Edward H. Hill III, PhD  |  ed at eh3.com  |  http://eh3.com/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070107/d6b42f3d/attachment.sig>

From landman at scalableinformatics.com  Sun Jan  7 07:06:17 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Sun, 07 Jan 2007 10:06:17 -0500
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <20070107112230.GA7654@galactic.demon.co.uk>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>	<200612290939.59593.csamuel@vpac.org>	<20061229005749.GA13471@galactic.demon.co.uk>	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>	<m3hcv8z8hg.fsf@unna.nsc.liu.se>	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
	<20070107112230.GA7654@galactic.demon.co.uk>
Message-ID: <45A10C69.4030908@scalableinformatics.com>


Andrew M.A. Cater wrote:
> On Wed, Jan 03, 2007 at 09:51:44AM -0500, Robert G. Brown wrote:
>> On Wed, 3 Jan 2007, Leif Nixon wrote:
>>
>>> "Robert G. Brown" <rgb at phy.duke.edu> writes:
>>>
>>   b) If an attacker has compromised a user account on one of these
>> workstations, IMO the security battle is already largely lost.  They

s/largely/completely/g

At least for this user, if they have single factor passwordless login
set up between workstation and cluster.

Of course if they are using a malware-ridden, keylogger hosting machine,
they have ... uh ... somewhat worse things to deal with than just their
accounts on the cluster being open to attack.

The solution to this is simple.  Never let this happen.  Which means,
don't use a system which is significantly vulnerable to malware or
keylogger insertion.  It is left as an exercise to the reader to figure
out which platforms are more vulnerable.

>> have a choice of things to attack or further evil they can try to wreak.
>> Attacking the cluster is one of them, and as discussed if the cluster is
>> doing real parallel code it is likely to be quite vulnerable regardless
>> of whether or not its software is up to date because network security is
>> more or less orthogonal to fine-grained code network performance.
>>
> 
> Amen, brother :)
> 
>> BTW, the cluster's servers were not (and I would not advise that servers
>> ever be) running the old distro -- we use a castle keep security model
>> where servers have extremely limited access, are the most tightly
>> monitored, and are kept aggressively up to date on a fully supported
>> distro like Centos.  The idea is to give humans more time to detect
>> intruders that have successfully compromised an account at the
>> workstation LAN level and squash them like the nasty little dung beetles
>> that they are.

Yup.  Even better is never letting the users log in to admin machines.
Provide machines for them to log into, submit and run jobs from.  Just
not the admin nodes.

[...]

>> In general, though, it is very good advice to stay with an updated OS.

... on threat-facing systems, yes, I agree.

For what I call production cycle shops, those places which have to churn
out processing 24x7x365, you want as little "upgrading" as possible, and
it has to be tested/functional with everything.  Ask your favorite CIO
if they would consider upgrading their most critical systems nightly.

It all boils down to a CBA (as everything does).  Upgrading carries
risk, no matter who does it, and how carefully things are packaged.  The
CBA equation should look something like this:

	value_of_upgrade = positive_benefits_of_upgrade -
			   potential_risks_of_upgrade

And if the value_of_upgrade is not strongly positive, you probably
should not do it if you are supplying a service to a user base.  Sure,
you can do it on your own personal cluster.  I appreciate that people on
this list do this for their systems.  Regardless of this, you need to be
of the (somewhat paranoid) mindset when looking at an upgrade, and the
potential for loss of time/data/...

A (not so great) example would be someone packaging up a recent 2.6.19
kernel with that oh-so-nice ext3-vm interaction which gave us
compromised files.  It hit mmap based files from what I could see.  All
you need is an end user with a corner case that happens to tickle the
trigger and whammo.  You are now spending time fixing their problem
(which requires downgrading/upgrading).

You have a perfectly valid reason to upgrade threat facing nodes.  Keep
them as minimal and as up-to-date as possible.  The non-threat facing
nodes, this makes far less sense.  If you are doing single factor
authentication, and have enabled passwordless access within the cluster:
 ssh keys or certificates or ssh-agent based, once a machine that holds
these has been compromised, the game is over.  Multi-factor
authentication for launching cluster runs is still a challenge, as
queuing systems may schedule jobs to start at 3am local time, and no one
wants to wait around for job start to enter additional factors.

You want to test any upgrade, and only upgrade what needs upgrading.
Just like other aspects of security 101, threat facing nodes need to be
running as little (important) stuff as possible, and need as limited
access as you can give them.  Upgrades can and do carry their own bugs
and security holes, and you really don't want to be chasing those as well.

>> My real point was that WITH yum and a bit of prototyping once every
>> 12-24 months, it is really pretty easy to ride the FC wave on MANY
>> clusters, where the tradeoff is better support for new hardware and more
>> advanced/newer libraries against any library issues that one may or may
>> not encounter depending on just what the cluster is doing.  Freezing FC
>> (or anything else) long past its support boundary is obviously less
>> desireable.  However, it is also often unnecessary.
>>
> 
> Fedora Legacy just closed its doors - if you take a couple of months 
> to get your Uebercluster up and running, you're 1/3 of the way through 
> your FC cycle :( It doesn't square. Fedora looks set to lose its way 
> again for Fedora 7 as they merge Fedora Core and Extras and grow to 

Hmmm.  Fedora is the testing framework for RHEL.  We know this.  I like
6, it looks to be a fine test distro, and has lots of nice things in it.
 Works on lots of hardware.  If I were building a cluster on it, I would
not upgrade the compute nodes. Once they are set, unless there is a good
reason to upgrade (newer packages that do not add needed or missing
features is not a valid reason IMO), I would leave the compute nodes
alone.  Probably the head node as well.  The login nodes are a different
story.  Upgrade them (security patches) as quickly as possible.

> n-000 packages again - the fast upgrade cycle, lack of maintainers and 
> lack of structure do not bode well. They're apparently moving to a 13 month 
> upgrade cycle - so your Fedora odd releases could well be three years apart. 
> The answer is to take a stable distribution, install the minimum and work 
> with it OR build your own custom infrastructure as far as I can see. 
> Neither Red Hat nor Novell are cluster-aware in any detail - they'll 
> support their install and base programs but don't have the depth of 
> expertise to go further :(

Both are happy to sell licenses to the unwary.  At the end of the day,
if you are going to build a RHEL cluster, use Centos/Scientific Linux
unless you absolutely wish to pay RH for security patches.  With SuSE,
use OpenSuSE.  If you are going to settle on Fedora, pick a distro, and
remember that it will be out of support in a year, which shouldn't
matter to the compute/head node once they are up.

>> On clusters that add new hardware, usually bleeding edge, every four to
>> six months as research groups hit grant year boundaries and buy their
>> next bolus of nodes, FC really does make sense as Centos probably won't
>> "work" on those nodes in some important way and you'll be stuck
>> backporting kernels or worse on top of your key libraries e.g. the GSL.
>> Just upgrade FC regularly across the cluster, probably on an "every
>> other release" schedule like the one we use.
>>
> 
> Chances are that anything Red Hat Enterprise based just won't work. New 
> hardware is always hard. 

Heh.  Try to point this out to a purchasing agent on an RFP which
demands a) newest possible hardware and b) RHEL 4 support.  You get to
pick one or the other, not both.  Which one do you want?  Hint: "b" is
far less valuable.

The other (not-so-funny) aspect of this is when we deliver new hardware
with an OS load that supports the newer hardware and someone wants to
pull it back to the "corporate standard".  In doing so, they give up
stability, performance, and often file system support.  Or in the case
of our JackRabbit unit, when we deliver 30TB of 5U system and we get the
"ext3 is almost as good as xfs" line.  Uh.... er.... no.   Those who
really insist upon this must only want 16TB units with no possibility to
ever grow beyond this (we have a design cooked up to show how to do a 1
PB in 4 racks as a single file system, or better, an HA 1 PB in 9 racks
as a single file system).  16TB is great for some folks, but it is a
fundamental ext3 limit.  You need the untried-in-the-real-world ext4 to
break that limit.  Or xfs and jfs.


-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From rgb at phy.duke.edu  Sun Jan  7 08:41:57 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Sun, 7 Jan 2007 11:41:57 -0500 (EST)
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <20070107112230.GA7654@galactic.demon.co.uk>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
	<m3hcv8z8hg.fsf@unna.nsc.liu.se>
	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
	<20070107112230.GA7654@galactic.demon.co.uk>
Message-ID: <Pine.LNX.4.64.0701070931101.5894@lilith.rgb.private.net>

On Sun, 7 Jan 2007, Andrew M.A. Cater wrote:

>> BTW, the cluster's servers were not (and I would not advise that servers
>> ever be) running the old distro -- we use a castle keep security model
>> where servers have extremely limited access, are the most tightly
>> monitored, and are kept aggressively up to date on a fully supported
>> distro like Centos.  The idea is to give humans more time to detect
>> intruders that have successfully compromised an account at the
>> workstation LAN level and squash them like the nasty little dung beetles
>> that they are.
>>
>
> Can I quote you for Security 101 when I need to explain this stuff for
> senior management ?

Sure.  If you arrange to get me there and pay me an exorbitant fee, I'll
bring along my sucker rod and explain it to them myself on your behalf.

(I wouldn't try to extort the exorbitant fee for it except that to my
experience, if management isn't paying you $150/hour plus expenses for
your expertise, they devalue it.  Besides, I've still got to get
SOMEBODY to pay for my kids' Nintendo W(here )i(s )i(t) -- once I
actually find one to purchase;-)

> It is _always_ worth browsing the archives of this list. Somebody,
> somewhere has inevitably already seen it/done it/get the scars and is
> able to explain stuff lucidly. I can't recommend this list highly enough
> both for it's high signal/noise ratio and it's smart people [rgb 1-8
> inclusive, for example]

Make that $200/hour...;-)

> Fedora Legacy just closed its doors - if you take a couple of months
> to get your Uebercluster up and running, you're 1/3 of the way through
> your FC cycle :( It doesn't square. Fedora looks set to lose its way
> again for Fedora 7 as they merge Fedora Core and Extras and grow to
> n-000 packages again - the fast upgrade cycle, lack of maintainers and
> lack of structure do not bode well. They're apparently moving to a 13 month
> upgrade cycle - so your Fedora odd releases could well be three years apart.
> The answer is to take a stable distribution, install the minimum and work
> with it OR build your own custom infrastructure as far as I can see.
> Neither Red Hat nor Novell are cluster-aware in any detail - they'll
> support their install and base programs but don't have the depth of
> expertise to go further :(

Yeah, well, if RH asked me to be on their board of directors, I could
probably do something about a lot of this.  Their business plan is very
conservative and is "working" in that they are making money and keeping
their investors and the community simultaneously at bay, but they are
also missing multiple opportunities to really solidify and attack
Microsoft head on.  Fedora is working very well in many respects for
them -- I've been very happy with it since roughly FC 2 (FC 1 sucked,
partly because of the emergence of x86_64 and partly for other reasons).

I mean what the heck, they're right down the road, right?  I could
probably drive to board meetings and they could pay me in options...;-)

> Chances are that anything Red Hat Enterprise based just won't work. New
> hardware is always hard.

Yeah, starting right here.  There are several boats RH is missing, but
this is the biggest one.  RH can freeze all sorts of things in any given
distro and just maintain them, but the kernel and its associated
hardware layer of support tools (and applications that directly access
them) are NOT AMONG THEM!  Having a fixed release and support cycles in
terms of months or years is just silly.  Release cycles should be
determined by the way the product itself evolves, which in turn is a
marvelous and somewhat erratic function of the rapidly changing hardware
market and the whims of the toplevel developers (kernel, compiler, main
unix libraries, X).  Application space needs to be DECOUPLED from some
sort of sane base.

This is what they haven't yet grokked -- it is long since past time for
linux in general to separate into two distinct pieces, e.g. Fedora
>>core<< (which should be really minimal but well maintained for a "long
time") and Fedora >>applications<< which should be be entirely separate.
Precisely the same split should be visible in e.g. RHEL -- a core that
is large enough to support commercial applications with aggressive
kernel and hardware-layer updates and number of distinct layers of
applicationware -- X all by itself (separate from the core for RHEL
since servers don't need it and it really isn't desireable there as it's
more to validate, more to secure), DB ware, userspace applications in
general, etc.

With yum, all the work of being able to support partitioned maintenance
on the server or workstation itself is DONE, but the num-nums don't seem
to realize it.  Microsoft would go mad (and go broke) if they tried to
enforce a clean rebuild of every application in the Universe for every
new OS version they release.  And of course they can't -- even though
they've been systematically engulfing makers of WinXX software for years
as rapidly as antitrust laws permit them to do so there are still so
many companies out there that make hardware with device drivers on disk
or standalone software packages that they pretty much have to distribute
a core OS and leave it up to the user to break the hell out of things
with Installshield and battling libraries from ill-built or out of date
software packages.

This is where RH missed the boat entirely.  Faced with a resource
problem as they tried to do the undoable and given a space of possible
solutions, they opted for one of the simplest, but least efficient, of
those solutions.  What they NEEDED to do -- and still need to do -- is
think long and hard about just how to reorganize support of RHE linuces
(and or FC) so it is BOTH efficient enough to remain within their means
and the abilities of their software people to deliver AND capable of
both staying up to date on the kernel/core across the board.  I can then
think of all sorts of ways they could choose to layer successive updates
of application space.  In fact, "Fedora" could refer ONLY to the
aggressiveness of updates in the application layer.

At any rate, I empirically have found Centos to be nearly useless for
roughly 1/2 of each upgrade cycle on whole classes of hardware.  On
laptops it is a joke (except for one 6 month window perhaps right after
it comes out).  On x86_64 hardware it has been a crap shoot.  Even on
i386 hardware, one has the usual problem with this device or that
device, especially in a desktop environment where users DO want their
onboard video or sound or network to work (on server class hardware and
apps it is more likely to work).  Even FC makes me wait on laptops and
some desktop hardware.

THIS is one of two or three places where Lin still suffers relative to
Win -- Windows "always" works on any platform you buy because it is
"always" preinstalled and vendors experience pain and suffering if it
doesn't preinstall in a functional state.  Lin requires me to spend a
quiet hour of moderately expert time googling and reading stuff from
specialized sites to determine which (if any) firewire PCMCIA cards are
known to work before I dare to buy one, which cameras are likely to
work, which video adapters or sound cards are supported, which
motherboard CHIPSETS are known to work.

Bitch, bitch, bitch.  Sigh.

>> Nowadays, with PXE/Kickstart/Yum (or Debian equivalents, or the OS of
>> your choice with warewulf, or...) reinstalling OR upgrading a cluster
>> node is such a non-event in terms of sysadmin time and effort that it
>> can pretty much be done at will.
>
> I've had the pleasure/pain of watching cluster admins from a distance
> as they worked on a fully commercial cluster from major vendors. For
> most on this list, its a no-brainer. I wish I had seen the same.

Rather than say it is a no-brainer, perhaps it is fairer to say that
once one makes a relatively modest investment in training the brain to
learn how to use certain well-supported toolsets and ideas, it becomes
easy and the investment is paid back tenfold.  We're not quite to where
we have a "build-a-bear" GUI front end for cluster building or a
complete "cluster package" in any of the major distros, as far as I
know, although the warewulf folks and maybe the scyld folks and possibly
some others are getting there in their own distinct ways.

Again this is fairly silly.  Installing a cluster in this way and
installing a workstation or office LAN in this way (via PXE/KS/Y) are
really pretty much the same general task -- they differ only in package
selection and possibly -- I say possibly -- in the way workstations or
office systems are named.

Imagine a Red Hat sales rep walking into an office with a laptop (with a
gigE interface, an 8 port gigE switch with cables, and a halfway decent
fast disk).  He sets up the laptop and "borrows" four or five office
desktops and cables them into the switch.  He powers them on and sets
their BIOS to boot from the network first, with a standard 3 second or
whatever timeout.  They boot up, and -- magic! -- they are running
RHEL-whatever, with ooffice etc installed and ready to run.  WinXX is
still untouched on their native disks.  Everything is bulletproof and
automaintaining, with a clear partition between userspace and rootspace,
full control over user accounts and access, etc.

He removes the systems from the switch and puts them back on their
native LAN and reboots them to WinXX, and points out that installing and
maintaining Lin is just that easy.  He could have them set up with a Lin
server that support WinXX clients and Lin clients that boot just that
way overnight, and that permit the office staff to gradually convert to
Lin as they learn that it is mostly virusproof, that ooffice pretty much
just "works" like msoffice, that a browser is a browser and firefox is a
decent one, that there are several hundred free nifty desktop games to
while away those tedious cubicle hours when nobody's looking intead of
three.  At a cost of $50/seat and they can get rid of 2/3 of their admin
staff at the same time because one admin can easily support 100-200
desktop seats...

Hey, I can dream, can't I?

>> The worst thing that such a strategy might require is a rebuild of user
>> applications for both distros, but with shared libraries to my own
>> admittedly anecdotal experience this "usually" isn't needed going from
>> older to newer (that is, an older Centos built binary will "probably"
>> still work on a current FC node, although this obviously depends on the
>> precise libraries it uses and how rapidly they are changing).  It's a
>> bit harder to take binaries from newer to older, especially in packaged
>> form.  There you almost certainly need an rpmbuild --rebuild and a bit
>> of luck.
>>
>
> I use Debian - I've never had to learn about more than one repository
> and one distribution for everything I need. What is this "rebuilding" of
> which you speak :)

Ha.  I remember well the time that we considered Debian in our
department and rejected it because its stable distro suffered from
precisely the same problem that RHEL/Centos suffer from now.  It was
very stable, it worked excellently well, and it was way, way behind the
hardware curve in libraries and kernel support.  It may well be that
they've done a better job than RH at recognizing this as a core user
requirement in pretty much any environment so that the stable release
tracks the kernel and new hardware better (dealing with libraries and
dependencies as required).  It would be pretty easy to do.  It's just
unfortunate that Linux has never QUITE managed to turn the corner and
create clean layers of separation between the hardware and kernel, the
core libraries and compiler, and application space.  Hence the need for
distributions at all per se, hence the need for distribution "releases"
with applications pretty much all rebuilt just for the functional core
in question.

The weird thing is that in principle, both rpm and apt permit one to do
much better.  This is really a problem in computer science and software
design and OS organization that is SOLVABLE.  Packaging schema contain
the hooks required to do so, and the open source community has worked
out truly awesome methodology for maintaining a >>huge<< collection of
packages (I just grabbed images of FC 6 for i386 and x86_64 from Duke's
repo, and they ate close to 30 GB of disk for just the binary RPMs!)
The problem is all in the partitioning -- creating "independently"
maintainable layers.

I have modest hopes for HAL -- it was something of a joke previous to FC
5 or 6, but in 6 it actually works perfectly and transparently a lot of
time.  This is the kind of thing that is necessary -- with enough
abstraction it might be possible to maintain a kernel snapshot
"indefinitely" by simply updating its collection of modules and hal
itself, so that applications "just work" with new hardware without
having to upgrade to an unstable/rawhide release.

>> Truthfully, cluster installation and administration has never been
>> simpler.
>>
>
> I think you underestimate your expertise - and the expertise on this
> list. My mantra is that cluster administration should be simple and
> straightforward: in reality, it's seldom so.

It depends on the paradigm you adopt, and how lucky you are in terms of
hardware matching the capabilities of your distro/release.  Which
perhaps "shouldn't" be a matter of luck, but often is as there is
nothing that can protect you from "lemon" hardware but buying from a
vendor that will if necessary completely replace it.  (Even prototyping
won't always reveal a problem -- it just "probably" will.)

IF you select hardware from a vendor that guarantees hardware
compatibility with any of the current/mainline distros -- and there are
several that do -- AND you select one of those mainline (well-supported
and automagically installable) distros AND you learn to master its
automagic installation techniques, then managing any sort of linux
operation from a single machine to an organization-spanning LAN
consisting of an arbitrary mixture of servers, workstation/office LANs,
and clusters has never been simpler.  That is a true statement.  A
single repo mirror set, a single homemade package repo, and PXE permit a
single individual to provide ALL the software installation and
maintenance support required by a large company under these
circumstances.  Individuals can install linux on their own hardware (at
their own risk) at will from the repo(s), departments that follow the
hardware rules can install and maintain standardized systems any of a
number of ways, and in all cases a pro-class distribution updates all of
these systems in a fully automatic way e.g. nightly to the current repo
update level, making it easy to install new software or update old
software.

Cluster admins have it even easier, as their (linux distro compatible)
nodes are likely to be all IDENTICAL (in groups, at least, over several
generations) and homogeneity is the friend of the administrator just as
heterogeneity is Evil Incarnate.  Give me a switch and cables and a
rackful of Penguin boxes (please!:-), one equipped with a row of
hot-swappable disks and a tape library, and I'll take my laptop and its
currently FC6-full backpack disk and return you a functional cluster in
the amount of time required to physically assemble the nodes plus less
than a day to (re)install them with a perfectly reasonable cluster
configuration, very nearly independent of the number of nodes or racks.
Give me a couple or three days and I can probably arrange to install the
cluster a couple or three different ways -- diskful, diskless, mixed,
scyldified.

Not ALL cluster needs would be satisfied by this of course.  That's the
basic problem described in detail above.  If the cluster "required"
RHEL/Centos release X so it could run commercial package Y (and it
didn't just run anyway on FC6, which it probably would do:-) and the
penguin hardware "required" FC6 because older RHEL/Centos kernels just
don't support the network device or dual core dual CPU AMD x86_64 BIOS,
then yeah, you enter one form of Linux Hell from which there is no easy
escape but to not get the unsupported hardware in the first place, no
matter how much your users beg for bleeding edge hardware, OR getting
your #&!@ software vendor that you are PAYING to REBUILD their damn
application for FC6 (and in the process, package the thing up so it
autobuilds as RPMs or whatever) at least as well as all the maintainers
of the 6000-odd FREE packages in FC6 manage to package them up (grrr) OR
backporting kernels and key libraries from FC6 to RHEL/Centos whatever
-- maybe, possibly, don't hold your breath.  Hell.

Yup, then yum and friends, permitted RPM-derived linuces to emerge from
the long night of software dependency hell (where Debian had long since
stepped into the light).  It is time to really focus on hardware
dependency hell and conditional provisioning trees, both of which are
well within the capabilities of modern packaging systems and the general
linux design.  Conditional provisioning trees, in particular, could
really revolutionize things and perhaps make it possible to get away
from the notion of the "complete distribution release".  The current
paradigm, which worked amazingly well for order of a few hundred
packages, does not scale to a few thousand particularly well, and we're
well on our way to 10 Kpkg and up distribution releases, which will be a
maintenance nightmare under the current scheme.  I think, anyway.

The future should be interesting... as always.  It would be funny, in a
sick sort of way, if Windows manages to hold on in the face of linux
because it supports LESS software (but all of the hardware, nearly
perfectly).  Most people don't need more than a few hundred of the ~10
Kpkgs available.

    rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From rgb at phy.duke.edu  Sun Jan  7 12:49:50 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Sun, 7 Jan 2007 15:49:50 -0500 (EST)
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <45A10C69.4030908@scalableinformatics.com>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
	<m3hcv8z8hg.fsf@unna.nsc.liu.se>
	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
	<20070107112230.GA7654@galactic.demon.co.uk>
	<45A10C69.4030908@scalableinformatics.com>
Message-ID: <Pine.LNX.4.64.0701071543170.5894@lilith.rgb.private.net>

On Sun, 7 Jan 2007, Joe Landman wrote:

>>> BTW, the cluster's servers were not (and I would not advise that servers
>>> ever be) running the old distro -- we use a castle keep security model
>>> where servers have extremely limited access, are the most tightly
>>> monitored, and are kept aggressively up to date on a fully supported
>>> distro like Centos.  The idea is to give humans more time to detect
>>> intruders that have successfully compromised an account at the
>>> workstation LAN level and squash them like the nasty little dung beetles
>>> that they are.
>
> Yup.  Even better is never letting the users log in to admin machines.
> Provide machines for them to log into, submit and run jobs from.  Just
> not the admin nodes.

That would be the "servers have extremely limited access" part -- as in
sysadmins only.

> For what I call production cycle shops, those places which have to churn
> out processing 24x7x365, you want as little "upgrading" as possible, and
> it has to be tested/functional with everything.  Ask your favorite CIO
> if they would consider upgrading their most critical systems nightly.
>
> It all boils down to a CBA (as everything does).  Upgrading carries
> risk, no matter who does it, and how carefully things are packaged.  The
> CBA equation should look something like this:
>
> 	value_of_upgrade = positive_benefits_of_upgrade -
> 			   potential_risks_of_upgrade

I completely agree with this.  As I pointed out earlier in the thread,
companies such as banks make "conservative" seem downright radical when
it comes to OS upgrades.  They have to do a complete, thorough,
comprehensive security audit to change ANYTHING on their machines -- as
a requirement in federal law, IIRC.  To get them to take you seriously,
you MUST be prepared to support the OS they install on (once it is
successfully audited) forever -- until the hardware itself falls apart
into itty-bitty bits.

>>> On clusters that add new hardware, usually bleeding edge, every four to
>>> six months as research groups hit grant year boundaries and buy their
>>> next bolus of nodes, FC really does make sense as Centos probably won't
>>> "work" on those nodes in some important way and you'll be stuck
>>> backporting kernels or worse on top of your key libraries e.g. the GSL.
>>> Just upgrade FC regularly across the cluster, probably on an "every
>>> other release" schedule like the one we use.
>>>
>>
>> Chances are that anything Red Hat Enterprise based just won't work. New
>> hardware is always hard.
>
> Heh.  Try to point this out to a purchasing agent on an RFP which
> demands a) newest possible hardware and b) RHEL 4 support.  You get to
> pick one or the other, not both.  Which one do you want?  Hint: "b" is
> far less valuable.
>
> The other (not-so-funny) aspect of this is when we deliver new hardware
> with an OS load that supports the newer hardware and someone wants to
> pull it back to the "corporate standard".  In doing so, they give up
> stability, performance, and often file system support.  Or in the case
> of our JackRabbit unit, when we deliver 30TB of 5U system and we get the
> "ext3 is almost as good as xfs" line.  Uh.... er.... no.   Those who
> really insist upon this must only want 16TB units with no possibility to
> ever grow beyond this (we have a design cooked up to show how to do a 1
> PB in 4 racks as a single file system, or better, an HA 1 PB in 9 racks
> as a single file system).  16TB is great for some folks, but it is a
> fundamental ext3 limit.  You need the untried-in-the-real-world ext4 to
> break that limit.  Or xfs and jfs.

Proving once again that Joe's company provides a valuable service,
because companies like this fill in an important gap between e.g. FC and
a customer's conservative needs.  However, I'll bet Joe is still just as
vulnerable to the other problem -- customer wants to run commercial
package X (which "requires" RHEL) but ALSO wants to run it on bleeding
edge hardware.  I'll bet you really earn your keep on those ones...

   ;-)

      rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From landman at scalableinformatics.com  Sun Jan  7 13:25:17 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Sun, 07 Jan 2007 16:25:17 -0500
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <Pine.LNX.4.64.0701071543170.5894@lilith.rgb.private.net>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
	<m3hcv8z8hg.fsf@unna.nsc.liu.se>
	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
	<20070107112230.GA7654@galactic.demon.co.uk>
	<45A10C69.4030908@scalableinformatics.com>
	<Pine.LNX.4.64.0701071543170.5894@lilith.rgb.private.net>
Message-ID: <45A1653D.4060605@scalableinformatics.com>

Robert G. Brown wrote:

> Proving once again that Joe's company provides a valuable service,

Well thank you (the check will be in the mail :) )

> because companies like this fill in an important gap between e.g. FC and
> a customer's conservative needs.  However, I'll bet Joe is still just as
> vulnerable to the other problem -- customer wants to run commercial
> package X (which "requires" RHEL) but ALSO wants to run it on bleeding
> edge hardware.  I'll bet you really earn your keep on those ones...

... and some rather deep scars, traumatic head wounds, and related ... 
Still have most of my fingers ...

Humor aside, we have a little download area where we point our customers 
to (http://downloads.scalableinformatics.com).  Each file there usually 
represents a solved problem ... though some have caused problems on 
their own (the areca drivers and xfs bits for RHEL based distros are 
there ... I am loath to rewrite anyones initrd without asking them, 
nicely, and giving them a way to recover should it go horribly wrong ... 
I still want to come up with a good solution ... ).

> 
>   ;-)
> 
>      rgb
> 


-- 

Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From landman at scalableinformatics.com  Sun Jan  7 19:49:55 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Sun, 07 Jan 2007 22:49:55 -0500
Subject: [Beowulf] Any Gaussian users out there?
Message-ID: <45A1BF63.4010203@scalableinformatics.com>

I found a neat ... feature ... of Linux while getting g03 running in SMP 
on cluster nodes.  Long story, but the folks I am doing this for don't 
have/want to use Linda.  They asked us to help them get g03 operational 
in SMP parallel.  This wasn't painful.  Have it integrated into SGE and 
our SICE interface now as well.

Basic idea is that we are getting a kernel exception in the VFS layer 
only when running with 2 or more CPUs on an SMP node.  Shows up only on 
SuSE 9.3 nodes.  The other nodes are RHEL 3 based (2.4 kernel, but hey, 
its really stable).

I don't want to post a nasty-looking trap here.

The problem occurs with both xfs and jfs.  Haven't had the chance to try 
ext3 yet, though if the issue is in the vfs layer, I can't see how 
changing the underlying block device is going to alter the layers (VFS) 
above it.

The net effect of this is that it runs great on the 2.4 based machines, 
but gets SIGKILLs when running on the 2.6 based SuSE 9.3 machines. 
Looks like the app is tickling the OS bug.  I can repeatably cause this 
trap, though it seems to occur at "random" places, well, not really. 
The way Gaussian runs, it has "links" which are binary modules which 
execute a particular portion of the calculation (its pretty neat 
really).  Each link is read in from the disk.  This VFS bug gets 
triggered regardless of local or remote FS.

Any Gaussian users out there see that?  Does a kernel upgrade fix it? 
Inquiring minds want to know ...

-- 

Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From nixon at nsc.liu.se  Mon Jan  8 03:03:00 2007
From: nixon at nsc.liu.se (Leif Nixon)
Date: Mon, 08 Jan 2007 12:03:00 +0100
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <45A10C69.4030908@scalableinformatics.com> (Joe Landman's message
	of "Sun, 07 Jan 2007 10:06:17 -0500")
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
	<m3hcv8z8hg.fsf@unna.nsc.liu.se>
	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
	<20070107112230.GA7654@galactic.demon.co.uk>
	<45A10C69.4030908@scalableinformatics.com>
Message-ID: <m3y7odzspn.fsf@unna.nsc.liu.se>

Joe Landman <landman at scalableinformatics.com> writes:

> Andrew M.A. Cater wrote:
>> On Wed, Jan 03, 2007 at 09:51:44AM -0500, Robert G. Brown wrote:
>>> On Wed, 3 Jan 2007, Leif Nixon wrote:
>>>
>>>> "Robert G. Brown" <rgb at phy.duke.edu> writes:
>>>>
>>>   b) If an attacker has compromised a user account on one of these
>>> workstations, IMO the security battle is already largely lost.  They
>
> s/largely/completely/g
>
> At least for this user, if they have single factor passwordless login
> set up between workstation and cluster.

Of course. But you want to contain the intrusion to that single user,
as far as possible. If your security hinges on no user passwords ever
being stolen, you can very easily wind up in a situation that
traditionally is said to involve creeks, but not paddles. I have two
thick binders sitting on my desk, containing stolen passwords from an
impressive range of commercial, academic and military institutions. 

>>> In general, though, it is very good advice to stay with an updated OS.
>
> ... on threat-facing systems, yes, I agree.
>
> For what I call production cycle shops, those places which have to churn
> out processing 24x7x365, you want as little "upgrading" as possible, and
> it has to be tested/functional with everything.  Ask your favorite CIO
> if they would consider upgrading their most critical systems nightly.

I see this in hospitals a lot. Some healthcare systems can't be
patched without reapplying for FDA approval, which is of course a
hideously complicated process. So hospitals wind up running software
which you can push over with a feather. Theoretically, they should be
running on an isolated network ("It's no problem, we have
firewalls!!!"), but it only takes a single mistake: somebody plugs in
an infected laptop, or somebody misconfigures a VLAN. Our local
hospital has fallen over due to worm infestations a couple of times.

> It all boils down to a CBA (as everything does).  Upgrading carries
> risk, no matter who does it, and how carefully things are packaged.  The
> CBA equation should look something like this:
>
> 	value_of_upgrade = positive_benefits_of_upgrade -
> 			   potential_risks_of_upgrade

With the security benefits being really hard to quantify. 

> You have a perfectly valid reason to upgrade threat facing nodes.  Keep
> them as minimal and as up-to-date as possible.  The non-threat facing
> nodes, this makes far less sense.  If you are doing single factor
> authentication, and have enabled passwordless access within the cluster:
>  ssh keys or certificates or ssh-agent based, once a machine that holds
> these has been compromised, the game is over.

I don't get this. What's the point of having a "secure" frontend if
the systems behind it are insecure? OK, there's one big point -
hopefully you can buy some time - but other than that? 

The goal is to be able to contain user level intrusions. If you can do
this, the game *isn't* over even if you have an intrusion spreading to
a cluster machine. A user level intrusion isn't too hard to deal with,
but a cluster-wide root intrusion... isn't much fun. Sure, you can
probably reinstall the entire cluster in an hour. To a vulnerable
state. Hooray.

-- 
Leif Nixon                       -            Systems expert
------------------------------------------------------------
National Supercomputer Centre    -      Linkoping University
------------------------------------------------------------


From landman at scalableinformatics.com  Mon Jan  8 07:20:34 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Mon, 08 Jan 2007 10:20:34 -0500
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <m3y7odzspn.fsf@unna.nsc.liu.se>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>	<200612290939.59593.csamuel@vpac.org>	<20061229005749.GA13471@galactic.demon.co.uk>	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>	<m3hcv8z8hg.fsf@unna.nsc.liu.se>	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>	<20070107112230.GA7654@galactic.demon.co.uk>	<45A10C69.4030908@scalableinformatics.com>
	<m3y7odzspn.fsf@unna.nsc.liu.se>
Message-ID: <45A26142.9050408@scalableinformatics.com>

I am posting before coffee (PBC) so if I ramble more than usual, my
apologies.

Leif Nixon wrote:
> Joe Landman <landman at scalableinformatics.com> writes:

>>>>   b) If an attacker has compromised a user account on one of these
>>>> workstations, IMO the security battle is already largely lost.  They
>> s/largely/completely/g
>>
>> At least for this user, if they have single factor passwordless login
>> set up between workstation and cluster.
> 
> Of course. But you want to contain the intrusion to that single user,
> as far as possible.

I think there are two different issues.  First: security is meant to be
an access control and thottle/choke point.  Second: is how you view your
cluster.  Is it "one-big-machine" in some sense (not necessarily Scyld,
but with a security model such that if you are on the access node you
are on the machine), or is it really a collection of individual machines
each with their own administrative domain?  One of these models works
really well for "cluster" use.

> If your security hinges on no user passwords ever
> being stolen, you can very easily wind up in a situation that
> traditionally is said to involve creeks, but not paddles. 

Your security model should mirror your intended usage model as indicated
above.  If you are using a cluster, security is the front door.  If you
are using something else, the security model may be different.  Since we
are into analogies, why not look at it like the front of a very
exclusive club.  If you get in, you are in.  If you want, you can even
implement different security room to room, which very quickly causes
your club members to leave as it gets hard to move room to room.

Security is in part about containment.  Containment is not necessarily
putting a lock on every door, and a different required key or three to
unlock.

More importantly, security is about minimizing the maximum damage an
attack can do.  A different lock on every door may stop the casual
attacker, but as you have large binders of stolen passwords (the
authorites might wish to ask you how you got them :( ), I have some not
so nice log files of years of hackers, some script kiddies, and some
very good ones, beating on everything but the front door.

Put another way, I've been mimicing a few others for the better part of
a decade, saying security is a process, not a product.  Making a process
hard doesn't necessarily make it secure.  Making sure that when the
process breaks down, and it will, the damage as a result of that
breakage is as low as you can make it.

> I have two
> thick binders sitting on my desk, containing stolen passwords from an
> impressive range of commercial, academic and military institutions. 
> 
>>>> In general, though, it is very good advice to stay with an updated OS.
>> ... on threat-facing systems, yes, I agree.
>>
>> For what I call production cycle shops, those places which have to churn
>> out processing 24x7x365, you want as little "upgrading" as possible, and
>> it has to be tested/functional with everything.  Ask your favorite CIO
>> if they would consider upgrading their most critical systems nightly.
> 
> I see this in hospitals a lot.

I see this in every single production cycle shop we have been in.  Not
just FDA-regulated.  So much so that they have a process that involves
building a second (or Nth) test machine, called a sandbox, specifically
to test things until they believe them to work before deploying them.

Back to this in a second.

> Some healthcare systems can't be
> patched without reapplying for FDA approval, which is of course a
> hideously complicated process. So hospitals wind up running software
> which you can push over with a feather. Theoretically, they should be
> running on an isolated network ("It's no problem, we have
> firewalls!!!"), but it only takes a single mistake: somebody plugs in
> an infected laptop, or somebody misconfigures a VLAN. Our local
> hospital has fallen over due to worm infestations a couple of times.

The analogy fails to hold up.  Zero-day viruses and malware on fully
patched windows systems burns through the desktop/laptop population of
many.  What is terrifying to me is that my government still
mandates/allows the use of systems which are easily compromised in its
most sensitive inner reaches.  Specifically in the military and related
areas.  I don't know details, only heard faint mutterings online, but
something like this appears to have knocked some portion of government
computers in a highly sensitive area offline for several days very recently.

As indicated before, security is not a product (e.g. an updated patch),
it is a process (minimizing the maximum damage).  If you act otherwise,
the zero day virus' and malware are going to wreak havoc.  Or if you
think your systems are secure because you use multifactor access control
with long random passwords and secure id cards, you somehow (mistakenly)
believe your systems are secure, and you don't pay attention to some ...
misfeatures that are being exercised by people of nefarious intent.

If all I ever do is send random garbage to port 22 after doing the
handshaking, and eventually blow ssh out of the water, it really doesnt
matter if you have multifactor authentication running.  I would be in as
the user running the daemon.  Hence privilege separation.  Change the
code so that if there is a break in, the maximum damage that can be done
is done as the sshd_daemon user.  Since they are no longer root user,
and they are isolated, in their own group, the damage they can do is
contained.  Minimizing the maximum damage.

> 
>> It all boils down to a CBA (as everything does).  Upgrading carries
>> risk, no matter who does it, and how carefully things are packaged.  The
>> CBA equation should look something like this:
>>
>> 	value_of_upgrade = positive_benefits_of_upgrade -
>> 			   potential_risks_of_upgrade
> 
> With the security benefits being really hard to quantify. 

Not really.  If you have a huge gaping hole that needs patching (OpenSSL
off-by-one or weakness), the benefits are easy.  Again, it is a process.
 You test the upgrade, and if it breaks nothing else, you do it.  In
fact, this suggests (usually) doing upgrades in smaller incremental bits
rather than large complex bits.  A huge bolus of patches and fixes often
has a few new (mis)features (I could name a company here, but they know
who they are) which are unfortunately potentially exploitable.

To keep risks low, make as few changes as possible.  To keep benefits
high, update important threat facing things.  To keep risks lower, do
not introduce more changes than absolutely needed.  Patches should not
include new (mis)features.

>> You have a perfectly valid reason to upgrade threat facing nodes.  Keep
>> them as minimal and as up-to-date as possible.  The non-threat facing
>> nodes, this makes far less sense.  If you are doing single factor
>> authentication, and have enabled passwordless access within the cluster:
>>  ssh keys or certificates or ssh-agent based, once a machine that holds
>> these has been compromised, the game is over.
> 
> I don't get this. What's the point of having a "secure" frontend if
> the systems behind it are insecure? OK, there's one big point -
> hopefully you can buy some time - but other than that? 

Its the model of how you use the machine.  If you lock all the doors
tight with impenetrable seals, and the attacker goes through the weaker
windows, those impenetrable seals haven't done much for you.

The idea is you minimize the exposed footprint of the machine to threat
facing access.  This is why lots of the secure sites are disabling USB
ports on the motherboards (but mistakenly then running systems which can
install keyloggers and other malware ... ).  If the USB does not
electrically work, it is not a possible attack vector.

You can always take the approach of compartmentalization; locking
*everything* down.  Put those impenetrable seals up.  Have one port
exposed.  Allow no back channels whatsoever.  No shared storage.  No
single factor authentication.  This is not a cluster computing model
that I have heard of.  Would break too many things.  Yeah yeah, grid
this and that.

> The goal is to be able to contain user level intrusions. If you can do

I disagree.  I think the goal is to minimize the maximum damage.  I do
not think it is possible to completely contain a smart and resourceful
attacker with multiple attack vectors.  I know lots of security folks
who used to think that their firewalls could, and then watched said
resourceful hackers go through them.

> this, the game *isn't* over even if you have an intrusion spreading to
> a cluster machine. A user level intrusion isn't too hard to deal with,
> but a cluster-wide root intrusion... isn't much fun. Sure, you can
> probably reinstall the entire cluster in an hour. To a vulnerable
> state. Hooray.

Again, I disagree.  I do not believe patching is a magic solution.  A
well designed security model that, in the event that the assumptions of
the model break down (say all the doors and ports suddenly, magically
spring open, because the attacker muttered the appropriate phrase into
the wire), still limits the damage that can be done, might be an
approach worth considering.

Again, I watch (in horror) as military organizations, with some really
nice rules and procedures behind them designed to contain and control
these bits, proceed to use systems that are known to be insecure by
design.  If your system can be keylogged, it should never ever be on a
network, anywhere.  Or change your system so that keylogging is
harder/impossible.  Security is a process.  Like never downloading
important information to a laptop only to let it be stolen / lost later
on.  The current fad is encrypting the disk, and this might prevent some
attacks, or slow the rate of information release.  Or not.

The point is that if the maximum damage an attacker can do is contained
or minimized, then you can gather valuable threat information from their
attack.  Part of the rationale for honeypots is putting systems without
anything important out there, in order to observe attacks, find
vulnerabilities, and learn how to defend against them.  This is done by
not co-locating the honeypot on a useful system.  By containing what is
in there, and what it has access to.


-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From diep at xs4all.nl  Mon Jan  8 08:06:04 2007
From: diep at xs4all.nl (Vincent Diepeveen)
Date: Mon, 8 Jan 2007 17:06:04 +0100
Subject: [Beowulf] Which distro for the cluster?
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com><Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net><200612290939.59593.csamuel@vpac.org><20061229005749.GA13471@galactic.demon.co.uk><Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net><m3hcv8z8hg.fsf@unna.nsc.liu.se>
	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
Message-ID: <006201c7333e$e9393d60$0300a8c0@gourmandises>


----- Original Message ----- 
From: "Robert G. Brown" <rgb at phy.duke.edu>
To: "Leif Nixon" <nixon at nsc.liu.se>
Cc: <beowulf at beowulf.org>
Sent: Wednesday, January 03, 2007 3:51 PM
Subject: Re: [Beowulf] Which distro for the cluster?


> On Wed, 3 Jan 2007, Leif Nixon wrote:
>
>> "Robert G. Brown" <rgb at phy.duke.edu> writes:
>>
>>> Also, plenty of folks on this list have done just fine running "frozen"
>>> linux distros "as is" for years on cluster nodes.  If they aren't broke,
>>> and live behind a firewall so security fixes aren't terribly important,
>>> why fix them?
>>
>> Because your users will get their passwords stolen.
>>
>> If your cluster is accessible remotely, that firewall doesn't really
>> help you very much. The attacker can simply login as a legitimate user
>> and proceed to walk through your wide-open local security holes.
>
> So:
>
>   a) Our cluster wasn't remotely accessible.  In fact, it was on a
> 192.168 network and in order to even touch it, one had to login to an up
> to date, carefully defended desktop workstation login server in the
> department.

Of course it is the layman talking here:

Anything that is connected to a network is going to get hacked if there is 
more than interesting information to find.

The real question is: "what type of software/data do you store?"

Depending upon the answer hackers will get in or will not even try.

Guided missile data? Oh boy... (even oh boy when it is a total average 
university project that is going to draw 0 interesting conclusions,
with the usual fraud of data modifying in order to let the results look 
better, rather than fix the software)

Most hackers seem to just automatically collect everything in order to keep 
busy.

There is weird holes in all kind of distributions.
A few weeks ago i installed a debian server here and for some weird reason 
some kind of email client opened a port default (port 25).

>   b) If an attacker has compromised a user account on one of these
> workstations, IMO the security battle is already largely lost.  They

If it is possible to login somehow and store files on some kind of 
harddrive, then you're already rooted in no-time.
It is too easy in UNIX/IRIX/LINUX to READ data.

Not necessarily modify data, but READING other persons data is far easier 
than modifying data.

Think of caches that store data and so on.

In the long run the real big problem is not taking care that no one from 
outside can get into your machine.

The problem is all the different types of software that users run on their 
accounts. Like my debian router/firewall is a joke of a firewall,
because the windows clients behind it can nearly freely access the internet.

Just 1 bad program that has some sort of spyware that goes outside, and 
hackers can use that same spyware channel to get in.

With respect to remotely accessing clusters over the internet, you can call 
those of course semi-secure, because the way you
access them is not secure. Some SSH type connection is enough to get rid of 
a few unorganized criminals who usually cannot tap
the entire conversation stream.

In case of PGP when the first bits arrive, just flip a protocol bit, after 
which the entire response goes unencrypted, and just before
shipping it to the receiving client, encrypt it for that client. He won't 
notice.

However that is all very paranoia thinking.
When using default security, things are pretty safe, because no one can tap 
the entire conversation.

> have a choice of things to attack or further evil they can try to wreak.
> Attacking the cluster is one of them, and as discussed if the cluster is
> doing real parallel code it is likely to be quite vulnerable regardless
> of whether or not its software is up to date because network security is
> more or less orthogonal to fine-grained code network performance.

> Still, a cluster is paradoxically one of the best monitored parts of a
> network.  Although it would make a gangbusters DoS platform, network

You don't hack a cluster in order to start a DoS attack.

If a cluster gets hacked it'll be for the software that runs on it and the 
output data.

> traffic on the cluster, cpu consumption on the cluster, user access to
> the cluster are all relatively carefully monitored.  The cluster
> installation is likely to be different enough and "odd" enough to make
> standard rootkit encapsulations fail for anyone but the legendary
> Ubercracker (who can always do whatever they want anyway, right?;-) In
> an organization that tightly monitors everything all the time on general
> security principles (first line of defense, really, as one can NEVER be
> sure all exploitable holes are closed even with a yum-updated, stable,
> currently supported distro and human eyes are better at picking up
> anomalies in system operation than any automated tool) I think it is
> pretty likely that any attempt to take over a cluster and use it for
> diabolical ends would be almost instantly detected.

I feel the real problem is not so much misusage of your hardware by the 
Uebercracker,
as well as that some companies fear that their data can get read by others. 
Years of original
development of your idea and hardware, stolen by a simple hackattempt.

Most importantly giving your competitor new ideas on how to progress, even 
more important than
that they can "reproduce" your original idea.

But basically paranoia only applies to software/hardware that falls under 
category 5 of the wassenaar treaty
( www.wassenaar.org ), in about every other case the average person in ICT 
has too much paranoia, whereas
there is nothing wrong and no one is stealing his data.

btw that doesn't apply to collegues of me, as i detected they install 
together with their software spyware (like shredder classic does do,
a weird program called wuw.exe or windows update wizard that mcafee didn't 
detect), of course as usual, that is windows software.

> BTW, the cluster's servers were not (and I would not advise that servers
> ever be) running the old distro -- we use a castle keep security model
> where servers have extremely limited access, are the most tightly
> monitored, and are kept aggressively up to date on a fully supported
> distro like Centos.  The idea is to give humans more time to detect
> intruders that have successfully compromised an account at the
> workstation LAN level and squash them like the nasty little dung beetles
> that they are.

"Castle keep security model".

You mean it has been airgapped?

> FWIW, our department is entirely linux at the server level, and almost
> entirely linux at the workstation level.  A very few experimental groups
> and individuals run either Windows boxes (usually to be able to use some
> particular software package) or Macs (because they are, umm, "that kind
> of user":-).  I'm guessing that the ratio is something like 4:1 linux to
> Win at the workstation level (Macs down there in the noise) and maybe
> 10:1 linux to win if you include cluster nodes, whatever OS they might
> be running.

What if someone installs on a windoze box for example shredder classic,
which spyware communicates to outside with the great security of 32 bits 
RSA,
it gotta run fast on a 32 bits machine of course,
meaning that if some clever student, who just got a job, manages to crack 
that,
he can take over the communication will manage to root your network and get 
all
data he wants from it.

Of course this is just a paranoia thought experiment, as your uni doesn't 
have of course
anything interesting to anybody, let alone that you can make money with it, 
let alone that
it is interesting.

> Since Seth introduced yup on top of RH (maybe 7-8 years ago?  How time
> flies...), and then proceeded to write yum to replace yup for RPM
> distros in general, we haven't had a single successful promotion to root
> in the department.  Nothing done locally can prevent some grad student's
> password from being trapped as they login from some compromised
> win-based system in their hometown over fall break, but the very few of
> these that have occurred have been quickly detected and quickly squashed
> without further compromise.

> In that same interval, we had a WinXX system compromised and turned into
> a pile of festering warez rot something like twice a year.  Pretty
> amazing given that they are kept up to date as best as possible and they
> make up only 10-20% of our total system count.

"How bad is it Humphrey?"

"Yes Minister, only 10% of our organization has been infected,
so we do not need to start some commie hunt within our organisation at all,
as 90% of it is clean, so not a SINGLE file could have been possibly taken
away, as those 90% would have noticed it; besides a file needs 12 stamps 
from
6 different departments before it can get out".

>> But you know this already.
>
> Oh yeah;-)
>
> And we didn't do this "willingly" and aren't that likely to repeat it
> ourselves.  We had some pretty specific reasons to freeze the node
> distro -- the cluster nodes in question were the damnable Tyan dual
> Athlon systems that were an incredible PITA to stabilize in the first
> place (they had multiple firmware bugs and load-based stability issues
> under the best of circumstances).  Once we FINALLY got them set up with
> a functional kernel and library set so that they wouldn't crash, we were
> extremely loathe to mess with it.  So we basically froze it and locked
> down the nodes so they weren't easily accessible except from inside the
> department, and then monitored them with xmlsysd and wulfstat in
> addition to the usual syslog-ng and friends admin tools.
>
> Odd usage patterns (that is, almost any sort of running binary that
> wasn't a well-known numerical task associated with one of the groups,
> logins by anyone who wasn't a known user) would have been noticed by any
> of a half-dozen people, one of whom was me, almost immediately.  The
> kernel was "barely stable" as it was and couldn't easily have been
> replaced with a hacker kernel (to e.g. erase /proc trace) without a VERY
> high probability that the hacker kernel would crash the system and
> reveal the hacker on the first try. xmlsysd reads all sorts of stuff
> from all over /proc and was custom code that I was working on and
> periodically updating, even while Seth was working on yum and updating
> THAT.  Somebody would have had to literally custom craft some very
> advanced C code to stay hidden on the cluster and even then would have
> been revealed by e.g. an update of xmlsysd unless they were a bit beyond
> even Ubercracker status.
>
> In general, though, it is very good advice to stay with an updated OS.
> My real point was that WITH yum and a bit of prototyping once every
> 12-24 months, it is really pretty easy to ride the FC wave on MANY
> clusters, where the tradeoff is better support for new hardware and more
> advanced/newer libraries against any library issues that one may or may
> not encounter depending on just what the cluster is doing.  Freezing FC
> (or anything else) long past its support boundary is obviously less
> desireable.  However, it is also often unnecessary.
>
> On clusters that add new hardware, usually bleeding edge, every four to
> six months as research groups hit grant year boundaries and buy their
> next bolus of nodes, FC really does make sense as Centos probably won't
> "work" on those nodes in some important way and you'll be stuck
> backporting kernels or worse on top of your key libraries e.g. the GSL.
> Just upgrade FC regularly across the cluster, probably on an "every
> other release" schedule like the one we use.
>
> On clusters (or sub-clusters) with a 3 year replacement cycle, Centos or
> other stable equivalent is a no-brainer -- as long as it installs on
> your nodes in the first place (recall my previous comment about the
> "stars needing to be right" to install RHEL/Centos -- the latest release
> has to support the hardware you're buying) you're good to go
> indefinitely, with the warm fuzzy knowledge that your nodes will update
> from a "supported" repo most of their 3+ year lifetime, although for the
> bulk of that time the distro will de-facto be frozen except for whatever
> YOU choose to backport and maintain.
>
> And really, there isn't much stopping folks from adopting a range of
> "mixed" strategies -- running FC-whatever on new nodes for a year or
> whatever as needed in order to support their hardware or use new
> libraries, then reinstalling them with Centos/RHEL (which is basically
> FC-even-current-at-release-time frozen and supported or so it seems
> recently anyway) as Centos support catches up with the hardware by
> syncing with an FC-current on a new release.
>
> Nowadays, with PXE/Kickstart/Yum (or Debian equivalents, or the OS of
> your choice with warewulf, or...) reinstalling OR upgrading a cluster
> node is such a non-event in terms of sysadmin time and effort that it
> can pretty much be done at will.  Except for pathological cases (like
> the Tyans) we're talking at most a few days of sysadmin time to set up a
> prototyping node or four, flash over to the new distro via a discrete
> node reboot (unattended automated reinstall or a new node diskless
> image), and let selected users whack on it for a week or two.  If it
> proves invisibly stable and satisfactory -- the rule rather than the
> exception -- crank it on up across the cluster.  Even if it "fails" on
> some untested pathway after you do this, it costs you at most a reboot
> (again to a reinstall/replacement of a node image) to put things back as
> they were while you fix things.
>
> The worst thing that such a strategy might require is a rebuild of user
> applications for both distros, but with shared libraries to my own
> admittedly anecdotal experience this "usually" isn't needed going from
> older to newer (that is, an older Centos built binary will "probably"
> still work on a current FC node, although this obviously depends on the
> precise libraries it uses and how rapidly they are changing).  It's a
> bit harder to take binaries from newer to older, especially in packaged
> form.  There you almost certainly need an rpmbuild --rebuild and a bit
> of luck.
>
> Truthfully, cluster installation and administration has never been
> simpler.
>
>    rgb
>
> -- 
> Robert G. Brown                        http://www.phy.duke.edu/~rgb/
> Duke University Dept. of Physics, Box 90305
> Durham, N.C. 27708-0305
> Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf
> 


From kus at free.net  Mon Jan  8 08:30:31 2007
From: kus at free.net (Mikhail Kuzminsky)
Date: Mon, 08 Jan 2007 19:30:31 +0300
Subject: [Beowulf] Any Gaussian users out there?
In-Reply-To: <45A1BF63.4010203@scalableinformatics.com>
Message-ID: <web-1257758@free.net>

In message from Joe Landman <landman at scalableinformatics.com> (Sun, 07 
Jan 2007 22:49:55 -0500):
>I found a neat ... feature ... of Linux while getting g03 running in 
>SMP on cluster nodes.  Long story, but the folks I am doing this for 
>don't have/want to use Linda.  They asked us to help them get g03 
>operational in SMP parallel.  This wasn't painful.  Have it 
>integrated into SGE and our SICE interface now as well.
>
>Basic idea is that we are getting a kernel exception in the VFS layer 
>only when running with 2 or more CPUs on an SMP node.  Shows up only 
>on SuSE 9.3 nodes.  The other nodes are RHEL 3 based (2.4 kernel, but 
>hey, its really stable).
   We have working g03 C02 w/SMP parallelization under SuSE 9.0 for 
x86-64 (2.6 kernel, but more old than f0r 9.3 ). In particular, xfs 
works OK.

Yours
Mikhail


>
>I don't want to post a nasty-looking trap here.
>
>The problem occurs with both xfs and jfs.  Haven't had the chance to 
>try ext3 yet, though if the issue is in the vfs layer, I can't see 
>how changing the underlying block device is going to alter the layers 
>(VFS) above it.
>
>The net effect of this is that it runs great on the 2.4 based 
>machines, but gets SIGKILLs when running on the 2.6 based SuSE 9.3 
>machines. Looks like the app is tickling the OS bug.  I can 
>repeatably cause this trap, though it seems to occur at "random" 
>places, well, not really. The way Gaussian runs, it has "links" which 
>are binary modules which execute a particular portion of the 
>calculation (its pretty neat really).  Each link is read in from the 
>disk.  This VFS bug gets triggered regardless of local or remote FS.
>
>Any Gaussian users out there see that?  Does a kernel upgrade fix it? 
>Inquiring minds want to know ...
>
>-- 
>
>Joseph Landman, Ph.D
>Founder and CEO
>Scalable Informatics LLC,
>email: landman at scalableinformatics.com
>web  : http://www.scalableinformatics.com
>phone: +1 734 786 8423
>fax  : +1 734 786 8452 or +1 866 888 3112
>cell : +1 734 612 4615
>
>_______________________________________________
>Beowulf mailing list, Beowulf at beowulf.org
>To change your subscription (digest mode or unsubscribe) visit 
>http://www.beowulf.org/mailman/listinfo/beowulf


From rgb at phy.duke.edu  Mon Jan  8 12:43:18 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Mon, 8 Jan 2007 15:43:18 -0500 (EST)
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <45A26142.9050408@scalableinformatics.com>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
	<m3hcv8z8hg.fsf@unna.nsc.liu.se>
	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
	<20070107112230.GA7654@galactic.demon.co.uk>
	<45A10C69.4030908@scalableinformatics.com>
	<m3y7odzspn.fsf@unna.nsc.liu.se>
	<45A26142.9050408@scalableinformatics.com>
Message-ID: <Pine.LNX.4.64.0701081510540.26424@lilith.rgb.private.net>

On Mon, 8 Jan 2007, Joe Landman wrote:

> The idea is you minimize the exposed footprint of the machine to threat
> facing access.  This is why lots of the secure sites are disabling USB
> ports on the motherboards (but mistakenly then running systems which can
> install keyloggers and other malware ... ).  If the USB does not
> electrically work, it is not a possible attack vector.

(Ignoring the rest of Joe's quite excellent security summary, which for
the most part I completely agree with although I'm much more willing to
say the word "Microsoft" than he is, apparently.  Microsoft.  Microsoft.
Microsoft <poof, they disappear>:-)

This part reminds me of parts of Neal Stepheson's "Cryptonomicon" and
trans-ubercrackerdom.  In principle, every time you type a key, you
generate a tiny electrical signal with an associated EM pulse signature.
Some portions of the energy associated with the signal are immediately
radiated into the surrounding environment, where they are e.g. absorbed
by components on the motherboard, others cause tiny fluctuations in the
power draw.  In all cases there exist amplifiers and feedback loops that
can cause those signals to modulate existing signals and noise.  Indeed,
if you run your system's microphone on at high gain (whether or not the
microphone is plugged in) and listen to audio noise, you can usually
actually hear some of the noise modulation produced by your typing as
you do so.

In principle those modulations can be isolated from the generic noise
and signal mix on e.g. the power lines, ambient phone lines, external
high gain EM antennae, and so on.  Or in another of my favorite spy
methologies, one can bounce lasers off the external windows or
microwaves off of the walls of a house, do a fairly simple
autocorrelative deconvolution of the reflected signal, and pick up e.g.
human conversation or the noise of keyboarding from inside.  Since
humans tend to type keys in patterns and frequencies that can (with some
effort) be stochastically analyzed and matched to keystrokes, if
somebody REALLY REALLY WANTS TO they can very likely snoop on your
system activity in some pretty extraordinary ways.  Ditto in principle
one can often recover whole histories of read/write behavior from hard
disks by working hard enough on analyzing the residual magnetization
distribution of magnetic domains.  The "physics" of systems isn't really
designed to be secure, it is designed not to annoy people or other
hardware devices with EM noise above a certain intensity in certain
frequency ranges.  BELOW those intensity ranges there is a wide expanse
of in-principle detectable.

So who wants to this badly (cracking and snooping at this level isn't
cheap)?  Bad people where there is a lot of money at stake are one
possibility -- maybe it is time for another Neal Stephenson novel where
the world's largest bank heist takes out the fortune of a well-known
multibillionaire computer geek who foolishly allows his online access to
enormous amounts of money to be keylogged in many different ways, or
where bank officers or bank IT systems are systematically compromised in
this way.  Banks tend to be paranoid enough to completely isolate their
core systems -- NO external network, careful filtering of all power
supplies, NO windows, NO external walls, checks on checks at all human
levels.  Also the military and government, where some secrets are worth
more than money on both sides -- as the cracker (of e.g. al queda
systems, if any are known) and defending against crackers.  Again, I'm
fairly certain that most of the NSA's systems are locked down against
all of this sort of thing and still more, with systems people that are
paranoid even by the borderline personality standards of that insanely
paranoid profession...

SO it isn't just keeping good passwords or being a boy scout or
monitoring a system carefully.  Der Ubercracker is, almost by
definition, always one leg up on you.  The only thing that stops them
from cracking you is the investment in time and other resources
involved, or the risk of negative penalties if they are discovered
trying (which can be minimized by investing more heavily in the effort,
etc.).  I absolutely agree with Joe's basic approach -- inform everybody
that avoiding data theft is a matter of investment and CBA on BOTH sides
of the line -- you have to protect the data on the basis of what it is
worth.  Beyond that, the smart thing to do is engineer the system so
that if you are cracked, in some sense you do not care.  Your data is
backed up (multiply, redundantly, over a long enough time interval that
you can go back before the cracker entered and work forward cleaning as
you go).  Your systems can be reinstalled "instantly" (see previous
discussion on automated scalable install and maintenance).  Your servers
are sufficiently tough and you watch things sufficiently carefully that
you probably didn't get cracked there and if you did, well, you
reinstall them from scratch too (off the network, taking real care to
clean up the primary means of entry as you do so).

A good design can make being cracked ALMOST a non-event for less than
ubercrackers (who are so good at encapsulation that you may never know
they are there, or can only tell that they are there by passively
monitoring raw network traffic from an uncompromised box).  They get in,
you catch them quickly, reinstall the compromised machine and freeze the
compromised account (pending a talk with the user, sucker rod in
hand:-), and go on with life.

   rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From csamuel at vpac.org  Mon Jan  8 14:38:06 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Tue, 9 Jan 2007 09:38:06 +1100
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <m3y7odzspn.fsf@unna.nsc.liu.se>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<45A10C69.4030908@scalableinformatics.com>
	<m3y7odzspn.fsf@unna.nsc.liu.se>
Message-ID: <200701090938.13323.csamuel@vpac.org>

On Monday 08 January 2007 22:03, Leif Nixon wrote:

> I see this in hospitals a lot. Some healthcare systems can't be
> patched without reapplying for FDA approval, which is of course a
> hideously complicated process.

I've seen this with accredited firewalls where any patches require the systems 
to be re-accredited.    Consequently I've seen places put a non-accredited 
firewall in front of their mandated accredited one to protect it. :-)

cheers,
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070109/bad00369/attachment.sig>

From csamuel at vpac.org  Mon Jan  8 15:01:35 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Tue, 9 Jan 2007 10:01:35 +1100
Subject: [Beowulf] OT: Software RAID & Multipath
Message-ID: <200701091001.36100.csamuel@vpac.org>

This is about a storage node for a cluster, so it's partly on topic.. :-)

Through happy coincidence we now have a box with two FC cards going to a SAN 
switch and thence into each side of an IBM FAStT 600 (doing H/W RAID5).  The 
FAStT is partitioned into two 1.6TB lumps and each FC card can see both 
controllers on the FAStT (for failover).

Booting a live CD shows me that the multipath-tools package automatically 
detects it has two paths and sets this up appropriately (very nice).

Now, if I wanted to stripe accesses to the FAStT down each controller I seem 
to have two options:

1) Use software RAID-0 with MD.  My concern then is that I don't know whether 
the RAID-0 will kick in *before* the multipath and think it can stripe over 4 
drives (which would be bad).

2) Add both multipath'd partitions as PV's to LVM2 and then every time we 
create a new logical volume we MUST remember to specify the stripe option for 
lvcreate for it to work.  I also don't know how efficient or reliable it is..

Thoughts anyone ?

cheers!
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070109/11cd933e/attachment.sig>

From gmpc at sanger.ac.uk  Tue Jan  9 01:31:44 2007
From: gmpc at sanger.ac.uk (Guy Coates)
Date: Tue, 09 Jan 2007 09:31:44 +0000
Subject: [Beowulf] OT: Software RAID & Multipath
In-Reply-To: <200701091001.36100.csamuel@vpac.org>
References: <200701091001.36100.csamuel@vpac.org>
Message-ID: <45A36100.4080906@sanger.ac.uk>

Chris Samuel wrote:
> This is about a storage node for a cluster, so it's partly on topic.. :-)
> 
> Through happy coincidence we now have a box with two FC cards going to a SAN 
> switch and thence into each side of an IBM FAStT 600 (doing H/W RAID5).  The 
> FAStT is partitioned into two 1.6TB lumps and each FC card can see both 
> controllers on the FAStT (for failover).


> Booting a live CD shows me that the multipath-tools package automatically 
> detects it has two paths and sets this up appropriately (very nice).
> 
> Now, if I wanted to stripe accesses to the FAStT down each controller I seem 
> tohave two options:

You should be able to do this entirely within the  dm-multipath layer. It can
deal with multiple controllers as well as multiple fabrics.

In dual controller/dual fabric setups, multipath should create  four paths for
each lun  (2 fabrics * 2 controllers = 4 paths).

However, not all dual controller arrays are active-active, some are active-passive.


If the controller really is active-active, then the dm-multipath should
round-robbin IO across all 4 paths (you can check that with iostat).

(Example multipath output from an HP EVA8000)

[size=3726 GB][features=1 queue_if_no_path][hwhandler=0]
\_ round-robin 0 [prio=4][enabled]
 \_ 1:0:2:1 sdab 65:176 [active][ready]          <--- Green Fabric, controller A
 \_ 0:0:2:1 sdaa 65:160 [active][ready]          <----Red fabric, controller A
 \_ 0:0:3:1 sdac 65:192 [active][ready]          <----Red fabric, controller B
 \_ 1:0:3:1 sdz  65:144 [active][ready]          <----Green fabric, controller B


If the controller is active-passive, then you will see 2 active paths, and 2
passive paths. (this example is from an HP EVA5000, where passive paths are
labelled "ghost." The output may be different for other models of controller)

[size=100 GB][features=1 queue_if_no_path][hwhandler=0]
\_ round-robin 0 [prio=2][enabled]
 \_ 0:0:0:2 sdb  8:16   [active][ready]    <---green fabric, controller A
 \_ 1:0:1:2 sdk  8:160  [active][ready]    <---red fabric, controller A
\_ round-robin 0 [prio=2][enabled]
 \_ 0:0:1:2 sde  8:64   [active][ghost]    <---green fabric, controller B
 \_ 1:0:0:2 sdh  8:112  [active][ghost]    <----red fabric, controller B


If you are seeing something different, you need to mess with the
path_grouping_policy and path_checker values in the multipath.conf file.  The
exact values depends on the exact model disk-controller you are using.  I don't
have any IBM storage, so can't help you with the exact values I'm afraid.

The people on the dm-devel list should be able to help you with the actual
values to use. http://sources.redhat.com/dm/


Cheers,

Guy

-- 
Dr. Guy Coates,  Informatics System Group
The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1HH, UK
Tel: +44 (0)1223 834244 x 6925
Fax: +44 (0)1223 496802


From nixon at nsc.liu.se  Tue Jan  9 04:30:46 2007
From: nixon at nsc.liu.se (Leif Nixon)
Date: Tue, 09 Jan 2007 13:30:46 +0100
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <45A26142.9050408@scalableinformatics.com> (Joe Landman's message
	of "Mon, 08 Jan 2007 10:20:34 -0500")
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
	<m3hcv8z8hg.fsf@unna.nsc.liu.se>
	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
	<20070107112230.GA7654@galactic.demon.co.uk>
	<45A10C69.4030908@scalableinformatics.com>
	<m3y7odzspn.fsf@unna.nsc.liu.se>
	<45A26142.9050408@scalableinformatics.com>
Message-ID: <m3bql8gz61.fsf@unna.nsc.liu.se>

Joe Landman <landman at scalableinformatics.com> writes:

> I think there are two different issues.  First: security is meant to be
> an access control and thottle/choke point.  Second: is how you view your
> cluster.  Is it "one-big-machine" in some sense (not necessarily Scyld,
> but with a security model such that if you are on the access node you
> are on the machine), or is it really a collection of individual machines
> each with their own administrative domain?  One of these models works
> really well for "cluster" use.

I don't think it's quite that black and white. You can have the
cluster appear as a single security domain to the user, while still
maintaining some internal barriers. Even if you have passwordless ssh
access across the cluster for ordinary users, you probably should have
restrictions for root access - even if an attacker can root the login
node, he shouldn't be able to just ssh as root to any machine. Yes, if
he can root the login node, he can probably root the other nodes as
well, but let him work for it.

>>> It all boils down to a CBA (as everything does).  Upgrading carries
>>> risk, no matter who does it, and how carefully things are packaged.  The
>>> CBA equation should look something like this:
>>>
>>> 	value_of_upgrade = positive_benefits_of_upgrade -
>>> 			   potential_risks_of_upgrade
>> 
>> With the security benefits being really hard to quantify. 
>
> Not really.  If you have a huge gaping hole that needs patching (OpenSSL
> off-by-one or weakness), the benefits are easy.

The security benefits of the upgrade (or, rather, the costs of *not*
performing the upgrade) is something like

  benefits = potential_damages_from_exploit * risk_of_exploit

Trying to estimate the risk of somebody exploiting a particular
vulnerability can be very hard. 

>> I don't get this. What's the point of having a "secure" frontend if
>> the systems behind it are insecure? OK, there's one big point -
>> hopefully you can buy some time - but other than that? 
>
> Its the model of how you use the machine.  If you lock all the doors
> tight with impenetrable seals, and the attacker goes through the weaker
> windows, those impenetrable seals haven't done much for you.

Exactly. But you seem to propose to seal the login node tight, but
leave the windows on the compute nodes ajar.

> The idea is you minimize the exposed footprint of the machine to threat
> facing access.

Yeah. But we seem to have different opinions on where the threat is.
It isn't just the Internet connected login node that is exposed to
threat. Even if you think you can trust your users, each and every
remote login session might actually be a hijacked account.

It's all very well saying that "If your system can be keylogged, it
should never ever be on a network, anywhere.", but I'm afraid that's
just wishful thinking. In actuality a huge proportion of Windows
systems are malware infested *AND* there have been large password
theft attacks against Unix systems in the last few years, using ssh
trojans and X-based keyloggers. This is what reality looks like, and
we have to deal with it.

We all know it's impossible to lock down a system completely. You
always have to make trade-offs and risk assessments. I'm not arguing
for a system where the users have to turn up in person and deliver
their jobs on punchcards. Rather, I think my two main points are:

a) Defense-in-depth. Relying on perimeter defense is so 20th century.
The Windows world is starting to discover this, and I think we should
learn from them. Putting all your effort into one big barrier is the
wrong way to build security. The attackers should have an uphill
struggle *all* the way - "we shall fight on the beaches, we shall
fight on the landing grounds, we shall fight in the fields and in the
streets, we shall fight in the hills; we shall never surrender"

b) The main threat has changed. You still have to protect yourself
against remote exploits, but for a cluster that exposes few services
this is no big problem. Instead, our main headache is now protection
against *local* attacks through identity theft.

-- 
Leif Nixon                       -            Systems expert
------------------------------------------------------------
National Supercomputer Centre    -      Linkoping University
------------------------------------------------------------


From nixon at nsc.liu.se  Tue Jan  9 04:35:32 2007
From: nixon at nsc.liu.se (Leif Nixon)
Date: Tue, 09 Jan 2007 13:35:32 +0100
Subject: [Beowulf] OT: Software RAID & Multipath
In-Reply-To: <200701091001.36100.csamuel@vpac.org> (Chris Samuel's message of
	"Tue, 9 Jan 2007 10:01:35 +1100")
References: <200701091001.36100.csamuel@vpac.org>
Message-ID: <m37ivwgyy3.fsf@unna.nsc.liu.se>

Chris Samuel <csamuel at vpac.org> writes:

> Now, if I wanted to stripe accesses to the FAStT down each controller I seem 
> to have two options:
>
> 1) Use software RAID-0 with MD. My concern then is that I don't know
> whether the RAID-0 will kick in *before* the multipath and think it
> can stripe over 4 drives (which would be bad).

I suspect this is deprecated these days, but I have handled situations
like this by using the *MD* multipath support instead. Then you can
explicitly define your multipath devices and stripe them together, all
in /etc/mdadm.conf.

-- 
Leif Nixon                       -            Systems expert
------------------------------------------------------------
National Supercomputer Centre    -      Linkoping University
------------------------------------------------------------


From landman at scalableinformatics.com  Tue Jan  9 07:14:59 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Tue, 09 Jan 2007 10:14:59 -0500
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <m3bql8gz61.fsf@unna.nsc.liu.se>
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>	<200612290939.59593.csamuel@vpac.org>	<20061229005749.GA13471@galactic.demon.co.uk>	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>	<m3hcv8z8hg.fsf@unna.nsc.liu.se>	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>	<20070107112230.GA7654@galactic.demon.co.uk>	<45A10C69.4030908@scalableinformatics.com>	<m3y7odzspn.fsf@unna.nsc.liu.se>	<45A26142.9050408@scalableinformatics.com>
	<m3bql8gz61.fsf@unna.nsc.liu.se>
Message-ID: <45A3B173.5060205@scalableinformatics.com>


Leif Nixon wrote:
> Joe Landman <landman at scalableinformatics.com> writes:
> 
>> I think there are two different issues.  First: security is meant to be
>> an access control and thottle/choke point.  Second: is how you view your
>> cluster.  Is it "one-big-machine" in some sense (not necessarily Scyld,
>> but with a security model such that if you are on the access node you
>> are on the machine), or is it really a collection of individual machines
>> each with their own administrative domain?  One of these models works
>> really well for "cluster" use.
> 
> I don't think it's quite that black and white. You can have the
> cluster appear as a single security domain to the user, while still
> maintaining some internal barriers. Even if you have passwordless ssh
> access across the cluster for ordinary users, you probably should have
> restrictions for root access - even if an attacker can root the login
> node, he shouldn't be able to just ssh as root to any machine. Yes, if
> he can root the login node, he can probably root the other nodes as
> well, but let him work for it.

I think we are delving into design areas now.

Login nodes are not and should not be administrative nodes.  That is, do
not trust login nodes for non-end-user accounts.  This is a nice idea,
and sadly, not implemented in most practice.  Rocks and other cluster
distros happily enable end user login to the cluster administrative
node.  A login node is like a compute node, though bad-people (tm) can
get to it.  Which means you should trust it less.  If you fuse this with
an admin node, then you increase your risk.

>>>> It all boils down to a CBA (as everything does).  Upgrading carries
>>>> risk, no matter who does it, and how carefully things are packaged.  The
>>>> CBA equation should look something like this:
>>>>
>>>> 	value_of_upgrade = positive_benefits_of_upgrade -
>>>> 			   potential_risks_of_upgrade
>>> With the security benefits being really hard to quantify. 
>> Not really.  If you have a huge gaping hole that needs patching (OpenSSL
>> off-by-one or weakness), the benefits are easy.
> 
> The security benefits of the upgrade (or, rather, the costs of *not*
> performing the upgrade) is something like
> 
>   benefits = potential_damages_from_exploit * risk_of_exploit

Yes, exactly.  This is precisely correct.

> 
> Trying to estimate the risk of somebody exploiting a particular
> vulnerability can be very hard. 

No.  Follow the cert/secunia/... lists.  See what is being exploited in
the wild.  Won't be perfect, but if it is not being exploited (not that
cert et al are perfect or reliable ahead of time), or is very hard if
not impossible to exploit (e.g. the cache based back channel attack on
SMP systems), then your risk is low.  Risk is inversely proportional to
the ease of exploit.  The easier it is to exploit, the higher the risk.


>>> I don't get this. What's the point of having a "secure" frontend if
>>> the systems behind it are insecure? OK, there's one big point -
>>> hopefully you can buy some time - but other than that? 
>> Its the model of how you use the machine.  If you lock all the doors
>> tight with impenetrable seals, and the attacker goes through the weaker
>> windows, those impenetrable seals haven't done much for you.
> 
> Exactly. But you seem to propose to seal the login node tight, but
> leave the windows on the compute nodes ajar.

Nope, the analogy is incorrect and inaccurate, as is your
characterization of what I am writing.  What I have pointed out is that
no matter how good you *think* your security model is,

a) it isnt that good
b) it is attackable
c) it is being attacked
d) it is being attacked in a way you didn't consider
e) your super-duper-ultra-fantastic model X security system on the door
does absolutely nothing for you if they come in through the air duct.

Sooner or later, unless you cut the tx line on the network card, someone
is going to compromise your system.  The only way around that is

a) never to transmit anything back to the user
b) never allow writing of bits anywhere for any reason.

Again, I have seen people guffaw at security threats relying heavily
upon a single or several measures, all of them similar (e.g. firewalls)
with a good appreciation that there are many more attack vectors than
through the front door.

The idea is, again, don't over-fortify one section believing that it
will be the only method of getting in.  Reduce your security footprint
an vulnerability.  Keep your risks low.  Understand that they will
eventually beat what you have in place, so your only real option is to
minimize the damage they can do.

>> The idea is you minimize the exposed footprint of the machine to threat
>> facing access.
> 
> Yeah. But we seem to have different opinions on where the threat is.
> It isn't just the Internet connected login node that is exposed to
> threat. Even if you think you can trust your users, each and every
> remote login session might actually be a hijacked account.

Yes.  It may be.  We might have different opinions of where the threat
is.  My belief is that the threat can come from any possible attack
vector.  This suggests one should contain the potential attack vectors
if possible.  If rsh is an external attack vector, then ask yourself if
you really need it.  If your ssh is being hit by dictionary attacks day
in and day out, ask yourself how it is being used, and contain the
operational modes to the minimum set you can (no ssh1, ...)

Hijacked accounts do happen.  Used to be from telnet/ftp/rsh access to
remote systems.  Now a-days it is from windows malware and keyloggers.

Worse is when it is purposeful insider attacks.  You cannot protect
against all attack vectors, you can protect against destruction of data
or configuration.  Data theft is harder to protect against.

Perimeter defenses do little for the insider attacks.

> It's all very well saying that "If your system can be keylogged, it
> should never ever be on a network, anywhere.", but I'm afraid that's
> just wishful thinking. In actuality a huge proportion of Windows
> systems are malware infested *AND* there have been large password

Yes, a huge proportion are infested.  Wishful thinking, no.  Security
begins with good security practices, and again, limiting damage
potential at the local level.  This in part means running in
least-privilege mode.  Unfortunately for some of the systems, it is not
possible to do this, and have a useful system, due in large part to its
(mis)design.

> theft attacks against Unix systems in the last few years, using ssh
> trojans and X-based keyloggers. This is what reality looks like, and
> we have to deal with it.

Heck, most of the password theft against unix systems comes from open
telnet, ftp, pop, and imap servers, and a little network sniffer.  As
late as mid last year, I was asked to show a customer what happens if
you stick a little packet sniffer on their net.  Doesn't even need an
IP.  Just have that going while they are on it, and then have them look
at the screen as they log into their mail.  They were convinced their
big (switch manufacturers name elided) switch would save them due to its
advanced security features.

You rely upon a perimeter defense and the (intelligent) attackers will
choose non-perimeter vectors.  This has the impact of rendering your
perimeter defense useless.

I like telling people that systems designed to fail often do.

Perimeter defenses are Maginot lines
(http://en.wikipedia.org/wiki/Maginot_Line).  They are the definition,
the poster child, of a failed *total* defense design.  A perimeter
defense is a speed bump to a determined hacker, it is a defensive
element, not a defense in and of itself.  As long as you accept that
they *will* get through these, you have to think about adding depth to
the speed bump.  My point is, adding additional perimeters may not be
the best approach to add this depth.  I think this is where we disagree,
as the sense I get is that you may believe that additional perimeters
are great for defensive depth.

> We all know it's impossible to lock down a system completely. You
> always have to make trade-offs and risk assessments. I'm not arguing

Yes.  This is/was my point.  You cannot *ever* lock it down.  You must
assume it will be compromised at some point.

> for a system where the users have to turn up in person and deliver
> their jobs on punchcards. Rather, I think my two main points are:
> 
> a) Defense-in-depth. Relying on perimeter defense is so 20th century.

Yes.  Agreed.  I am not arguing perimeter defenses.  I am pointing out
that enabling attack vectors by increasing your exposure footprint is
anathema to your ability to contain your risk.  Keep your perimeter as
small as possible.  Keep your attackable footprint as small as possible.
 Don't firewall rsh/telnet/etc... don't install them.  If they are not
there, they cannot be used as attack targets.

> The Windows world is starting to discover this, and I think we should
> learn from them. Putting all your effort into one big barrier is the
> wrong way to build security. The attackers should have an uphill

[scratch scratch]  Who is arguing for building a heavy door?  I am
arguing for minimizing the maximum damage.  In part you do this by
reducing your exposed surface area.  Keep as few threat-facing systems
as possible, and keep them patched, up to date, and don't trust them.

> struggle *all* the way - "we shall fight on the beaches, we shall
> fight on the landing grounds, we shall fight in the fields and in the
> streets, we shall fight in the hills; we shall never surrender"

Uh...  ok.   You seemed (maybe I misread or misunderstood you) that
multiple perimeters are the way to go.  I disagree with this.  I am also
of the opinion that "force" as it were, is best applied where it makes
the most sense.  Making the end users slog through using a system along
with the nasties seems not to be a solution that most would like.  There
are other alternatives, some very good, that limit the maximum possible
damage a user can do.

> b) The main threat has changed. You still have to protect yourself
> against remote exploits, but for a cluster that exposes few services
> this is no big problem. 

Complacency or a lack of profound paranoia is the first step down the
slope of (mistakenly) believing your systems are secure.  Keeping as few
services on the exposed net limits the attack vectors.  But it does not
make it secure.  Limiting the damage that can be done also doesn't make
it secure.  It just reduces the impact of the cleanup.

> Instead, our main headache is now protection
> against *local* attacks through identity theft.

Yeah...  well, it is my understanding that the insider attacks
(committed and trusted people with nefarious intent) as well as identity
theft are the fastest growing crimes.  Basing a security model upon an
identity that can be stolen (say from a USB key inserted into a
compromised machine, or keylogged, or ...) is problematic.  Including
additional factors that require possession of multiple critical
elements, including those that are never linked together (SecureID and
alike cards), is much better.  Unfortunately you cannot use such things
to protect against the determined internal attacker.  Your employee is
annoyed that someone else got a raise/promotion/ata-boy, so they decide
to steal and sell your design for super-duper-widget to your competitor.
 This employee is trusted.  How do you prevent this?  Or, you use single
factor (ssh key) authentication to get in.  Someone's keys are stolen
through a USB fob they think is secure that they run putty from, yet was
inserted into a zombified PC at a university.  Now the bad-guys (tm)
have access in.  They can use resources, delete files, alter content.

If someone can explain precisely how to protect against these scenarios
without using multi-factor (disconnected) authentication methods, I
would love to hear it.  But even if the new protection scheme fails (and
it will), how do you limit the damage?

If you can blast past the defenses, and you have ownership of the
cluster, the game is over.  If you prevent this from happening by
limiting the maximum damage that can ever be done (no, won't be perfect,
but a heck-of-a-lot better than not having it).

Again, it sounds like we may be agreeing more than we disagree.  I am
not advocating a perimeter model.  I am advocating forcing attacks to
use fewer numbers of vectors (smaller defense perimeter).  I am also
advocating reducing the potential damage an attacker who breaks through
can do.  Force them to channel their attacks, and limit their prize
should they win.  Sun Tzu explained this in his book, and it is worth
taking into consideration.

-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From nixon at nsc.liu.se  Tue Jan  9 08:41:20 2007
From: nixon at nsc.liu.se (Leif Nixon)
Date: Tue, 09 Jan 2007 17:41:20 +0100
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <45A3B173.5060205@scalableinformatics.com> (Joe Landman's message
	of "Tue, 09 Jan 2007 10:14:59 -0500")
References: <1d151d3b0612270946m2b09039ct538339e487cad6e8@mail.gmail.com>
	<Pine.LNX.4.64.0612281156120.25888@lilith.rgb.private.net>
	<200612290939.59593.csamuel@vpac.org>
	<20061229005749.GA13471@galactic.demon.co.uk>
	<Pine.LNX.4.64.0612290247590.9199@lilith.rgb.private.net>
	<m3hcv8z8hg.fsf@unna.nsc.liu.se>
	<Pine.LNX.4.64.0701030828480.9199@lilith.rgb.private.net>
	<20070107112230.GA7654@galactic.demon.co.uk>
	<45A10C69.4030908@scalableinformatics.com>
	<m3y7odzspn.fsf@unna.nsc.liu.se>
	<45A26142.9050408@scalableinformatics.com>
	<m3bql8gz61.fsf@unna.nsc.liu.se>
	<45A3B173.5060205@scalableinformatics.com>
Message-ID: <m3zm8sf8zz.fsf@unna.nsc.liu.se>

Joe Landman <landman at scalableinformatics.com> writes:

> Login nodes are not and should not be administrative nodes.  That is, do
> not trust login nodes for non-end-user accounts.  This is a nice idea,
> and sadly, not implemented in most practice.  Rocks and other cluster
> distros happily enable end user login to the cluster administrative
> node.  A login node is like a compute node, though bad-people (tm) can
> get to it.  Which means you should trust it less.  If you fuse this with
> an admin node, then you increase your risk.

Full agreement here.

>> Trying to estimate the risk of somebody exploiting a particular
>> vulnerability can be very hard. 
>
> No.  Follow the cert/secunia/... lists.  See what is being exploited in
> the wild.  Won't be perfect, but if it is not being exploited (not that
> cert et al are perfect or reliable ahead of time), or is very hard if
> not impossible to exploit (e.g. the cache based back channel attack on
> SMP systems), then your risk is low.  Risk is inversely proportional to
> the ease of exploit.  The easier it is to exploit, the higher the risk.

OK, let's say it's just hard for me, then. 8^) I think there are
obvious high-risk vulnerabilities (remote exploits in sshd) and
obvious low-risk ones (like your example), and then a sea of in-betweens.

>>>> I don't get this. What's the point of having a "secure" frontend if
>>>> the systems behind it are insecure? OK, there's one big point -
>>>> hopefully you can buy some time - but other than that? 
>>> Its the model of how you use the machine.  If you lock all the doors
>>> tight with impenetrable seals, and the attacker goes through the weaker
>>> windows, those impenetrable seals haven't done much for you.
>> 
>> Exactly. But you seem to propose to seal the login node tight, but
>> leave the windows on the compute nodes ajar.
>
> Nope, the analogy is incorrect and inaccurate, as is your
> characterization of what I am writing.

Sorry, not intentional. Reading the rest of your post it seems we are
mostly in agreement.

I might have been reading too much into something you wrote a bit
earlier in the thread:

| You have a perfectly valid reason to upgrade threat facing nodes. Keep
| them as minimal and as up-to-date as possible. The non-threat facing
| nodes, this makes far less sense. If you are doing single factor
| authentication, and have enabled passwordless access within the
| cluster: ssh keys or certificates or ssh-agent based, once a machine
| that holds these has been compromised, the game is over.

I interpreted this as saying "compute node vulnerabilities aren't that
important as long as the login node is secure", and this is what I've
been arguing against. Basically, there *aren't* any non-threat facing
nodes. But I guess I'm misunderstanding you.

> You seemed (maybe I misread or misunderstood you) that
> multiple perimeters are the way to go.  I disagree with this.  I am also
> of the opinion that "force" as it were, is best applied where it makes
> the most sense.  Making the end users slog through using a system along
> with the nasties seems not to be a solution that most would like.  There
> are other alternatives, some very good, that limit the maximum possible
> damage a user can do.

I'm not sure what you mean by "multiple perimeters", but I suspect I'm
not proposing them. 8^) 

There is a mindset (which I'm carefully not accusing you of sharing)
which leads people to say things like "There's no point in fixing
$VULNERABILITY, because to exploit it the attacker must have root on
$MACHINE_X, and then we are already screwed". This irritates me.
Instead, assume $MACHINE_X *will* be rooted and try to limit the
damage the attacker can cause and make him work uphill (thus my
Churchill quote about beaches and so forth). But this is of course
exactly what you are saying.

-- 
Leif Nixon                       -            Systems expert
------------------------------------------------------------
National Supercomputer Centre    -      Linkoping University
------------------------------------------------------------


From buccaneer at rocketmail.com  Tue Jan  9 12:26:54 2007
From: buccaneer at rocketmail.com (Buccaneer for Hire.)
Date: Tue, 9 Jan 2007 12:26:54 -0800 (PST)
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <45A3B173.5060205@scalableinformatics.com>
Message-ID: <20070109202654.12482.qmail@web30608.mail.mud.yahoo.com>

> a) it isnt that good
> b) it is attackable
> c) it is being attacked
> d) it is being attacked in a way you didn't consider
> e) your super-duper-ultra-fantastic model X security
> system on the door
> does absolutely nothing for you if they come in
> through the air duct.

Or if they are already sitting at your kitchen table.

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


From landman at scalableinformatics.com  Tue Jan  9 12:31:39 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Tue, 09 Jan 2007 15:31:39 -0500
Subject: [Beowulf] Which distro for the cluster?
In-Reply-To: <20070109202654.12482.qmail@web30608.mail.mud.yahoo.com>
References: <20070109202654.12482.qmail@web30608.mail.mud.yahoo.com>
Message-ID: <45A3FBAB.8050208@scalableinformatics.com>

Buccaneer for Hire. wrote:
>> a) it isnt that good
>> b) it is attackable
>> c) it is being attacked
>> d) it is being attacked in a way you didn't consider
>> e) your super-duper-ultra-fantastic model X security
>> system on the door
>> does absolutely nothing for you if they come in
>> through the air duct.
> 
> Or if they are already sitting at your kitchen table.

Yup.  My understanding is that some sizable fraction of threat comes 
from disgruntled (as opposed to gruntled?) employees.

I wrote later in that same post

>> Worse is when it is purposeful insider attacks.  You cannot protect
>> against all attack vectors, you can protect against destruction of data
>> or configuration.  Data theft is harder to protect against.

>> Perimeter defenses do little for the insider attacks.


-- 

Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From csamuel at vpac.org  Tue Jan  9 15:42:51 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Wed, 10 Jan 2007 10:42:51 +1100
Subject: [Beowulf] OT: Software RAID & Multipath
In-Reply-To: <200701091001.36100.csamuel@vpac.org>
References: <200701091001.36100.csamuel@vpac.org>
Message-ID: <200701101042.51839.csamuel@vpac.org>

On Tuesday 09 January 2007 10:01, Chris Samuel wrote:

> 1) Use software RAID-0 with MD. ?My concern then is that I don't know
> whether the RAID-0 will kick in *before* the multipath and think it can
> stripe over 4 drives (which would be bad).

I gave it a go myself and the easiest solution is the above, just used mdadm 
to create a pair of multipath RAID devices (md0 and md1) for each path to the 
visible LUNs, then mdadm'd a RAID-0 device (md2) over md0 and md1.

Then pvcreate /dev/md2, vgcreate RAID /dev/md2 and away you go with sharpened 
mallets..

I cribbed the MD multipath stuff from:

  http://oss.gonicus.de/openpower/index.php/Dm-multipath-vscsi

which I found after posting the original email.

I did try using multipath-tools, but LVM was starting up before multipath (as 
I was using LVM for the Ubuntu install proper) and it was grabbing the raw 
devices, noticing the duplicate pv labels and only picking one SCSI device of 
each pair to use.

I could have hacked around it, but I wanted something that just worked and 
wouldn't catch us out later after a dist-upgrade and reboot..

cheers,
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070110/e569f48e/attachment.sig>

From csamuel at vpac.org  Tue Jan  9 15:44:58 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Wed, 10 Jan 2007 10:44:58 +1100
Subject: [Beowulf] OT: Software RAID & Multipath
In-Reply-To: <m37ivwgyy3.fsf@unna.nsc.liu.se>
References: <200701091001.36100.csamuel@vpac.org>
	<m37ivwgyy3.fsf@unna.nsc.liu.se>
Message-ID: <200701101044.58753.csamuel@vpac.org>

On Tuesday 09 January 2007 23:35, Leif Nixon wrote:

> I suspect this is deprecated these days, but I have handled situations
> like this by using the *MD* multipath support instead. Then you can
> explicitly define your multipath devices and stripe them together, all
> in /etc/mdadm.conf.

I don't think it's deprecated at all, but I don't believe that mdadm.conf is 
used like that these days, it's generally just done with the mdadm --create 
command lines I believe..

cheers!
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070110/fc94511d/attachment.sig>

From nixon at nsc.liu.se  Wed Jan 10 00:45:45 2007
From: nixon at nsc.liu.se (Leif Nixon)
Date: Wed, 10 Jan 2007 09:45:45 +0100
Subject: [Beowulf] OT: Software RAID & Multipath
In-Reply-To: <200701101044.58753.csamuel@vpac.org> (Chris Samuel's message of
	"Wed, 10 Jan 2007 10:44:58 +1100")
References: <200701091001.36100.csamuel@vpac.org>
	<m37ivwgyy3.fsf@unna.nsc.liu.se> <200701101044.58753.csamuel@vpac.org>
Message-ID: <m3vejffex2.fsf@unna.nsc.liu.se>

Chris Samuel <csamuel at vpac.org> writes:

> On Tuesday 09 January 2007 23:35, Leif Nixon wrote:
>
>> I suspect this is deprecated these days, but I have handled situations
>> like this by using the *MD* multipath support instead. Then you can
>> explicitly define your multipath devices and stripe them together, all
>> in /etc/mdadm.conf.
>
> I don't think it's deprecated at all, but I don't believe that mdadm.conf is 
> used like that these days, it's generally just done with the mdadm --create 
> command lines I believe..

For some combinations of kernels, HBA drivers and Red Hat initscripts,
mdadm's automatic assembly has failed for me, so I tend to define RAID
devices explicitly.

-- 
Leif Nixon                       -            Systems expert
------------------------------------------------------------
National Supercomputer Centre    -      Linkoping University
------------------------------------------------------------


From scheinin at crs4.it  Wed Jan 10 01:48:54 2007
From: scheinin at crs4.it (Alan Louis Scheinine)
Date: Wed, 10 Jan 2007 10:48:54 +0100
Subject: [Beowulf] OT: Software RAID & Multipath
In-Reply-To: <200701101044.58753.csamuel@vpac.org>
References: <200701091001.36100.csamuel@vpac.org>	<m37ivwgyy3.fsf@unna.nsc.liu.se>
	<200701101044.58753.csamuel@vpac.org>
Message-ID: <45A4B686.8030602@crs4.it>

I just did RAID once with mdadm and I noticed that everything
could be done with a command line.  If I recall correctly, there
is an option for having the current configuration written-out in
the syntax that can be used for mdadm.conf.  With regard to the
comment "it's generally just done with the mdadm --create
command lines I believe ..." I would like to point out that
creating an mdadm.conf after configuring would be useful for
recovery from a hardware failure.

Chris Samuel wrote:
> On Tuesday 09 January 2007 23:35, Leif Nixon wrote:
>> I suspect this is deprecated these days, but I have handled situations
>> like this by using the *MD* multipath support instead. Then you can
>> explicitly define your multipath devices and stripe them together, all
>> in /etc/mdadm.conf.
> I don't think it's deprecated at all, but I don't believe that mdadm.conf is 
> used like that these days, it's generally just done with the mdadm --create 
> command lines I believe..
> cheers!
> Chris


From jakob at unthought.net  Wed Jan 10 02:10:17 2007
From: jakob at unthought.net (Jakob Oestergaard)
Date: Wed, 10 Jan 2007 11:10:17 +0100
Subject: [Beowulf] OT: Software RAID & Multipath
In-Reply-To: <45A4B686.8030602@crs4.it>
References: <200701091001.36100.csamuel@vpac.org>
	<m37ivwgyy3.fsf@unna.nsc.liu.se>
	<200701101044.58753.csamuel@vpac.org> <45A4B686.8030602@crs4.it>
Message-ID: <20070110101016.GA2645@unthought.net>

On Wed, Jan 10, 2007 at 10:48:54AM +0100, Alan Louis Scheinine wrote:
> I just did RAID once with mdadm and I noticed that everything
> could be done with a command line.  If I recall correctly, there
> is an option for having the current configuration written-out in
> the syntax that can be used for mdadm.conf.  With regard to the
> comment "it's generally just done with the mdadm --create
> command lines I believe ..." I would like to point out that
> creating an mdadm.conf after configuring would be useful for
> recovery from a hardware failure.

Scans for array components and prints out an mdadm.conf:

mdadm --misc -D -s

-- 

 / jakob


From atp at piskorski.com  Wed Jan 10 06:21:33 2007
From: atp at piskorski.com (Andrew Piskorski)
Date: Wed, 10 Jan 2007 09:21:33 -0500
Subject: no 'commodity' OS is 'secure' Re: [Beowulf] Which distro for the
	cluster?
In-Reply-To: <Pine.LNX.4.64.0701071543170.5894@lilith.rgb.private.net>
References: <Pine.LNX.4.64.0701071543170.5894@lilith.rgb.private.net>
Message-ID: <20070110142133.GA63012@tehun.pair.com>

On Sun, Jan 07, 2007 at 03:49:50PM -0500, Robert G. Brown wrote:

> I completely agree with this.  As I pointed out earlier in the thread,
> companies such as banks make "conservative" seem downright radical when
> it comes to OS upgrades.  They have to do a complete, thorough,
> comprehensive security audit to change ANYTHING on their machines -- as
> a requirement in federal law, IIRC.  To get them to take you seriously,
> you MUST be prepared to support the OS they install on (once it is
> successfully audited) forever -- until the hardware itself falls apart
> into itty-bitty bits.

And yet these same hyper-'secure' organizations are running Microsoft
Windows, Linux, and/or Unix on these super important, super 'secure',
mission-critical boxes?  Frankly, that's oxymoronic.  It sounds
suspiciously like decision making driven by what the rules and
paperwork says you're supposed to do (aka, CYA), and/or general
myopia, rather than a sound assessment of what the right solution to
the real problem actually is.

We all know that Windows is (much) less secure than Linux, and Linux
is presumably less secure than OpenBSD.  But if you take a step back
and look at the bigger picture, OpenBSD and MS Windows are both in the
same bin, and that bin is labeled, "inherently unreliable and insecure
operating systems".

OpenBSD calls itself "ultra-secure", which is like calling the most
advanced World War II piston-engined fighter planes "ultra-fast".
Yes, it's true, more or less - as long as you're only talking about
other piston engined aircraft, and are content to ignore the existence
of jets and rockets.

It's not something I know much about, but I am told that much more
reliable and secure operating systems do exist, and have been
commercially successfull in niche markets, both now and in the past.
Niche markets like, say, the OS that runs your advanced pacemaker,
some network routers, or aerospace systems.

Now, I assume that using any such non-mainstream system is probably
(so far, to date) significantly more painful, annoying, and thus
expensive than just running Linux.  (And thus is unlikely to be
appropriate for a Beowulf cluster.)

But if you're a huge organization already throwing millions of dollars
into horribly painful manual re-audits of even trivial updates to
"commodity" operating systems for mission-critical "highly secure"
applications, then I strongly suspect that you're already well into
the same cost range where investing those $millions into the use of
secure-by-design systems might well make much more sense.

At some point, no matter how much you like Otto-cycle engines, putting
more and more money and effort into carefully tuning and inspecting
your turbo-supercharged, nitrous oxide injected, hand polished and
streamlined, piston-engined aircraft simply no longer makes sense.  If
you care that much, you should be looking into jets...

Like I said, I don't really know much about such secure-by-design
systems, but I've come across thought provoking discussion in various
places, including:

  http://www.coyotos.org/docs/osverify-2004/osverify-2004.html
  http://www.coyotos.org/docs/misc/linus-rebuttal.html
  http://www.eros-os.org/pipermail/cap-talk/2001-July/000604.html
  http://www.erights.org/talks/captp4omg/captp4omg/sld008.htm
  http://zesty.ca/capmyths/

-- 
Andrew Piskorski <atp at piskorski.com>
http://www.piskorski.com/


From jmdavis1 at vcu.edu  Wed Jan 10 06:58:58 2007
From: jmdavis1 at vcu.edu (Mike Davis)
Date: Wed, 10 Jan 2007 09:58:58 -0500
Subject: no 'commodity' OS is 'secure' Re: [Beowulf] Which distro for
	the	cluster?
In-Reply-To: <20070110142133.GA63012@tehun.pair.com>
References: <Pine.LNX.4.64.0701071543170.5894@lilith.rgb.private.net>
	<20070110142133.GA63012@tehun.pair.com>
Message-ID: <45A4FF32.7020308@vcu.edu>

1. Any OS can be made more secure.
2. Good Security is "Security in depth."
3. The perfect is the enemy of the "good enough."

I would note that turbocharged piston engine aircraft are still in use 
militarily, commercially, and recreationally. One of the reasons for the 
fact that the C-130 is approaching an operational life of 50 years is 
that it can do things that C-141's, C5's, and C-20's can't. The same is 
true for linux and even (Ugh) windows.

The only secure computer is the one in the vault, with dedicated power 
and its HD stored in a safe when not in use. This is not the most 
practical approach for either a business or a research institution. So, 
we design for security at the border, subnet, and host levels. We test 
and audit. We monitor, we mirror data online and on tape. We do many 
other things as well. This is one of the things that admins get paid for.

Now, if the question is "can I compromise one of the systems?", the 
answer is yes. I've been using unix for more than 20 years and used 
mainframes and minis before that. Some of the same methods used to gain 
mainframe access will still work with a few modifications. But,.my 
abilities do not inherently make these systems insecure.


Mike Davis

Andrew Piskorski wrote:

>On Sun, Jan 07, 2007 at 03:49:50PM -0500, Robert G. Brown wrote:
>
>  
>
>>I completely agree with this.  As I pointed out earlier in the thread,
>>companies such as banks make "conservative" seem downright radical when
>>it comes to OS upgrades.  They have to do a complete, thorough,
>>comprehensive security audit to change ANYTHING on their machines -- as
>>a requirement in federal law, IIRC.  To get them to take you seriously,
>>you MUST be prepared to support the OS they install on (once it is
>>successfully audited) forever -- until the hardware itself falls apart
>>into itty-bitty bits.
>>    
>>
>
>And yet these same hyper-'secure' organizations are running Microsoft
>Windows, Linux, and/or Unix on these super important, super 'secure',
>mission-critical boxes?  Frankly, that's oxymoronic.  It sounds
>suspiciously like decision making driven by what the rules and
>paperwork says you're supposed to do (aka, CYA), and/or general
>myopia, rather than a sound assessment of what the right solution to
>the real problem actually is.
>
>We all know that Windows is (much) less secure than Linux, and Linux
>is presumably less secure than OpenBSD.  But if you take a step back
>and look at the bigger picture, OpenBSD and MS Windows are both in the
>same bin, and that bin is labeled, "inherently unreliable and insecure
>operating systems".
>
>OpenBSD calls itself "ultra-secure", which is like calling the most
>advanced World War II piston-engined fighter planes "ultra-fast".
>Yes, it's true, more or less - as long as you're only talking about
>other piston engined aircraft, and are content to ignore the existence
>of jets and rockets.
>
>It's not something I know much about, but I am told that much more
>reliable and secure operating systems do exist, and have been
>commercially successfull in niche markets, both now and in the past.
>Niche markets like, say, the OS that runs your advanced pacemaker,
>some network routers, or aerospace systems.
>
>Now, I assume that using any such non-mainstream system is probably
>(so far, to date) significantly more painful, annoying, and thus
>expensive than just running Linux.  (And thus is unlikely to be
>appropriate for a Beowulf cluster.)
>
>But if you're a huge organization already throwing millions of dollars
>into horribly painful manual re-audits of even trivial updates to
>"commodity" operating systems for mission-critical "highly secure"
>applications, then I strongly suspect that you're already well into
>the same cost range where investing those $millions into the use of
>secure-by-design systems might well make much more sense.
>
>At some point, no matter how much you like Otto-cycle engines, putting
>more and more money and effort into carefully tuning and inspecting
>your turbo-supercharged, nitrous oxide injected, hand polished and
>streamlined, piston-engined aircraft simply no longer makes sense.  If
>you care that much, you should be looking into jets...
>
>Like I said, I don't really know much about such secure-by-design
>systems, but I've come across thought provoking discussion in various
>places, including:
>
>  http://www.coyotos.org/docs/osverify-2004/osverify-2004.html
>  http://www.coyotos.org/docs/misc/linus-rebuttal.html
>  http://www.eros-os.org/pipermail/cap-talk/2001-July/000604.html
>  http://www.erights.org/talks/captp4omg/captp4omg/sld008.htm
>  http://zesty.ca/capmyths/
>
>  
>


From hahn at physics.mcmaster.ca  Wed Jan 10 07:04:03 2007
From: hahn at physics.mcmaster.ca (Mark Hahn)
Date: Wed, 10 Jan 2007 10:04:03 -0500 (EST)
Subject: [Beowulf] machineroom design
Message-ID: <Pine.LNX.4.64.0701100949050.32736@coffee.psychology.mcmaster.ca>

I just had an episode in my machineroom where a small perturbation
in the heat load (turning off a handful of machines) caused a 30T
chiller to ice up and become mostly nonfunctional.  this was doubly
perplexing because there's plenty of load to keep it working.
the main factor was that we recently (well, month or to ago) 
remove some nasty plastic tarps which were trapping cold air inside
a cold aisle.  apparently this altered airflow enough to push a lot
of cold air towards the chiller.  these Liebert units have just one 
intake sensor, and it's nearly on a corner - out of the cold flow.

easy thing to fix, but reinforced to me once again that the canonical
hot/cold aisle approach is actually _not_ a good idea.  at the very 
least, you want something blocking the path from the cold aisle to 
the chiller intakes.

a 2-plenum design (say, cold underfloor and drop ceiling for return)
could certainly avoid this problem, assuming there's no large gap
between the ceiling and top-of-rack.

but in our case, it would have made a lot more sense to simply make 
a single row of compute racks, with their hot little bums mooning 
the row of chillers along one wall.  that's the only airflow that 
really matters, and the real problem with the current hot/cold setup
is the numerous possible bypass and counter-flows.

in short: don't build hot/cold machinerooms unless you control
both the cold outflow and hot intake locations quite carefully.
at the very least, plan to block off the end(s) of cold aisles,
since any flow out of them that doesn't go through machines is wasted,
and quite possibly problematic.

for raised floor, a simple row of machines facing away from chillers 
is a lot nicer behaved.  if you can't fit the row, consider folding 
it into sort of a W-shaped structure that still puts racks between 
cold air outflow and chiller intakes.  most servers these days generate
a pretty powerful jet of air out the back, and pointing them at least
partly towards the chiller intakes is certainly helpful.

regards, mark hahn.


From gerry.creager at tamu.edu  Wed Jan 10 07:22:40 2007
From: gerry.creager at tamu.edu (Gerry Creager N5JXS)
Date: Wed, 10 Jan 2007 09:22:40 -0600
Subject: no 'commodity' OS is 'secure' Re: [Beowulf] Which distro for
	the	cluster?
In-Reply-To: <45A4FF32.7020308@vcu.edu>
References: <Pine.LNX.4.64.0701071543170.5894@lilith.rgb.private.net>	<20070110142133.GA63012@tehun.pair.com>
	<45A4FF32.7020308@vcu.edu>
Message-ID: <45A504C0.9070505@tamu.edu>

Just to whine a bit for the sake of accuracy, the C-130 isn't a piston 
aircraft and never has been. It's a turboprop... a turbine-powered 
aircraft where the shaft drives a propeller.

Your first three points are good ones, though.

gerry

Mike Davis wrote:
> 1. Any OS can be made more secure.
> 2. Good Security is "Security in depth."
> 3. The perfect is the enemy of the "good enough."
> 
> I would note that turbocharged piston engine aircraft are still in use 
> militarily, commercially, and recreationally. One of the reasons for the 
> fact that the C-130 is approaching an operational life of 50 years is 
> that it can do things that C-141's, C5's, and C-20's can't. The same is 
> true for linux and even (Ugh) windows.
> 
> The only secure computer is the one in the vault, with dedicated power 
> and its HD stored in a safe when not in use. This is not the most 
> practical approach for either a business or a research institution. So, 
> we design for security at the border, subnet, and host levels. We test 
> and audit. We monitor, we mirror data online and on tape. We do many 
> other things as well. This is one of the things that admins get paid for.
> 
> Now, if the question is "can I compromise one of the systems?", the 
> answer is yes. I've been using unix for more than 20 years and used 
> mainframes and minis before that. Some of the same methods used to gain 
> mainframe access will still work with a few modifications. But,.my 
> abilities do not inherently make these systems insecure.
> 
> 
> Mike Davis
> 
> Andrew Piskorski wrote:
> 
>> On Sun, Jan 07, 2007 at 03:49:50PM -0500, Robert G. Brown wrote:
>>
>>  
>>
>>> I completely agree with this.  As I pointed out earlier in the thread,
>>> companies such as banks make "conservative" seem downright radical when
>>> it comes to OS upgrades.  They have to do a complete, thorough,
>>> comprehensive security audit to change ANYTHING on their machines -- as
>>> a requirement in federal law, IIRC.  To get them to take you seriously,
>>> you MUST be prepared to support the OS they install on (once it is
>>> successfully audited) forever -- until the hardware itself falls apart
>>> into itty-bitty bits.
>>>   
>>
>>
>> And yet these same hyper-'secure' organizations are running Microsoft
>> Windows, Linux, and/or Unix on these super important, super 'secure',
>> mission-critical boxes?  Frankly, that's oxymoronic.  It sounds
>> suspiciously like decision making driven by what the rules and
>> paperwork says you're supposed to do (aka, CYA), and/or general
>> myopia, rather than a sound assessment of what the right solution to
>> the real problem actually is.
>>
>> We all know that Windows is (much) less secure than Linux, and Linux
>> is presumably less secure than OpenBSD.  But if you take a step back
>> and look at the bigger picture, OpenBSD and MS Windows are both in the
>> same bin, and that bin is labeled, "inherently unreliable and insecure
>> operating systems".
>>
>> OpenBSD calls itself "ultra-secure", which is like calling the most
>> advanced World War II piston-engined fighter planes "ultra-fast".
>> Yes, it's true, more or less - as long as you're only talking about
>> other piston engined aircraft, and are content to ignore the existence
>> of jets and rockets.
>>
>> It's not something I know much about, but I am told that much more
>> reliable and secure operating systems do exist, and have been
>> commercially successfull in niche markets, both now and in the past.
>> Niche markets like, say, the OS that runs your advanced pacemaker,
>> some network routers, or aerospace systems.
>>
>> Now, I assume that using any such non-mainstream system is probably
>> (so far, to date) significantly more painful, annoying, and thus
>> expensive than just running Linux.  (And thus is unlikely to be
>> appropriate for a Beowulf cluster.)
>>
>> But if you're a huge organization already throwing millions of dollars
>> into horribly painful manual re-audits of even trivial updates to
>> "commodity" operating systems for mission-critical "highly secure"
>> applications, then I strongly suspect that you're already well into
>> the same cost range where investing those $millions into the use of
>> secure-by-design systems might well make much more sense.
>>
>> At some point, no matter how much you like Otto-cycle engines, putting
>> more and more money and effort into carefully tuning and inspecting
>> your turbo-supercharged, nitrous oxide injected, hand polished and
>> streamlined, piston-engined aircraft simply no longer makes sense.  If
>> you care that much, you should be looking into jets...
>>
>> Like I said, I don't really know much about such secure-by-design
>> systems, but I've come across thought provoking discussion in various
>> places, including:
>>
>>  http://www.coyotos.org/docs/osverify-2004/osverify-2004.html
>>  http://www.coyotos.org/docs/misc/linus-rebuttal.html
>>  http://www.eros-os.org/pipermail/cap-talk/2001-July/000604.html
>>  http://www.erights.org/talks/captp4omg/captp4omg/sld008.htm
>>  http://zesty.ca/capmyths/
>>
>>  
>>
> 
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf

-- 
Gerry Creager -- gerry.creager at tamu.edu
Texas Mesonet -- AATLT, Texas A&M University	
Cell: 979.229.5301 Office: 979.458.4020 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843


From rgb at phy.duke.edu  Wed Jan 10 08:04:49 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Wed, 10 Jan 2007 11:04:49 -0500 (EST)
Subject: no 'commodity' OS is 'secure'  Re: [Beowulf] Which distro for
	the cluster?
In-Reply-To: <20070110142133.GA63012@tehun.pair.com>
References: <Pine.LNX.4.64.0701071543170.5894@lilith.rgb.private.net>
	<20070110142133.GA63012@tehun.pair.com>
Message-ID: <Pine.LNX.4.64.0701100934460.6590@lilith.rgb.private.net>

On Wed, 10 Jan 2007, Andrew Piskorski wrote:

> On Sun, Jan 07, 2007 at 03:49:50PM -0500, Robert G. Brown wrote:
>
>> I completely agree with this.  As I pointed out earlier in the thread,
>> companies such as banks make "conservative" seem downright radical when
>> it comes to OS upgrades.  They have to do a complete, thorough,
>> comprehensive security audit to change ANYTHING on their machines -- as
>> a requirement in federal law, IIRC.  To get them to take you seriously,
>> you MUST be prepared to support the OS they install on (once it is
>> successfully audited) forever -- until the hardware itself falls apart
>> into itty-bitty bits.
>
> And yet these same hyper-'secure' organizations are running Microsoft
> Windows, Linux, and/or Unix on these super important, super 'secure',
> mission-critical boxes?  Frankly, that's oxymoronic.  It sounds
> suspiciously like decision making driven by what the rules and
> paperwork says you're supposed to do (aka, CYA), and/or general
> myopia, rather than a sound assessment of what the right solution to
> the real problem actually is.

CYA doesn't begin to describe it -- try federal law.  And it isn't
really crazy to do things the way that they do them -- look at all the
phishing out there for bank information.  Ten years ago script kiddies
who cracked some Sun workstations on our campus used them as a jumping
off place to attack both banks and the FBI, and they were doing it just
for fun, not because they seriously expected success.  Those were also
the days when there was a real ubercracker in one of the campus's Unix
networks -- one that encapsulated so precisely that the only way you
could tell he or she was there was with a completely passive box on the
same wire monitoring the actual packet traffic created during the times
they came in through there completely invisible back door.  No log
traces, no sign of binaries being tampered with -- every piece of
software that might have revealed their presence was replaced.  (This
was relatively easy with SunOS in the old days, as it was a very slowly
moving target and installed "identically" from tape to system in most
organizations).

I have little doubt that some of those attacks on banks and possibly
other facilities succeeded (forcing FDIC payout or painful losses) and
were seriously hushed up.  Mainframers dominated the scene then, of
course, and I'm equally certain that much FUD was spread about to help
those COBOL coders continue working.  Some very hardnosed individuals
then decided that Unixoid systems could succeed, but only if they were
kept up to a much higher security standard than most sysadmins knew how
to accomplish.  Hence laws that are likely derived from the older
mainframe laws plus a measure of common sense from the banks themselves
(who just HATE to lose money, after all).

> We all know that Windows is (much) less secure than Linux, and Linux
> is presumably less secure than OpenBSD.  But if you take a step back
> and look at the bigger picture, OpenBSD and MS Windows are both in the
> same bin, and that bin is labeled, "inherently unreliable and insecure
> operating systems".
>
> OpenBSD calls itself "ultra-secure", which is like calling the most
> advanced World War II piston-engined fighter planes "ultra-fast".
> Yes, it's true, more or less - as long as you're only talking about
> other piston engined aircraft, and are content to ignore the existence
> of jets and rockets.
>
> It's not something I know much about, but I am told that much more
> reliable and secure operating systems do exist, and have been
> commercially successfull in niche markets, both now and in the past.
> Niche markets like, say, the OS that runs your advanced pacemaker,
> some network routers, or aerospace systems.

Any OS can be made secure.  Even Windows.  It just requires a competent
sysadmin and audit team and a fairly rapid closed development loop
between the OS folks and the implementation/audit folks.  There are
Windows uberadmins and systems engineers out there too, don't forget,
and MS pays its coders lavishly and gets some of the best that there
are.  They just put their money and development effort where the profits
are.  Consumers are sysadmin idiots in ALL cases for ALL OS's because a
modern networking OS is a nontrivial thing to administer with lots of
moving and breakable parts and because they install software from dozens
of sources including ones with absolutely no line of responsibility or
trust.

There is a world of difference between a Windows server set up in a bank
environment, where they are running only a fully patched variant of
Windows that has been really throroughly audited for holes, in a
completely minimal installation (no gorp as all gorp must be audited and
increases risk) with only certain very specific ports open and those
watchdogged and externally firewalled, running software that only MS has
written and debugged top to bottom, being administered by REAL MCSE's --
not the ones that pick up their degrees from an online training program,
but people with masters level CPS degrees AND MCSEs AND credentials from
multiple additional training courses AND ten years of experience in the
trenches.

In this sort of environment, Windows is remarkably stable (surprise
surprise) and not at all easy to crack because People Are Watching the
Software that is Watching the People that are Watching the Software that
is Watching the Computer...(iterate to some sort of convergence).
Problems that emerge are quickly and quietly fixed, and the whole thing
re-audited which is possible because of the minimal configuration thing.
Costs an ocean of money to do things this way, of course, but to a bank
or a government secret org or a major R&D company with secrets to
protect, it is worth it.

The real observation that you are making is that (as is often the case)
"worth it" isn't the same as "cost effective compared to alternatives".
I would guess that it is a hell of a lot easier to secure almost any
unixoid OS in a server configuration, where again one can secure even
things like RH 7.3 or AIX or MacOS IF you are willing to pay what it
takes to close the audit/debugging process and invest the human
resources to configure and run the thing intelligently.  A system (or
internal network) with a single port open to the outside world, with
guardian daemons and humans constantly watching the doors inside and
outside, where physical presence sitting at a local terminal with things
like magstripe cards and/or bioscans needed to authenticate, where those
very physical presences are required to pee into cups and take regular
polygraphs -- it isn't really that easy to crack from the outside, even
for the ubercracker.

Basically they have to find a hole in the daemon that manages the one
open port (whose source has been micro-audited for e.g. leaks and buffer
problems outside of the usual development stream and which may not even
be the same source as what is in the open distribution version) AND
figure out a way to slip inside without getting eaten by any of the
automatic or human cereberus's that guard the door.  The idea that this
occurs and folks succeed makes for a great film idea, of course, but
I'll bet that nearly every successful attempt at a core system protected
in depth like this is made EITHER with penetrations through HARDWARE or
FIRMWARE holes -- tapping that good old powerline or the like to snoop
keys -- or by insiders or with their knowing or unknowing collusion
(snitching their magstripe card, bugging their bedroom where they talk
in their sleep from all of the jolt cola they drink on the job:-).

> Now, I assume that using any such non-mainstream system is probably
> (so far, to date) significantly more painful, annoying, and thus
> expensive than just running Linux.  (And thus is unlikely to be
> appropriate for a Beowulf cluster.)
>
> But if you're a huge organization already throwing millions of dollars
> into horribly painful manual re-audits of even trivial updates to
> "commodity" operating systems for mission-critical "highly secure"
> applications, then I strongly suspect that you're already well into
> the same cost range where investing those $millions into the use of
> secure-by-design systems might well make much more sense.

Ah, a believer in rational decisioning, CBA, minimal TCO.  Don't you
see, man, that you're up against a whole world of people that don't,
actually, understand the rational process?  A world where 1/2 of its
members have IQ's under 100, and where 100 \pm 10 is usually a bit iffy
when it comes to being able to actually analyze things logically or
mathematically?  A world which is additionally so incredibly
cost-nonlinear that it may well BE cheaper to continue using a very
expensive WinXX network that everybody knows how to use and manage and
that very rarely breaks compared to the HUGE costs of conversion, a fact
that is appreciated by every salesperson of computer software or
services in the universe because it works for them (when a fish is
landed) and against them (when they are trying to land a fish that is
already in somebody else's net).  And then there is FUD, ignorance, pure
cupidity and kickback schemes (not kickSTART schemes, where are
different:-), training costs (everybody already knows how to use X
already, so even though it is an ancient and cumbersome legacy
application, you have to look at days, weeks, months of retraining and
loss of productivity throughout that period, where a lot of the folks
being trained are those ~100 IQ people that are NOT happy learning new
things).

Did I mention the immense inertia of "standard" mission-critical
software packages in monopolistically universal use that become in and
of themselves a criterion for that rational decisioning?  In >>state
law<< in many cases (yes, I'm ashamed to say that NC is one of many
states that test high school students on the use of >>MS Office<<
components, not generic office software tools.  How powerful is that?
Imagine if all NC driving tests required that you take them in a Ford,
and that people were forbidden by law to make cars with certain Ford
features like the ability to burn Ford gasoline which curiously enough
was the only gas to be found in 90% of the pumps?  Something like that,
of course, would spark torch-and-pitchfork activity because the
automobile industry is actually not a monopoly and companies actually
compete.  In the software industry it creates nary a ripple... because
who is there who will complain, rouse the rabble, hang Bill Gates in
effigy?

I tell you, the greatest and most cut-throat monopoly the world has ever
seen, in control of our INFORMATION flow -- western society must have a
death wish.

So, all that stands between that current reality and a future of
rational decisioning in software is a little-bitty paradigm shift.  The
current way of doing business is a clear attractor in anybody's economic
benefit space -- change is always a cost barrier and the designers of
software deliberately do everything in their considerable power to keep
that barrier as high as they possibly can.  To create a change, a new
attractor (CBA basin) has to emerge that is a) much lower than the
first; b) broad enough that there is good phase space overlap with the
operational needs of a large fraction of the customer population; c) a
"valley through the hills", real or perceived, to minimize that cost of
getting from there to here; d) advertising like all hell to get the word
out about the valley and the green hills available on the other side
where the rivers run in milk and cows grow on trees; e) companies and
government officials whose pension funds aren't all tied up in stock in
the company that "owns" the original less good basin; f) something to
trigger a stampede -- a stampede basically tramples down those
mountains real fast.

Arguably linux is a) already, is working (too blindly and slowly) on b),
utterly lacks c), is missing d) altogether, barring a handful of ads
from IBM, is totally f&*!ed by e) (seriously!), and has yet to produce
an f) largely because even if some part of it was capable of doing so
nobody knows about it (see b) and c) and d):-(.

Nothing a good business plan couldn't fix, actually.  Nothing that
nature won't fix on its own, eventually, because a) isn't going away and
is a LOT lower, and b) tends to grow over time, and these two alone will
probably eventually fix c) and maybe f) to the extent that d) and e)
don't matter as much.  Of course MS may manage to create f) all on their
own. One really, really serious security hit -- say a major bank that IS
knocked over to the tune of an unhidable multi-billion dollar loss
because of a flaw in its design -- followed by a change in government
banking policy and public perception -- might be all that it takes to
trigger at least a modest stampede.  Of course linux and the rest are
vulnerable to this sort of thing as well, and MS actually HAS a sales
force that makes FUD a fine art and amplifies even the tiniest incident
into massive perceived risk...

    rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From James.P.Lux at jpl.nasa.gov  Wed Jan 10 10:02:02 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Wed, 10 Jan 2007 10:02:02 -0800
Subject: no 'commodity' OS is 'secure'  Re: [Beowulf] Which distro
	for the cluster?
In-Reply-To: <Pine.LNX.4.64.0701100934460.6590@lilith.rgb.private.net>
References: <Pine.LNX.4.64.0701071543170.5894@lilith.rgb.private.net>
	<20070110142133.GA63012@tehun.pair.com>
	<Pine.LNX.4.64.0701100934460.6590@lilith.rgb.private.net>
Message-ID: <6.2.3.4.2.20070110093838.0300d650@mail.jpl.nasa.gov>

At 08:04 AM 1/10/2007, Robert G. Brown wrote:
>On Wed, 10 Jan 2007, Andrew Piskorski wrote:
>
>>On Sun, Jan 07, 2007 at 03:49:50PM -0500, Robert G. Brown wrote:
>>
>>>I completely agree with this.  As I pointed out earlier in the thread,
>>>companies such as banks make "conservative" seem downright radical when
>>>it comes to OS upgrades.  They have to do a complete, thorough,
>>>comprehensive security audit to change ANYTHING on their machines -- as
>>>a requirement in federal law, IIRC.  To get them to take you seriously,
>>>you MUST be prepared to support the OS they install on (once it is
>>>successfully audited) forever -- until the hardware itself falls apart
>>>into itty-bitty bits.


<snip>

There is a world of difference between a Windows server set up in a bank
>environment, where they are running only a fully patched variant of
>Windows that has been really throroughly audited for holes, in a
>completely minimal installation (no gorp as all gorp must be audited and
>increases risk) with only certain very specific ports open and those
>watchdogged and externally firewalled, running software that only MS has
>written and debugged top to bottom, being administered by REAL MCSE's --
>not the ones that pick up their degrees from an online training program,
>but people with masters level CPS degrees AND MCSEs AND credentials from
>multiple additional training courses AND ten years of experience in the
>trenches.
<snip>


>Basically they have to find a hole in the daemon that manages the one
>open port (whose source has been micro-audited for e.g. leaks and buffer
>problems outside of the usual development stream and which may not even
>be the same source as what is in the open distribution version) AND
>figure out a way to slip inside without getting eaten by any of the
>automatic or human cereberus's that guard the door.  The idea that this
>occurs and folks succeed makes for a great film idea, of course, but
>I'll bet that nearly every successful attempt at a core system protected
>in depth like this is made EITHER with penetrations through HARDWARE or
>FIRMWARE holes -- tapping that good old powerline or the like to snoop
>keys -- or by insiders or with their knowing or unknowing collusion
>(snitching their magstripe card, bugging their bedroom where they talk
>in their sleep from all of the jolt cola they drink on the job:-).
>
>>Now, I assume that using any such non-mainstream system is probably
>>(so far, to date) significantly more painful, annoying, and thus
>>expensive than just running Linux.  (And thus is unlikely to be
>>appropriate for a Beowulf cluster.)
>>
>>But if you're a huge organization already throwing millions of dollars
>>into horribly painful manual re-audits of even trivial updates to
>>"commodity" operating systems for mission-critical "highly secure"
>>applications, then I strongly suspect that you're already well into
>>the same cost range where investing those $millions into the use of
>>secure-by-design systems might well make much more sense.
>
>Ah, a believer in rational decisioning, CBA, minimal TCO.  Don't you
>see, man, that you're up against a whole world of people that don't,
>actually, understand the rational process?  A world where 1/2 of its
>members have IQ's under 100, and where 100 \pm 10 is usually a bit iffy
>when it comes to being able to actually analyze things logically or
>mathematically?

<snip>

Banks, IT, and security..  My wife is a senior IT manager in a big 
bank, so I get to hear quite a bit about what's involved in this.

They take it quite seriously (backed up by federal and state 
regulations and laws)

First off... tons of money are spent on it.  As rgb pointed out, 
they're not out hiring kids out of highschool as sysadmins.  These 
folks get paid reasonably well and are quite skilled and competent.

Second.. there are many levels of checking and cross checking.  Not 
only is there a whole second independent group of people through whom 
all software changes must flow, but there's a third independent group 
of auditors making life a miserable hell for the aforementioned first 
two groups. And, within these groups, there are multiple levels of 
approval required to even contemplate making the change in the first 
place.  You'd have to suborn and coopt a lot of people to "sneak 
something in", and those people are paid quite well so it ain't going 
to be the "slip someone a few hundred bucks under the table to leave 
the door unlocked" sort of thing.

Third.. systems are designed to require multiple people to be 
involved in any significant transaction or event.  And, there are 
rules that require those people to take vacations and be 
"disconnected", so that there are always new/fresh eyes looking at 
the day to day operations.  This is basic accounting 101... be 
suspicious of clerical employees who never take a vacation, and have 
a different person write the checks vs checking the statement from 
the bank. (I learned that one the hard way)

Fourth.. there are big time criminal penalties involved.  That's a 
much bigger club than some civil action or a "theft of services" sort 
of prosecution.  The police WILL get involved, the FBI and Secret 
Service WILL get involved.

Fifth.. Everybody working in a position of trust has to have an 
Office of the Comptroller of Currency background check and 
pass.  Lots of bright people don't pass the check because of some 
crippling problem or stupid indiscretion in their deep dark 
past.  The guidelines are out on the OCC website somewhere, and most 
companies have their own list of infractions.  It's done by a sort of 
point count scheme.  I would imagine (but do not know) that having 
been involved in ANY sort of fraud or scam (whether computer related 
or not) is sufficient to immediately disqualify you.  The stories you 
hear about high-school or college hackers seeing the light and being 
hired to help secure things are just that.. stories.  They might hire 
a "black-hat" consultant to give advice or do a penetration attempt, 
but they're going to be well firewalled (as in physically separate 
locations, no connectivity, etc.) from actual operations.  They'd 
never get a job as a coder, thence to lie in wait as they get 
promoted over 15 years to a position where they could actually be 
able to do some damage.

Sixth.. these are financial transactions, and they can always be 
reversed.  This is sort of the ultimate "checkpoint/restore" 
mechanism.  There have been compromises and mistakes (hey, if you're 
processing millions of transactions a day, run of the mill software 
errors crop up) and the people affected always get "made 
whole".  Sometimes it might take some time, but it gets fixed 
eventually. (to the point where there are opportunists who wait for 
the inevitable mistakes and cash in on the penalties... Taking 
recording of mortage pay-offs as an example, if you record the 
document late or improperly (where the time line is defined by 
statute), the borrower gets some sort of compensation (as well as 
getting the transaction fixed). )


So, the actual cost and security status of the OS involved is 
insignificant in comparison to the enormous people and infrastructure 
costs already being spent.  Furthermore, the pecularities or not of 
the OS don't really have an effect.  You've got a huge staff of 
people who are very experienced in those peculiarities, whatever they 
are.  The whole system architecture (including the people 
architecture) is specifically designed to make security sort of 
automatic.  It's tedious, it's expensive, and it works fairly well.


James Lux, P.E.
Spacecraft Radio Frequency Subsystems Group
Flight Communications Systems Section
Jet Propulsion Laboratory, Mail Stop 161-213
4800 Oak Grove Drive
Pasadena CA 91109
tel: (818)354-2075
fax: (818)393-6875 


From csamuel at vpac.org  Wed Jan 10 19:53:56 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Thu, 11 Jan 2007 14:53:56 +1100
Subject: no 'commodity' OS is 'secure' Re: [Beowulf] Which distro for the
	cluster?
In-Reply-To: <20070110142133.GA63012@tehun.pair.com>
References: <Pine.LNX.4.64.0701071543170.5894@lilith.rgb.private.net>
	<20070110142133.GA63012@tehun.pair.com>
Message-ID: <200701111453.56555.csamuel@vpac.org>

On Thursday 11 January 2007 01:21, Andrew Piskorski wrote:

> It sounds suspiciously like decision making driven by what the rules and
> paperwork says you're supposed to do

I knew an organisation (not this one) that had the rule that every system had 
to run a full virus scan once a day.

The security folks insisted that this rule applied to their new Linux/AIX 
cluster and so they dutifully paid for a commercial Linux A/V package and set 
up cron to scan the user and project data (mounted from an AIX box) once a 
day.

Only problem was once they had more than a trivial amount of data it took more 
than 24 hours for the scan to run, so the first one was still running when 
the second one was kicked off by cron.  This slowed both of them down, so the 
first one still hadn't finished by the next day, which slowed all 3 of them 
down further, so the next day.. well, you get the picture.

I gave them a hand with this tricky problem and eventually they managed to 
persuade their higher ups that they could get away with running ClamAV on the 
NFS server (as there was no commercial AV for AIX, unsurprisingly) and the 
problems went away.

cheers,
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070111/f191fd2d/attachment.sig>

From steve_heaton at iinet.net.au  Wed Jan 10 22:16:42 2007
From: steve_heaton at iinet.net.au (Steve Heaton)
Date: Thu, 11 Jan 2007 17:16:42 +1100
Subject: no 'commodity' OS is 'secure' Re: [Beowulf] Which distro for
	the cluster?
In-Reply-To: <200701101501.l0AF0otR011571@bluewest.scyld.com>
References: <200701101501.l0AF0otR011571@bluewest.scyld.com>
Message-ID: <45A5D64A.5090807@iinet.net.au>

G'day all

I agree with Andrew et al.

Having spent a short sentence inside a major financial institution's 
security section I just thought I'd add a bit more.

They don't run Linux for *anything* related to security (although it's 
starting to do well elsewhere). Everything is from 'major software 
vendors'. The big boys in *NIX OS and apps.

In their opinion there is no Linux vendor (or associated financial 
support) that could cover the risk. This place has bigger financial 
teeth than most countries.

Nothing from M$, Apple or anyone else is allowed anywhere near the live 
perimeter. No exceptions. Ever. They regularly get approached directly 
and indirectly on the Evil Empire's behalf, as I'm sure you can imagine. 
They also find this a regular source of mirth.

While I agree they're conservative they also run "relatively" recent aka 
'stable' releases. Their test suite is awesome... and they have two 
mirrors of the live environment: development and testing. 'Dev' is the 
same platforms but typically less storage. The 'test' is an *exact* copy 
of what is a huge environment. (Completely separate DR/BC as well).
They don't do squat without it having run through the test process.

This really blew me away... an >exact< copy of the whole live 
environment. Platforms, versions, BIOS the whole shabang. (Rumour has it 
even patch lead lengths). It was then pointed out that they're a bank. 
Money is what they do. Money is what they have. Yours! :)


Cheers
Stevo


From john.hearns at streamline-computing.com  Thu Jan 11 00:05:50 2007
From: john.hearns at streamline-computing.com (John Hearns)
Date: Thu, 11 Jan 2007 08:05:50 +0000
Subject: no 'commodity' OS is 'secure' Re: [Beowulf] Which distro for
	the cluster?
In-Reply-To: <45A5D64A.5090807@iinet.net.au>
References: <200701101501.l0AF0otR011571@bluewest.scyld.com>
	<45A5D64A.5090807@iinet.net.au>
Message-ID: <45A5EFDE.1020206@streamline-computing.com>

Steve Heaton wrote:
>
> 
> This really blew me away... an >exact< copy of the whole live 
> environment. Platforms, versions, BIOS the whole shabang. (Rumour has it 
> even patch lead lengths). It was then pointed out that they're a bank. 
> Money is what they do. Money is what they have. Yours! :)

Someone who worked in the City once told me "When you (a bank) handle 
lots and lots of money, some rubs off on your fingers"

And re. Jim Lux's point about software errors and financial transactions 
being "made good", this did happen recently in the UK.
I listen to Radio 4's 'Moneybox' programme, and a big UK bank had a 
failure in the batch processing system, which led to many standing 
orders being paid late. Anyone who lost money due to this was recompensed.


-- 
      John Hearns
      Senior HPC Engineer
      Streamline Computing,
      The Innovation Centre, Warwick Technology Park,
      Gallows Hill, Warwick CV34 6UW
      Office: 01926 623130 Mobile: 07841 231235


From hchsiao at csie.ncku.edu.tw  Thu Jan  4 04:14:54 2007
From: hchsiao at csie.ncku.edu.tw (Hung-Chang Hsiao)
Date: Thu, 4 Jan 2007 20:14:54 +0800
Subject: [Beowulf] CFP: ICPADS 2007
Message-ID: <003f01c72ff9$f50a65d0$2201a8c0@yourdbc2d4fabb>


***********************************************************
We apology if you receive multiple copies of this CFP.
***********************************************************

Call for Paper
The 13th International Conference on Parallel and Distributed Systems
(ICPADS'2007)
December 5 - 7, 2007
National Tsing Hua University, Hsinchu, Taiwan
http://www.ccrc.nthu.edu.tw/icpads2007

PURPOSE AND SCOPE
The conference provides an international forum for scientists, engineers,
and users to exchange and share their experiences, new ideas, and latest
research results on all aspects of parallel and distributed systems. All
accepted papers will appear in the Proceedings of ICPADS 2007, which is to
be published by the IEEE Computer Society Press.
Topics of interest include, but are not limited to:
* Parallel and Distributed Applications and Algorithms
* High Performance Computational Biology and Bioinformatics
* Multicore and Multithreaded Architectures
* Power-aware Computing
* Distributed and Parallel Operating Systems
* Resource Management and Scheduling
* Peer-to-Peer Computing
* Cluster and Grid Computing
* Web-based Computing and Service-Oriented Architecture
* Communication and Networking Systems
* Wireless and Mobile Computing
* Ad Hoc and Sensor Networks
* Security and Privacy
* Dependable and Trustworthy Computing and Systems
* Real-Time and Multimedia Systems
* Performance Modeling and Evaluation

IMPORTANT DATES
Workshop Proposal Due: March 2, 2007
Paper Submission Due: May 20, 2007
Author Notification: August 3, 2007
Final Manuscript Due: September 2, 2007

PAPER SUBMISSION
Papers presenting original and unpublished work are invited and will be
evaluated based on originality, significance, technical soundness, and
clarity of exposition. Submitted papers should be formatted in a two-column
IEEE Computer Society format (URL:
http://computer.org/cspress/instruct.htm) and should not exceed eight pages
including figures and references. Submissions will be via the conference web
site: http://www.ccrc.nthu.edu.tw/icpads2007

WORKSHOPS
Proposals for the workshops are solicited. Workshop proposals should be no
longer than 2 pages, and should include a summary describing the themes of
the workshop, scope and motivation, topics of interest, and brief
biographies of the organizers. The proposals should be submitted to the
Workshop Chair (clwang at cs.hku.hk) by March 2, 2007.

For further information of the conference, please contact Prof. Chung-Ta
King at king at cs.nthu.edu.tw

ORGANIZING COMMITTEE
Honorary Chair
   Wen-Tsuen Chen, National Tsing Hua University, Taiwan General Chair
   Lionel M. Ni, Hong Kong University of Science and Technology, Hong Kong
Program Chair
   Chung-Ta King, National Tsing Hua University, Taiwan Program Vice Chairs
1. Parallel Algorithms and Applications
   Wanlei Zhou, Deakin University, Australia 2. Parallel and Distributed
Architecture
   Wei-Chung Hsu, University of Minnesota, USA 3. Resource Management and
Scheduling
   Tai-Yi Huang, National Tsing Hua University, Taiwan 4. Cluster and Grid
Computing
   Mark Baker, University of Reading, UK
   Daniel Katz, Louisiana State University, USA 5. Web and Peer-to-peer
Systems
   Yunhao Liu, Hong Kong University of Science and Technology, Hong Kong 6.
Mobile and Ubiquitous Computing
   Matt Mutka, Michigan State University, USA 7. Dependability and
Trustworthy Computing
   Bobby Bhattacharjee, University of Maryland, USA Workshop Chair
   Cho-Li Wang, University of Hong Kong, Hong Kong Award Chair
   Makoto Takizawa, Tokyo Denki University, Japan Publication Chair
   Yeh-Ching Chung, National Tsing Hua University, Taiwan Finance Chair
   Chiu-Ting Hsu, National Tsing Hua University, Taiwan Publicity Chair
   Hung-Chang Hsiao, National Cheng Kung Univerity, Taiwan Registration
Chair
   Shiao-Li Tsao, National Chiao Tung University, Taiwan Local Arrangements
Chair
   Ming-Jer Tsai, National Tsing Hua University, Taiwan Steering Committee
Chair
   Wen-Tsuen Chen, National Tsing Hua University, Taiwan


From rafapa at us.es  Tue Jan  9 00:33:42 2007
From: rafapa at us.es (Rafael R. Pappalardo)
Date: Tue, 9 Jan 2007 09:33:42 +0100
Subject: [Beowulf] Any Gaussian users out there?
In-Reply-To: <45A1BF63.4010203@scalableinformatics.com>
References: <45A1BF63.4010203@scalableinformatics.com>
Message-ID: <200701090933.42357.rafapa@us.es>

On Monday 08 January 2007 04:49, Joe Landman wrote:
> I found a neat ... feature ... of Linux while getting g03 running in SMP
> on cluster nodes.  Long story, but the folks I am doing this for don't
> have/want to use Linda.  They asked us to help them get g03 operational
> in SMP parallel.  This wasn't painful.  Have it integrated into SGE and
> our SICE interface now as well.
>
> Basic idea is that we are getting a kernel exception in the VFS layer
> only when running with 2 or more CPUs on an SMP node.  Shows up only on
> SuSE 9.3 nodes.  The other nodes are RHEL 3 based (2.4 kernel, but hey,
> its really stable).
>
> I don't want to post a nasty-looking trap here.
>
> The problem occurs with both xfs and jfs.  Haven't had the chance to try
> ext3 yet, though if the issue is in the vfs layer, I can't see how
> changing the underlying block device is going to alter the layers (VFS)
> above it.
>
> The net effect of this is that it runs great on the 2.4 based machines,
> but gets SIGKILLs when running on the 2.6 based SuSE 9.3 machines.
> Looks like the app is tickling the OS bug.  I can repeatably cause this
> trap, though it seems to occur at "random" places, well, not really.
> The way Gaussian runs, it has "links" which are binary modules which
> execute a particular portion of the calculation (its pretty neat
> really).  Each link is read in from the disk.  This VFS bug gets
> triggered regardless of local or remote FS.
>
> Any Gaussian users out there see that?  Does a kernel upgrade fix it?
> Inquiring minds want to know ...

Don't know if it's threads related but... Sometimes setting
LD_ASSUME_KERNEL to 2.4.1 in the environment solves this kind of problems.
There are other possible values, you can have a look at:
http://people.redhat.com/drepper/assumekernel.html

Best regards,

Rafael
-- 
Dr. Rafael R. Pappalardo
Dept. Physical Chemistry, Univ. de Sevilla (Spain)
e-mail: rafapa at us.es


From yong.li at rioh.cn  Tue Jan  9 19:08:31 2007
From: yong.li at rioh.cn (=?gb2312?B?wO7Twg==?=)
Date: Wed, 10 Jan 2007 11:08:31 +0800
Subject: [Beowulf] lam6.5.9 & dyna_mpp970 
Message-ID: <20070110030831.26218.qmail@mail.rioh.cn>

beowulf:
hi,I am a engineer from beijing china,recently I am in trouble when I was setting up my HPC beowulf last week. I am trying to run LS-DYNA on a 8 nodes cluster .The problem is following.
[hpc at node1 ~]$ lamboot -v

LAM 6.5.9/MPI 2 C++/ROMIO - Indiana University

Executing hboot on n0 (node1 - 1 CPU)...
Executing hboot on n1 (node2 - 1 CPU)...
Executing hboot on n2 (node3 - 1 CPU)...
Executing hboot on n3 (node4 - 1 CPU)...
Executing hboot on n4 (node5 - 1 CPU)...
Executing hboot on n5 (node6 - 1 CPU)...
Executing hboot on n6 (node7 - 1 CPU)...
Executing hboot on n7 (node8 - 1 CPU)...
topology done
[hpc at node1 ~]$ mpirun -np 8 mpp970.exe info
-----------------------------------------------------------------------------
It seems that [at least] one of processes that was started with mpirun
did not invoke MPI_INIT before quitting (it is possible that more than
one process did not invoke MPI_INIT -- mpirun was only notified of the
first one, which was on node n0).

mpirun can *only* be used with MPI programs (i.e., programs that
invoke MPI_INIT and MPI_FINALIZE).  You can use the "lamexec" program
to run non-MPI programs over the lambooted nodes.
-----------------------------------------------------------------------------
forrtl: info: Fortran error message number is 78.
forrtl: warning: Could not open message catalog: ifcore_msg.cat.
forrtl: info: Check environment variable NLSPATH and protection of /usr/lib/ifcore_msg.cat.
[hpc at node1 ~]$
 
ps: lam version is lam-6.5.9
    dyna version is mpp970_s_intelsse_linux_lam659.exe
    

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070110/602f1a79/attachment.html>

From glen at callident.com  Wed Jan 10 15:44:30 2007
From: glen at callident.com (Glen Otero)
Date: Wed, 10 Jan 2007 15:44:30 -0800
Subject: [Beowulf] HPC Consortium and Cell Hack-a-thon
Message-ID: <5297539C-689C-4162-B561-B56FC0D82AAF@callident.com>

Folks-

Very interesting announcements from Terra Soft with regard to Cell  
and HPC. The Hack-a-thon sounds awesome, and free of charge.

Glen

Terra Soft Launches HPC Consortium
http://www.terrasoftsolutions.com/news/2007/2007-01-10a.shtml

Terra Soft Hosts Cell Hack-a-thon
http://www.terrasoftsolutions.com/news/2007/2007-01-10b.shtml


From gdjacobs at gmail.com  Fri Jan 12 00:57:57 2007
From: gdjacobs at gmail.com (Geoff Jacobs)
Date: Fri, 12 Jan 2007 02:57:57 -0600
Subject: [Beowulf] HPC Consortium and Cell Hack-a-thon
In-Reply-To: <5297539C-689C-4162-B561-B56FC0D82AAF@callident.com>
References: <5297539C-689C-4162-B561-B56FC0D82AAF@callident.com>
Message-ID: <45A74D95.7010100@gmail.com>

Glen Otero wrote:
> Folks-
> 
> Very interesting announcements from Terra Soft with regard to Cell and
> HPC. The Hack-a-thon sounds awesome, and free of charge.
> 
> Glen
> 
> Terra Soft Launches HPC Consortium
> http://www.terrasoftsolutions.com/news/2007/2007-01-10a.shtml
> 
> Terra Soft Hosts Cell Hack-a-thon
> http://www.terrasoftsolutions.com/news/2007/2007-01-10b.shtml

Please be more forthcoming about professional associations when making
announcements like this. Forgetting to note a business relationship
between Terrasoft and yourself could be construed by the suspicious as
astroturfing.

-- 
Geoffrey D. Jacobs


From reuti at staff.uni-marburg.de  Fri Jan 12 01:55:02 2007
From: reuti at staff.uni-marburg.de (Reuti)
Date: Fri, 12 Jan 2007 10:55:02 +0100
Subject: [Beowulf] lam6.5.9 & dyna_mpp970
In-Reply-To: <20070110030831.26218.qmail@mail.rioh.cn>
References: <20070110030831.26218.qmail@mail.rioh.cn>
Message-ID: <CE960EFD-4E3F-4E4E-9A21-B33C5AF4BE0D@staff.uni-marburg.de>

Am 10.01.2007 um 04:08 schrieb ??:

> beowulf:
> hi,I am a engineer from beijing china,recently I am in trouble when  
> I was setting up my HPC beowulf last week. I am trying to run LS- 
> DYNA on a 8 nodes cluster .The problem is following.
> [hpc at node1 ~]$ lamboot -v
>
> LAM 6.5.9/MPI 2 C++/ROMIO - Indiana University
>
> Executing hboot on n0 (node1 - 1 CPU)...
> Executing hboot on n1 (node2 - 1 CPU)...
> Executing hboot on n2 (node3 - 1 CPU)...
> Executing hboot on n3 (node4 - 1 CPU)...
> Executing hboot on n4 (node5 - 1 CPU)...
> Executing hboot on n5 (node6 - 1 CPU)...
> Executing hboot on n6 (node7 - 1 CPU)...
> Executing hboot on n7 (node8 - 1 CPU)...
> topology done

Fine.

> [hpc at node1 ~]$ mpirun -np 8 mpp970.exe info
> ---------------------------------------------------------------------- 
> -------
> It seems that [at least] one of processes that was started with mpirun
> did not invoke MPI_INIT before quitting (it is possible that more than
> one process did not invoke MPI_INIT -- mpirun was only notified of the
> first one, which was on node n0).
>
> mpirun can *only* be used with MPI programs (i.e., programs that
> invoke MPI_INIT and MPI_FINALIZE). You can use the "lamexec" program
> to run non-MPI programs over the lambooted nodes.
> ---------------------------------------------------------------------- 
> -------
> forrtl: info: Fortran error message number is 78.
> forrtl: warning: Could not open message catalog: ifcore_msg.cat.
> forrtl: info: Check environment variable NLSPATH and protection of / 
> usr/lib/ifcore_msg.cat.

This is the error message, that the file with the error messages  
can't be found. If you install ifc in a proper way, it should be  
somewhere in /opt/intel_fc_80. To me it looks like a programming  
error before any MPI statement.

I would recommend to use LAM 7.1.2 anyway.

-- Reuti

> [hpc at node1 ~]$
>
> ps: lam version is lam-6.5.9
> dyna version is mpp970_s_intelsse_linux_lam659.exe
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit  
> http://www.beowulf.org/mailman/listinfo/beowulf


From jlb17 at duke.edu  Fri Jan 12 06:32:32 2007
From: jlb17 at duke.edu (Joshua Baker-LePain)
Date: Fri, 12 Jan 2007 09:32:32 -0500 (EST)
Subject: [Beowulf] lam6.5.9 & dyna_mpp970 
In-Reply-To: <20070110030831.26218.qmail@mail.rioh.cn>
References: <20070110030831.26218.qmail@mail.rioh.cn>
Message-ID: <alpine.LRH.0.81.0701120930210.15491@chaos.egr.duke.edu>

On Wed, 10 Jan 2007 at 11:08am, ?? wrote


> hi,I am a engineer from beijing china,recently I am in trouble when I 
> was setting up my HPC beowulf last week. I am trying to run LS-DYNA on a 
> 8 nodes cluster .The problem is following. [hpc at node1 ?]$ lamboot -v

What Linux distribution are you using?

> [hpc at node1 ?]$ mpirun -np 8 mpp970.exe info

Just for grins, try specifying the *full* path to the mpp970 executable in 
that command line.

> ps: lam version is lam-6.5.9
>    dyna version is mpp970_s_intelsse_linux_lam659.exe

What *exact* version of DYNA are you using?  There are several point 
releases in the 970 series (and there's now the 971 series as well).

-- 
Joshua Baker-LePain
Department of Biomedical Engineering
Duke University

From deadline at clustermonkey.net  Fri Jan 12 09:38:00 2007
From: deadline at clustermonkey.net (Douglas Eadline)
Date: Fri, 12 Jan 2007 12:38:00 -0500 (EST)
Subject: [Beowulf] Cluster Design Rules 
Message-ID: <55886.192.168.1.1.1168623480.squirrel@mail.eadline.org>

I just posted an article about the Cluster Design Rules
from the Aggregate.org.  Bill Dieter gave me a demo
of at SC06. Now he and Hank Dietz have written
and article on this amazing tool. You *really*
want to have a look at this project.

  A Web-Based Tool for Optimized Cluster Design

  http://www.clustermonkey.net//content/view/181/33/

And while you are visiting the Monkey, check out RGB's

  Cluster Networking: The Dark Side of IP over Ethernet

  http://www.clustermonkey.net//content/view/180/32/

--
Doug


From csamuel at vpac.org  Sat Jan 13 17:27:15 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Sun, 14 Jan 2007 12:27:15 +1100
Subject: [Beowulf] OneSis experiences ?
Message-ID: <200701141227.15875.csamuel@vpac.org>

Hi folks,

Anyone here played with OneSis ?   It looks like a variation on the WareWulf 
theme..

http://www.onesis.org/

A thin, role-based, single image system for scalable cluster management. 
oneSIS is a simple, flexible method for managing one or more clusters from a 
single filesystem image. It is easy to deploy, and it simplifies the task of 
cluster administration.

Their intro page says:

oneSIS allows you to have diskless nodes, diskfull nodes, or any combination 
in-between. Simple setups require hardly any configuration at all, but with 
oneSIS even complex environments become easy to manage.
[...]

Supported distributions: 

  RedHat 7.1, 7.3, 8.0, 9.0
  RedHat Enterprise WS,AS 3.0
  RedHat Enterprise AS 4.0
  Fedora Core 2
  SuSe 9.1
  Debian
  Gentoo
  More to come ...

-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia


From becker at scyld.com  Mon Jan 15 10:44:35 2007
From: becker at scyld.com (Donald Becker)
Date: Mon, 15 Jan 2007 10:44:35 -0800 (PST)
Subject: [Beowulf] BayBUG 2007 meeting plans
Message-ID: <Pine.LNX.4.44.0701151038510.29452-100000@bluewest.scyld.com>


The BayBUG is moving to Bi-monthly meetings in 2007.

The first 2007 meeting will be in February, with two talks:
  Industry Standard Clusters with Clearspeed's Accelerators
and a general talk on site-wide and cluster schedulers from Cluster 
Resources.


--------

"Bay Area Beowulf User Group (BayBUG)
Moving to bi-monthly in 2007
First 2007 date:
February 20, 2007
2:30 - 5:00 p.m.
AMD headquarters Common Building,
Room C-6/7/8
991 Stewart Drive, Sunnyvale

Speakers: [black bold]
- Massimiliano Fatica, Sr. Solutions Architect at ClearSpeed Technology 
[black bold]
- A senior engineer from Cluster Resources Inc [black bold]

Join moderator and Beowulf cluster co-inventor Donald Becker for food and 
drinks and to learn from and network with other Linux HPC professionals."

-- 
Donald Becker				becker at scyld.com
Scyld Software	 			Scyld Beowulf cluster systems
914 Bay Ridge Road, Suite 220		www.scyld.com
Annapolis MD 21403			410-990-9993


From deadline at clustermonkey.net  Mon Jan 15 14:51:19 2007
From: deadline at clustermonkey.net (Douglas Eadline)
Date: Mon, 15 Jan 2007 17:51:19 -0500 (EST)
Subject: [Beowulf] Sun Releases Fortress Implementation
Message-ID: <47787.192.168.1.1.1168901479.squirrel@mail.eadline.org>


I just posted a short news item on Cluster Monkey about
Sun's release of a Fortress interpreter (BSD License).
Fortress is a parallel programming language for
HPC. The post has a bunch of links for the
curious.

http://www.clustermonkey.net//content/view/182/1/

It looks interesting. Does anyone have any
experience with Fortress design/ideas/implementation?

--
Doug


From mike at etek.chalmers.se  Mon Jan 15 11:00:17 2007
From: mike at etek.chalmers.se (Mikael Fredriksson)
Date: Mon, 15 Jan 2007 20:00:17 +0100
Subject: [Beowulf] SGI to offer Windows on clusters
Message-ID: <45ABCF41.3020501@etek.chalmers.se>

Hi all of You Beowulfers.


This article: "SGI to offer Windows on clusters" might be worth reading...


http://www.computerworld.com/action/article.do?command=viewArticleBasic&articleId=9007859&source=NLT_PM&nlid=8


Any comments?


Regards
MF


From eric-shook at uiowa.edu  Tue Jan 16 14:54:44 2007
From: eric-shook at uiowa.edu (Eric Shook)
Date: Tue, 16 Jan 2007 16:54:44 -0600
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45ABCF41.3020501@etek.chalmers.se>
References: <45ABCF41.3020501@etek.chalmers.se>
Message-ID: <45AD57B4.9060106@uiowa.edu>

I talked to our SGI rep about this yesterday and he told me they are not 
really targeting "hard-core" university research where Linux/UNIX 
already has a strong foot hold.  Instead this is for the Business sector 
where simplified workflows and having easy HPC integration into an 
already 100% Windows Infrastructure is more appealing.

This was his take and it seemed reasonable to me.

Eric Shook
-- 
Eric Shook (319) 335-6714
Technical Lead, Systems and Operations - GROW
http://grow.uiowa.edu


Mikael Fredriksson wrote:
> Hi all of You Beowulfers.
> 
> 
> This article: "SGI to offer Windows on clusters" might be worth reading...
> 
> 
> http://www.computerworld.com/action/article.do?command=viewArticleBasic&articleId=9007859&source=NLT_PM&nlid=8 
> 
> 
> 
> Any comments?
> 
> 
> 
> Regards
> MF
> 
> 
> 
> 
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf


From mwill at penguincomputing.com  Tue Jan 16 15:16:50 2007
From: mwill at penguincomputing.com (Michael Will)
Date: Tue, 16 Jan 2007 15:16:50 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
Message-ID: <433093DF7AD7444DA65EFAFE3987879C351384@orca.penguincomputing.com>

That is an interesting article... So I assume they went chapter 11 and
microsoft ressurrected them as a vehicle to market their cluster
solution? Just wait for the Zombie processes ;-)

Michael 

-----Original Message-----
From: beowulf-bounces at beowulf.org [mailto:beowulf-bounces at beowulf.org]
On Behalf Of Mikael Fredriksson
Sent: Monday, January 15, 2007 11:00 AM
To: Beowulf at beowulf.org
Subject: [Beowulf] SGI to offer Windows on clusters

Hi all of You Beowulfers.


This article: "SGI to offer Windows on clusters" might be worth
reading...


http://www.computerworld.com/action/article.do?command=viewArticleBasic&
articleId=9007859&source=NLT_PM&nlid=8


Any comments?


Regards
MF


_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org To change your subscription
(digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf


From steve_heaton at iinet.net.au  Tue Jan 16 18:13:06 2007
From: steve_heaton at iinet.net.au (Steve Heaton)
Date: Wed, 17 Jan 2007 13:13:06 +1100
Subject: [Beowulf] RAID for dummies
In-Reply-To: <200701162001.l0GK08J5018999@bluewest.scyld.com>
References: <200701162001.l0GK08J5018999@bluewest.scyld.com>
Message-ID: <45AD8632.8080709@iinet.net.au>

Came across this :)

http://www.epidauros.be/raid.jpg

I'm sure we can pick faults in the analogy... seeing most 'wulf keepers 
have a decent physics background but hey. Fun pic :)

Cheers
Stevo


From kus at free.net  Wed Jan 17 06:06:50 2007
From: kus at free.net (Mikhail Kuzminsky)
Date: Wed, 17 Jan 2007 17:06:50 +0300
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45ABCF41.3020501@etek.chalmers.se>
Message-ID: <web-1279309@free.net>

In message from Mikael Fredriksson <mike at etek.chalmers.se> (Mon, 15 
Jan 2007 20:00:17 +0100):
>This article: "SGI to offer Windows on clusters" might be worth 
>reading...
>
>http://www.computerworld.com/action/article.do?command=viewArticleBasic&articleId=9007859&source=NLT_PM&nlid=8
>Any comments?
   In any case it's bad news :-( 
SGI has solid reputation in HPC and university world, and may be 
somebody will be tempted. But it's interesting, w/which prices SGI 
will sell their clusters ? Hope that the price will be much more 
higher than for SGI Linux clusters ;-)

Mikhail Kuzminsky
Zelinsky Institute of Organic Chemistry
Moscow


From landman at scalableinformatics.com  Wed Jan 17 07:09:29 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Wed, 17 Jan 2007 10:09:29 -0500
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <web-1279309@free.net>
References: <web-1279309@free.net>
Message-ID: <45AE3C29.1090809@scalableinformatics.com>

Mikhail Kuzminsky wrote:
> In message from Mikael Fredriksson <mike at etek.chalmers.se> (Mon, 15 Jan 
> 2007 20:00:17 +0100):
>> This article: "SGI to offer Windows on clusters" might be worth 
>> reading...
>>
>> http://www.computerworld.com/action/article.do?command=viewArticleBasic&articleId=9007859&source=NLT_PM&nlid=8 
>>
>> Any comments?
>   In any case it's bad news :-( SGI has solid reputation in HPC and 
> university world, and may be somebody will be tempted. But it's 
> interesting, w/which prices SGI will sell their clusters ? Hope that the 
> price will be much more higher than for SGI Linux clusters ;-)

My understanding of pricing (for the windows portion) is that it adds 
(as an OS) $500USD to each node.  So for a 32 node machine, this is an 
extra $16k USD "tax" added on.  Doesn't include the absolutely necessary 
antivirus, anti-spyware, ...  Calling all that roughly $4k USD (roughly 
$125/node), we are looking at something closer to $20k extra per 32 
nodes.  So for 128 nodes, this adds $80k USD.  For 1024 nodes, this adds 
$640k USD.

My question has been on the CBA side.  What do you get for that extra 
tax that you don't get now?

Microsoft could simply be subsidizing this for SGI.  Others have (cough 
cough) for them.

Joe

(former SGI person)

-- 

Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From ctierney at hypermall.net  Wed Jan 17 07:26:01 2007
From: ctierney at hypermall.net (Craig Tierney)
Date: Wed, 17 Jan 2007 08:26:01 -0700
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AE3C29.1090809@scalableinformatics.com>
References: <web-1279309@free.net> <45AE3C29.1090809@scalableinformatics.com>
Message-ID: <45AE4009.6030808@hypermall.net>

Joe Landman wrote:
> Mikhail Kuzminsky wrote:
>> In message from Mikael Fredriksson <mike at etek.chalmers.se> (Mon, 15 
>> Jan 2007 20:00:17 +0100):
>>> This article: "SGI to offer Windows on clusters" might be worth 
>>> reading...
>>>
>>> http://www.computerworld.com/action/article.do?command=viewArticleBasic&articleId=9007859&source=NLT_PM&nlid=8 
>>>
>>> Any comments?
>>   In any case it's bad news :-( SGI has solid reputation in HPC and 
>> university world, and may be somebody will be tempted. But it's 
>> interesting, w/which prices SGI will sell their clusters ? Hope that 
>> the price will be much more higher than for SGI Linux clusters ;-)
> 
> My understanding of pricing (for the windows portion) is that it adds 
> (as an OS) $500USD to each node.  So for a 32 node machine, this is an 
> extra $16k USD "tax" added on.  Doesn't include the absolutely necessary 
> antivirus, anti-spyware, ...  Calling all that roughly $4k USD (roughly 
> $125/node), we are looking at something closer to $20k extra per 32 
> nodes.  So for 128 nodes, this adds $80k USD.  For 1024 nodes, this adds 
> $640k USD.


> 
> My question has been on the CBA side.  What do you get for that extra 
> tax that you don't get now?

You mean like the "tax" that vendors who sell Redhat put on their
systems because it adds an extra cost to each node?  What do you get for
that?

I think that you would get a system that fits well into an existing
MS environment.  I also see getting a system where you don't have to
go through driver hell to get things working when the vendors don't
(or can't) get their drivers in the kernel.  I know of some very large
and smart organizations that cannot get their IB, perfctr, and
lustre patches working together correctly.  Why do I have to
have a kernel engineer on staff to make this stuff work?

I see the MS solution attractive to the ISVs where they only
have to build their and test their code once.  No building
for RH and Novell, actively ignoring Fedora, Debian, Gentoo,
and Unbutu, and then worrying about the interconnect and
version of MPI that happens to be used.

Most everyone on this list is smart and talented enough to solve
these problems.  MS isn't selling to us.

And no, I don't have any interesting in building an MS cluster
for all of the other problems it introduces.

> 
> Microsoft could simply be subsidizing this for SGI.  Others have (cough 
> cough) for them.
> 

You wouldn't?  Anyone trying to crack into a new market
would do so.

Craig

> Joe
> 
> (former SGI person)
> 


From landman at scalableinformatics.com  Wed Jan 17 08:54:16 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Wed, 17 Jan 2007 11:54:16 -0500
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AE4009.6030808@hypermall.net>
References: <web-1279309@free.net> <45AE3C29.1090809@scalableinformatics.com>
	<45AE4009.6030808@hypermall.net>
Message-ID: <45AE54B8.80001@scalableinformatics.com>

Craig Tierney wrote:

>> My understanding of pricing (for the windows portion) is that it adds 
>> (as an OS) $500USD to each node.  So for a 32 node machine, this is an 
>> extra $16k USD "tax" added on.  Doesn't include the absolutely 
>> necessary antivirus, anti-spyware, ...  Calling all that roughly $4k 
>> USD (roughly $125/node), we are looking at something closer to $20k 
>> extra per 32 nodes.  So for 128 nodes, this adds $80k USD.  For 1024 
>> nodes, this adds $640k USD.
> 
> 
>>
>> My question has been on the CBA side.  What do you get for that extra 
>> tax that you don't get now?
> 
> You mean like the "tax" that vendors who sell Redhat put on their
> systems because it adds an extra cost to each node? 

That is correct.

> What do you get for
> that?

Good question.  Why don't you ask the people that do this.

> I think that you would get a system that fits well into an existing
> MS environment.  I also see getting a system where you don't have to

Curiously, well designed and implemented Linux clusters also integrate 
quite nicely into MS environments.  They have for years.

> go through driver hell to get things working when the vendors don't
> (or can't) get their drivers in the kernel.  I know of some very large

[scratch scratch]  so you have cluster vendors delivering things where 
there are no drivers?  And you pay them for this?  Or am I missing what 
you are saying here?

If you are talking about driver hell, I presume you have not ever 
installed MS WinNT/2k/XP?

> and smart organizations that cannot get their IB, perfctr, and
> lustre patches working together correctly.  Why do I have to
> have a kernel engineer on staff to make this stuff work?

Ah.... ok.  You are talking about getting drivers into the kernel.  This 
is different.  Small/large windows vendors also never (ever) get their 
drivers into the kernel.  They are built as DLLs (a.k.a. kernel 
modules).  Are you blaming Linux for not being able to enable code which 
cannot be built as a driver, but requires kernel patches to be 
responsible for it not being able to be built as a driver?  You are not 
blaming the driver authors?  I can build xfs as a kernel module.  Works 
fine on RH and similar systems where it is not included natively.

Which IB BTW?  IB is in the kernel now.

Looking up Perfctr inclusion, see http://lwn.net/Articles/203731/ at 
bottom.  It might be that the author does not wish to go through the 
process again.

> I see the MS solution attractive to the ISVs where they only
> have to build their and test their code once.  No building

Oddly enough the ISVs tend to follow where the customers are going.  We 
haven't seen many customers ask for an all windows shop (even on 
computing systems).  They (ISVs) went to Linux as their customers asked 
them to.  In the process they were able to whittle away OSes that have 
effectively died from the perspective of HPC purchasing.  This had the 
net effect of reducing the ISVs costs (reduction in supported platforms 
and testing).  Most of the ISVs we have spoken with are aiming at 2 
platforms going forward.  Many had been burned in the past by being on a 
single platform when that platform fell out of favor.

> for RH and Novell, actively ignoring Fedora, Debian, Gentoo,
> and Unbutu, and then worrying about the interconnect and

This is a problem with all Linux now.  One I am personally frustrated 
with.  Linux != Redhat, despite RH's best efforts (and SuSEs, but that 
is another story).  If we can get people to write to the standards 
(LSB), things will work nicely.  Right now they are not doing that, 
which means that Linux is rapidly becoming RH in the eyes of the 
customers.  And RH uses positively ancient kernels.  New system support 
is painful if you use RH.  SATA anyone?  NUMA?

We are currently working with two different accelerator cards that work 
wonderfully under RH and related distros, and not at all under late 
model distros (including FCx where x>3).  It has to do with how they 
wrote it.  They built in lots or RH-isms.  Which warms the cockles of 
RH's heart.

(n.b.  I have nothing against RH.  I simply disagree with their choices 
to ignore good file systems in the face of ones that don't work as well 
for large volumes/systems/high speed/highly reliable IO.  That and that 
they have positively ancient kernels which tends to have all the bugs 
and few of the fixes of the old kernels ... which has been explained to 
me before, but did not make business/technological sense then or now.  I 
do like and use RHEL4 and free variants when appropriate).

> version of MPI that happens to be used.

This is a problem, and one I have complained about before.  So many 
MPIs.  Completely missing binary compatibility.  Massively exploding 
test matrix.  Settle on one and move forward.  This causes *everyone* 
grief.  And it costs money.  We have MPICH, MPICH2, LAM, mvapich, 
mvapich2, OpenMPI, ... .  Then you can build each of these with 
different compilers (gcc 3.x, 4.x, intel, PGI, PathScale).  This all 
before you hit the commercial variants (Scali, ...).

This is nuts folks.  We need one binary interface and specification. 
Once set of libraries to link to.  Been muttering about this for years 
... :(

> Most everyone on this list is smart and talented enough to solve
> these problems.  MS isn't selling to us.

I would like to say of course not, but from what I have seen, they are 
going after the places that quite a few of us on this list work at/with.

I do believe they can add value.  I am just not convinced they are going 
about it the right way.  Their HPC efforts appear to be just a tactic in 
the extension of the "crush linux" strategy.  We (my company) do believe 
that closer integration of HPC resources is important, and enabling end 
users easier use and management of HPC from their desktops, laptops, and 
PDA-phones is a good thing.  We agree with Microsoft on that part.

> And no, I don't have any interesting in building an MS cluster
> for all of the other problems it introduces.

We follow our customers requests.  Haven't had any for windows clusters 
to date.  Might happen, and if it does, we will execute against it. 
Sort of like the Solaris 10 clusters.


>> Microsoft could simply be subsidizing this for SGI.  Others have 
>> (cough cough) for them.
>>
> 
> You wouldn't?  Anyone trying to crack into a new market
> would do so.

Not anyone.  SGI has been subsidized by others before (including, 
briefly in the past, MSFT).  I haven't seen it ever result in anything 
other than a disaster for the subsidizer.  Then again, SGI is under 
mostly new leadership, so hopefully the mistakes of the past are 
actually in the past.


-- 

Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From landman at scalableinformatics.com  Wed Jan 17 09:32:24 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Wed, 17 Jan 2007 12:32:24 -0500
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AE54B8.80001@scalableinformatics.com>
References: <web-1279309@free.net>
	<45AE3C29.1090809@scalableinformatics.com>	<45AE4009.6030808@hypermall.net>
	<45AE54B8.80001@scalableinformatics.com>
Message-ID: <45AE5DA8.7090307@scalableinformatics.com>

Joe Landman wrote:

gaak...

> Ah.... ok.  You are talking about getting drivers into the kernel.  This 
> is different.  Small/large windows vendors also never (ever) get their 
> drivers into the kernel.  They are built as DLLs (a.k.a. kernel 
> modules).  Are you blaming Linux for not being able to enable code which 

this is ok, but for below ...

s/driver/module/g

> cannot be built as a driver, but requires kernel patches to be 
> responsible for it not being able to be built as a driver?  You are not 
> blaming the driver authors?  I can build xfs as a kernel module.  Works 
> fine on RH and similar systems where it is not included natively.

PBC (posted before coffee)

-- 

Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From James.P.Lux at jpl.nasa.gov  Wed Jan 17 10:12:46 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Wed, 17 Jan 2007 10:12:46 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AE3C29.1090809@scalableinformatics.com>
References: <web-1279309@free.net> <45AE3C29.1090809@scalableinformatics.com>
Message-ID: <6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>

At 07:09 AM 1/17/2007, Joe Landman wrote:
>Mikhail Kuzminsky wrote:
>>In message from Mikael Fredriksson <mike at etek.chalmers.se> (Mon, 15 
>>Jan 2007 20:00:17 +0100):
>>>This article: "SGI to offer Windows on clusters" might be worth reading...
>>>
>>>http://www.computerworld.com/action/article.do?command=viewArticleBasic&articleId=9007859&source=NLT_PM&nlid=8 
>>>
>>>Any comments?
>>   In any case it's bad news :-( SGI has solid reputation in HPC 
>> and university world, and may be somebody will be tempted. But 
>> it's interesting, w/which prices SGI will sell their clusters ? 
>> Hope that the price will be much more higher than for SGI Linux clusters ;-)
>
>My understanding of pricing (for the windows portion) is that it 
>adds (as an OS) $500USD to each node.  So for a 32 node machine, 
>this is an extra $16k USD "tax" added on.  Doesn't include the 
>absolutely necessary antivirus, anti-spyware, ...

Probably wouldn't be that expensive, especially if you boot the same 
image on all nodes of the cluster.  Update one, update all.

Site licenses for AV and AS software are heavily discounted from 
retail, as well.

Is this Windows clustering version, too?


>   Calling all that roughly $4k USD (roughly $125/node), we are 
> looking at something closer to $20k extra per 32 nodes.  So for 128 
> nodes, this adds $80k USD.  For 1024 nodes, this adds $640k USD.
>
>My question has been on the CBA side.  What do you get for that 
>extra tax that you don't get now?
>
>Microsoft could simply be subsidizing this for SGI.  Others have 
>(cough cough) for them.


From mathog at caltech.edu  Wed Jan 17 10:20:04 2007
From: mathog at caltech.edu (David Mathog)
Date: Wed, 17 Jan 2007 10:20:04 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
Message-ID: <E1H7FOG-0006T0-6W@mendel.bio.caltech.edu>

Mikael Fredriksson wrote
> 
> This article: "SGI to offer Windows on clusters" might be worth reading...
> 
http://www.computerworld.com/action/article.do?command=viewArticleBasic&articleId=9007859&source=NLT_PM&nlid=8
> 
> 
> Any comments?

Seems like it's going to cost $$$ in extra work to keep a cluster like
that running.

For starters it would need to have the equivalent of a Windows site
license on the cluster, or it would be pure hell to activate, image,
and so forth.  Security patches are also going to be interesting,
since the target market is Windows servers talking to Windows clients
one can't just firewall off the cluster and ignore it.  But do you
really want your production cluster running automatic updates?  I
think not!  There's also the general issues of managing so
many machines.  I guess you could set up sshd on them and run 
a lot of scripts, but historically I've always found that there's some
damn piece of Windows that's only accessible through a GUI, and
who wants to point and click a hundred times, once per node, to do the
same thing on every node? Finally there's the basic question of:
"how is this a cluster"?  Sure you can have N nodes splitting a load
under windows, typically by just shunting jobs around at the network
level, but in terms of working together as even a loosely coupled
whole how is that implemented?  Especially if the nodes are just running
off the shelf, single node type software.

For these reasons I suspect that running a big cluster of windows
machines isn't going to be an option unless there's a whole lot of
magic rolled into "Windows Compute Cluster".  And if they can roll
in that much magic they really should target that software differently,
as "Windows Multiple Workstation/Server Management Console".  That is
a tool I'd like to have now, not for clusters, but for groups of
workstations, which invariably all need the same software installs
and/or tweaks but which would take hours to reimage.

Regards,

David Mathog
mathog at caltech.edu
Manager, Sequence Analysis Facility, Biology Division, Caltech


From landman at scalableinformatics.com  Wed Jan 17 10:33:18 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Wed, 17 Jan 2007 13:33:18 -0500
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
References: <web-1279309@free.net> <45AE3C29.1090809@scalableinformatics.com>
	<6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
Message-ID: <45AE6BEE.9050001@scalableinformatics.com>

Jim Lux wrote:

>> My understanding of pricing (for the windows portion) is that it adds 
>> (as an OS) $500USD to each node.  So for a 32 node machine, this is an 
>> extra $16k USD "tax" added on.  Doesn't include the absolutely 
>> necessary antivirus, anti-spyware, ...
> 
> Probably wouldn't be that expensive, especially if you boot the same 

It is.  List is 479$US per compute node.

> image on all nodes of the cluster.  Update one, update all.

I agree.  If it were 479$ per cluster (of any size, sorta like you can 
do with Linux for $0 per cluster), this would be interesting.

> Site licenses for AV and AS software are heavily discounted from retail, 
> as well.

$125 per node is guess.  Even if it is $60, or $20.  Basic idea is the same.

Costs scale per node for this path.  It adds cost.  Whether or not the 
benefit one derives from these costs is worth it is important, as is 
whether or not the benefit one derives from the alternatives are higher 
or lower.

> Is this Windows clustering version, too?

Yes.


-- 

Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615


From dag at sonsorol.org  Wed Jan 17 10:58:56 2007
From: dag at sonsorol.org (Chris Dagdigian)
Date: Wed, 17 Jan 2007 13:58:56 -0500
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <E1H7FOG-0006T0-6W@mendel.bio.caltech.edu>
References: <E1H7FOG-0006T0-6W@mendel.bio.caltech.edu>
Message-ID: <8FA7EE9E-6613-41B6-AB49-BFA36ADA3439@sonsorol.org>


There was an interesting thread on MS HPC on this list in the past,  
rather than retype I'll post the URL to my older post:

http://www.beowulf.org/archive/2006-June/015721.html

For those that don't want to follow the link -- there is probably a  
decent market for the MS HPC product, especially for small or  
dedicated systems that may be servicing data collection devices like  
lab instruments or imaging systems that live in non-datacenter  
environments where single point of contact support from an ISV who  
knows your particular domain/market/field cold is essential.  There  
are many non-HCP markets now which need significant compute power  
that is accessible and usable by non HPC specialists and if MS can  
line up the proper systems integrators, consultants and resellers who  
can service the specialized markets then this could be pretty  
successful.

My company has had a Rocketcalc system running MS Cluster Server 2003  
in our colo cage for quite some time now, the feedback from the  
individual actively using it has been very positive.  Regardless of  
what happens it will be an interesting thing to watch.

Regards,
Chris


On Jan 17, 2007, at 1:20 PM, David Mathog wrote:

> Mikael Fredriksson wrote
>>
>> This article: "SGI to offer Windows on clusters" might be worth  
>> reading...
>>
> http://www.computerworld.com/action/article.do? 
> command=viewArticleBasic&articleId=9007859&source=NLT_PM&nlid=8
>>
>>
>> Any comments?
>
> Seems like it's going to cost $$$ in extra work to keep a cluster like
> that running.
>
> For starters it would need to have the equivalent of a Windows site
> license on the cluster, or it would be pure hell to activate, image,
> and so forth.  Security patches are also going to be interesting,
> since the target market is Windows servers talking to Windows clients
> one can't just firewall off the cluster and ignore it.  But do you
> really want your production cluster running automatic updates?  I
> think not!  There's also the general issues of managing so
> many machines.  I guess you could set up sshd on them and run
> a lot of scripts, but historically I've always found that there's some
> damn piece of Windows that's only accessible through a GUI, and
> who wants to point and click a hundred times, once per node, to do the
> same thing on every node? Finally there's the basic question of:
> "how is this a cluster"?  Sure you can have N nodes splitting a load
> under windows, typically by just shunting jobs around at the network
> level, but in terms of working together as even a loosely coupled
> whole how is that implemented?  Especially if the nodes are just  
> running
> off the shelf, single node type software.
>
> For these reasons I suspect that running a big cluster of windows
> machines isn't going to be an option unless there's a whole lot of
> magic rolled into "Windows Compute Cluster".  And if they can roll
> in that much magic they really should target that software  
> differently,
> as "Windows Multiple Workstation/Server Management Console".  That is
> a tool I'd like to have now, not for clusters, but for groups of
> workstations, which invariably all need the same software installs
> and/or tweaks but which would take hours to reimage.
>
> Regards,
>
> David Mathog
> mathog at caltech.edu
> Manager, Sequence Analysis Facility, Biology Division, Caltech
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit  
> http://www.beowulf.org/mailman/listinfo/beowulf


From mike at etek.chalmers.se  Tue Jan 16 23:50:39 2007
From: mike at etek.chalmers.se (Mikael Fredriksson)
Date: Wed, 17 Jan 2007 08:50:39 +0100
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <433093DF7AD7444DA65EFAFE3987879C351384@orca.penguincomputing.com>
References: <433093DF7AD7444DA65EFAFE3987879C351384@orca.penguincomputing.com>
Message-ID: <45ADD54F.3090202@etek.chalmers.se>

Michael Will wrote:
> That is an interesting article... So I assume they went chapter 11 and
> microsoft ressurrected them as a vehicle to market their cluster
> solution?

Jepp.  It was no secret that SGI was missmanaged under a lot of years
and thus went bankrup.  But, their reputation is *still* better than
Microsofts...  ;-)


> Just wait for the Zombie processes ;-)

A whole cluster full... :-)


MF


From mike at etek.chalmers.se  Tue Jan 16 23:50:08 2007
From: mike at etek.chalmers.se (Mikael Fredriksson)
Date: Wed, 17 Jan 2007 08:50:08 +0100
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AD57B4.9060106@uiowa.edu>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
Message-ID: <45ADD530.90000@etek.chalmers.se>

Eric Shook wrote:
> I talked to our SGI rep about this yesterday and he told me they are not 
> really targeting "hard-core" university research where Linux/UNIX 
> already has a strong foot hold.  Instead this is for the Business sector 
> where simplified workflows and having easy HPC integration into an 
> already 100% Windows Infrastructure is more appealing.
> 
> This was his take and it seemed reasonable to me.

Yes, it is.  And more so if this cluster/LAN can also utilize som type
of "MOSIX" system.  This will substatially increase the throughput of
"standard serial" processes.  But as stated in a previous thread, the
"hard-core" systems are fairly specialized.

MF


From ashley at quadrics.com  Thu Jan 18 02:30:21 2007
From: ashley at quadrics.com (Ashley Pittman)
Date: Thu, 18 Jan 2007 10:30:21 +0000
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45ADD530.90000@etek.chalmers.se>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
Message-ID: <1169116222.4365.16.camel@localhost.localdomain>

On Wed, 2007-01-17 at 08:50 +0100, Mikael Fredriksson wrote:
> Eric Shook wrote:
> > I talked to our SGI rep about this yesterday and he told me they are not 
> > really targeting "hard-core" university research where Linux/UNIX 
> > already has a strong foot hold.  Instead this is for the Business sector 
> > where simplified workflows and having easy HPC integration into an 
> > already 100% Windows Infrastructure is more appealing.
> > 
> > This was his take and it seemed reasonable to me.
> 
> Yes, it is.  And more so if this cluster/LAN can also utilize som type
> of "MOSIX" system.  This will substatially increase the throughput of
> "standard serial" processes.

I find this statement hard to comprehend, how can any OS substantially
improve throughput of jobs unless what it replaces is incredibly
deficient in some way?  The limiting factor on clusters is the speed of
the hardware, even if some OS magically manages to be say 50% more
efficient doing it's bit than another OS it's still only a tiny percent
of time used, substantial improvements in job throughput can only come
about from better parallel algorithms, better code or faster hardware.

> But as stated in a previous thread, the "hard-core" systems are fairly specialized.

I think you may be surprised if you actually used a hard-core system,
whilst it's true that they are more than the sum of their parts the
parts are mostly that of a bog standard Linux distribution.

I suppose it could be true that changing to OS to Windows would make
them less specialised however that probably says more about Windows than
it does about "hard-core" clusters, I've no idea if this statement would
be true any more however, I've not used Windows in a number of years.

Ashley,


From diep at xs4all.nl  Thu Jan 18 04:43:26 2007
From: diep at xs4all.nl (Vincent Diepeveen)
Date: Thu, 18 Jan 2007 13:43:26 +0100
Subject: [Beowulf] SGI to offer Windows on clusters
References: <433093DF7AD7444DA65EFAFE3987879C351384@orca.penguincomputing.com>
	<45ADD54F.3090202@etek.chalmers.se>
Message-ID: <004601c73afe$41b0d2c0$0300a8c0@gourmandises>

It is a powerful combination microsoft + sgi.

Not because of the mouse saying to the elephant when crossing a river over a 
wooden bridge: "We make a lot of noise don't we?"

But rather because SGI has something microsoft can't deliver. Machine that 
to a limited number of cpu's provide a shared memory environment to windows 
users meanwhile having a real fast latency.

They can together offer a product that scales real well for users up to a 
brick or 8 == 32 sockets,
perhaps even 16 bricks. That'll be like 128+ cores.

Very powerful.
.
That will kick butt for windows software that is not embarrassingly parallel 
but with a little rewrite works fine
using shared memory.
Of course windows will require to rewrite the virtual adress space 
management in the kernel completely to get all such
software work well on such a machine.
But imagine the extra computing power it gives to companies for a relative 
small extra investment.
What will a 64 socket node cost now, like half a million or so?

Not peanuts, but price of SGI will go down for those machines bigtime when 
they start selling them by the zillion,
which will increase their market share even more. They could already take 
the risk of lowering the price of those machine.

Of course assuming they have some kind of monopoly on delivering a shared 
memory machine, and assuming some
8 socket K8 machine isn't a lot cheaper / faster.

Now their only problem will be letting windows customers figure out they can 
run their application faster on SGI,
but in that respect the elephant will make enough noise on its own already 
to get that done.

If SGI can deliver this package using some K8L chip they would dominate soon 
the highend market, because of delivering it
at a relative competative price.

Vincent

----- Original Message ----- 
From: "Mikael Fredriksson" <mike at etek.chalmers.se>
To: <Beowulf at beowulf.org>
Sent: Wednesday, January 17, 2007 8:50 AM
Subject: Re: [Beowulf] SGI to offer Windows on clusters


> Michael Will wrote:
>> That is an interesting article... So I assume they went chapter 11 and
>> microsoft ressurrected them as a vehicle to market their cluster
>> solution?
>
> Jepp.  It was no secret that SGI was missmanaged under a lot of years
> and thus went bankrup.  But, their reputation is *still* better than
> Microsofts...  ;-)
>
>
>
>> Just wait for the Zombie processes ;-)
>
> A whole cluster full... :-)
>
>
>
> MF
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf
> 


From rbw at ahpcrc.org  Thu Jan 18 06:17:34 2007
From: rbw at ahpcrc.org (Richard Walsh)
Date: Thu, 18 Jan 2007 08:17:34 -0600
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <1169116222.4365.16.camel@localhost.localdomain>
References: <45ABCF41.3020501@etek.chalmers.se>
	<45AD57B4.9060106@uiowa.edu>	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
Message-ID: <45AF817E.7020009@ahpcrc.org>

Ashley Pittman wrote:
> On Wed, 2007-01-17 at 08:50 +0100, Mikael Fredriksson wrote
>> Yes, it is.  And more so if this cluster/LAN can also utilize som type
>> of "MOSIX" system.  This will substatially increase the throughput of
>> "standard serial" processes.
>>     
>
> I find this statement hard to comprehend, how can any OS substantially
> improve throughput of jobs unless what it replaces is incredibly
> deficient in some way?  The limiting factor on clusters is the speed of
> the hardware, even if some OS magically manages to be say 50% more
> efficient doing it's bit than another OS it's still only a tiny percent
> of time used, substantial improvements in job throughput can only come
> about from better parallel algorithms, better code or faster hardware.
>
>   
     While I agree with this argument, especially at small scale, at 
very large scale operating
     system derived load imbalance (so-called skew, due to the random 
nature of system
     call driven interrupts) can destroy scalability, and thus 
efficiency.  This is worth mentioning,
     although I would not expect Windows to improve on Linux in this 
context.  You need
     a light-weight kernel like Catamount to reduce skew.

     There is a very good paper showing the effects of skew at scale by 
Kerberyson, et al from
     Sandia. 

     rbw

-- 

Richard B. Walsh

"The world is given to me only once, not one existing and one
 perceived. The subject and object are but one."

Erwin Schroedinger

Project Manager
Network Computing Services, Inc.
Army High Performance Computing Research Center (AHPCRC)
rbw at ahpcrc.org  |  612.337.3467

-----------------------------------------------------------------------
This message (including any attachments) may contain proprietary or
privileged information, the use and disclosure of which is legally
restricted.  If you have received this message in error please notify
the sender by reply message, do not otherwise distribute it, and delete
this message, with all of its contents, from your files.
----------------------------------------------------------------------- 


From eric-shook at uiowa.edu  Thu Jan 18 06:53:31 2007
From: eric-shook at uiowa.edu (Eric Shook)
Date: Thu, 18 Jan 2007 08:53:31 -0600
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <1169116222.4365.16.camel@localhost.localdomain>
References: <45ABCF41.3020501@etek.chalmers.se>
	<45AD57B4.9060106@uiowa.edu>	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
Message-ID: <45AF89EB.40600@uiowa.edu>


Ashley Pittman wrote:
> On Wed, 2007-01-17 at 08:50 +0100, Mikael Fredriksson wrote:
>> Eric Shook wrote:
>>> I talked to our SGI rep about this yesterday and he told me they are not 
>>> really targeting "hard-core" university research where Linux/UNIX 
>>> already has a strong foot hold.  Instead this is for the Business sector 
>>> where simplified workflows and having easy HPC integration into an 
>>> already 100% Windows Infrastructure is more appealing.
>>>
>>> This was his take and it seemed reasonable to me.
>> Yes, it is.  And more so if this cluster/LAN can also utilize som type
>> of "MOSIX" system.  This will substatially increase the throughput of
>> "standard serial" processes.
> 
> I find this statement hard to comprehend, how can any OS substantially
> improve throughput of jobs unless what it replaces is incredibly
> deficient in some way?  The limiting factor on clusters is the speed of
> the hardware, even if some OS magically manages to be say 50% more
> efficient doing it's bit than another OS it's still only a tiny percent
> of time used, substantial improvements in job throughput can only come
> about from better parallel algorithms, better code or faster hardware.
> 


Actually there are a few case studies floating around comparing Linux to 
Windows (not sure about UNIX).  That when running on identical hardware 
and the same code you can lose up to 30% efficiency running on Windows. 
  I am too lazy to try to find my supporting evidence but for the last 2 
years at SC05/06 there have been such studies on the show floor (I don't 
think they made it to the papers/posters section of the conference, so I 
cannot comment on the quality of the study).

Eric

>> But as stated in a previous thread, the "hard-core" systems are fairly specialized.
> 
> I think you may be surprised if you actually used a hard-core system,
> whilst it's true that they are more than the sum of their parts the
> parts are mostly that of a bog standard Linux distribution.
> 
> I suppose it could be true that changing to OS to Windows would make
> them less specialised however that probably says more about Windows than
> it does about "hard-core" clusters, I've no idea if this statement would
> be true any more however, I've not used Windows in a number of years.
> 

> Ashley,
> 
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf

-- 
Eric Shook (319) 335-6714
Technical Lead, Systems and Operations - GROW
http://grow.uiowa.edu


From peter.st.john at gmail.com  Thu Jan 18 07:00:51 2007
From: peter.st.john at gmail.com (Peter St. John)
Date: Thu, 18 Jan 2007 10:00:51 -0500
Subject: [Beowulf] array-processing programming languages
Message-ID: <e4d4fd070701180700p5e126308pebba6e5bb67fc1f5@mail.gmail.com>

Hello folks,

Does anybody use the array-processing language ZPL? The last reference I can
find to it is about two years ago; for example, the latest nightly "Cutting
Edge" build was: *"Last reflected modification: Tue Nov 16 18:55:55 PST 2004
"*  (from http://www.cs.washington.edu/research/zpl/download/download.html).


I will have to rewrite some of my own stuff to get a better trade-off of
space for time, than made sense when I originally wrote it in the mid-90's.
I'd be interested in any suggestions for language choices.

Peter St.John

P.S. I'm a software developer new to this list. My interest comes from
parallelizable combinatorial optimization algorthms (I have my own nutty way
to do genetic algorithms) and the possibility of building my own small
cluster.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070118/38bd740d/attachment.html>

From ashley at quadrics.com  Thu Jan 18 07:20:22 2007
From: ashley at quadrics.com (Ashley Pittman)
Date: Thu, 18 Jan 2007 15:20:22 +0000
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AF817E.7020009@ahpcrc.org>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
Message-ID: <1169133623.4365.65.camel@localhost.localdomain>

On Thu, 2007-01-18 at 08:17 -0600, Richard Walsh wrote:
> Ashley Pittman wrote:
> > On Wed, 2007-01-17 at 08:50 +0100, Mikael Fredriksson wrote
> >> Yes, it is.  And more so if this cluster/LAN can also utilize som type
> >> of "MOSIX" system.  This will substatially increase the throughput of
> >> "standard serial" processes.
> >>     
> >
> > I find this statement hard to comprehend, how can any OS substantially
> > improve throughput of jobs unless what it replaces is incredibly
> > deficient in some way?  The limiting factor on clusters is the speed of
> > the hardware, even if some OS magically manages to be say 50% more
> > efficient doing it's bit than another OS it's still only a tiny percent
> > of time used, substantial improvements in job throughput can only come
> > about from better parallel algorithms, better code or faster hardware.
> >
> >   
>      While I agree with this argument, especially at small scale, at 
> very large scale operating
>      system derived load imbalance (so-called skew, due to the random 
> nature of system
>      call driven interrupts) can destroy scalability, and thus 
> efficiency.  This is worth mentioning,
>      although I would not expect Windows to improve on Linux in this 
> context.  You need
>      a light-weight kernel like Catamount to reduce skew.

Absolutely, it's one of the more interesting challenges facing large
scale cluster development currently.  Large scale in this context is
>1000 nodes however, I don't think this is the market Microsoft is
targeting.

I could of course argue that the answer to the skew problem is going to
be a algorithmic one which would mean my previous statement holds.

Ashley,


From rgb at phy.duke.edu  Thu Jan 18 07:42:30 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Thu, 18 Jan 2007 10:42:30 -0500 (EST)
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AF817E.7020009@ahpcrc.org>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
Message-ID: <Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>

On Thu, 18 Jan 2007, Richard Walsh wrote:

> Ashley Pittman wrote:
>> On Wed, 2007-01-17 at 08:50 +0100, Mikael Fredriksson wrote
>>> Yes, it is.  And more so if this cluster/LAN can also utilize som type
>>> of "MOSIX" system.  This will substatially increase the throughput of
>>> "standard serial" processes.
>>> 
>> 
>> I find this statement hard to comprehend, how can any OS substantially
>> improve throughput of jobs unless what it replaces is incredibly
>> deficient in some way?  The limiting factor on clusters is the speed of
>> the hardware, even if some OS magically manages to be say 50% more
>> efficient doing it's bit than another OS it's still only a tiny percent
>> of time used, substantial improvements in job throughput can only come
>> about from better parallel algorithms, better code or faster hardware.
>>
>>
>    While I agree with this argument, especially at small scale, at very 
> large scale operating
>    system derived load imbalance (so-called skew, due to the random nature 
> of system
>    call driven interrupts) can destroy scalability, and thus efficiency. 
> This is worth mentioning,
>    although I would not expect Windows to improve on Linux in this context. 
> You need
>    a light-weight kernel like Catamount to reduce skew.
>
>    There is a very good paper showing the effects of skew at scale by 
> Kerberyson, et al from
>    Sandia.

It also isn't the point.  Nobody, and I mean nobody in this universe,
analyzes and compares WinXX and Linux from a performance point of view.
WinXX isn't engineered for performance per se, it is engineered for its
marketability first and foremost, its real or perceived
user-friendliness at the desktop (as a major component of its
marketability), its ability to permit less-than-genius admins to manage
a complex, networked, operating environment (marketability), and its
real or perceived availability of corporate support for same.
Performance is a distant last place, way down there beneath stability
(and look how well they do with stability) and ensuring that their
perception as the only platform that makes sense for the development of
commercial software remains unchallenged.

Stability is important to sales, especially in a server environment and
as I previously noted pro-grade systems people can make a WinXX Server
setup sufficiently stable for production purposes.  The (correct)
perception by developers that it is corporate suicide to develop for a
presumed corporate linux network that extends to the desktop is also
critical to MS's business plan and continued success.  Performance is a
competitive metric, and would only matter if they had any competition or
if their performance isn't "adequate".

Linux remains around the 1% level in US desktop occupancy, and even in
the pacific rim where its numbers are the best it only makes it to 3% or
so (the rest are doubtless mostly bootleg WinXX).  The largest monopoly
ever to exist in the history of the world laughs at these numbers.  On
the broader server market Linux fares better, but it is still very much
David against Goliath where David may appear sometimes to be winning,
but Goliath has yet to be hit in the head with any kind of stone.

At this point, with massive investment in MS stock on the part of
corporate retirement and pension accounts, hitting MS in the head with a
killer stone would probably trigger a nationwide panic or even a
depression.  It could still happen, but I really expect that killing
Goliath may take years of nibbling at his heels and not a single blow,
with Goliath fighting back and changing form all the way to try to avoid
his fate.

Basically, MS's cluster product is almost certainly designed to do two
things.  One is provide them with a credible presence in the cluster
market not because it is particularly important to them as a profit
center but because hurting linux and the other unices strengthens their
position in the general server market in many ways.  They do not want
corporate admins or execs to be forced into installing a linux cluster
in a WinXX server environment as it is way too likely that they'll learn
that Linux can take over their WinXX server responsibilities and save
them a bundle, once they've already invested in linux admins and gotten
over the startup "hump".  The other is to defend against the possibility
that a "killer app" might emerge in that marketplace, or that the apps
in that marketplace might attract enough software companies into the at
least part time linux development market that they, too, might start
looking at porting their software in general to linux.

>From what I can see, there are various things gradually lowering the
barrier between linux development and Windows development -- making it
easier to port Windows code directly to linux with a recompile, making
it easier to run Windows code directly within linux without a Windows
license.  Wine/cedega, vmware, win4lin and others on the one hand,
cross-architecture development libraries on the other hand.  Microsoft
has a strong interest in maintaining those barriers and doubtless moves
things around to keep code porting difficult (compare how easy it is to
move code between unices to how difficult it is to move between linux
and MS, and how much expense that adds to the task of maintaining a code
base in both worlds).

So the MS Cluster will doubtless run shrink-wrapped software in
corporate WinXX-only server rooms, permitting them to pay MS $500 a box
(or whatever) rather than hire another two or three $80K/year employees
to run a linux cluster, and control the cluster, policy, and the
application from the familiar Windows toplevel GUI.  For a single task,
and a cluster with only 16 or so nodes, that's a total bargain.  Even
linux cluster consulting companies would be hard-pressed to provide a
turnkey cluster and remote manage it (necessary to avoid hiring those
local linux admins) for only $8-10K in margin.  For larger numbers of
nodes, of course, at some point this argument doesn't scale, but even
there you come up against MS's formidable sales force, who will whack
you on the head with all sorts of TCO FUD, support FUD, ease of use FUD,
and ultimately may well convince you that it is easier and cheaper to
just pay them a 20-40% "tax" per node out to hundreds of nodes than it
is to take that shiftless, irresponsible, hippie-supported idealistic
"movement" that is linux into their pristine server rooms.

They'll win some of these arguments, lose others.  And not care much
either way.  Their purpose is served.  What they continue to risk is the
gradual movement of BIG companies away from Windows-dominated server
rooms and the even gradualler movement of a very few of those companies
towards a corporate linux desktop.  The last thing in the world they
want is companies publically trumpeting major savings from converting to
linux on an enterprise level.  Yet the barrier to doing such a
conversion is gradually coming down, and the number of big companies
with a significant linux presence grows.

So it isn't about performance, it is about presence.  SGI is a big, high
visibility deal and provides them with FUD-war ammunition, a way to slow
the bleed of server sites to linux, and which will almost certainly pay
for itself even if it is little more than a batch queue program with a
nice GUI and perhaps a STABLE API -- something linux could certainly
use.

    rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From deadline at eadline.org  Thu Jan 18 07:43:22 2007
From: deadline at eadline.org (Douglas Eadline)
Date: Thu, 18 Jan 2007 10:43:22 -0500 (EST)
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
References: <web-1279309@free.net> <45AE3C29.1090809@scalableinformatics.com>
	<6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
Message-ID: <59012.192.168.1.1.1169135002.squirrel@mail.eadline.org>

 snip

>>
>>My understanding of pricing (for the windows portion) is that it
>>adds (as an OS) $500USD to each node.  So for a 32 node machine,
>>this is an extra $16k USD "tax" added on.  Doesn't include the
>>absolutely necessary antivirus, anti-spyware, ...
>
> Probably wouldn't be that expensive, especially if you boot the same
> image on all nodes of the cluster.  Update one, update all.
>
> Site licenses for AV and AS software are heavily discounted from
> retail, as well.
>
> Is this Windows clustering version, too?

Here is a data point.

I just finished writing an article about the Tyan PSC
for Linux magazine. Actually have the demo model sitting
next to me. It has 10 quad-core Xeons, GigE, Infiniband,
a KVM all in small 21" x 14" x 28" (52.7 cm x 35.6 cm x 70 cm)
package (it weighs 150 pounds).

They are selling a optional five node version of WCCS for
$2,345 ($469/node)

About WCCS. I believe MS is concentrating on bundled applications
to sell the WCCS. I think they (and others) see a market for small
single application bundled clusters. The PSC will eventually hold up
to 40 cores and plug into a wall outlet. The OS requirements
for these types of systems is much different than a 128 node
(512P with dual core) multi-user HPC cluster.

i.e. Ask a person in an office setting what version of
Windows is running under Word and you get the idea.

--
Doug

>
>
>
>
>>   Calling all that roughly $4k USD (roughly $125/node), we are
>> looking at something closer to $20k extra per 32 nodes.  So for 128
>> nodes, this adds $80k USD.  For 1024 nodes, this adds $640k USD.
>>
>>My question has been on the CBA side.  What do you get for that
>>extra tax that you don't get now?
>>
>>Microsoft could simply be subsidizing this for SGI.  Others have
>>(cough cough) for them.
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
> !DSPAM:45ae6756299111446633523!
>


-- 
Doug


From rbw at ahpcrc.org  Thu Jan 18 07:55:02 2007
From: rbw at ahpcrc.org (Richard Walsh)
Date: Thu, 18 Jan 2007 09:55:02 -0600
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
	<Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
Message-ID: <45AF9856.70607@ahpcrc.org>

Robert G. Brown wrote:
> On Thu, 18 Jan 2007, Richard Walsh wrote:
>
>> Ashley Pittman wrote:
>>> On Wed, 2007-01-17 at 08:50 +0100, Mikael Fredriksson wrote
>>>> Yes, it is.  And more so if this cluster/LAN can also utilize som type
>>>> of "MOSIX" system.  This will substatially increase the throughput of
>>>> "standard serial" processes.
>>>>
>>>
>>> I find this statement hard to comprehend, how can any OS substantially
>>> improve throughput of jobs unless what it replaces is incredibly
>>> deficient in some way?  The limiting factor on clusters is the speed of
>>> the hardware, even if some OS magically manages to be say 50% more
>>> efficient doing it's bit than another OS it's still only a tiny percent
>>> of time used, substantial improvements in job throughput can only come
>>> about from better parallel algorithms, better code or faster hardware.
>>>
>>>
>>    While I agree with this argument, especially at small scale, at 
>> very large scale operating
>>    system derived load imbalance (so-called skew, due to the random 
>> nature of system
>>    call driven interrupts) can destroy scalability, and thus 
>> efficiency. This is worth mentioning,
>>    although I would not expect Windows to improve on Linux in this 
>> context. You need
>>    a light-weight kernel like Catamount to reduce skew.
>>
>>    There is a very good paper showing the effects of skew at scale by 
>> Kerberyson, et al from
>>    Sandia.
>
> It also isn't the point.  Nobody, and I mean nobody in this universe,
> analyzes and compares WinXX and Linux from a performance point of view.
     Robert that universe includes me ... ;-) ... I was only qualifying 
Ashley's statement about operating
     system impact on HPC cluster efficiency/performance.   The 
Windows/Linux cluster market analysis
     can  continue ...

     rbw

-- 

Richard B. Walsh

"The world is given to me only once, not one existing and one
 perceived. The subject and object are but one."

Erwin Schroedinger

Project Manager
Network Computing Services, Inc.
Army High Performance Computing Research Center (AHPCRC)
rbw at ahpcrc.org  |  612.337.3467

-----------------------------------------------------------------------
This message (including any attachments) may contain proprietary or
privileged information, the use and disclosure of which is legally
restricted.  If you have received this message in error please notify
the sender by reply message, do not otherwise distribute it, and delete
this message, with all of its contents, from your files.
----------------------------------------------------------------------- 


From cap at nsc.liu.se  Thu Jan 18 08:08:24 2007
From: cap at nsc.liu.se (Peter Kjellstrom)
Date: Thu, 18 Jan 2007 17:08:24 +0100
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AF817E.7020009@ahpcrc.org>
References: <45ABCF41.3020501@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
Message-ID: <200701181708.29205.cap@nsc.liu.se>

On Thursday 18 January 2007 15:17, Richard Walsh wrote:
...
>      While I agree with this argument, especially at small scale, at
> very large scale operating
>      system derived load imbalance (so-called skew, due to the random
> nature of system
>      call driven interrupts) can destroy scalability, and thus
> efficiency.  This is worth mentioning,
>      although I would not expect Windows to improve on Linux in this
> context.  You need
>      a light-weight kernel like Catamount to reduce skew.

Or you could sacrifice some cores and use some kind of cpu isolation strategy. 
This is getting more and more resonable for each new cpu model (you don't 
need to and/or you can't use all cores for your application anyway).

/Peter

...
>      rbw
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070118/3294f773/attachment.sig>

From hahn at physics.mcmaster.ca  Thu Jan 18 08:16:31 2007
From: hahn at physics.mcmaster.ca (Mark Hahn)
Date: Thu, 18 Jan 2007 11:16:31 -0500 (EST)
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45ADD530.90000@etek.chalmers.se>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
Message-ID: <Pine.LNX.4.64.0701181106300.8989@coffee.psychology.mcmaster.ca>

> Yes, it is.  And more so if this cluster/LAN can also utilize som type
> of "MOSIX" system.  This will substatially increase the throughput of
> "standard serial" processes.

hmm, curious argument - why do you think MOSIX will be more efficient
than a "normal" cluster when confronted with a trivial/serial workload?
is there something about a typical pbs/sge/lsf/etc system that you think
can't handle serial jobs?


From rgb at phy.duke.edu  Thu Jan 18 09:10:05 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Thu, 18 Jan 2007 12:10:05 -0500 (EST)
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AF9856.70607@ahpcrc.org>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
	<Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
	<45AF9856.70607@ahpcrc.org>
Message-ID: <Pine.LNX.4.64.0701181137140.27199@lilith.rgb.private.net>

On Thu, 18 Jan 2007, Richard Walsh wrote:

>    Robert that universe includes me ... ;-) ... I was only qualifying 
> Ashley's statement about operating
>    system impact on HPC cluster efficiency/performance.   The Windows/Linux 
> cluster market analysis
>    can  continue ...

Sure, Richard, but you (forgive me) are doubtless a geek.  I am a geek.
95% or more of the participants on this list are geeks, and the 5% that
aren't and are corporate are either geek wannabes or have close friends
or subordinates that are geeks.  There have even been one or two borgs
lurking on the list in years past, but they tend to get sprayed with
toxic substances when they emerge and hence they are usually content to
just lurk while plotting their nefarious plan to perpetuate their World
Domination.

Geeks do not live in the Universe to which I was referring -- the one
where corporate shirts are doing CBA and risk assessments (including
their own personal risk associated with rocking boats to force major
changes) and then making decisions in which they have to compare things
like short run comfort to a POSSIBLE long term ROI should they invest in
a product that they KNOW will a) piss some people off because it doesn't
work for them for their favorite bit of software pie; b) piss other
people off just because they have to change and learn knew things; c)
piss STILL other people off because it doesn't work on their particular
piece of hardware, forcing the company to invest some unspecified dollar
amount in rebuilding things so that they will work.

No, we geeks live in a parallel universe, one where people make rational
decisions based (paradoxically enough) on ideals such as "better
performance", "cheaper", "more scalable", "better design", "more stable"
instead of on more socially aware and concrete things like "more likely
to get me fired when my boss gets pissed off that his system no longer
can run Quickbooks, Turbo Tax, WoW, and the nifty app that synchronizes
his personal favorite calendar program with his PDA", or "more likely to
get me fired when I have to replace all the NICs and GAs in the older
office PCs in order to have linux recognize them", "more likely to get
me fired when it fails to work for all of our cameras in marketing, mp3
players in development, cheap lexmark printers wherever, A2D converters
and labware in the lab..."

There are of course, a few humans that actually manage to do the "Tavern
Between the Worlds" thing and pass freely between the geek Universe and
the corporate Universe, scarfing a beer as they pass through.  You can
usually tell them by their odd costumery when they've recently passed
through the Gate -- those coats and ties clash with all the tee shirts
with penguins on them -- but you can also tell that they truly belong if
you try to hold an intelligent conversation with them.

And finally there are the Borgs, who pass between the Universes but do
so in their Master's Keep, entering through one gate dressed like the
soulless automata that they truly are and emerging from another wearing
double-knit polos with Borg logos and knife-edged khaki pants and
loafers, so they can pretend that they are just "one of the geeks" at
some Geek Faire being held in Geekworld.  If you fall into their
clutches at one of these events they will simply grab at your collar
(they're searching for your non-existent tie or lapels) and intone
"resistance is futile" until somebody comes along and makes fun of them,
causing them to scurry back into the protective embrace of their control
droid at the biggest, most overdone booth at the Faire.  Sometimes The
Borg Himself shows up to deliver a keynote address or otherwise assert
his Geek Roots, but no real geeks are ever seen in his Company (in
either sense of the term) so it cannot be said with certainty if it
really is The Borg or just one of his many clones.

So to clarify, nobody in the Universe under discussion -- the one that
isn't already using linux clusters because of its vast performance and
cost advantages in their mileau -- could care less about performance per
se as long as it is "good enough" and the risk/ROI/CYA equations work
out right.  Places where The Borg already Owns Their Soul, in other
words.

"Resistance is Futile..."

(Gawds, there's bound to be a video game in here somewhere, isn't
there?:-)

   rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From James.P.Lux at jpl.nasa.gov  Thu Jan 18 09:37:30 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Thu, 18 Jan 2007 09:37:30 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AF89EB.40600@uiowa.edu>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF89EB.40600@uiowa.edu>
Message-ID: <6.2.3.4.2.20070118091835.03390e30@mail.jpl.nasa.gov>

At 06:53 AM 1/18/2007, Eric Shook wrote:


>Ashley Pittman wrote:
>>On Wed, 2007-01-17 at 08:50 +0100, Mikael Fredriksson wrote:
>>>Eric Shook wrote:
>>>>I talked to our SGI rep about this yesterday and he told me they 
>>>>are not really targeting "hard-core" university research where 
>>>>Linux/UNIX already has a strong foot hold.  Instead this is for 
>>>>the Business sector where simplified workflows and having easy 
>>>>HPC integration into an already 100% Windows Infrastructure is more appealing.
>>>>
>>>>This was his take and it seemed reasonable to me.
>>>Yes, it is.  And more so if this cluster/LAN can also utilize som type
>>>of "MOSIX" system.  This will substatially increase the throughput of
>>>"standard serial" processes.
>>I find this statement hard to comprehend, how can any OS substantially
>>improve throughput of jobs unless what it replaces is incredibly
>>deficient in some way?  The limiting factor on clusters is the speed of
>>the hardware, even if some OS magically manages to be say 50% more
>>efficient doing it's bit than another OS it's still only a tiny percent
>>of time used, substantial improvements in job throughput can only come
>>about from better parallel algorithms, better code or faster hardware.
>
>
>Actually there are a few case studies floating around comparing 
>Linux to Windows (not sure about UNIX).  That when running on 
>identical hardware and the same code you can lose up to 30% 
>efficiency running on Windows.

Hmmm..  I would imagine that there are not-quite-pathological cases 
where this is true.  Certainly, I would expect this kind of 
differential installing essentially the same application on box stock 
installs of each, just because the stock install of Win tends to have 
a lot of other stuff added in to "enhance the user experience" as 
well as providing a "Windows Genuine Advantage" so that your vast 
music and video collection "Plays for Sure".

On a stripped down WinXP system, I'd not be so sure.  Once you've 
gotten rid of all the little helpful stuff that grabs a cycle here 
and grabs a cycle there (automatic software updates in the background 
are a particuarly annoying case) it's going to be pretty similar.. 
the underlying kernel to do things like disk i/o and start/stop 
processes doesn't consume a huge fraction of the overall CPU or bus 
bandwidth in either Win or Linux, so even if Win's an absolute dog, 
performance wise, the overall impact isn't going to be that 
big.  (And, of course, the kernel in Win isn't all that inefficient.. 
a) it's not that hard to make it "reasonable" and b)it's a worthwhile 
investment for MS to make it decent)

There's a whole literature on tweaking WinXP for realtime performance 
(folks doing things like DSP for radios, audio recording, video 
editing, gaming, etc. have figured all this out), just as there is 
for Linux.    And, that tweaking is essentially a one time 
"installation" sort of effort.  Spend the couple days getting rid of 
unneeded processes and utilities (hmm sort of like customizing your 
init scripts) setting process priorities, etc. and you're in good shape.

Yes, there is ALWAYS the risk that some update goes and sets things 
back, just to be helpful, but, in general, that's pretty rare.

A real time waster for WinXP has to do with network access to remote 
disk drives.. there are pathological cases where it can spend a LOT 
of time apparently spinning in the kernel waiting for a response that 
never comes.  But I've only encountered this on a large heterogeneous 
network (JPLnet can fairly be called large and heterogenous, I 
suppose) with both machines having a lot of oddball network access 
drivers and services installed (e.g. AFS Client, Tivoli, NearSpace, 
Timbuktu, etc.), some of which I am certain are mutually incompatible.

On a smallish (<10 machines) network with me controlling all the 
nodes, I've never had the big kernel waits.


James Lux, P.E.
Spacecraft Radio Frequency Subsystems Group
Flight Communications Systems Section
Jet Propulsion Laboratory, Mail Stop 161-213
4800 Oak Grove Drive
Pasadena CA 91109
tel: (818)354-2075
fax: (818)393-6875 


From mathog at caltech.edu  Thu Jan 18 12:46:25 2007
From: mathog at caltech.edu (David Mathog)
Date: Thu, 18 Jan 2007 12:46:25 -0800
Subject: [Beowulf] Re: SGI to offer Windows on clusters
Message-ID: <E1H7e9R-0000mJ-Bo@mendel.bio.caltech.edu>

> The PSC will eventually hold up
> to 40 cores and plug into a wall outlet. 

That's going to be quite a challenge with 40 Xeon cores.  Even if
they can beat the maximum powerconsumption down to 50W per
core that's still not going to work on a 20A wall outlet unless
there's no other electrical load in the system whatsoever (no disks,
no memory...).

Maybe they use mobile CPUs?

What product did they send you?  Their web site lists only one PSC,
the B2881, which has 8 Opterons.

David Mathog
mathog at caltech.edu
Manager, Sequence Analysis Facility, Biology Division, Caltech


From deadline at eadline.org  Thu Jan 18 13:27:51 2007
From: deadline at eadline.org (Douglas Eadline)
Date: Thu, 18 Jan 2007 16:27:51 -0500 (EST)
Subject: [Beowulf] Re: SGI to offer Windows on clusters
In-Reply-To: <E1H7e9R-0000mJ-Bo@mendel.bio.caltech.edu>
References: <E1H7e9R-0000mJ-Bo@mendel.bio.caltech.edu>
Message-ID: <40025.192.168.1.1.1169155671.squirrel@mail.eadline.org>

>> The PSC will eventually hold up
>> to 40 cores and plug into a wall outlet.
>
> That's going to be quite a challenge with 40 Xeon cores.  Even if
> they can beat the maximum powerconsumption down to 50W per
> core that's still not going to work on a 20A wall outlet unless
> there's no other electrical load in the system whatsoever (no disks,
> no memory...).

I had a pre-release PSC 650 version with quad cores, but
it did not have the low power quads. The PSC 630 uses dual
cores i.e. Dual Intel? Xeon? 5148 2.33GHz/1333Hz FSB LV processor,
MAX. TDP 40 Watts (pasted from the web page)

>
> Maybe they use mobile CPUs?
>
> What product did they send you?  Their web site lists only one PSC,
> the B2881, which has 8 Opterons.

Did you go here:

http://www.tyanpsc.com/

 --
 Doug
>
> !DSPAM:45afdcaa17283122173853!
>


From rbw at ahpcrc.org  Thu Jan 18 13:51:09 2007
From: rbw at ahpcrc.org (Richard Walsh)
Date: Thu, 18 Jan 2007 15:51:09 -0600
Subject: [Beowulf] Re: SGI to offer Windos on clusters ---> Skew/Jitter paper
In-Reply-To: <20070118211424.27F2935A594@mail.scali.no>
References: <20070118211424.27F2935A594@mail.scali.no>
Message-ID: <45AFEBCD.9010007@ahpcrc.org>

H?kon Bugge wrote:
> You wrote:
>> There is a very good paper showing the effects of skew at scale by 
>> Kerberyson, et al from Sandia.
>
> Googling on Kerberyson gave zero hits. Since I am working on a 
> solution for the problem; would be nice to have that paper. You have a 
> reference (or pdf)?
>
     Hakon,

     I am sorry. Spelling error ... ;-) ... I just searched on using the 
authors and correct spelling:

     Kerbyson Petrini Pakin

     And got it.  The title is:

     "The Case of the Missing Supercomputing Performance"

     They also have a nice paper on their model for predicting/modeling 
system
     performance prior to purchase.

     Hope that does it.

     Regards,

     rbw

-- 

Richard B. Walsh

"The world is given to me only once, not one existing and one
 perceived. The subject and object are but one."

Erwin Schroedinger

Project Manager
Network Computing Services, Inc.
Army High Performance Computing Research Center (AHPCRC)
rbw at ahpcrc.org  |  612.337.3467

-----------------------------------------------------------------------
This message (including any attachments) may contain proprietary or
privileged information, the use and disclosure of which is legally
restricted.  If you have received this message in error please notify
the sender by reply message, do not otherwise distribute it, and delete
this message, with all of its contents, from your files.
----------------------------------------------------------------------- 


From mathog at caltech.edu  Thu Jan 18 14:03:14 2007
From: mathog at caltech.edu (David Mathog)
Date: Thu, 18 Jan 2007 14:03:14 -0800
Subject: [Beowulf] Re: SGI to offer Windows on clusters
Message-ID: <E1H7fLm-0000tR-Dl@mendel.bio.caltech.edu>

> Did you go here:
> 
> http://www.tyanpsc.com/

No here:

   http://www.tyan.com/products/html/clusterservers.html

Looks like Tyan has  a few more links to put into their main website.

Regards,

David Mathog
mathog at caltech.edu
Manager, Sequence Analysis Facility, Biology Division, Caltech


From James.P.Lux at jpl.nasa.gov  Thu Jan 18 14:06:30 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Thu, 18 Jan 2007 14:06:30 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
	<Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
Message-ID: <6.2.3.4.2.20070118135045.0327f388@mail.jpl.nasa.gov>

At 07:42 AM 1/18/2007, Robert G. Brown wrote:
>On Thu, 18 Jan 2007, Richard Walsh wrote:
>
>>Ashley Pittman wrote:
>>>On Wed, 2007-01-17 at 08:50 +0100, Mikael Fredriksson wrote
>>>>Yes, it is.  And more so if this cluster/LAN can also utilize som type
>>>>of "MOSIX" system.  This will substatially increase the throughput of
>>>>"standard serial" processes.
>><snip>
>
>It also isn't the point.  Nobody, and I mean nobody in this universe,
>analyzes and compares WinXX and Linux from a performance point of view.

Well, at least not in a rigorous way.  People compare Win and Linux 
performance all the time, but it's never a real head to head 
comparison with comparable and controlled environments and configurations.

However, that's the whole "benchmarking magic" argument...

Stability is important to sales, especially in a server environment and
>as I previously noted pro-grade systems people can make a WinXX Server
>setup sufficiently stable for production purposes.

And likewise, WinXP on the desktop.  A company with 20,000 WinXP 
desktops cannot tolerate BSODs and mystery hangs on a significant 
fraction of those desktops at any frequency.  When your call center 
operators are being timed to the second, the sysadmin folks know 
INSTANTLY when there are problems.

But, just as in the server application, the configurations are 
rigorously controlled and tested.  It's certainly not the usual home 
computer with umpty-five downloaded widgets, etc.


>Linux remains around the 1% level in US desktop occupancy, and even in
>the pacific rim where its numbers are the best it only makes it to 3% or
>so (the rest are doubtless mostly bootleg WinXX).  The largest monopoly
>ever to exist in the history of the world laughs at these numbers.  On
>the broader server market Linux fares better, but it is still very much
>David against Goliath where David may appear sometimes to be winning,
>but Goliath has yet to be hit in the head with any kind of stone.

More like grains of sand being dribbled about the feet.


>At this point, with massive investment in MS stock on the part of
>corporate retirement and pension accounts, hitting MS in the head with a
>killer stone would probably trigger a nationwide panic or even a
>depression.  It could still happen, but I really expect that killing
>Goliath may take years of nibbling at his heels and not a single blow,
>with Goliath fighting back and changing form all the way to try to avoid
>his fate.

And, bizarre as it may seem, Goliath might change into a service 
provider via webservices running on some sort of serverfarm which 
could conceivably be anything.  Imagine.. paying a nickle per 
document page to use the WebServices version of MSWord with whatever 
browser you care to use.  Hey, we're willing to pay that for a 
photocopier, and it doesn't even do spell checking.


>Basically, MS's cluster product is almost certainly designed to do two
>things.  One is provide them with a credible presence in the cluster
>market not because it is particularly important to them as a profit
>center but because hurting linux and the other unices strengthens their
>position in the general server market in many ways.  They do not want
><snip>

I think it's also to support the turnkey software vendors who need a 
platform with more compute crunch for their existing Windows 
application.  Think finite element models of one kind or another.  If 
your application costs $50K/seat, a kilobuck or two for an OS isn't a big deal.


> From what I can see, there are various things gradually lowering the
>barrier between linux development and Windows development -- making it
>easier to port Windows code directly to linux with a recompile, making
>it easier to run Windows code directly within linux without a Windows
>license.

Mind you, MS does make a moving target here, and not necessarily to 
make it hard to do this, but just because they choose the way they 
want to go for their own interests. Kind of like fleas on a dog that 
decides to roll over.

>   Wine/cedega, vmware, win4lin and others on the one hand,
>cross-architecture development libraries on the other hand.  Microsoft
>has a strong interest in maintaining those barriers and doubtless moves
>things around to keep code porting difficult (compare how easy it is to
>move code between unices to how difficult it is to move between linux
>and MS, and how much expense that adds to the task of maintaining a code
>base in both worlds).

But the difficulty of moving between Windows and *nix compared to 
among *nix is more like the difference between translating between 
German and English vs translating between English dialects.  There's 
common roots, and the grammar is similar, for the former, but the 
latter is mostly a matter of vocabulary and pronounciation.

It is further complicated by the fact that unlike in the 
language/dialect case, the two OSes (and their corresponding 
development models and environments) are both changing, windows more 
rapidly than *nix for the most part.


>So it isn't about performance, it is about presence.  SGI is a big, high
>visibility deal and provides them with FUD-war ammunition, a way to slow
>the bleed of server sites to linux, and which will almost certainly pay
>for itself even if it is little more than a batch queue program with a
>nice GUI and perhaps a STABLE API -- something linux could certainly
>use.

Jim Lux 


From ashley at quadrics.com  Thu Jan 18 14:29:28 2007
From: ashley at quadrics.com (ashley at quadrics.com)
Date: Thu, 18 Jan 2007 22:29:28 -0000
Subject: [Beowulf] Re: SGI to offer Windos on clusters ---> Skew/Jitter
	paper
References: <20070118211424.27F2935A594@mail.scali.no>
	<45AFEBCD.9010007@ahpcrc.org>
Message-ID: <30062B7EA51A9045B9F605FAAC1B4F621A54FE@exch01.quadrics.com>

>     And got it.  The title is:
>
>    "The Case of the Missing Supercomputing Performance"

I wondered if you were talking about that paper but it's from lanl not sandia, it should be essential reading for everyone working with large clusters.

Ashley,


From laytonjb at charter.net  Thu Jan 18 15:21:39 2007
From: laytonjb at charter.net (laytonjb at charter.net)
Date: Thu, 18 Jan 2007 15:21:39 -0800
Subject: [Beowulf] Re: SGI to offer Windows on clusters
Message-ID: <1752357766.1169162499546.JavaMail.root@fepweb14>

---- David Mathog <mathog at caltech.edu> wrote: 
> > Did you go here:
> > 
> > http://www.tyanpsc.com/
> 
> No here:
> 
>    http://www.tyan.com/products/html/clusterservers.html
> 
> Looks like Tyan has  a few more links to put into their main website.

Remember that the PSC on the Tyan website is the first generation PSC. With
this new version, they created a new "company" called TyanPSC to sell, market,
and I think support it. But it would be a good idea to put a big link on their
main page to www.tyanpsc.com.

Jeff


From James.P.Lux at jpl.nasa.gov  Thu Jan 18 15:33:20 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Thu, 18 Jan 2007 15:33:20 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <Pine.LNX.4.64.0701181137140.27199@lilith.rgb.private.net>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
	<Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
	<45AF9856.70607@ahpcrc.org>
	<Pine.LNX.4.64.0701181137140.27199@lilith.rgb.private.net>
Message-ID: <6.2.3.4.2.20070118151702.032b8440@mail.jpl.nasa.gov>

At 09:10 AM 1/18/2007, Robert G. Brown wrote:
... while plotting their nefarious plan to perpetuate their World
>Domination.

Is there a higher calling than that?


>Geeks do not live in the Universe to which I was referring -- the one
>where corporate shirts are doing CBA and risk assessments...

Sure they do.. they're kept in carpeted cages where the damage they 
can do is minimized.

>No, we geeks live in a parallel universe, one where people make rational
>decisions based (paradoxically enough) on ideals such as "better
>performance", "cheaper", "more scalable", "better design", "more stable"

Of course, that corporate CBA folk might say they are doing exactly 
the same thing, just that their metric for performance or cheaper 
isn't defined in the same way as yours, for environmental reasons. If 
I want to keep my office at a comfortable temperature, the equation 
relating actual temperature in objective terms to "comfort factor" 
depends a lot on what my attire is.


>There are of course, a few humans that actually manage to do the "Tavern
>Between the Worlds" thing and pass freely between the geek Universe and
>the corporate Universe, scarfing a beer

is that beer free?

>  as they pass through.  You can
>usually tell them by their odd costumery when they've recently passed
>through the Gate -- those coats and ties clash with all the tee shirts
>with penguins on them -- but you can also tell that they truly belong if
>you try to hold an intelligent conversation with them.
>
>And finally there are the Borgs, who pass between the Universes but do
>so in their Master's Keep, entering through one gate dressed like the
>soulless automata that they truly are and emerging from another wearing
>double-knit polos with Borg logos and knife-edged khaki pants and
>loafers,

But, now, through the miracle of online ordering and automated 
manufacturing, YOU too can order such things from Land's end and have 
them delivered to your doorstep in days.  Then, you could enter their 
world (should you consider it useful) in a form of disguise.  You can 
even have your favorite logo embroidered in the appropriate 
place.  (it's called "business outfitter")  (e.g. Slackware shirts 
are done by Lands End)

( I note that Lands End has hired unix sysadmins and Java folk in the 
past, so procuring the concealing attire might even be supporting a 
geek) (Of course, if you search for Linux at the Lands End site,you 
get a page with 6 women's swimsuits.. odd, but who knows what 
peculiarities lurk within the search engine)


>  so they can pretend that they are just "one of the geeks" at
>some Geek Faire being held in Geekworld.  If you fall into their
>clutches at one of these events they will simply grab at your collar
>(they're searching for your non-existent tie or lapels) and intone
>"resistance is futile" until somebody comes along and makes fun of them,
>
>So to clarify, nobody in the Universe under discussion -- the one that
>isn't already using linux clusters because of its vast performance and
>cost advantages in their mileau -- could care less about performance per
>se as long as it is "good enough" and the risk/ROI/CYA equations work
>out right.  Places where The Borg already Owns Their Soul, in other
>words.
>
>"Resistance is Futile..."
>
>(Gawds, there's bound to be a video game in here somewhere, isn't
>there?:-)


No, but there is a "uniform look with variety and consistency"...

Jim


From rgb at phy.duke.edu  Thu Jan 18 16:41:31 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Thu, 18 Jan 2007 19:41:31 -0500 (EST)
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <6.2.3.4.2.20070118135045.0327f388@mail.jpl.nasa.gov>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
	<Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
	<6.2.3.4.2.20070118135045.0327f388@mail.jpl.nasa.gov>
Message-ID: <Pine.LNX.4.64.0701181905490.29882@lilith.rgb.private.net>

On Thu, 18 Jan 2007, Jim Lux wrote:

> And likewise, WinXP on the desktop.  A company with 20,000 WinXP desktops 
> cannot tolerate BSODs and mystery hangs on a significant fraction of those 
> desktops at any frequency.  When your call center operators are being timed 
> to the second, the sysadmin folks know INSTANTLY when there are problems.
>
> But, just as in the server application, the configurations are rigorously 
> controlled and tested.  It's certainly not the usual home computer with 
> umpty-five downloaded widgets, etc.

Even with strong controls and an instantly reinstallable system image,
WinXX boxes are corrupted once a month or so in our labs.  Too many
things that can go wrong.  Fortunately, they've dropped the
reinstallation time to almost nothing.

>> Linux remains around the 1% level in US desktop occupancy, and even in
>> the pacific rim where its numbers are the best it only makes it to 3% or
>> so (the rest are doubtless mostly bootleg WinXX).  The largest monopoly
>> ever to exist in the history of the world laughs at these numbers.  On
>> the broader server market Linux fares better, but it is still very much
>> David against Goliath where David may appear sometimes to be winning,
>> but Goliath has yet to be hit in the head with any kind of stone.
>
> More like grains of sand being dribbled about the feet.

Well 1% of a person's height would TECHNICALLY be somewhere between 1
and 2 cm.  And in the server arena, it would be quite a bit higher.
Say, small pebbles to just the kind of rocks one can turn an ankle
on...;-)

I should have used a different metaphor, though.  Microsoft so far has
been to Linux like Fezzik was with Westley in The Princess Bride,
tolerating its occassional blows.  "I just want you to feel you're doing
well.  I have for people to die embarrassed..."

>
>> Basically, MS's cluster product is almost certainly designed to do two
>> things.  One is provide them with a credible presence in the cluster
>> market not because it is particularly important to them as a profit
>> center but because hurting linux and the other unices strengthens their
>> position in the general server market in many ways.  They do not want
>> <snip>
>
> I think it's also to support the turnkey software vendors who need a platform 
> with more compute crunch for their existing Windows application.  Think 
> finite element models of one kind or another.  If your application costs 
> $50K/seat, a kilobuck or two for an OS isn't a big deal.

As I said, they might want to make money, sure, and although they MIGHT
do it at a loss if they thought it was important enough they certainly
would rather not.  I think they'll make money, but probably not a lot.
It's almost like developing a new business, and there is plenty of
competition even though of course they'll exploit their advantages where
they help.

    rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From hunting at ix.netcom.com  Thu Jan 18 21:14:38 2007
From: hunting at ix.netcom.com (Michael Huntingdon)
Date: Thu, 18 Jan 2007 21:14:38 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <Pine.LNX.4.64.0701181905490.29882@lilith.rgb.private.net>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
	<Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
	<6.2.3.4.2.20070118135045.0327f388@mail.jpl.nasa.gov>
	<Pine.LNX.4.64.0701181905490.29882@lilith.rgb.private.net>
Message-ID: <7.0.1.0.2.20070118205401.02376e60@ix.netcom.com>

At 04:41 PM 1/18/2007, Robert G. Brown wrote:
>On Thu, 18 Jan 2007, Jim Lux wrote:
>
>>And likewise, WinXP on the desktop.  A company with 20,000 WinXP 
>>desktops cannot tolerate BSODs and mystery hangs on a significant 
>>fraction of those desktops at any frequency.  When your call center 
>>operators are being timed to the second, the sysadmin folks know 
>>INSTANTLY when there are problems.
>>
>>But, just as in the server application, the configurations are 
>>rigorously controlled and tested.  It's certainly not the usual 
>>home computer with umpty-five downloaded widgets, etc.
>
>Even with strong controls and an instantly reinstallable system image,
>WinXX boxes are corrupted once a month or so in our labs.  Too many
>things that can go wrong.  Fortunately, they've dropped the
>reinstallation time to almost nothing.
>
>>>Linux remains around the 1% level in US desktop occupancy, and even in
>>>the pacific rim where its numbers are the best it only makes it to 3% or
>>>so (the rest are doubtless mostly bootleg WinXX).  The largest monopoly
>>>ever to exist in the history of the world laughs at these numbers.  On
>>>the broader server market Linux fares better, but it is still very much
>>>David against Goliath where David may appear sometimes to be winning,
>>>but Goliath has yet to be hit in the head with any kind of stone.
>>
>>More like grains of sand being dribbled about the feet.
>
>Well 1% of a person's height would TECHNICALLY be somewhere between 1
>and 2 cm.  And in the server arena, it would be quite a bit higher.
>Say, small pebbles to just the kind of rocks one can turn an ankle
>on...;-)
>
>I should have used a different metaphor, though.  Microsoft so far has
>been to Linux like Fezzik was with Westley in The Princess Bride,
>tolerating its occassional blows.  "I just want you to feel you're doing
>well.  I have for people to die embarrassed..."
>
>>
>>>Basically, MS's cluster product is almost certainly designed to do two
>>>things.  One is provide them with a credible presence in the cluster
>>>market not because it is particularly important to them as a profit
>>>center but because hurting linux and the other unices strengthens their
>>>position in the general server market in many ways.  They do not want
>>><snip>

I dunno. I found print somewhere reflecting the overall HPC market 
for 2007 looking like $11.4B growing at nearly 10% per year thru 
2010, with departmental and workgroup clusters estimated at 6.6B. 
Completely aside from putting a dent in the linux armour, I would 
expect that someone within MS had a huge epiphany in terms of how 
they might be able to help dice up that $6.6B. You can bet they've 
been working with hundreds of significant ISV's and they'll be close 
to their projected ROI, whether it's a better mousetrap or not.

>>I think it's also to support the turnkey software vendors who need 
>>a platform with more compute crunch for their existing Windows 
>>application.  Think finite element models of one kind or 
>>another.  If your application costs $50K/seat, a kilobuck or two 
>>for an OS isn't a big deal.
>
>As I said, they might want to make money, sure, and although they MIGHT
>do it at a loss if they thought it was important enough they certainly
>would rather not.  I think they'll make money, but probably not a lot.
>It's almost like developing a new business, and there is plenty of
>competition even though of course they'll exploit their advantages where
>they help.
>
>    rgb
>
>--
>Robert G. Brown                        http://www.phy.duke.edu/~rgb/
>Duke University Dept. of Physics, Box 90305
>Durham, N.C. 27708-0305
>Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu
>
>
>_______________________________________________
>Beowulf mailing list, Beowulf at beowulf.org
>To change your subscription (digest mode or unsubscribe) visit 
>http://www.beowulf.org/mailman/listinfo/beowulf


From hunting at ix.netcom.com  Thu Jan 18 21:31:37 2007
From: hunting at ix.netcom.com (Michael Huntingdon)
Date: Thu, 18 Jan 2007 21:31:37 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <Pine.LNX.4.64.0701181905490.29882@lilith.rgb.private.net>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
	<Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
	<6.2.3.4.2.20070118135045.0327f388@mail.jpl.nasa.gov>
	<Pine.LNX.4.64.0701181905490.29882@lilith.rgb.private.net>
Message-ID: <7.0.1.0.2.20070118212800.02504318@ix.netcom.com>

HP's CCS announcement from early 11/06:
http://www.hp.com/hpinfo/newsroom/press/2006/061106a.html?jumpid=reg_R1002_USEN

michael


At 04:41 PM 1/18/2007, Robert G. Brown wrote:
>On Thu, 18 Jan 2007, Jim Lux wrote:
>
>>And likewise, WinXP on the desktop.  A company with 20,000 WinXP 
>>desktops cannot tolerate BSODs and mystery hangs on a significant 
>>fraction of those desktops at any frequency.  When your call center 
>>operators are being timed to the second, the sysadmin folks know 
>>INSTANTLY when there are problems.
>>
>>But, just as in the server application, the configurations are 
>>rigorously controlled and tested.  It's certainly not the usual 
>>home computer with umpty-five downloaded widgets, etc.
>
>Even with strong controls and an instantly reinstallable system image,
>WinXX boxes are corrupted once a month or so in our labs.  Too many
>things that can go wrong.  Fortunately, they've dropped the
>reinstallation time to almost nothing.
>
>>>Linux remains around the 1% level in US desktop occupancy, and even in
>>>the pacific rim where its numbers are the best it only makes it to 3% or
>>>so (the rest are doubtless mostly bootleg WinXX).  The largest monopoly
>>>ever to exist in the history of the world laughs at these numbers.  On
>>>the broader server market Linux fares better, but it is still very much
>>>David against Goliath where David may appear sometimes to be winning,
>>>but Goliath has yet to be hit in the head with any kind of stone.
>>
>>More like grains of sand being dribbled about the feet.
>
>Well 1% of a person's height would TECHNICALLY be somewhere between 1
>and 2 cm.  And in the server arena, it would be quite a bit higher.
>Say, small pebbles to just the kind of rocks one can turn an ankle
>on...;-)
>
>I should have used a different metaphor, though.  Microsoft so far has
>been to Linux like Fezzik was with Westley in The Princess Bride,
>tolerating its occassional blows.  "I just want you to feel you're doing
>well.  I have for people to die embarrassed..."
>
>>
>>>Basically, MS's cluster product is almost certainly designed to do two
>>>things.  One is provide them with a credible presence in the cluster
>>>market not because it is particularly important to them as a profit
>>>center but because hurting linux and the other unices strengthens their
>>>position in the general server market in many ways.  They do not want
>>><snip>
>>
>>I think it's also to support the turnkey software vendors who need 
>>a platform with more compute crunch for their existing Windows 
>>application.  Think finite element models of one kind or 
>>another.  If your application costs $50K/seat, a kilobuck or two 
>>for an OS isn't a big deal.
>
>As I said, they might want to make money, sure, and although they MIGHT
>do it at a loss if they thought it was important enough they certainly
>would rather not.  I think they'll make money, but probably not a lot.
>It's almost like developing a new business, and there is plenty of
>competition even though of course they'll exploit their advantages where
>they help.
>
>    rgb
>
>--
>Robert G. Brown                        http://www.phy.duke.edu/~rgb/
>Duke University Dept. of Physics, Box 90305
>Durham, N.C. 27708-0305
>Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu
>
>
>_______________________________________________
>Beowulf mailing list, Beowulf at beowulf.org
>To change your subscription (digest mode or unsubscribe) visit 
>http://www.beowulf.org/mailman/listinfo/beowulf


From James.P.Lux at jpl.nasa.gov  Thu Jan 18 21:31:07 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Thu, 18 Jan 2007 21:31:07 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <Pine.LNX.4.64.0701181905490.29882@lilith.rgb.private.net>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
	<Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
	<6.2.3.4.2.20070118135045.0327f388@mail.jpl.nasa.gov>
	<Pine.LNX.4.64.0701181905490.29882@lilith.rgb.private.net>
Message-ID: <6.2.3.4.2.20070118212454.032a5ed0@mail.jpl.nasa.gov>

At 04:41 PM 1/18/2007, Robert G. Brown wrote:
>On Thu, 18 Jan 2007, Jim Lux wrote:
>
>>And likewise, WinXP on the desktop.  A company with 20,000 WinXP 
>>desktops cannot tolerate BSODs and mystery hangs on a significant 
>>fraction of those desktops at any frequency.  When your call center 
>>operators are being timed to the second, the sysadmin folks know 
>>INSTANTLY when there are problems.
>>
>>But, just as in the server application, the configurations are 
>>rigorously controlled and tested.  It's certainly not the usual 
>>home computer with umpty-five downloaded widgets, etc.
>
>Even with strong controls and an instantly reinstallable system image,
>WinXX boxes are corrupted once a month or so in our labs.

But you've got those pesky students to deal with.  Not like a 
corporate environment where everyone boots off the same image from 
the server, they run SMS, and if you muck with the configuration, you 
can get fired.

>Too many
>things that can go wrong.  Fortunately, they've dropped the
>reinstallation time to almost nothing.

Precisely..

I should have used a different metaphor, though.  Microsoft so far has
>been to Linux like Fezzik was with Westley in The Princess Bride,
>tolerating its occassional blows.  "I just want you to feel you're doing
>well.  I have for people to die embarrassed..."

In an interesting coincidence of references, my wife and daughter's 
horse is named "The dread pirate Roberts", the reference to which I 
have found is almost as dating as a former competitor of mine(in 
horse shows, not in engineering) who named her horse "E-Ticket", said 
tickets not having existed for decades. We all knew what it meant, 
but the 12 year olds hanging around the barn didn't. (Of course, they 
didn't understand who "The Stones" were, either.  Such is life in codgerdom)


Jim 


From mike at etek.chalmers.se  Thu Jan 18 22:10:19 2007
From: mike at etek.chalmers.se (Mikael Fredriksson)
Date: Fri, 19 Jan 2007 07:10:19 +0100
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <Pine.LNX.4.64.0701181106300.8989@coffee.psychology.mcmaster.ca>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<Pine.LNX.4.64.0701181106300.8989@coffee.psychology.mcmaster.ca>
Message-ID: <45B060CB.1010904@etek.chalmers.se>

Mark Hahn wrote:
>> Yes, it is.  And more so if this cluster/LAN can also utilize som type
>> of "MOSIX" system.  This will substatially increase the throughput of
>> "standard serial" processes.
> 
> 
> hmm, curious argument - why do you think MOSIX will be more efficient
> than a "normal" cluster when confronted with a trivial/serial workload?
> is there something about a typical pbs/sge/lsf/etc system that you think
> can't handle serial jobs?

I probbably expressed my self i a bad way.  What i mean is that with a 
MOSIX extension you can start a large bunch of "serial" processes on one 
node and these will then migrate in a balanced way in the cluster.  A 
MOSIX extension is transparent for the other parallell software.  I hope 
this link will clarify my point:

http://howto.x-tend.be/openMosixWiki/index.php/FAQ

MF


From greg.lindahl at qlogic.com  Thu Jan 18 22:41:41 2007
From: greg.lindahl at qlogic.com (Greg Lindahl)
Date: Thu, 18 Jan 2007 22:41:41 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <6.2.3.4.2.20070118212454.032a5ed0@mail.jpl.nasa.gov>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
	<Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
	<6.2.3.4.2.20070118135045.0327f388@mail.jpl.nasa.gov>
	<Pine.LNX.4.64.0701181905490.29882@lilith.rgb.private.net>
	<6.2.3.4.2.20070118212454.032a5ed0@mail.jpl.nasa.gov>
Message-ID: <20070119064141.GA5431@localhost.localdomain>

On Thu, Jan 18, 2007 at 09:31:07PM -0800, Jim Lux wrote:

> In an interesting coincidence of references, my wife and daughter's 
> horse is named "The dread pirate Roberts", the reference to which I 
> have found is almost as dating as a former competitor of mine

... you just hang out with the wrong people. The Princess Bride is
required viewing for SCAdians. And the book is even better than the
movie.

-- greg


From kball at pathscale.com  Fri Jan 19 10:36:18 2007
From: kball at pathscale.com (Kevin Ball)
Date: Fri, 19 Jan 2007 10:36:18 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <6.2.3.4.2.20070118212454.032a5ed0@mail.jpl.nasa.gov>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<1169116222.4365.16.camel@localhost.localdomain>
	<45AF817E.7020009@ahpcrc.org>
	<Pine.LNX.4.64.0701180945290.26071@lilith.rgb.private.net>
	<6.2.3.4.2.20070118135045.0327f388@mail.jpl.nasa.gov>
	<Pine.LNX.4.64.0701181905490.29882@lilith.rgb.private.net>
	<6.2.3.4.2.20070118212454.032a5ed0@mail.jpl.nasa.gov>
Message-ID: <1169231778.31830.2.camel@ammonite>

On Thu, 2007-01-18 at 21:31, Jim Lux wrote:
> At 04:41 PM 1/18/2007, Robert G. Brown wrote:
> >On Thu, 18 Jan 2007, Jim Lux wrote:
> >
> >>And likewise, WinXP on the desktop.  A company with 20,000 WinXP 
> >>desktops cannot tolerate BSODs and mystery hangs on a significant 
> >>fraction of those desktops at any frequency.  When your call center 
> >>operators are being timed to the second, the sysadmin folks know 
> >>INSTANTLY when there are problems.
> >>
> >>But, just as in the server application, the configurations are 
> >>rigorously controlled and tested.  It's certainly not the usual 
> >>home computer with umpty-five downloaded widgets, etc.
> >
> >Even with strong controls and an instantly reinstallable system image,
> >WinXX boxes are corrupted once a month or so in our labs.
> 
> But you've got those pesky students to deal with.  Not like a 
> corporate environment where everyone boots off the same image from 
> the server, they run SMS, and if you muck with the configuration, you 
> can get fired.
> 
> >Too many
> >things that can go wrong.  Fortunately, they've dropped the
> >reinstallation time to almost nothing.
> 
> Precisely..
> 
> I should have used a different metaphor, though.  Microsoft so far has
> >been to Linux like Fezzik was with Westley in The Princess Bride,
> >tolerating its occassional blows.  "I just want you to feel you're doing
> >well.  I have for people to die embarrassed..."
> 
> In an interesting coincidence of references, my wife and daughter's 
> horse is named "The dread pirate Roberts", the reference to which I 
> have found is almost as dating as a former competitor of mine(in 
> horse shows, not in engineering) who named her horse "E-Ticket", said 
> tickets not having existed for decades. We all knew what it meant, 
> but the 12 year olds hanging around the barn didn't. (Of course, they 
> didn't understand who "The Stones" were, either.  Such is life in codgerdom)

I'm not sure how dating the princess bride is;  my younger brother is
still in college, but we both grew up with that movie!  'E-Ticket'
though?  Man!

-Kevin


> 
> 
> 
> Jim 
> 
> 
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf


From ryanw at windows.microsoft.com  Thu Jan 18 18:19:50 2007
From: ryanw at windows.microsoft.com (Ryan Waite)
Date: Thu, 18 Jan 2007 18:19:50 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45AE6BEE.9050001@scalableinformatics.com>
References: <web-1279309@free.net>
	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
Message-ID: <74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>

I know some of you aren't, um, tolerant of Microsoft for various reasons
but I thought I'd clear up a couple errors in some of the posts. If you
hate Microsoft at least you now have an email address for when you're
feeling grumpy.


Pricing

Retail pricing for Windows Server is about $750. Retail pricing for
Compute Cluster Server (CCS) is around $470. Most users will get the
product through either an OEM or a volume licensing agreement. In both
cases they pay less than retail. Academic users can purchase CCS for
less than $100.

CCS is comprised of two CDs. The first is Windows Server. The second CD
contains the clustering tools. The second CD has three major features:
1) a job scheduler, 2) systems management tools, and 3) Microsoft's MPI
stack. The majority of HPC systems sold are small (less than 256 nodes)
and we've designed for those customers. So, users get an OS, job
scheduler, management package, and MPI stack for < $500.

Our MPI stack is based on MPICH2 but we've made performance and security
enhancements. The folks at ANL are very talented UNIX developers but
Windows is more efficient using async overlapped I/O. We've made other,
similar changes to our stack and we're providing those changes back to
ANL for incorporation in future MPICH stacks. We're also the first group
at Microsoft making these kinds of sizable contributions back to the
open source community.


SGI

These folks are great and I'm sure they have a lot to teach from their
years in HPC. Also, we've hired people onto our HPC team from places
like Platform Computing, Cray, Silverstorm and other related companies.
While we may be new, and while v1 products may be a little rough, I
think we're going to help the community bring HPC into mainstream
computing.


Thanks,
Ryan Waite
Group Program Manager, HPC
Microsoft

-----Original Message-----
From: beowulf-bounces at beowulf.org [mailto:beowulf-bounces at beowulf.org]
On Behalf Of Joe Landman
Sent: Wednesday, January 17, 2007 10:33 AM
To: Jim Lux
Cc: Mikhail Kuzminsky; Beowulf at beowulf.org; mike at etek.chalmers.se
Subject: Re: [Beowulf] SGI to offer Windows on clusters

Jim Lux wrote:

>> My understanding of pricing (for the windows portion) is that it adds

>> (as an OS) $500USD to each node.  So for a 32 node machine, this is
an 
>> extra $16k USD "tax" added on.  Doesn't include the absolutely 
>> necessary antivirus, anti-spyware, ...
> 
> Probably wouldn't be that expensive, especially if you boot the same 

It is.  List is 479$US per compute node.

> image on all nodes of the cluster.  Update one, update all.

I agree.  If it were 479$ per cluster (of any size, sorta like you can 
do with Linux for $0 per cluster), this would be interesting.

> Site licenses for AV and AS software are heavily discounted from
retail, 
> as well.

$125 per node is guess.  Even if it is $60, or $20.  Basic idea is the
same.

Costs scale per node for this path.  It adds cost.  Whether or not the 
benefit one derives from these costs is worth it is important, as is 
whether or not the benefit one derives from the alternatives are higher 
or lower.

> Is this Windows clustering version, too?

Yes.


-- 

Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452 or +1 866 888 3112
cell : +1 734 612 4615

_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf


From mkamranmustafa at gmail.com  Thu Jan 18 20:10:55 2007
From: mkamranmustafa at gmail.com (=?UTF-8?B?4omIz4Fz0YfComjHv+KJiA==?=)
Date: Fri, 19 Jan 2007 09:10:55 +0500
Subject: [Beowulf] Linpack Results...!
Message-ID: <e9da936f0701182010k7cb98dd3k3afc92cc23dc05dc@mail.gmail.com>

Hi All,

I want to calculate the speed or the peak performance of my 50 nodes Linux
based Beowulf cluster. I have successfully configured Linpack over the
Cluster but when I run the following command, the output that I get is very
lengthy and also not understandable to me:

prompt]    *mpirun -np 4 xhpl*
**
As you know, HPL.dat is the input file but I dont know how to make
ammendments in this file to find out the speed of my cluster. Kindly help.

Regards,

Kamran
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070119/9a9aa1e4/attachment.html>

From hahn at mcmaster.ca  Thu Jan 18 20:12:49 2007
From: hahn at mcmaster.ca (Mark Hahn)
Date: Thu, 18 Jan 2007 23:12:49 -0500 (EST)
Subject: [Beowulf] Re: SGI to offer Windos on clusters ---> Skew/Jitter
	paper
In-Reply-To: <30062B7EA51A9045B9F605FAAC1B4F621A54FE@exch01.quadrics.com>
References: <20070118211424.27F2935A594@mail.scali.no>
	<45AFEBCD.9010007@ahpcrc.org>
	<30062B7EA51A9045B9F605FAAC1B4F621A54FE@exch01.quadrics.com>
Message-ID: <Pine.LNX.4.64.0701182242350.16566@coffee.psychology.mcmaster.ca>

>>    "The Case of the Missing Supercomputing Performance"
>
> I wondered if you were talking about that paper but it's from lanl not sandia, it should be essential reading for everyone working with large clusters.

I love this paper.  but it's critical to realize that it's all about
very large, very tightly-coupled, frequent-global-collective-using
applications.  you could easily have a 2k-node cluster (I'd call it large)
dedicated to 1-to-100-core jobs and gleefully ignore jitter.  or be running
an 8k-core montecarlo that never needs any global synchronization, etc.

I'd actually love to see data on whether jitter affects apps 
other than ah, "stockpile stewardship" ;)


From hahn at mcmaster.ca  Fri Jan 19 05:44:27 2007
From: hahn at mcmaster.ca (Mark Hahn)
Date: Fri, 19 Jan 2007 08:44:27 -0500 (EST)
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45B060CB.1010904@etek.chalmers.se>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<Pine.LNX.4.64.0701181106300.8989@coffee.psychology.mcmaster.ca>
	<45B060CB.1010904@etek.chalmers.se>
Message-ID: <Pine.LNX.4.64.0701190818230.1962@coffee.psychology.mcmaster.ca>

>> hmm, curious argument - why do you think MOSIX will be more efficient
>> than a "normal" cluster when confronted with a trivial/serial workload?
>> is there something about a typical pbs/sge/lsf/etc system that you think
>> can't handle serial jobs?
>
> I probbably expressed my self i a bad way.  What i mean is that with a MOSIX 
> extension you can start a large bunch of "serial" processes on one node and 
> these will then migrate in a balanced way in the cluster.  A MOSIX extension

sure, Mosix has been around a while, so is reasonably well-know,
as is Scyld's approach.  what I'm interested in is whether Mosix
functions well in a more-than-toy cluster (say, at least 100p).

I guess I also am uncertain where Mosix's competitive advantage lies.
my experience is that a serial-job workload is so undemanding that 
migration is unnecessary - it's mainly for parallel jobs where you 
really want migration.  (in SHARCnet, there is a strong correlation
for serial jobs to also be relatively short and quite small in memory
use.  not _all_ are, of course, but certainly most.)

regards, mark hahn.


From gdjacobs at gmail.com  Fri Jan 19 13:39:29 2007
From: gdjacobs at gmail.com (Geoff Jacobs)
Date: Fri, 19 Jan 2007 15:39:29 -0600
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
References: <web-1279309@free.net>	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
Message-ID: <45B13A91.7020208@gmail.com>

Ryan Waite wrote:
> I know some of you aren't, um, tolerant of Microsoft for various reasons
> but I thought I'd clear up a couple errors in some of the posts. If you
> hate Microsoft at least you now have an email address for when you're
> feeling grumpy.
> 
> 
> Pricing
> 
> Retail pricing for Windows Server is about $750. Retail pricing for
> Compute Cluster Server (CCS) is around $470. Most users will get the
> product through either an OEM or a volume licensing agreement. In both
> cases they pay less than retail. Academic users can purchase CCS for
> less than $100.
> 
> CCS is comprised of two CDs. The first is Windows Server. The second CD
> contains the clustering tools. The second CD has three major features:
> 1) a job scheduler, 2) systems management tools, and 3) Microsoft's MPI
> stack. The majority of HPC systems sold are small (less than 256 nodes)
> and we've designed for those customers. So, users get an OS, job
> scheduler, management package, and MPI stack for < $500.
What about compilers?

> Our MPI stack is based on MPICH2 but we've made performance and security
> enhancements. The folks at ANL are very talented UNIX developers but
> Windows is more efficient using async overlapped I/O. We've made other,
> similar changes to our stack and we're providing those changes back to
> ANL for incorporation in future MPICH stacks. We're also the first group
> at Microsoft making these kinds of sizable contributions back to the
> open source community.
As much as many of us might have issues with, err, the more aggressive
marketing strategies Microsoft has used in the past, I can certainly
appreciate people such as yourself - wanting to succeed by creating good
software - no matter where they work.

> 
> SGI
> 
> These folks are great and I'm sure they have a lot to teach from their
> years in HPC. Also, we've hired people onto our HPC team from places
> like Platform Computing, Cray, Silverstorm and other related companies.
> While we may be new, and while v1 products may be a little rough, I
> think we're going to help the community bring HPC into mainstream
> computing.
I'm not sure that HPC will ever be mainstream. By definition, HPC
involves making trade-offs and pushing the envelope of what is possible
with modern computer technology. It is also somewhat limited in the
class of problem which it tackles. Mainstream (in my view) is synonymous
with general purpose.

-- 
Geoffrey D. Jacobs


From hahn at MCMASTER.CA  Fri Jan 19 12:23:06 2007
From: hahn at MCMASTER.CA (Mark Hahn)
Date: Fri, 19 Jan 2007 15:23:06 -0500 (EST)
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
References: <web-1279309@free.net>
	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
Message-ID: <Pine.LNX.4.64.0701191509070.6349@coffee.psychology.mcmaster.ca>

> I know some of you aren't, um, tolerant of Microsoft for various reasons

"tolerant" is the wrong word.  beowulf is, definitionally, open-source.
anyone using *nix has obviously made a decision (aesthetic, commercial, etc)
to avoid the dominant OS; this is rejection rather than ill tolerance...

> and we've designed for those customers. So, users get an OS, job
> scheduler, management package, and MPI stack for < $500.

compared to $0.

in that light, the question is- does the academic cost include some 
form of support?

> Our MPI stack is based on MPICH2 but we've made performance and security
> enhancements. The folks at ANL are very talented UNIX developers but

I cynically guess the "security" enhancements are not really enhancements,
but rather simply making it work in the MSFT universe (domain controller,
etc).  is that correct?


> Windows is more efficient using async overlapped I/O. We've made other,

what _would_ be interesting is to hear about the technical aspects.
that is: how much difference does this make?  is the impetus to use 
async mainly to batch completion events (sort of like mpi_waitall)?
does it affect latency?  how does it compare to normal mpich on the same
hardware under linux?

also, is there any documentation on the job scheduler?

these kind of specifics would, truely,  be most gratefully "tolerated" ;)


> ANL for incorporation in future MPICH stacks. We're also the first group
> at Microsoft making these kinds of sizable contributions back to the
> open source community.

are the contributions entirely specific to the windows API?
(I would not call it a contribution if you've simply ported to the 
architecture you own...)


> think we're going to help the community bring HPC into mainstream
> computing.

I'd be curious to understand what that means.  HPC as implemented by 
beowulves seems entirely mainstream to me.  (lots of places already have 
substituted a GUI button for a queue-submit command...)

regards, mark hahn.


From James.P.Lux at jpl.nasa.gov  Fri Jan 19 14:22:06 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Fri, 19 Jan 2007 14:22:06 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45B13A91.7020208@gmail.com>
References: <web-1279309@free.net> <45AE3C29.1090809@scalableinformatics.com>
	<6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
	<45B13A91.7020208@gmail.com>
Message-ID: <6.2.3.4.2.20070119141628.0302db28@mail.jpl.nasa.gov>

At 01:39 PM 1/19/2007, Geoff Jacobs wrote:
>R


> > scheduler, management package, and MPI stack for < $500.
>What about compilers?

Visual Studio 2005 Express Edition of VC++, VC#, VB, etc are free for 
the downloading.  They all include command line compiler/linker tools and make.

Not a straight across port from, e.g. gcc, but not bad, especially 
for console (non GUI) apps.

And, given that this product is aimed at folks who are already 
Windows developers, I would assume they use their existing 
tools.  The MPI stack is accessed (at least as of last summer) by 
what are essentially OS calls (P/Invoke).. It would be way cool if 
Ryan, et al., were to actually make a namespace, function calls, etc. 
so you didn't have to do the P/Invoke thing.

(I've been burned more than once trying to get things like Serial 
Comm working with P/Invoke in a multithreaded process... For me, the 
HUGE advantage of VS2005 was that I hopefully will never have to use 
P/Invoke again)


From James.P.Lux at jpl.nasa.gov  Fri Jan 19 14:38:26 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Fri, 19 Jan 2007 14:38:26 -0800
Subject: [Beowulf] MS MPI
Message-ID: <6.2.3.4.2.20070119143623.03030158@mail.jpl.nasa.gov>

Correction to my previous comment about needing to use P/Invoke...

There's now a MPI namespace, functions, etc.

And, apparently a parallel debugger inside Visual Studio 2005.

I haven't tried it yet.

James Lux, P.E.
Spacecraft Radio Frequency Subsystems Group
Flight Communications Systems Section
Jet Propulsion Laboratory, Mail Stop 161-213
4800 Oak Grove Drive
Pasadena CA 91109
tel: (818)354-2075
fax: (818)393-6875 


From mike at etek.chalmers.se  Fri Jan 19 23:57:35 2007
From: mike at etek.chalmers.se (Mikael Fredriksson)
Date: Sat, 20 Jan 2007 08:57:35 +0100
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <Pine.LNX.4.64.0701190818230.1962@coffee.psychology.mcmaster.ca>
References: <45ABCF41.3020501@etek.chalmers.se> <45AD57B4.9060106@uiowa.edu>
	<45ADD530.90000@etek.chalmers.se>
	<Pine.LNX.4.64.0701181106300.8989@coffee.psychology.mcmaster.ca>
	<45B060CB.1010904@etek.chalmers.se>
	<Pine.LNX.4.64.0701190818230.1962@coffee.psychology.mcmaster.ca>
Message-ID: <45B1CB6F.5010108@etek.chalmers.se>

Mark Hahn wrote:
> ........ what I'm interested in is whether Mosix
> functions well in a more-than-toy cluster (say, at least 100p).
>
> I guess I also am uncertain where Mosix's competitive advantage lies.
> my experience is that a serial-job workload is so undemanding that 
> migration is unnecessary - it's mainly for parallel jobs where you 
> really want migration.  (in SHARCnet, there is a strong correlation
> for serial jobs to also be relatively short and quite small in memory
> use.  not _all_ are, of course, but certainly most.)

Well, one big advantage is that the cluster will be open to a much wider
range of people who can do some programming but are not parallell
programming experts.  This in it self is a very powerful argument when
you want to motivate the funding you need to buy a (large) cluster.
Several departments may have to share the cluster and not all have the
skill, time or the money to develop specialized parallell software for
their applications.  But what they usually have is a *good* knowledge to
handle, for instance, math programs e.g. MATLAB.  So if they do, for
example, a type of "loop unrolling" in their MATLAB scripts (and also
ingrease detail), they can then start say 100 instances of MATLAB and
let them migrate.  And if necessary, they can implement simple types of
syncronization barriers.  This is not optimal HPC, but it is a way to
adapt to the reality for many organisations.

So, if we return to the SGI/Windows cluster case, with a MOSIX extension
there is a possibility that the cluster can also be used to run e.g.
serial administrative jobs.  Again, with a MOSIX extension you have a
extra argument to motivate the purchase (or selling) of a *good* cluster.


MF


From ashley at quadrics.com  Mon Jan 22 07:39:29 2007
From: ashley at quadrics.com (Ashley Pittman)
Date: Mon, 22 Jan 2007 15:39:29 +0000
Subject: [Beowulf] Re: SGI to offer Windos on clusters ---> Skew/Jitter
	paper
In-Reply-To: <Pine.LNX.4.64.0701182242350.16566@coffee.psychology.mcmaster.ca>
References: <20070118211424.27F2935A594@mail.scali.no>
	<45AFEBCD.9010007@ahpcrc.org>
	<30062B7EA51A9045B9F605FAAC1B4F621A54FE@exch01.quadrics.com>
	<Pine.LNX.4.64.0701182242350.16566@coffee.psychology.mcmaster.ca>
Message-ID: <1169480370.14268.68.camel@localhost.localdomain>

On Thu, 2007-01-18 at 23:12 -0500, Mark Hahn wrote:
> >>    "The Case of the Missing Supercomputing Performance"
> >
> > I wondered if you were talking about that paper but it's from lanl not sandia, it should be essential reading for everyone working with large clusters.
> 
> I love this paper.  but it's critical to realize that it's all about
> very large, very tightly-coupled, frequent-global-collective-using
> applications.  you could easily have a 2k-node cluster (I'd call it large)
> dedicated to 1-to-100-core jobs and gleefully ignore jitter.  or be running
> an 8k-core montecarlo that never needs any global synchronization, etc.
> 
> I'd actually love to see data on whether jitter affects apps 
> other than ah, "stockpile stewardship" ;)

In my experience yes.  Clearly some apps are more susceptible than
others.  At one extreme even embarrassingly parallel apps can suffer
from noise if the job is only considered complete when the last result
is returned.

Any app that performs synchronisation between nodes (even implicitly
with point-to-point comms) will cause delays caused by noise to
propagate across the cluster and unfortunately because of the way these
delays combine the effect gets quite defined at size.

Consider for example a 64 node cluster with one CPU per node, on this
cluster there is a deamon which wakes up once a minute, spins for a
second and goes back to sleep.  Running a single process job you can
expect to see 59/60 seconds elapsed used by the job.  You probably don't
worry about this.  Now assume that you have a 64 way job which performs
a global barrier every two seconds, now in that two second timeframe
statistically *at least one* node will be affected by noise so the
compute time for the process on that node is two seconds for the
application and one for the deamon.  Each timestep now takes three
seconds to achieve two seconds worth of compute time, that's 33% of your
compute time down the drain.  In reality the figures I've given here are
pessimistic, Linux doesn't have *that much* jitter so smaller clusters
are by-and-large unaffected however it's a fairly common problem on 1024
+ way clusters.

In answer to a previous post about using extra CPUS/cores to alleviate
this problem it's not a new idea, IIRC PSC were doing this six or seven
years ago, I'd be interested to see if hyperthreading helps the
situation, it's almost always turned of and any cluster over 32 CPU's
but it might be advantageous to enable it and use something like cpusets
to bind the application to real CPU's whilst letting the resource
manager/Ganglia/sendmail twiddle it's thumbs on the other virtual 20%
CPU.

Ashley,


From cap at nsc.liu.se  Mon Jan 22 09:14:39 2007
From: cap at nsc.liu.se (Peter Kjellstrom)
Date: Mon, 22 Jan 2007 18:14:39 +0100
Subject: [Beowulf] Re: SGI to offer Windos on clusters ---> Skew/Jitter
	paper
In-Reply-To: <1169480370.14268.68.camel@localhost.localdomain>
References: <20070118211424.27F2935A594@mail.scali.no>
	<Pine.LNX.4.64.0701182242350.16566@coffee.psychology.mcmaster.ca>
	<1169480370.14268.68.camel@localhost.localdomain>
Message-ID: <200701221814.39461.cap@nsc.liu.se>

On Monday 22 January 2007 16:39, Ashley Pittman wrote:
...
> In answer to a previous post about using extra CPUS/cores to alleviate
> this problem it's not a new idea, IIRC PSC were doing this six or seven
> years ago,

I was well aware that it wasn't new at all, my main point was that given 
the "core explosion" going on it's getting more interesting and cheaper every 
day.

> I'd be interested to see if hyperthreading helps the 
> situation,

Yes numbers on this would be interesting, my initial guess would be that it 
would be good (assuming that you never schedule more work than you have 
actual cores or course).

/Peter

> it's almost always turned of and any cluster over 32 CPU's 
> but it might be advantageous to enable it and use something like cpusets
> to bind the application to real CPU's whilst letting the resource
> manager/Ganglia/sendmail twiddle it's thumbs on the other virtual 20%
> CPU.
>
> Ashley,
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070122/d912eaf9/attachment.sig>

From ryanw at windows.microsoft.com  Sun Jan 21 17:57:20 2007
From: ryanw at windows.microsoft.com (Ryan Waite)
Date: Sun, 21 Jan 2007 17:57:20 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <Pine.LNX.4.64.0701191509070.6349@coffee.psychology.mcmaster.ca>
References: <web-1279309@free.net>
	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
	<Pine.LNX.4.64.0701191509070.6349@coffee.psychology.mcmaster.ca>
Message-ID: <74735BF202608043B11025A9FAA9438904662B97@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>


-----Original Message-----
From: Mark Hahn [mailto:hahn at MCMASTER.CA] 
Sent: Friday, January 19, 2007 12:23 PM
To: Ryan Waite
Cc: Beowulf at beowulf.org
Subject: RE: [Beowulf] SGI to offer Windows on clusters
> 
>> I know some of you aren't, um, tolerant of Microsoft for various
reasons
> 
> "tolerant" is the wrong word.  beowulf is, definitionally,
open-source.
> anyone using *nix has obviously made a decision (aesthetic,
commercial, etc)
> to avoid the dominant OS; this is rejection rather than ill
tolerance...
> 
>> and we've designed for those customers. So, users get an OS, job
>> scheduler, management package, and MPI stack for < $500.
> 
> compared to $0.
> 
> in that light, the question is- does the academic cost include some 
> form of support?

Academic version is provided through MSDN Academic Alliance. Support
depends on the type of agreement. Our dev team also monitors
http://windowshpc.net and provides support there.

> 
>> Our MPI stack is based on MPICH2 but we've made performance and
security
>> enhancements. The folks at ANL are very talented UNIX developers but
> 
> I cynically guess the "security" enhancements are not really
enhancements,
> but rather simply making it work in the MSFT universe (domain
controller,
> etc).  is that correct?

We did some other security work as well. One example is the MPICH2 code
stores password information un-encrypted in the registry. That needed to
be changed. There were a couple similar issues uncovered during our
security review.

When used with our job scheduler we do integrate with Active Directory.
When jobs execute on compute nodes we first create a Windows Job Object
and then log on using the submitting user's credentials. The Job Object
runs with that user's credentials. If the user has their data on a
secured server then they should be able to access that data directly.
Also, if the process running in the Job Object spawns any other
processes they are contained in the same Job Object; if the user's job
is cancelled we can clean up any child processes.

> 
>> Windows is more efficient using async overlapped I/O. We've made
other,
> 
> what _would_ be interesting is to hear about the technical aspects.
> that is: how much difference does this make?  is the impetus to use 
> async mainly to batch completion events (sort of like mpi_waitall)?
> does it affect latency?  how does it compare to normal mpich on the
same
> hardware under linux?
> 

There were a number of changes. If you think it would be interesting I
could arrange a conference call or an online presentation where we
present our changes and the rationale behind those changes.

> also, is there any documentation on the job scheduler?
> 

Yep. Here's a paper with an overview of the scheduler:
http://www.microsoft.com/downloads/details.aspx?FamilyID=c4dd011a-42e0-4
978-b518-dd6cfef7131f&displaylang=en

Here's an article on linking with the CCP Job Scheduling API. Our
partners like Mathworks, Ansys, Schlumberger use these APIs to integrate
their apps directly with the cluster job scheduler. The result is that
users don't have to learn job control languages, they just press the
compute button from inside Fluent, etc.
http://msdn2.microsoft.com/en-us/library/aa578732.aspx


> these kind of specifics would, truely,  be most gratefully "tolerated"
;)
> 
> 
>> ANL for incorporation in future MPICH stacks. We're also the first
group
>> at Microsoft making these kinds of sizable contributions back to the
>> open source community.
> 
> are the contributions entirely specific to the windows API?
> (I would not call it a contribution if you've simply ported to the 
> architecture you own...)
> 
> 
>> think we're going to help the community bring HPC into mainstream
>> computing.
> 
> I'd be curious to understand what that means.  HPC as implemented by 
> beowulves seems entirely mainstream to me.  (lots of places already
have 
> substituted a GUI button for a queue-submit command...)
> 

Yeah, I think HPC is much more mainstream than before. I'd like to get
to a place where an HPC cluster is just another shared network resource.
Just like you can plug a printer into a network and decide to share it
with your colleagues I think setting up an HPC cluster and sharing it
should be just as simple. 

Ideally you could buy a 16 or 32 node cluster for your workgroup, plug
it in, it images itself (or all the nodes are pre-installed), integrates
with your network security (Active Directory or Kerberos or NIS, etc.),
decide who you want to share it with, and it's ready to run jobs. With
applications like Matlab's DCT and Grid Mathematica integrated directly
with the job scheduler it's easy to submit jobs and the results are
returned directly to the application.

So, mainstream, for us isn't about building bigger and bigger clusters
but about building clusters that are easy to manage, integrated directly
with your apps, and that work well with the rest of the software
infrastructure you have in place.

Woah, starting to sound like a marketing person.

> regards, mark hahn.
> 
> 


From ryanw at windows.microsoft.com  Sun Jan 21 18:23:25 2007
From: ryanw at windows.microsoft.com (Ryan Waite)
Date: Sun, 21 Jan 2007 18:23:25 -0800
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <45B13A91.7020208@gmail.com>
References: <web-1279309@free.net>	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
	<45B13A91.7020208@gmail.com>
Message-ID: <74735BF202608043B11025A9FAA9438904662B9D@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>

Hi Geoff, comments below.
 
 -----Original Message-----
From: Geoff Jacobs [mailto:gdjacobs at gmail.com] 
Sent: Friday, January 19, 2007 1:39 PM
To: Ryan Waite
Cc: Joe Landman; Jim Lux; Beowulf at beowulf.org; Mikhail Kuzminsky;
mike at etek.chalmers.se
Subject: Re: [Beowulf] SGI to offer Windows on clusters
> 
> Ryan Waite wrote:
>> I know some of you aren't, um, tolerant of Microsoft for various
reasons
>> but I thought I'd clear up a couple errors in some of the posts. If
you
>> hate Microsoft at least you now have an email address for when you're
>> feeling grumpy.
>> 
>> 
>> Pricing
>> 
>> Retail pricing for Windows Server is about $750. Retail pricing for
>> Compute Cluster Server (CCS) is around $470. Most users will get the
>> product through either an OEM or a volume licensing agreement. In
both
>> cases they pay less than retail. Academic users can purchase CCS for
>> less than $100.
>> 
>> CCS is comprised of two CDs. The first is Windows Server. The second
CD
>> contains the clustering tools. The second CD has three major
features:
>> 1) a job scheduler, 2) systems management tools, and 3) Microsoft's
MPI
>> stack. The majority of HPC systems sold are small (less than 256
nodes)
>> and we've designed for those customers. So, users get an OS, job
>> scheduler, management package, and MPI stack for < $500.
> What about compilers?

Compilers are available from PGI (Fortran and C++), Intel (Fortran and
C++), Lahey (Fortran, don't remember if this was 64-bit support or not),
and Microsoft (C++, etc.). Visual Studio 2005 includes a parallel
debugger and OpenMP support.

> 
>> Our MPI stack is based on MPICH2 but we've made performance and
security
>> enhancements. The folks at ANL are very talented UNIX developers but
>> Windows is more efficient using async overlapped I/O. We've made
other,
>> similar changes to our stack and we're providing those changes back
to
>> ANL for incorporation in future MPICH stacks. We're also the first
group
>> at Microsoft making these kinds of sizable contributions back to the
>> open source community.
> As much as many of us might have issues with, err, the more aggressive
> marketing strategies Microsoft has used in the past, I can certainly
> appreciate people such as yourself - wanting to succeed by creating
good
> software - no matter where they work.
> 
>> 
>> SGI
>> 
>> These folks are great and I'm sure they have a lot to teach from
their
>> years in HPC. Also, we've hired people onto our HPC team from places
>> like Platform Computing, Cray, Silverstorm and other related
companies.
>> While we may be new, and while v1 products may be a little rough, I
>> think we're going to help the community bring HPC into mainstream
>> computing.
> I'm not sure that HPC will ever be mainstream. By definition, HPC
> involves making trade-offs and pushing the envelope of what is
possible
> with modern computer technology. It is also somewhat limited in the
> class of problem which it tackles. Mainstream (in my view) is
synonymous
> with general purpose.

Yep, I think you're right. I'm oversimplifying but I think HPC will have
two divisions in the future. The first are the disciplined (read:
hard-core) HPC users, people who require the fruits that come from
careful and sometimes laborious optimization of their HPC environments.
These are also the people who have the skills to deploy large (>512
node) clusters. These users have sophistication with complex software
packages, development tools, middlewear (schedulers, MPI stacks), and/or
hardware.

The second division will be people who don't have sophistication with
programming or systems management. Instead of using C++ and Fortran they
use very high level environments like R, Matlab, Mathematica, and Excel.
While they aren't classic HPC users they do have a lot of computational
work, work that could be completed quicker on a cluster. In this case
you're right, it's much more general purpose.

> 
> -- 
> Geoffrey D. Jacobs
> 
> 
> 


From landman at scalableinformatics.com  Mon Jan 22 21:44:23 2007
From: landman at scalableinformatics.com (Joe Landman)
Date: Tue, 23 Jan 2007 00:44:23 -0500
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
References: <web-1279309@free.net>
	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
Message-ID: <45B5A0B7.1040501@scalableinformatics.com>


Ryan Waite wrote:
> I know some of you aren't, um, tolerant of Microsoft for various reasons
> but I thought I'd clear up a couple errors in some of the posts. If you
> hate Microsoft at least you now have an email address for when you're
> feeling grumpy.

As I now have a chance for a response, I think it is warranted.

Intolerance?  No.  Suspicious of ulterior motives?  Hmmm.  Why would 
that be?

There are Microsoft tools you won't get from me without prying from my 
cold dead fingers.  There are Microsoft tools that are extraordinarily 
painful to use/deploy, that don't play well with the mainstream 
currently in use tools.  There are Microsoft tools with a, well, less 
than stellar security record or model.

> Pricing
> 
> Retail pricing for Windows Server is about $750. Retail pricing for
> Compute Cluster Server (CCS) is around $470. Most users will get the
> product through either an OEM or a volume licensing agreement. In both
> cases they pay less than retail. Academic users can purchase CCS for
> less than $100.

Lets be clear about this.  The pricing (470, 100, ...) is *per node* 
correct?  If it is not *per node* and is *per cluster* this is an 
interesting scenario, and somewhat at odds with what people have been 
telling me over the last several months.

> CCS is comprised of two CDs. The first is Windows Server. The second CD
> contains the clustering tools. The second CD has three major features:
> 1) a job scheduler, 2) systems management tools, and 3) Microsoft's MPI
> stack. The majority of HPC systems sold are small (less than 256 nodes)
> and we've designed for those customers. So, users get an OS, job
> scheduler, management package, and MPI stack for < $500.

*per node* or *per cluster*

> Our MPI stack is based on MPICH2 but we've made performance and security
> enhancements. The folks at ANL are very talented UNIX developers but
> Windows is more efficient using async overlapped I/O. We've made other,
> similar changes to our stack and we're providing those changes back to
> ANL for incorporation in future MPICH stacks. We're also the first group
> at Microsoft making these kinds of sizable contributions back to the
> open source community.

This is good.  I am glad to hear this.

As others have noted, the aggressive marketing campaign is a bit over 
the top.  Linux clusters have been "mainstream" and well integrated into 
corporate worlds for a while.  This could explain for example that the 
market has been experiencing explosive growth long before Microsoft ever 
entered it.

Be that as it may, I believe that there are specific areas where it 
would be quite valuable to have Microsoft work, and tap into the huge 
and rapidly growing market.  The replacement strategy represents a risk 
to users IMO if Microsoft tires of this (relatively small for them) 
market, and decides "no more".  There are other areas within HPC where a 
strong Microsoft presence would be good, in terms of interoperability, 
cross platform scripting/development, interface.  Not VBA everywhere. 
Mono with hooks to enable us to bind our languages in.

You might take pains to notice that some of us who are vocal critics of 
your companies actions and products, are also vocal critics of your 
competition.  I have been decrying the MPI binary interface issue on 
Linux for a long time.  This impedes progress IMO.  I have praised the 
DLL approach where one ABI for MPI would be supported (not one chip/os 
ABI, but application level ABI).  You might note that some of us heap 
appropriate scorn upon the poor choices of some Linux vendors (not 
necessarily the ones who sign deals with you).


> Thanks,
> Ryan Waite
> Group Program Manager, HPC
> Microsoft

As I said, there are Microsoft products that we will not stop using.

I personally would like to see some of them on Linux.  And I would pay 
for them on Linux.  As I am sure others would as well.  I pay for 
products that enable me to run them on Linux.  I pay for the additional 
license so that I legally can run them on Linux.

Now why would that be, if I were intolerant of Microsoft and their 
products?  Why would I be working hard to get the specific tools (Excel, 
Powerpoint, and to an extent, Word) available to me on this platform?

I wonder.

-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: landman at scalableinformatics.com
web  : http://www.scalableinformatics.com
phone: +1 734 786 8423
fax  : +1 734 786 8452
cell : +1 734 612 4615


From rgb at phy.duke.edu  Tue Jan 23 09:28:33 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Tue, 23 Jan 2007 12:28:33 -0500 (EST)
Subject: [Beowulf] An OT patented rgb editorial rant, skip if you like...
In-Reply-To: <74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
References: <web-1279309@free.net>
	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
Message-ID: <Pine.LNX.4.64.0701230730120.6292@lilith.rgb.private.net>

On Thu, 18 Jan 2007, Ryan Waite wrote:

> I know some of you aren't, um, tolerant of Microsoft for various reasons
> but I thought I'd clear up a couple errors in some of the posts. If you
> hate Microsoft at least you now have an email address for when you're
> feeling grumpy.

I don't feel grumpy (I've had my coffee:-) about Microsoft, nor do I
hate it.

If anything, I fear it.  And so should you, even as you work for it.

Never in the history of the world has a single company achieved the
level of single-market dominance that Microsoft now has.  Even AT&T at
its peak didn't dominate the WORLD market, and it was a government
regulated monopoly (indeed, it could not have come into existence
without the active help of the government, which more or less
deliberately decided to give it exclusivity in the market in exchange
for accepting government regulation and price control).  J.D.
Rockefeller was a piker, Vanderbilt a wimp in comparison.  Only Ford,
perhaps, enjoyed a similar period of global dominance but then, no,
probably not, as global markets didn't really exist until after he had
competition.

Microsoft, on the other hand, is for all practical purposes completely
unregulated, it faces no serious competition, it routinely engages in
business practices that make it very difficult for serious competition
to ever arise, and it extends all over the world, not just in the United
States.  It has long since surpassed critical mass.  It has demonstrated
conclusively that it is invulnerable to antitrust suits -- it can
cheerfully spend more money defending against them than it stands to
lose, and can stand to lose a billion dollars, and still come out
unimaginably ahead.  After all, its opponents also have to match it
dollar for dollar and politically breaking it up is not an option even
if it is the "obvious" thing to do.

Microsoft has exploited its position to achieve the unthinkable -- it
has become a globe-spanning "hydraulic empire" (water monopoly), the
strongest kind of monopoly there is and one where it has virtually NO
competition and where by virtue of its position it can ensure that NO
competition has any sort of realistic chance to emerge.

This is more than an analogy -- its practices fit this historical model
better, in many ways, than e.g the Chinese empires that were one of
Wittfogel's original examples.  By controlling the basic operating
system (the "water") it has asserted a level of control over the mass
software market for PCs that vastly exceeds any reasonable definition of
a "trust".  Basically, it does whatever it likes in this market, in such
a way that it literally cannot be opposed.  Time and again, when a new
software market has developed in the past, when an entrepreneur has come
up with a good idea and at risk of personal fortune and time created a
new software product, Microsoft has simply written their own version of
the product, shifted the access of their competitor to the "water" of
the operating system to create problems that they (Microsoft) are able
to avoid, and behold! The emperor's troops remain healthy and strong
while those of the upstart warlords are thin and emaciated without the
water to grow rice!  They have then proceeded to take as much of the
market as they liked.  Where is Borland today?  Lotus?  Corel?
Netscape?  Even Apple exists to some extent because Microsoft "needs" a
visible "competitor" lest our government be forced to actually
acknowledge the obvious truth.  OS2 was the last viable candidate for a
competitor, and if it had won IT would doubtless have become the
hydraulic empire and we'd all be railing against IBM.

I could go on (and have gone on in this and other forums in the past:-).
Adam Smith's invisible hand relies on the POSSIBILITY of nucleation and
growth of real competition, but the wonderful (from Microsoft's point of
view) thing about hydraulic empires is that they historically never fall
from within, and even when conquered from without their replacement
starts to "look like" the conquered bureaucracy -- the temptation to
exert abolute control by controlling access to water is just too strong.
Only forces from outside -- foreign barbarian invaders -- tend to be
able to bring about real change.

So when netscape emerges as a viable competitor in one small part of the
Empire -- sorry, no water for you.  Your product will not work, our
competing product cannot be removed and does.  Java?  A clear threat, as
it enables the development of software that does not rely on our supply
of water -- suborn it and insert our own insidious code base to ensure
that future programs written to use it require water from our carefully
controlled and expensive wells.  Make sure that our customers know that
glacial ice melt water provided by penguins, however clear and cold and
free of access, is of limited supply and contains giardia, cholera,
amoebic dysentary and possibly traces of mercury and radioactive
compounds because penguins have unclean habits and never wipe their feet
and should NEVER be used to make java.  We (Microsoft) cannot lose,
because somewhere between 90% and 95% of all desktops already run our
flavor of water (and the exceptions are pretty much confined to graphics
arts workstations or geek machines, both ignorable markets that we still
dominate anyway) and will hence inherit our flavor of Java. Business
developers who choose to fight the trend will simply dry up and blow
away, and if we have to pay Sun a half-billion dollars in "damages" who
cares?  The real "damage" is already done to our advantage and the
markets at stake are tens of billions per year.

Or my favorite -- when assessing and certifying competence on computers
in the state of North Carolina, students are tested on the use of an
integrated office suite.  Which one(s)?  Well, let's see.  Schools have
the choice of Microsoft Office, Microsoft Works (even for -- and this is
not a joke -- DOS 2 or 3) or Apple Works (or again not a joke, Claris).

Hmmm.  Apple has been driven to the edge of extinction several times and
has only been teased back from the brink by the invention of the ipod
and OSX (the latter allowing it to tap into the fast pool of OS software
and solving to some extent Apple's problems finding people outside of
Apple willing to develop for the platform).  And Apple has a certain
appeal in elementary schools in the state, especially with the deals
Apple is willing to cut to remain in the market.  Still, what does this
mean, practically speaking, given the cold hard reality of that 95%+ of
all BUSINESS desktops being Microsoft mentioned above? That the great
state of North Carolina metaphorically tests "driving" -- not of any old
vehicle -- but of a Ford, because if and when you graduate and go on to
work in business, you're gonna be driving a Ford.

Oh, you can use a late model Ford, a used Ford, or even one of those
antique Fords that still use handbrakes and are started with a little
handle up front, but a Ford it must be.  And if not a Ford, we'll
tolerate an "artistic" American Motors, because after all it is modelled
upon the Ford and besides some of us still own stock in it or like the
garish colors of its sporty models.  Don't even think about coming in to
pass your driving test in one of those "open source" autos, that somehow
auto-magically assemble themselves -- God knows if the gearshift even
works, and then don't run on the approved flavor of Water.

Thank you North Carolina (and many, many other states).  Talk about
>>institutionalizing<< a monopoly by >>government mandate<< by training
our children to accept it as the natural state of affairs from their
earliest years...

This globe-spanning supermonopoly is a serious and ongoing threat to our
personal freedom.  This is for a variety of reasons.  For one, the
"water" that is being controlled is the fundamental means of processing
information, and we live in a society where information and its
processing is so tightly integrated with economic, governmental,
military, and research activities that the possibilities of abuse in
this arena are positively nightmarish (and are explored in various
movies and books that make this point).  For another, the monopoly (like
all superpowerful orgainizations, criminal or otherwise) becomes a form
of "shadow government" -- collecting what resembles a tax far more than
a fee for service as an unavoidable cost of doing business, since there
is really no viable alternative to using water from their tightly
controlled and very expensive wells.

The supermonopoly can also directly impact political choice simply
because of its vast resources.  Money has a huge effect on the success
of modern media-based political campaigns, and by directing even tiny
bits of its vast resources -- through completely legal means -- a
supermonopoly can have a disproportionate effect on political campaigns
and political decision making.  We've seen how pervasive this sort of
thing can be in the case of e.g. the tobacco industry and its powerful
and well-funded lobby, that kept it more or less invulnerable to any
sort of rational regulation at the cost of HUNDREDS of millions of LIVES
worldwide over the DECADES from when the scientific evidence of
addiction, mobidity and mortality was completely overwhelming and beyond
any reasonable doubt.  If we can't even act to preserve our lives
against the power and money of the tobacco lobby, who could expect us to
act to preserve something as ephemeral as our informational freedom in
the hands of a supermonopoly that doesn't need a "consortium" of
companies to create a lobby -- it IS the consortium?

Almost by definition, much of the influence exercised in this way is
"invisible" -- it can be uncovered only by means of nearly impossible
detective work, and then usually only surfaces during a scandal of some
sort where the usual protections of cronyism, "unremarkable" memberships
on the board of directors of seemingly disconnected companies, and
untraceable non-cash quid-pro-quo deals break down.  Some of it IS
uncovered, but it turns out (unsurprisingly) that short of a smoking gun
or the crossing of an invisible line somewhere, nobody cares.  So Tom
Delay goes down, perhaps there are connections there back to Microsoft,
perhaps not, but they are quickly explained or hushed and everybody goes
back to their business having seen "nothing".

Why is that?  Well, for one thing in addition to holding a water
monopoly sort of control over competitors that makes it "impossible" for
a serious competitor for any given significant software product it takes
an interest in to emerge WITHIN the confines of its uniquely pervasive
desktop operating system, it gets to rely on a variety of aspects of
human nature to help it maintain a position where people don't CARE if
it maintains its monopoly, or even actively support it.  They are
content, as it were, to accept the risk to their personal freedoms and
to pay the Microsoft tax as long as their own personal computing
environment remains familiar.  Just as was the case for decades with
AT&T.

It is a sad fact that roughly 90% of all humans hate to have to learn
new things (a thing that I constantly struggle with as a teacher and
parent).  Seriously.  Sure, there are exceptions -- all people don't
mind learning some new things, some people would love to be able to
learn all new things, but all people do NOT want to learn all new things
and a significant class doesn't want to have to learn at all.  As a
species, though we live in a perpetual state of what Alvin Toffler once
called "Future Shock" and we just aren't evolved for it.  We especially
hate to have to learn new things (and maybe fail at it!) in order to
keep our jobs, in order to be able to do work we've already figured out
how to do "the old way".  Learning is "expensive".  It costs time and
money.  There is also something mysterious about how it is an
>>unpleasant<< aspect of mental activity for most people -- we are
somehow evolved, one is almost forced to conclude, to >>avoid<< the
particular mental actions and states associated with structured
learning.

As a systems person I've seen this a million times over.  Once a
secretary or office person has by virtue of necessity associated with
the means of making their living overcome all of the pain and invested
all of the time and "mastered" enough of e.g.  Microsoft Office to be
able to do their job with it, they will NOT willingly change.  Change
means threat, it means more work for them, it means an uncomfortable
period of uncertainty -- they will only willingly change if they are
de-facto threatened with dismissal if they fail to change and if they
are supported through the change, at which point they will become just
as adamently opposed to change away from the new product.  [This isn't
just a factor that works in favor of Microsoft products -- for many
years the physics department used (the old toy) Macintoshes
administratively because our then chair was enamored of them.  When a
new chair took over and decided to change away from this system to
Windows based PCs (this was an easy ten years ago and Linux wasn't even
a vaguely possible alternative at that point, and Sun workstations which
were were 2-3x more expensive) there was much pain and resistance and
suffering before the move was accomplished.]

Humans in this state become conservative and defensive about the
provider of the flavor of water they think that they need to survive,
unmolested by the need to change.  They are in a curious way addicted,
trapped in their current way of doing business by many natural and
artificial/perceived barriers to change.

EVEN if many flavors of water were out there, they'd prefer a world with
only the one they are "used to" because they have a hard time coping
with change, with choice, with the "threat" associated with the
possibility that they might be required to learn a new tool that is
finally beyond their abilities to master or that lacks some feature that
they have grown accustomed to in their old toolset.  Remember, computers
in particular are the leading edge, the very shockwave itself, of Future
Shock.  Moore's Law more or less guarantees it.  Five years is enough to
see a complete revolution, change that might have taken a lifetime to
see two hundred and fifty years ago compressed into two hundred and
fifty weeks.

Voice recognition is coming, so are universal convertible tablets, plus
changes as yet unknown, all of them scary, unsettling, expensive.  Not
even industry pundits can predict what the world of computing will be
like five years from now with any real accuracy, and in ten years we
will probably be carrying around fully voice-driven wireless universal
interfaces to "the network" which at long last will indeed be "the
computer" -- and the media delivery channel, and the phone system, and
roughly 90% of our active memory and de facto usable intelligence.  Or
something even more bizarre.

So sure, those humans are actually perfectly happy to worship the
Emperor and bless Him at meals, as it is by the Emperor's good graces
that food arrives on the table -- his water let's their crops of rice
grow and if fools start digging their own wells or diverting the rivers
of free water there will be war and chaos and "interesting times".  It
is better to remain a peasant with rice on the table than to be brave
and perhaps watch one's children starve or to die at the hands of the
barbarians.

Finally, there is Microsoft and pension plans and the general stock
market.  This is perhaps the scariest part of Microsoft's supermonopoly
status, one that a gentleman named Bill Parrish seems to have devoted
himself to uncovering and laying bare to an obviously uncaring world.
Microsoft stock is a rather huge component of stock owned by both
pension plans and individual "S&P Index" investors (and individuals) all
over the world.  If Microsoft stock were to collapse, or even to slip
steadily down in nominal value, the economic consequences would be
catastrophic.  It would make the collapse of Enron look tame by
comparison, because Microsoft is considerably larger at baseline than
Enron ever was.  This creates a HUGE disincentive for individuals and
companies to challenge Microsoft's hydraulic legacy -- Microsoft has
essentially tied the future well being and wealth of an entire
generation of corporate employees and index fund investors to their own
continued success.

Who can doubt the political impact of this astounding fact (and feat)?
What president, what attorney general, would dare to tackle this
supergiant when by doing so he or she would damage the retirement
prospects of tens of millions of (voting) people?  Even traditional
opponents of supermonopolies quail before the damage this would do to
the ordinary workers that are their constituents.  Note that Microsoft
is nearly unique in their status here -- in most other industries a
gradual slippage gives the market time to adjust and reinvest in other
emerging and more profitable businesses in the same sector, including
those that are (in a healthy market economy) the ones that are putting
the hurt on the failing business.

However there ARE no other businesses poised to "become Microsoft", and
there is little sign that anybody really wants a mixed marketplace with
many choices (an argument that was used for years to justify the
perpetuation of AT&T, BTW, although after it was broken up it turned out
that the consumer just LOVED the explosion of competitive alternatives
for their phone service dollar and still are benefitting from them
today).  Apple is still a joke as far as threats go, and could be
quashed more or less at will if it were in Microsoft's real interest to
do so -- they NEED at least one "visible" competitor to trumpet in their
period antitrust suits to help them advance the argument that they don't
need to be broken up like AT&T was, they're just strugging to keep their
head above water folks, really, competition could emerge >>any day
now<<.  So sure, Linux makes steady inroads in the server market and
somehow managed to create a multibillion dollar cluster market all by
itself, other unices are holding their own or slipping a bit, but the
big market, the one that matters, are the hundreds of millions of
desktop computers, not the millions of servers that serve them (that are
STILL overwhelmingly Microsoft servers), and they all use Microsoft
water to grow Microsoft rice that has to be eaten with Microsoft
chopsticks from a Microsoft bowl (where other chopsticks tend to drop
valuable grains of rice, other bowls spill rice on the table) by an
overwhelming margin.

Even if (or rather when, in my opinion) Linux emerges as a viable threat
on the desktop, it will do so in a way that is disasterous for those
pension funds, because it will do so by DEFLATING the incredibly
INFLATED software market back to something approximating true value.
This isn't "just" a matter of Linux being basically free so that
software companies in this market are really service providers and not
software providers, eliminating the high margins of pure profit
associated with having dozens of products developed and maintained by
any ten or even hundred employees that are then resold onto a hundred
million or more desktops.  Microsoft's P/E for years has been one of a
strong growth company and is in no way balanced as a generator of steady
revenues as an income stock.  If (or rather, WHEN) its growth shows
signs of actually peaking, not just bobbling along with the market or
tapering off but actually deflating some with no obvious new markets to
exploit and no more headroom for growth, The P/E bubble will burst and
Microsoft could lose 1/3 to 2/3 of its value in a matter of a year, with
NO company emerging as a suitable reinvestment platform to replace the
money with matching stratospheric growth in the sector.  A hundred
billion dollars will simply vanish from our economy like the paper it
is, dragging with it hundreds of billions more as the complex of debt
structures, pension investments, exchanges of services, and so on comes
crashing down.

Sure, we would survive this, just as we survived the S&L collapse that
caused a few hundred billion paper dollars to disappear, we survived the
dotcom collapse brought about by a lot of ongoing business practices
that inflate apparent value and preserve the illusion of endless growth,
we survived Enron, we survived Tyco, we survived MCI/Worldcom.  However,
what politician wants to be seen as the one that triggers such a
collapse, even the collapse of a rotten, termite-ridden house when that
house shelters millions of voters?  What businessman (or congressman) is
immune to the charm of continuing to buy into Microsoft's empire when
Microsoft's market position makes it so easy and besides, it would be
bad for their own pension plans and their own personal investment
portfolios to do otherwise?

In my opinion, the world is still coming to grips with emerging global
supermonopolies, with intellectual property seen now as a "natural
resource" to be created by individual minds, often with high risks, and
then taken over by corporate supergiants as their bread and butter
resource.  Strong historical, political and economic forces conspire to
protect many of these supermonopolies because they often provide
services or goods that are "necessary" to the functioning of the global
economy.  Also, they necessarily have as an essential part of their
superorganismal nature an urge to grow, to dominate markets, to quash
competition, to make more money for their shareholders and preserve the
power and intersts of their corporate leadership and employees.  They
are indeed little shadow governments, and have interests that are not,
actually, the interests of the general public at heart.

They are opposed, ultimately, not by the forces of communism or
totalitarianism (either of which tend to simply "become" the hydraulic
empire anew under new management) but by the forces of a free society
with the right to regulate business practice and level playing fields
for the common good.  The laws of our free society, however, change
slowly, very slowly compared to the rate at which supermonopolies have
emerged, and at no time have the lawmakers been free from the immense
influence wielded by those supermonopolies via the mechanisms outlined
above.  As a consequence we must suffer each "surprising" collapse, each
"unethical" business practice that is revealed for the pyramid scheme or
shell game that it is when the peak of the pyramid is finally reached
and there is no longer any way to pay off the expectations of all of
those who invested in it.

So no, I don't hate Microsoft, any more than I hate Ford or hate Exxon
or hate Verizon or hate Enron.  I fear Microsoft for the threat it
implies to my own personal political freedom, for the influence it has
had on the last couple of presidential and all ongoing congressional
elections (won, we must recall, by the thinnest of margins and usually
by the candidate with the deepest pockets), for the disaster I see
looming when it can no longer count on growing at a rate that justifies
its shareholders expectations as a "growth stock" and is left in a state
of eternal war to defend a slowly eroding income stream against the tiny
nibbling penguins that ultimately will only go away if Microsoft manages
to stake out some sort of unassailable intellectual property turf, and
for the significant problems I see associated with any company's IP
becoming a de facto standard for information storage and processing,
especially for the government.

So I forsee "interesting times" ahead on all fronts.  As a Microsoft
employee, you can hardly state in print that you share any of these
concerns.  You more or less have to defend the point of view that it is
simply great and wonderful that a single company controls such an
overwhelming share of the world's information technology industry (and
wealth -- more than a rather impressive list of COUNTRIES) because it is
YOUR company and YOU benefit directly from its success.  You have to be
overjoyed to see that yet another possible high growth market will be
usurped and co-opted on behalf of your Emperor because it pays for the
rice that feeds your children and maintains a state of peace in the
Empire.

These are good times, for you.  The barbarian penguins are far away and
weak -- it is easy in this time of plenty to feel the warm joy of a life
well lived and well ordered, where all of humanity worships the Emperor
and eats his the rice that the water that he controls makes possible,
even when it is the peasants themselves that actually grow the rice and
pump the water up from his wells with the strength of their backs.  It
is even possible to learn from these upstart penguins, to observe how
they fight battles and use the profitable weapons they have discovered
back upon them, a strategy that has worked well so many times before.

It is not necessary, nor even desireable, to wipe them out, any more
than it would be a good thing to eliminate the loyal opposition, Apple.
The forms of democracy and "free-market" competition must be observed.
All that is needed is to ensure that no seed may be planted, no twisted
sapling take root, that might one day grow into a vast kudzu-like mass
that could challenge the Emperor, and so the Emperor's ministers remain
vigilant, guarding against these weeds that can grow without the
Emperor's water by crowding them out, buying them out, or planting right
next to them and lavishing such care as to ensure that they grow strong
while the challenger at best lives a blighted existence thereafter.
Perfection is not needed -- good enough is plenty when you rule the
entire world.

As a human being, though, you too must fear the Emperor.  If he fails,
you will be among the first to starve.  His weaknesses are your
weaknesses, and in our society there are always the Gods of Democracy
and Free Trade that stand even over the Emperor and can, with the stroke
of a pen, cast him down. There are always the warring demons of the
stock exchange, ever fickle, that can lose confidence in the strength of
the Emperor and overnight make you a pauper.  There is the chance that
among the penguins will emerge a veritable Ghengis Khan who will overrun
the Empire with a might horde.  To defend against these threats the
Emperor ever seeks to extend his Dominion over even these Gods and
Demons, to arrange matters so that no longer are his ministers and loyal
subjects threatened in this way but instead are protected, aye, are
become one with the Gods themselves.  To have to eat the Emperor's rice
by law, to see it served in all of the schools, surely that is enough to
ensure the immortality of the Emperor and all who support him.

But never forget -- the barbarian penguins have one weapon, one tool,
that the Emperor can never embrace for it would unmake him, and cause
his mighty empire to unravel and turn to dust even as he sought to grasp
it.  A tool stronger than the worst Khan of a penguin of sweaty
nightmares, a weapon greater than any other ever discovered. Everybody
on this list knows well what it is, and why that tool makes it
impossible, ultimately, to wipe out these pesky penguins UNLESS the
Emperor becomes a Dark God and can do so by fiat, unless the Empire is
indeed protected by force of law.

Do you?

     rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From simon at thekelleys.org.uk  Tue Jan 23 12:17:49 2007
From: simon at thekelleys.org.uk (Simon Kelley)
Date: Tue, 23 Jan 2007 20:17:49 +0000
Subject: [Beowulf] An OT patented rgb editorial rant, skip if you like...
In-Reply-To: <Pine.LNX.4.64.0701230730120.6292@lilith.rgb.private.net>
References: <web-1279309@free.net>	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>	<45AE6BEE.9050001@scalableinformatics.com>	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
	<Pine.LNX.4.64.0701230730120.6292@lilith.rgb.private.net>
Message-ID: <45B66D6D.60208@thekelleys.org.uk>

Robert G. Brown wrote:
> On Thu, 18 Jan 2007, Ryan Waite wrote:
> 
>> I know some of you aren't, um, tolerant of Microsoft for various reasons
>> but I thought I'd clear up a couple errors in some of the posts. If you
>> hate Microsoft at least you now have an email address for when you're
>> feeling grumpy.
> 
> 
> I don't feel grumpy (I've had my coffee:-) about Microsoft, nor do I
> hate it.
> 
> If anything, I fear it.  And so should you, even as you work for it.
> 

< snip tour-de-force >

> Do you?
> 
>     rgb
> 

Bravo! That was almost worthy of Neal Stephenson.

Cheers,

Simon.


From mbernabeu at dsic.upv.es  Tue Jan 23 07:30:28 2007
From: mbernabeu at dsic.upv.es (Miguel =?ISO-8859-1?Q?=D3scar?= Bernabeu i Llinares)
Date: Tue, 23 Jan 2007 16:30:28 +0100
Subject: [Beowulf] middleware for heterogeneous cluster
Message-ID: <1169566228.3276.53.camel@riesling.dsic.upv.es>

Hi all,

I'm new to the list, so let me introduce myself: I work as a researcher
in the Technical University of Valencia (Spain). I've been using Beowulf
Clusters for several years, but I've got little experience in the
installation and administration of them.

We are planning to integrate some legacy hardware (Pentium IV and Xeon
two-processors) with brand new hardware (Itanium II Montecito SMP
boards) in order to build an heterogeneous cluster.

I'm interested in using a middleware like Warewulf, Perceus, Oscar,
Rocks, ... to reduce installation and maintenance overhead, since we
must focus in research tasks. I've been reading some webs and papers
about how dealing with heterogeneity in Warewulf, Perceus and Oscar, but
none of them seem to be the best solution: Oscar forces me to use a ia64
master node
( http://www.mail-archive.com/oscar-users at lists.sourceforge.net/msg06075.html ), while Perceus is not properly documented AFAIK.

Does anybody successfully configured a cluster like this? Any
suggestion?

Regards.

-- 
Miquel ?scar Bernabeu i Llinares <mbernabeu at dsic.upv.es>
Department of Information Systems and Computation.
Technical University of Valencia. Spain.


From rgb at phy.duke.edu  Tue Jan 23 15:23:04 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Tue, 23 Jan 2007 18:23:04 -0500 (EST)
Subject: [Beowulf] An OT patented rgb editorial rant, skip if you like...
In-Reply-To: <OFD98C2949.DEE5EEC5-ON8525726C.006EABCD-8525726C.0073E300@rohmhaas.com>
References: <OFD98C2949.DEE5EEC5-ON8525726C.006EABCD-8525726C.0073E300@rohmhaas.com>
Message-ID: <Pine.LNX.4.64.0701231639280.6292@lilith.rgb.private.net>

On Tue, 23 Jan 2007, Thomas H Dr Pierce wrote:

> Now as for the size of the computer market and Microsoft. Nope, the top 10
> companies in the US are 1. Exxon Mobil 2. Wal-Mart Stores 3. General
> Motors 4. Chevron 5. Ford Motor 6. ConocoPhillips 7. General Electric 8.
> Citigroup  9. AIG  10. IBM  - and one can argue that IBM is not a computer
> company these days: They are a services company to a large degree.
>
> Personally I fear Walmart, but the Oil companies are probably more
> powerful.

I agree.  Oil companies are terribly dangerous and powerful, and in
spite of publications to the contrary I strongly suspect that they have
manipulated the last two presidential elections and dictated a
tremendous amount of US foreign policy from behind the scenes at the
White House.

The point isn't that Microsoft is the world's greatest corporate danger,
it is that it is a unique corporate danger, different from those you
list above in many ways.  The biggest single difference is that there is
real competition in all of the markets represented above, indeed in
nearly all major markets including those that are dominated by very
large companies.  In fact, the top ten list above contains a number of
companies that are in fact competitors.  Some of them are indeed
dangerous -- as I said, the world is still finding ways of dealing with
the multinational supercorp.  Still, there are many oil companies (and
potential alternatives to oil, however much we are addicted to the
stuff).  There are many car companies in many countries, and the barrier
isn't so great that one couldn't conceive of a new one emerging.  Some
of these are banks or holding companies, which are definitely dangerous
but are also strongly regulated (not necessarily effectively, but
regulated).

Microsoft is quite different.  It is basically unopposed in the
marketplace (however much its competitors would like to think otherwise,
the numbers are simply overwhelming), and has agreements in place that
make it nearly impossible for any real competition to arise.  It has
deals with hardware manufacturers (and the practical side effects of its
consumer monopoly) to guarantee that it and only it releases an
operating system product that will run "all PC hardware".  To further
lock this in, it has agreements -- totally legal ones, I would guess --
that lock in the vast, vast majority of computer resellers to offer
exclusively Microsoft Windows as a pre-installed computer operating
system option.  Local vendors that I know of that would LIKE to offer
pre-installed Linux systems cannot do so because if they do, they will
essentially be forced out of business overnight as MS bumps their prices
by more than enough to remove their marginal profitability in this
thin-margin business.  Similarly, desktop software companies that do not
develop products that run on Microsoft's operating system simply have no
chance of surviving.

> And where is Intel in this monopoly?  They own 80% share in the world
> market by some measures. But I do not want to add to the conspiracy

This is a smaller share than Microsoft's, and they have numerous sources
of competition.  Intel dominates CPUs and computer firmware, perhaps,
but they have solid competition there and this is still only one part of
the overall chip market where there is far more competition, including a
great deal of global competition.  Worrisome, perhaps, but consider that
Intel requires raw materials and multibillion dollar chip foundries and
a huge amount of R&D investment to operate in its marketplace.  Chips
have to be built, humans and machines have to build them, they have to
be assembled by humans and machines into devices (usually by an entirely
different company), the devices have to be loaded with ware of one sort
or another, shipped to wholesalers, shipped further to retailers, and
are marked up every step of the way.

In spite of the immense overhead of all of these steps and the immense
investment required to actually build the chips into machines (where
ultimately those chips are commodity items and easily replaced by
functional equivalents subject only to the co-development of firmware
and/or software) what do we find?  A modern PC sells retail for as
little as $500 -- even laptops are now selling for only a bit more.  To
load it with Microsoft Windows XP Pro and Office Pro at full retail
costs MORE than this.  Add antivirus, add any other software at all and
your software costs COULD exceed the cost of the system on which they
run by a factor of two or even more.

This is truly amazing!  The "manufacturing costs" for this software are
on the order of a buck, and more money is probably spent on the box and
manual (such as it is) than on the actual CD(s).  Instead of teams of
hundreds of engineers and billion dollar capital investment foundries
and tens of thousands of employees working in the manufacturing sector
and massive sales and support operations one has teams of hundreds of
(software) engineers, followed by -- sales and support, with as little
of the latter as they can get away with and maintain their market.

They might as well just print money.

So I don't think it is at all fair to compare Intel with Microsoft.  One
has a position of major risk, is constantly required to reinvest huge
blocks of money in ten-billion dollar chunks or lose their market
dominance, and has competition that is doing their best to eat their
lunch.  As a person that screens graduate applications from Chinese
students, let me tell you that Chinese scientific academe from high
school through graduate programs there is overwhelmingly focused on
microelectronics, nanotechnology, and novel information processing
schema.  Intel, AMD, TI, Motorola, Fujitsu, IBM -- none of them are
secure, not from each other and not from new threats being born in the
world marketplace.  In this competition -- even competition between
giants -- world consumers gain tremendous benefits.  That's WHY the
margins on computer systems are so thin -- it is one of the most
cut-throat businesses on the planet.

But Microsoft doesn't care.  Every one of these marginally profitable
machines that is sold is a guaranteed $50, $100, $300 in their pocket
over the lifetime of the machine, the bulk of it pure profit.  They make
more actual post-cost money from the sale of a computer than any other
participant in the process.

> theories. A source of conspiracy theories is when people see short-term
> tactical events that personally affect them that are driven by overlooked
> long-term trends.  Microsoft is benefiting from the trend in personal
> computing. This trend could end with computing becoming entertainment ala
> youtube, or personal cellphones or very powerful personal digital
> assistants or personal networks or something else. With the rate of change
> these days is it unlikely to remain a trend in personal computing.

These examples seem to me to be fairly irrelevant.  They are all
distinct markets and have nothing to do with the viability or necessity
of personal computers in business, nor with the software packages that
will support those desktop business functions.  Microsoft is uniquely
positioned to exploit their market position and maintain dominance in
any new or emerging area as it emerges, as well.

However, the general idea of an emergent challenger is certainly
something to hope for.  However, note well that Microsoft has endured
more or less unchallenged since it betrayed IBM on the OS/2 deal in the
early 90's, and has in that time wiped out OS/2, Netscape (co-opting the
consumer/client side of the web), numerous other software companies,
concepts, products, and survived even their own incompetence -- who else
could sell an operating system that you don't dare to use in a
professional environment without spending money with third parties to
"fix" its huge, glaring security holes? Fifteen years is three computing
eternities already, and it is difficult offhand to see them failing in
the next two eternities UNLESS the long awaited invasion of the penguin
people occurs.

It could -- IBM is pushing it (amazingly clumsily, in my opinion, given
their capacity for core investment).  Novell could possibly manage it,
although they have a history of shooting themselves in the foot.  Red
Hat is pursuing the most conservative of strategies and trying to become
Sun Microsystems, not Microsoft, which is worrisome as they are
duplicating a failed strategy, badly.  Linux "competition" for Microsoft
these days reminds me unfortunately of the three stooges, not of a clear
and coherent view of the challenges involved in achieving world
domination.  However, linux doesn't stand or fall with its corporate
champions -- it is too delocalized, too robust, too free.  And at any
moment, any of its corporate champions could grow a brain, or
demonstrate that I'm wrong and they've had a brain all along and it is I
that is brainless or lacking in subtlety.

> As for Microsoft HPC, well, that could work. I still remember the days
> when everyone said that white collar workers won't type their own memos,
> when people said that no one would buy personal computers because
> operating systems were complex and when people said that home networks
> were too complex to setup and use. ...  Microsoft HPCs have barriers to
> overcome, but if they work for a segment of the poplulation, then let's
> move on to the next issue.

Oh, I expect them to be wildly successful, but that isn't where the real
money is.  Rather they are planting a sprout, and doubtless they will
water it well, so that however well it grows or productive it proves, it
will stunt the growth of everything else in the garden.  Except that
they don't really >>understand<< linux, or the forces that gave rise to
COTS clusters in the first place.  Their cluster effort remains
vulnerable to the same things that old big iron supercomputers were
vulnerable to -- long term price pressure.  Again, Microsoft is counting
on people being willing to spend a signficant fraction of the cost of
their hardware on their software instead.  This means that they have the
eternal choice -- more boxes (and run linux) or fewer boxes (and run
Windows).  At constant investment, they will be spending long term
throughput capacity for the software.

Some groups will be happy enough with this deal.  Specifically, those
who want intermittant "bursts" of HPC and who also have a MS-dominated
operational environment.  However, groups who expect an uptime of close
to 100% on the resource will not be happy losing 20% of their potential
throughput (or more) just so that they can submit through a pretty GUI,
even for smallish clusters, and face it -- building a Linux cluster just
isn't that difficult.  High school students do it successfully.  I've
helped maybe fifty groups around the world get started just with a few
email messages and some guidance, and then there is this list.  And it
is pretty easy to get a "pretty" prebuilt turnkey linux cluster as well,
and the margins are likely to be smaller relative to the hardware.

Basically, as long as MS continues to dominate the desktop market,
they'll sell HPC systems to MS-centric clients.  If their grip on the
desktop market should loosen, or if their share of the server market
starts to slip (which basically can only happen as organizations adopt
linux and lower the management barrier cost for doing linux clusters),
their cluster market will slip along with it.

Or maybe this is just wishful thinking on my part.  Punditry isn't
perfect, and the world is a complex place...;-)

    rgb

>
> ------
> Sincerely,
>
>   Tom Pierce
>

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From steve_heaton at iinet.net.au  Tue Jan 23 15:46:51 2007
From: steve_heaton at iinet.net.au (Steve Heaton)
Date: Wed, 24 Jan 2007 10:46:51 +1100
Subject: [Beowulf] Stats based cluster monitoring tool: OVIS
In-Reply-To: <200701231736.l0NHZwbn012724@bluewest.scyld.com>
References: <200701231736.l0NHZwbn012724@bluewest.scyld.com>
Message-ID: <45B69E6B.4030804@iinet.net.au>

G'day Great Minds Collective

Just came across this via LWN. Apologies if this turns out to be a 
duplicate.

https://ovis.ca.sandia.gov/mediawiki/index.php/Main_Page

I wish I had enough nodes for it to be readily applicable ;)

Cheers
Stevo


From diep at xs4all.nl  Wed Jan 24 01:17:25 2007
From: diep at xs4all.nl (Vincent Diepeveen)
Date: Wed, 24 Jan 2007 10:17:25 +0100
Subject: [Beowulf] An OT patented rgb editorial rant, skip if you like...
References: <OFD98C2949.DEE5EEC5-ON8525726C.006EABCD-8525726C.0073E300@rohmhaas.com>
	<Pine.LNX.4.64.0701231639280.6292@lilith.rgb.private.net>
Message-ID: <002c01c73f98$79d85cb0$0300a8c0@gourmandises>

hi Robert,

I have even printed on hardcopy your 2 postings that good i found them.
Keep doing the good posts.

You could bring up some more arguments pro and contra.

Where you as a professor in USA probably look at a good salarycheck each 
month,
and by god you deserve it, majority of planet doesn't. Especially young 
people do not.

As linux is total unusuable by the average grammar- and highschool kid, 
windows is the only alternative to them.

Just to have fully functional machine with all software installed under 
windows, you already lose a fortune to software.
It is not only microsoft charging much. Simple graphics software to draw and 
paint. Boom 600 dollar. Editor, dang 300 dollar.

That is more than the total amount of money that the average income here 
netto can spend a MONTH.
Do you feel that the price policy of microsoft and other software 
manufacturers is bad from educational viewpoint?

In other words, they have no option but to illegal copy if you want to work 
with the 'professional editions' of products.

Now we didn't even discuss the far east, where salaries are considerable 
lower. I calculated, of course as a total layman, that the average factory 
worker in China earns 30 cent an hour.

That's far cheaper than any robot can deliver the same amount of work for, 
as electricity costs are already higher,
according to my uninformed doodling paper.

They aren't ever in their lifetime going to buy legal software. Besides in 
all those countries on the road for like half a dollar you can buy all 
cdroms and dvd's with whatever you want to have at it. 5 dollar for visual 
studio 2005 enterprise edition at a DVD in India on the street.

Are all those electronic media, especially software, making the new 
generations less honest citizens?
Or would we need more than what we do now for them when they grow up?

So i share your concern of price. Microsoft (and many other software 
producers) are way way too expensive selling their software.
For the same reason of course microsoft will take over the highend 
completely. Everything is very expensive there. Microsoft is relative cheap 
and will first outcompete all others and has a long breath. So they 
basically will take over all highend and clusters, except for a few massive 
ones where universities can afford to pay for linux staff, as the personnel 
is way cheaper than the hardware. That won't be many.

With respect to microsoft there is a good argument why they are the only 
popular operating system. Making a user friendly operating system with well 
tested drivers is very hard.

On the other hand i totally agree with some anti-m$ points that I am total 
disgusted by the many hardware devices i bought over the years which next 
version of windows no longer work. An expensive HP scanner, a HP printer, 
pen tablets and so on.

Especially windows XP to SP2 was a big dang in hardware.

Where software works upwards compatible usually, drivers do not. Very very 
disgusting.

Yet making an operating system that supports all that, and also works user 
friendly, is simply very hard. Perhaps it is just too hard to expect that 
for every hardware product there will exist 2 drivers made by the 
manufacturer and that mankind is just capable of maintaining 1 company that 
can do so.

Some brainstorming (now it really becomes totally off topic here):

Not long from now we'll get ways to completely indoctrinate and train people 
with 3d software directly projected into the brain all kind of skills Now 
all kind of research has been forbidden in the EU, so quite possibly less 
friendly regimes towards their people will soon surpass the skills we have 
there and will manage to control their population 100% in all respects with 
just some software and revolutionary biological concepts.

We should not fear that future. At least it allows us to educate and train 
children to become normal human beings with no criminal/terrorristic 
intentions. The difference between nations with Islamic laws and the rest 
will grow of course, because a logical
following of their own rules, laws and interpretations of their holy texts 
just lets them worship death and lobotomize women.

Children in such training programs in the first world, will have a major 
advantage over the islamic and south-american communistic continent. You can 
train them not only much better, but also teach them a lot more and force 
them to focus, giving the teaching entity (most likely with a real living 
teacher nearby), more chances to learn the kids more.

It might be quite questionable whether we want several companies to develop 
a program for that in a commercial manner, instead of combine all efforts 
into one big product that boots and controls our environment. Companies have 
the habit of following the KISS principle everywhere and already release a 
product in order to make money, before it works well. The marketing 
department is making up the rest of the story then.

One big well tested good product that already suffers delays because of 
concerns of politicians, who tend to forget that it really doesn't matter 
too much whether software is involved or not, as in the end it is that same 
teacher in highschool that has most influence at it; all that might be 
better sometimes than when we look to the buggy manner in which most GUI 
software get produced. Total without any form of testing.

Imagine that all that, now that we just discussed one example in the civil 
area, not even the terminator part yet, can easily be adopted by other 
nations and other companies, without need to cooperate with the west, what 
will happen then considering that their laws are not so sophisticated like 
ours (EU goes further in forbidding research than USA/Israel by the way) and 
their intentions quite different from those of the democracies. It is not 
hard for me to imagine that there is at least a number of people who will 
not be able to get to bed with that idea and sleep well.

Is there room for a second effort there?

Isn't it already hard enough to make 1 product that can recognize languages. 
English is easy to recognize, just like Spanish, but did you consider Dutch 
and German?

Not to mention Arabic.

Dutch is a far more polite language than English. We have 2 different words 
for "you". One ("u") that you use for people who you respect or when you 
want to say things in a polite manner, or simply because someone is a lot 
older. And a 'you' ("jij") that you use to adress friends and colleges.

Both translate to 'you' in english.

Arabic has its own unique forms for things when it has to do with either 1 
person, 2 persons, 1 female, 2 females etc.

Just getting all that work correct in the computer is a huge task.

When i did do some small research for my chess program to incorporate voice 
recognition and speech,
i was total shocked by how little the field there advanced past 10 years, 
especially for Dutch.
Not to mention all the constraints and impossibilities. Of course what 
really forced me to dump all those plans
was the huge price to use just one of those products features into my Diep3d 
chessprogram.

In fact it might be easier to teach the next generations a new language that 
they just speak when communicating with hardware.

An OS using that, especially linux, is not going to be able to quickly 
implement that, if linux EVER gets something like that.

Microsoft will manage however.

Vincent

Please note that i'm not working for Microsoft, so i am quite objective in 
this discussion.
At home i run both linux and windows.

----- Original Message ----- 
From: "Robert G. Brown" <rgb at phy.duke.edu>
To: "Thomas H Dr Pierce" <TPierce at rohmhaas.com>
Cc: "Beowulf Mailing List" <Beowulf at beowulf.org>
Sent: Wednesday, January 24, 2007 12:23 AM
Subject: Re: [Beowulf] An OT patented rgb editorial rant, skip if you 
like...


> On Tue, 23 Jan 2007, Thomas H Dr Pierce wrote:
>
>> Now as for the size of the computer market and Microsoft. Nope, the top 
>> 10
>> companies in the US are 1. Exxon Mobil 2. Wal-Mart Stores 3. General
>> Motors 4. Chevron 5. Ford Motor 6. ConocoPhillips 7. General Electric 8.
>> Citigroup  9. AIG  10. IBM  - and one can argue that IBM is not a 
>> computer
>> company these days: They are a services company to a large degree.
>>
>> Personally I fear Walmart, but the Oil companies are probably more
>> powerful.
>
> I agree.  Oil companies are terribly dangerous and powerful, and in
> spite of publications to the contrary I strongly suspect that they have
> manipulated the last two presidential elections and dictated a
> tremendous amount of US foreign policy from behind the scenes at the
> White House.
>
> The point isn't that Microsoft is the world's greatest corporate danger,
> it is that it is a unique corporate danger, different from those you
> list above in many ways.  The biggest single difference is that there is
> real competition in all of the markets represented above, indeed in
> nearly all major markets including those that are dominated by very
> large companies.  In fact, the top ten list above contains a number of
> companies that are in fact competitors.  Some of them are indeed
> dangerous -- as I said, the world is still finding ways of dealing with
> the multinational supercorp.  Still, there are many oil companies (and
> potential alternatives to oil, however much we are addicted to the
> stuff).  There are many car companies in many countries, and the barrier
> isn't so great that one couldn't conceive of a new one emerging.  Some
> of these are banks or holding companies, which are definitely dangerous
> but are also strongly regulated (not necessarily effectively, but
> regulated).
>
> Microsoft is quite different.  It is basically unopposed in the
> marketplace (however much its competitors would like to think otherwise,
> the numbers are simply overwhelming), and has agreements in place that
> make it nearly impossible for any real competition to arise.  It has
> deals with hardware manufacturers (and the practical side effects of its
> consumer monopoly) to guarantee that it and only it releases an
> operating system product that will run "all PC hardware".  To further
> lock this in, it has agreements -- totally legal ones, I would guess --
> that lock in the vast, vast majority of computer resellers to offer
> exclusively Microsoft Windows as a pre-installed computer operating
> system option.  Local vendors that I know of that would LIKE to offer
> pre-installed Linux systems cannot do so because if they do, they will
> essentially be forced out of business overnight as MS bumps their prices
> by more than enough to remove their marginal profitability in this
> thin-margin business.  Similarly, desktop software companies that do not
> develop products that run on Microsoft's operating system simply have no
> chance of surviving.
>
>> And where is Intel in this monopoly?  They own 80% share in the world
>> market by some measures. But I do not want to add to the conspiracy
>
> This is a smaller share than Microsoft's, and they have numerous sources
> of competition.  Intel dominates CPUs and computer firmware, perhaps,
> but they have solid competition there and this is still only one part of
> the overall chip market where there is far more competition, including a
> great deal of global competition.  Worrisome, perhaps, but consider that
> Intel requires raw materials and multibillion dollar chip foundries and
> a huge amount of R&D investment to operate in its marketplace.  Chips
> have to be built, humans and machines have to build them, they have to
> be assembled by humans and machines into devices (usually by an entirely
> different company), the devices have to be loaded with ware of one sort
> or another, shipped to wholesalers, shipped further to retailers, and
> are marked up every step of the way.
>
> In spite of the immense overhead of all of these steps and the immense
> investment required to actually build the chips into machines (where
> ultimately those chips are commodity items and easily replaced by
> functional equivalents subject only to the co-development of firmware
> and/or software) what do we find?  A modern PC sells retail for as
> little as $500 -- even laptops are now selling for only a bit more.  To
> load it with Microsoft Windows XP Pro and Office Pro at full retail
> costs MORE than this.  Add antivirus, add any other software at all and
> your software costs COULD exceed the cost of the system on which they
> run by a factor of two or even more.
>
> This is truly amazing!  The "manufacturing costs" for this software are
> on the order of a buck, and more money is probably spent on the box and
> manual (such as it is) than on the actual CD(s).  Instead of teams of
> hundreds of engineers and billion dollar capital investment foundries
> and tens of thousands of employees working in the manufacturing sector
> and massive sales and support operations one has teams of hundreds of
> (software) engineers, followed by -- sales and support, with as little
> of the latter as they can get away with and maintain their market.
>
> They might as well just print money.
>
> So I don't think it is at all fair to compare Intel with Microsoft.  One
> has a position of major risk, is constantly required to reinvest huge
> blocks of money in ten-billion dollar chunks or lose their market
> dominance, and has competition that is doing their best to eat their
> lunch.  As a person that screens graduate applications from Chinese
> students, let me tell you that Chinese scientific academe from high
> school through graduate programs there is overwhelmingly focused on
> microelectronics, nanotechnology, and novel information processing
> schema.  Intel, AMD, TI, Motorola, Fujitsu, IBM -- none of them are
> secure, not from each other and not from new threats being born in the
> world marketplace.  In this competition -- even competition between
> giants -- world consumers gain tremendous benefits.  That's WHY the
> margins on computer systems are so thin -- it is one of the most
> cut-throat businesses on the planet.
>
> But Microsoft doesn't care.  Every one of these marginally profitable
> machines that is sold is a guaranteed $50, $100, $300 in their pocket
> over the lifetime of the machine, the bulk of it pure profit.  They make
> more actual post-cost money from the sale of a computer than any other
> participant in the process.
>
>> theories. A source of conspiracy theories is when people see short-term
>> tactical events that personally affect them that are driven by overlooked
>> long-term trends.  Microsoft is benefiting from the trend in personal
>> computing. This trend could end with computing becoming entertainment ala
>> youtube, or personal cellphones or very powerful personal digital
>> assistants or personal networks or something else. With the rate of 
>> change
>> these days is it unlikely to remain a trend in personal computing.
>
> These examples seem to me to be fairly irrelevant.  They are all
> distinct markets and have nothing to do with the viability or necessity
> of personal computers in business, nor with the software packages that
> will support those desktop business functions.  Microsoft is uniquely
> positioned to exploit their market position and maintain dominance in
> any new or emerging area as it emerges, as well.
>
> However, the general idea of an emergent challenger is certainly
> something to hope for.  However, note well that Microsoft has endured
> more or less unchallenged since it betrayed IBM on the OS/2 deal in the
> early 90's, and has in that time wiped out OS/2, Netscape (co-opting the
> consumer/client side of the web), numerous other software companies,
> concepts, products, and survived even their own incompetence -- who else
> could sell an operating system that you don't dare to use in a
> professional environment without spending money with third parties to
> "fix" its huge, glaring security holes? Fifteen years is three computing
> eternities already, and it is difficult offhand to see them failing in
> the next two eternities UNLESS the long awaited invasion of the penguin
> people occurs.
>
> It could -- IBM is pushing it (amazingly clumsily, in my opinion, given
> their capacity for core investment).  Novell could possibly manage it,
> although they have a history of shooting themselves in the foot.  Red
> Hat is pursuing the most conservative of strategies and trying to become
> Sun Microsystems, not Microsoft, which is worrisome as they are
> duplicating a failed strategy, badly.  Linux "competition" for Microsoft
> these days reminds me unfortunately of the three stooges, not of a clear
> and coherent view of the challenges involved in achieving world
> domination.  However, linux doesn't stand or fall with its corporate
> champions -- it is too delocalized, too robust, too free.  And at any
> moment, any of its corporate champions could grow a brain, or
> demonstrate that I'm wrong and they've had a brain all along and it is I
> that is brainless or lacking in subtlety.
>
>> As for Microsoft HPC, well, that could work. I still remember the days
>> when everyone said that white collar workers won't type their own memos,
>> when people said that no one would buy personal computers because
>> operating systems were complex and when people said that home networks
>> were too complex to setup and use. ...  Microsoft HPCs have barriers to
>> overcome, but if they work for a segment of the poplulation, then let's
>> move on to the next issue.
>
> Oh, I expect them to be wildly successful, but that isn't where the real
> money is.  Rather they are planting a sprout, and doubtless they will
> water it well, so that however well it grows or productive it proves, it
> will stunt the growth of everything else in the garden.  Except that
> they don't really >>understand<< linux, or the forces that gave rise to
> COTS clusters in the first place.  Their cluster effort remains
> vulnerable to the same things that old big iron supercomputers were
> vulnerable to -- long term price pressure.  Again, Microsoft is counting
> on people being willing to spend a signficant fraction of the cost of
> their hardware on their software instead.  This means that they have the
> eternal choice -- more boxes (and run linux) or fewer boxes (and run
> Windows).  At constant investment, they will be spending long term
> throughput capacity for the software.
>
> Some groups will be happy enough with this deal.  Specifically, those
> who want intermittant "bursts" of HPC and who also have a MS-dominated
> operational environment.  However, groups who expect an uptime of close
> to 100% on the resource will not be happy losing 20% of their potential
> throughput (or more) just so that they can submit through a pretty GUI,
> even for smallish clusters, and face it -- building a Linux cluster just
> isn't that difficult.  High school students do it successfully.  I've
> helped maybe fifty groups around the world get started just with a few
> email messages and some guidance, and then there is this list.  And it
> is pretty easy to get a "pretty" prebuilt turnkey linux cluster as well,
> and the margins are likely to be smaller relative to the hardware.
>
> Basically, as long as MS continues to dominate the desktop market,
> they'll sell HPC systems to MS-centric clients.  If their grip on the
> desktop market should loosen, or if their share of the server market
> starts to slip (which basically can only happen as organizations adopt
> linux and lower the management barrier cost for doing linux clusters),
> their cluster market will slip along with it.
>
> Or maybe this is just wishful thinking on my part.  Punditry isn't
> perfect, and the world is a complex place...;-)
>
>    rgb
>
>>
>> ------
>> Sincerely,
>>
>>   Tom Pierce
>>
>
> -- 
> Robert G. Brown                        http://www.phy.duke.edu/~rgb/
> Duke University Dept. of Physics, Box 90305
> Durham, N.C. 27708-0305
> Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf
> 


From deadline at clustermonkey.net  Wed Jan 24 06:40:46 2007
From: deadline at clustermonkey.net (Douglas Eadline)
Date: Wed, 24 Jan 2007 09:40:46 -0500 (EST)
Subject: [Beowulf] An OT patented rgb editorial rant, skip if you like...
In-Reply-To: <Pine.LNX.4.64.0701230730120.6292@lilith.rgb.private.net>
References: <web-1279309@free.net>
	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
	<Pine.LNX.4.64.0701230730120.6292@lilith.rgb.private.net>
Message-ID: <53535.192.168.1.1.1169649646.squirrel@mail.eadline.org>

Robert,

Excellent screed. One thing:

 "barbarian penguins"

I call first dibs on using that name for a rock band.
(if only I could play an instrument)

 --
 Doug

>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
> !DSPAM:45b6462939547511819938!
>


-- 
Doug


From peter.st.john at gmail.com  Wed Jan 24 07:39:55 2007
From: peter.st.john at gmail.com (Peter St. John)
Date: Wed, 24 Jan 2007 10:39:55 -0500
Subject: [Beowulf] An OT patented rgb editorial rant, skip if you like...
In-Reply-To: <53535.192.168.1.1.1169649646.squirrel@mail.eadline.org>
References: <web-1279309@free.net> <45AE3C29.1090809@scalableinformatics.com>
	<6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
	<Pine.LNX.4.64.0701230730120.6292@lilith.rgb.private.net>
	<53535.192.168.1.1.1169649646.squirrel@mail.eadline.org>
Message-ID: <e4d4fd070701240739m11830480ld83dfa4af5c3dbc0@mail.gmail.com>

Gentlemen,
The "barbarian" allegory may be more apt than it appears (although it's not
the Penguins). The steppe-nomads who invaded Europe during the latter half
of the first millenium are supposed to have brought sheep with them.
Conquering Europe had the incidental effect of converting arable land from
crops to grazing. Instead of directly growing carbohydrate (wheat), growing
cellulose (grass) and then converting the cellulose to protein (sheep) and
then the protein to calories, is enormously less efficient in calories per
acre. Reducing the ability of the countryside to support the populations in
the cities is said to have been a major cause of the Dark Ages. Nonetheless
we admire Attilla.

Microsoft has swept accross the small computer landscape and conquered the
known world. Incidental to that, "Windows", integrating the GUI with the
OS (as opposed to say VMS, integrating DP with the OS via e.g. thick file
system RMS and shared memory section microcode; or say Unix, integrating the
development environment with the OS, so the archetype unix book is titled
"The Unix Programming Environment" not "The Unix Operating System") cuts the
efficiency (in DP and Development, if not of GUI) on vast stretches of
arable desktop. Putting a GUI OS on a server (e.g. "Exchange Server") is as
counterproductive as grazing sheep on arable land; sheep belong on hillsides
you can't irrigate, and wheat belongs on flat land near rivers.  Bill Gates
is the Attilla The Hun of the Information Age.

Businessmen today admire him the way warriors of old admired Attilla. My
advice to clients is just to point out, that I want to own stock in
McDonald's, I don't want to eat there everyday. Sell Vista to your customers
but don't install it on your servers.

Peter

Peter St.John (consultant, programming more lately in VB than C)


On 1/24/07, Douglas Eadline <deadline at clustermonkey.net> wrote:
>
> Robert,
>
> Excellent screed. One thing:
>
> "barbarian penguins"
>
> I call first dibs on using that name for a rock band.
> (if only I could play an instrument)
>
> --
> Doug
>
> >
> > _______________________________________________
> > Beowulf mailing list, Beowulf at beowulf.org
> > To change your subscription (digest mode or unsubscribe) visit
> > http://www.beowulf.org/mailman/listinfo/beowulf
> >
> > !DSPAM:45b6462939547511819938!
> >
>
>
> --
> Doug
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070124/55dc181b/attachment.html>

From James.P.Lux at jpl.nasa.gov  Wed Jan 24 08:16:27 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Wed, 24 Jan 2007 08:16:27 -0800
Subject: [Beowulf] low cost ethernet controlled power switch
Message-ID: <6.2.3.4.2.20070124081054.033b7348@mail.jpl.nasa.gov>

A amateur radio acquantaince pointed me to this device (of unknown 
quality, provenance, reliability, etc.) which is quite interesting.
8 switched outlets with some sort of ethernet interface for $90
http://www.surpluscomputers.com/store/main.aspx?p=ItemDetail&item=NET10332

I think that this thing is made by digital-loggers inc 
(http://www.digital-loggers.com/)
specifically: http://www.digital-loggers.com/lpc.html

Might be interesting to folks building up a low buck cluster that 
want remote control power switching


James Lux, P.E.
Spacecraft Radio Frequency Subsystems Group
Flight Communications Systems Section
Jet Propulsion Laboratory, Mail Stop 161-213
4800 Oak Grove Drive
Pasadena CA 91109
tel: (818)354-2075
fax: (818)393-6875 


From csamuel at vpac.org  Wed Jan 24 18:14:31 2007
From: csamuel at vpac.org (Chris Samuel)
Date: Thu, 25 Jan 2007 13:14:31 +1100
Subject: [Beowulf] An OT patented rgb editorial rant, skip if you like...
In-Reply-To: <002c01c73f98$79d85cb0$0300a8c0@gourmandises>
References: <OFD98C2949.DEE5EEC5-ON8525726C.006EABCD-8525726C.0073E300@rohmhaas.com>
	<Pine.LNX.4.64.0701231639280.6292@lilith.rgb.private.net>
	<002c01c73f98$79d85cb0$0300a8c0@gourmandises>
Message-ID: <200701251314.32034.csamuel@vpac.org>

On Wednesday 24 January 2007 8:17 pm, Vincent Diepeveen wrote:

> As linux is total unusuable by the average grammar- and highschool kid,
> windows is the only alternative to them.

Fortunately someone forgot to tell the ones in Australia about that!

Some quotes from people I know:

# My daughter turns seven this weekend. She started school last year.  She is
# using my FreeBSD/Gnome2 driven laptop. 


# My 13yo nephew decided to try out Linux on his Mac a couple of months ago,
# so I downloaded him a Ubuntu disc and send him home with it with the promise
# that next time I'm over there I'll help him install.
[...]
# The next time I visit he's not only partitioned his HD and installed Ubunutu
# he's gone to a Ubunutu forum and found and followed instructions to 
# <CRTL>-<ALT> to a console and edit xorg.conf (using vi!) to overcome a DRI 
# incompatability with his old Mac which left him with a blank screen on
# bootup.  


# My daughter (now 13) grew up on a diet of OS9/OSX, Windows and Linux.
# 
# Now she has Ubuntu on her blueberry iMac as Mac OSX was a bit slow and I 
# was not going to pay for Mac programs. (she won the iMac) 

and my favourite:

# Well actually, my son with using Linux to surf the web, and play some 
# games from when he was 2. 
# 
# This was a stock install of Debian with the KDE Desktop. I just put the 
# icons on the desktop, and had it auto login for him. (he had some  
# trouble with a user name and a secure password). 


If we expect children of that age to learn multiple languages (something I've 
never managed to do, I can't even speak my own native tongue of Welsh) then 
why should they find different computer systems any more difficult ?

cheers,
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia


From rgb at phy.duke.edu  Thu Jan 25 04:09:23 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Thu, 25 Jan 2007 07:09:23 -0500 (EST)
Subject: [Beowulf] [Fwd: GlusterFS 1.2-BENKI (GNU Cluster File System) -
 Announcement] (fwd)
Message-ID: <Pine.LNX.4.64.0701250708520.1665@lilith.rgb.private.net>

Done.

    rgb

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


---------- Forwarded message ----------
Date: Wed, 24 Jan 2007 09:31:12 -0800 (PST)
From: Anand Babu Periasamy <ab at gnu.org.in>
To: rgb at phy.duke.edu
Subject: [Fwd: GlusterFS 1.2-BENKI (GNU Cluster File System) - Announcement]

Robert,
I wrote a mail to Beowulf list announcing GlusterFS cluster filesystem. It
was held for moderator approval and it has not happened yet. Is Becker
still moderating the list?. I see, you are regular poster to the list.
Thats why I am seeking your help. May be you can also forward this
announcement on our behalf.

-- 
Anand Babu
GPG Key ID: 0x62E15A31
Personal Blog  [http://ab.freeshell.org]
The GNU Operating System [http://www.gnu.org]


---------------------------- Original Message ----------------------------
Subject: GlusterFS 1.2-BENKI (GNU Cluster File System) - Announcement
From:    "Anand Babu" <ab at gnu.org.in>
Date:    Tue, January 23, 2007 6:14 pm
To:      beowulf at beowulf.org
--------------------------------------------------------------------------

GlusterFS 1.2-BENKI Public Announcement:

GlusterFS (GNU Cluster File System) is designed to scale to peta-bytes of
data and handle massive aggregated I/O bandwidth. GlusterFS is part of the
Gluster Project (GNU Clustering Platform) and has been released under GNU
GPL v2 (or later) license.

The current release of GlusterFS is running stable and performs
exceedingly well against NFS  please refer to benchmarks at
http://www.gluster.org/docs/index.php/GlusterFS_Benchmarks for
benchmark comparison.

The next release (1.3-BENKI) will have further enhancements such as
"ib-verbs", Infiniband RDMA transport, asynchronous I/O and epoll
leading to much lower latency and more POSIX correctness.

GlusterFS runs commodity storage hardware with GigE or Infiniband
interconnect. It is easy to setup GlusterFS. To get quickly started with
GlusterFS, simply follow the instructions at
http://www.gluster.org/docs/index.php/Getting_Started_with_GlusterFS

GlusterFS download link:
   http://www.gluster.org/glusterfs.php

Documentation link:
  http://gluster.org/docs/index.php/GlusterFS

Please CC your discussions to gluster-devel (at) nongnu.org.

Happy Hacking,
--
Gluster Core Team.

-- 
Anand Babu
GPG Key ID: 0x62E15A31
Blog [http://ab.freeshell.org]
The GNU Operating System [http://www.gnu.org]


From rgb at phy.duke.edu  Thu Jan 25 06:09:38 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Thu, 25 Jan 2007 09:09:38 -0500 (EST)
Subject: [Beowulf] An OT patented rgb editorial rant, skip if you like...
In-Reply-To: <200701251314.32034.csamuel@vpac.org>
References: <OFD98C2949.DEE5EEC5-ON8525726C.006EABCD-8525726C.0073E300@rohmhaas.com>
	<Pine.LNX.4.64.0701231639280.6292@lilith.rgb.private.net>
	<002c01c73f98$79d85cb0$0300a8c0@gourmandises>
	<200701251314.32034.csamuel@vpac.org>
Message-ID: <Pine.LNX.4.64.0701250827270.1665@lilith.rgb.private.net>

On Thu, 25 Jan 2007, Chris Samuel wrote:

> # Well actually, my son with using Linux to surf the web, and play some
> # games from when he was 2.
> #
> # This was a stock install of Debian with the KDE Desktop. I just put the
> # icons on the desktop, and had it auto login for him. (he had some
> # trouble with a user name and a secure password).
>
>
> If we expect children of that age to learn multiple languages (something I've
> never managed to do, I can't even speak my own native tongue of Welsh) then
> why should they find different computer systems any more difficult ?

And FWIW, my three sons all grew up using Linux.  Their only major
complaint has been a) Even with Cedega and kernel-tainting Nvidia
drivers, running WoW (and other WinXX games) was then and remains now
painful.  For kids this is a major problem, alas; b) there have been a
handful of other thorns in the form of small applications -- one son has
friends who use AOLs chat to communicate, and gaim just hasn't worked
well for him as it lags the Win-native AOL client considerably; c) in
the old days, it was hard to find a decent Office-like suite, especially
a decent word processor.  They used Abiword, for example, which worked
"adequately" as a WP but sucked on the printer/driver front.

Since FC4 with a fully functional Open Office, the latter problem has
been resolved, but the game/small client issue remains.  So they are all
perfectly capable of using linux and can manage it for their schoolwork
just fine, but it is still lacking on the entertainment/casual use
front.  FC6 has several hundred games of its own, which is good, but as
far as kids are concerned the inability to "just run" over the counter
Windows-based games is a show stopper, far more important to them than
whether or not OOffice is adequate for writing papers, making graphs, or
doing presentations.

The same is more or less true of my near-luddite wife -- she gets by
with linux about the same way she'd get by with Windows.  In neither
case can she e.g. handle networking or any sort of problem (it had
better work automagically or else I have to step in and resolve the
problem).  As long as it does work automagically at the networking/login
level, hey, a browser is a browser, an email client is an email client,
and all she cares about is that these "just work".  And they do.

The biggest problem we've encountered in her case is similar to that of
the boys -- there are a few applications out there that she needs to run
that just plain require Windows, or Explorer.  Epocrates (a PDA-based
drug database) for example auto-updates through functions in Explorer
and will not work through any of the linux browsers, alas, although
there were rumors that they were going to at long last port to the Mac
(and maybe linux by inheritance).  However, this is too little too late
as her practice is about to install an EMR/PM system that will make this
irrelevant.  Similarly, she sometimes gets CD's containing clinical or
training information that are designed to launch automagically into a
step-through presentation from Windows.  However, a lot of these
recently have actually built the presentations on top of html or pdfs,
so that Linux can actually handle them fine IF one knows where to go to
find the toplevel link.  They lose that autolaunch capability that keeps
me out of the loop, but the loop can be closed.

Linux would in many ways make an excellent platform for schools these
days.  It can be secured and locked down far better than Windows, it is
much less vulnerable to viruses and spyware (the latter a critical issue
in schools where privacy and protection are major issues) and it comes
with pretty much a spanning set of software tools to support most of
what one would like to do OUTSIDE of the vast plethora of e.g. Reader
Rabbit Windows apps.  Again, if linux had an automagic winex/cedega
fully integrated with the distribution so that it would "just install"
Windows apps of this sort and they would "just run", life would be
grand.  Even without this, it is a good choice, but one that requires a
certain degree of expertise to set up and manage, to find the right
software, install it and support it (if necessary under an emulator).

It doesn't, however, provide most schools with much economic advantage.
Microsoft doesn't charge schools diddly for their OS or primary
software.  They don't have to.  If they charge $5 a seat, that's $3 of
pure additional profit (or more -- they usually only have to provide a
single media copy for the school and no longer have to market to the
school) and besides, all the machines the school bought CAME with
Windows so they've ALREADY made their profit at standard rates by
hooking into the supply chain much earlier where the school doesn't
realize that it really getting "nothing much" for free, for all that it
has to pay a lot less than other clients buying the same nothing much.
Still, Linux is likely to cost those schools "more", because offsite
management and setup may well be more expensive for Linux, because linux
does NOT just load up site-licensed learningware applications, and so
on.

Again, a market just begging to be opened.  Linux-based learningware
COULD be set up so that it cost a school FAR LESS to build a linux-only
network.  In fact, it could be made so easy that it cost them nothing
but the offsite management package they'd have to buy anyway, or the
opportunity cost time of a computer-savvy teacher to run it.  Once again
one is trapped by the paradox that to make it happen, somebody has to
make money (at least a living!) doing it, and then it is no longer free.

    rgb

>
> cheers,
> Chris
>

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From steve_heaton at iinet.net.au  Thu Jan 25 18:07:31 2007
From: steve_heaton at iinet.net.au (Steve Heaton)
Date: Fri, 26 Jan 2007 13:07:31 +1100
Subject: [Beowulf] An OT patented rgb editorial rant
In-Reply-To: <200701231736.l0NHZwbn012724@bluewest.scyld.com>
References: <200701231736.l0NHZwbn012724@bluewest.scyld.com>
Message-ID: <45B96263.70207@iinet.net.au>

G'day RGB and all

I draw great comfort from another lesson that history has taught us: all 
empires crumble.

All those previous empires also looked invincible.

Sure, some last longer than others but they all ultimately suffer the 
same fate.

Here's a thought, has Microsoft's dominance been supported by Western 
society's swing to the right? To what extent?

Ultimately we'll see the pendulum swing back to the left and freedom 
will again return to the lands. Will such a swing see the rise of the 
Penguins?

I'm sure there's a sociology PhD in there somewhere. At the very least a 
late night of robust red / dark ale ;)

Stevo


From greg.lindahl at qlogic.com  Thu Jan 25 19:30:56 2007
From: greg.lindahl at qlogic.com (Greg Lindahl)
Date: Thu, 25 Jan 2007 19:30:56 -0800
Subject: [Beowulf] An OT patented rgb editorial rant
In-Reply-To: <45B96263.70207@iinet.net.au>
References: <200701231736.l0NHZwbn012724@bluewest.scyld.com>
	<45B96263.70207@iinet.net.au>
Message-ID: <20070126033056.GA4705@localhost.localdomain>

On Fri, Jan 26, 2007 at 01:07:31PM +1100, Steve Heaton wrote:

> Here's a thought, has Microsoft's dominance been supported by Western 
> society's swing to the right? To what extent?

The desire to bust trusts (monopolies) is shared by a fraction of
people on both the left and right of the political spectrum. A
right-wing analysis goes: A market dominated by a monopoly is not a
free market. So in this model, the government is needed to fairly
apply rules to prevent or break up monopolies.

But we're sliding even more off-topic.

-- greg


From deadline at clustermonkey.net  Sat Jan 27 09:42:42 2007
From: deadline at clustermonkey.net (Douglas Eadline)
Date: Sat, 27 Jan 2007 12:42:42 -0500 (EST)
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeplo
	y.ntdev.microsoft.com>
References: <web-1279309@free.net>
	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
Message-ID: <37613.192.168.1.1.1169919762.squirrel@mail.eadline.org>

> I know some of you aren't, um, tolerant of Microsoft for various reasons
> but I thought I'd clear up a couple errors in some of the posts. If you
> hate Microsoft at least you now have an email address for when you're
> feeling grumpy.
>

Like Robert, I don't hate Microsoft. However, the following
URL is a recent example of why many people like myself
fear Microsoft (and other big "We are going to patent obvious and prior
art because you can't afford to defend yourself" technology companies)

http://www.bluej.org/mrt/?p=21

-- 
Doug


From gmkurtzer at gmail.com  Tue Jan 23 14:30:31 2007
From: gmkurtzer at gmail.com (Greg Kurtzer)
Date: Tue, 23 Jan 2007 14:30:31 -0800
Subject: [Beowulf] middleware for heterogeneous cluster
In-Reply-To: <1169566228.3276.53.camel@riesling.dsic.upv.es>
References: <1169566228.3276.53.camel@riesling.dsic.upv.es>
Message-ID: <BD58470D-1FCD-4333-8E6D-309E020422D6@gmail.com>

We have some contributors documenting Perceus as we speak. The  
documentation effort will be posted shortly (I even had a quickstart  
guide that was also just forwarded to me by a gracious user).  
Basically we are aware of the documentation "issues" and they are  
being resolved. Also, I will make myself available for helping anyone  
implement Perceus that will agree to exchange the help for  
documentation. Not to be read as free consulting, but I am very  
understanding to the fact that you can't RTFM without the FM. ;)

Perceus can deal with a great deal of heterogeneous operating system  
images, and is capable for ia32 and x86_64. I believe some people use  
WareCAT (Warewulf+IBM's xCAT=WareCat) on ia64 very successfully, but  
I am not too familiar with the solution.

Good luck with your decision!

Greg

On Jan 23, 2007, at 7:30 AM, Miguel ?scar Bernabeu i Llinares wrote:

> Hi all,
>
> I'm new to the list, so let me introduce myself: I work as a  
> researcher
> in the Technical University of Valencia (Spain). I've been using  
> Beowulf
> Clusters for several years, but I've got little experience in the
> installation and administration of them.
>
> We are planning to integrate some legacy hardware (Pentium IV and Xeon
> two-processors) with brand new hardware (Itanium II Montecito SMP
> boards) in order to build an heterogeneous cluster.
>
> I'm interested in using a middleware like Warewulf, Perceus, Oscar,
> Rocks, ... to reduce installation and maintenance overhead, since we
> must focus in research tasks. I've been reading some webs and papers
> about how dealing with heterogeneity in Warewulf, Perceus and  
> Oscar, but
> none of them seem to be the best solution: Oscar forces me to use a  
> ia64
> master node
> ( http://www.mail-archive.com/oscar-users at lists.sourceforge.net/ 
> msg06075.html ), while Perceus is not properly documented AFAIK.
>
> Does anybody successfully configured a cluster like this? Any
> suggestion?
>
> Regards.
>
> -- 
> Miquel ?scar Bernabeu i Llinares <mbernabeu at dsic.upv.es>
> Department of Information Systems and Computation.
> Technical University of Valencia. Spain.
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit  
> http://www.beowulf.org/mailman/listinfo/beowulf

--
Greg Kurtzer
email: gmkurtzer at gmail.com
aim: gmkurtzer
skype: gmkurtzer
irc: gmk at irc.freenode.net

    I believe the world would be a better place if people didn't  
their believe their beliefs. -- gmk


From ab at gnu.org.in  Tue Jan 23 12:48:37 2007
From: ab at gnu.org.in (Anand Babu)
Date: Tue, 23 Jan 2007 12:48:37 -0800 (PST)
Subject: [Beowulf] GlusterFS 1.2-BENKI (GNU Cluster File System) -
	Announcement
Message-ID: <10960.201.234.224.4.1169585317.squirrel@mail.gnu-india.org>

GlusterFS 1.2-BENKI Public Announcement:

GlusterFS (GNU Cluster File System) is designed to scale to peta-bytes
of data and handle massive aggregated I/O bandwidth. GlusterFS is part
of the Gluster Project (GNU Clustering Platform) and has been released
under GNU GPL v2 (or later) license.

The current release of GlusterFS is running stable and performs
exceedingly well against NFS  please refer to benchmarks at
http://www.gluster.org/docs/index.php/GlusterFS_Benchmarks for
benchmark comparison.

The next release (1.3-BENKI) will have further enhancements such as
"ib-verbs", Infiniband RDMA transport, asynchronous I/O and epoll
leading to much lower latency and more POSIX correctness.

GlusterFS runs commodity storage hardware with GigE or Infiniband
interconnect. It is easy to setup GlusterFS. To get quickly started
with GlusterFS, simply follow the instructions at
http://www.gluster.org/docs/index.php/Getting_Started_with_GlusterFS

GlusterFS download link:
  http://www.gluster.org/glusterfs.php

Documentation link:
 http://gluster.org/docs/index.php/GlusterFS

Happy Hacking,
--
Gluster Core Team.

-- 
Anand Babu
GPG Key ID: 0x62E15A31
Personal Blog  [http://ab.freeshell.org]
The GNU Operating System [http://www.gnu.org]


From Andrew.Cannon at amecnnc.com  Wed Jan 24 01:48:58 2007
From: Andrew.Cannon at amecnnc.com (Cannon, Andrew)
Date: Wed, 24 Jan 2007 09:48:58 -0000
Subject: [Beowulf] RE: Beowulf Digest, Vol 35, Issue 29
Message-ID: <732F361DB3C00E4999AE107665FCEF23048486@kfd-ex56.nnc.co.uk>

The text of this message should be posted on every forum we can.  It was a
fascinating read and needs publishing far and wide.  Robert, do we have your
permission to post this on any forums/bbs/mailing lists that we read
(subject to proper notification that it belongs to you)?

Andrew


Subject: [Beowulf] An OT patented rgb editorial rant, skip if you
	like...


**********************************************************************
AMEC Nuclear Holdings Limited (no. 3725076), AMEC NNC Limited (no. 1120437), National Nuclear Corporation Limited (no. 2290928), STATS-NNC Limited (no. 4339062) and Technica-NNC Limited (no. 235856).  The registered office of each company is at Booths Park, Chelford Road, Knutsford, Cheshire WA16 8QZ except for Technica-NNC Limited whose registered office is at Citygate, Altens Farm Road, Aberdeen, Aberdeenshire, AB12 3LB.  AMEC NNC's main  switchboard number is 01565 633800.  
The AMEC NNC website is www.amecnnc.com

Any request, advice, information or opinion in this message which does not relate to the business of any of the above companies is not authorised by any of the above companies.  Where this message does so relate,  it is sent by the relevant company (as above) and is commercial in confidence and intended for the use of the individual or entity to whom it is addressed.  The content is subject to contract and, unless so stated, does not form part of any contract.  If you have received this e-mail in error please notify the AMEC NNC system manager by email at eadm at amecnnc.com.
**********************************************************************


From coutinho at dcc.ufmg.br  Wed Jan 24 12:51:59 2007
From: coutinho at dcc.ufmg.br (Bruno Rocha Coutinho)
Date: Wed, 24 Jan 2007 18:51:59 -0200
Subject: [Beowulf] middleware for heterogeneous cluster
Message-ID: <45B7C6EF.6000906@dcc.ufmg.br>

 > We are planning to integrate some legacy hardware (Pentium IV and Xeon
 > two-processors) with brand new hardware (Itanium II Montecito SMP
 > boards) in order to build an heterogeneous cluster.
 >
 > I'm interested in using a middleware like Warewulf, Perceus, Oscar,
 > Rocks, ... to reduce installation and maintenance overhead,


I think this middleware can solve your installation problems:
http://wiki.systemimager.org/

You can have several images to be replicated and select a image and a 
install script for each machine.


From award at uda.ad  Wed Jan 24 23:15:53 2007
From: award at uda.ad (Alan Ward)
Date: Thu, 25 Jan 2007 08:15:53 +0100
Subject: RS: [Beowulf] An OT patented rgb editorial rant, skip if you like...
References: <OFD98C2949.DEE5EEC5-ON8525726C.006EABCD-8525726C.0073E300@rohmhaas.com><Pine.LNX.4.64.0701231639280.6292@lilith.rgb.private.net><002c01c73f98$79d85cb0$0300a8c0@gourmandises>
	<200701251314.32034.csamuel@vpac.org>
Message-ID: <FCDCFA11F8533D468EF04D2144C121EB27BC@mail.ua.ad>


Hi.

Here at University of Andorra we had some problems getting the authorization to install Linux in just one of the labs. We installed Ubuntu. 

Six months later, user feedback made it possible to convert all but one of them to Ubuntu.

It should be mentionned that our kids, like yours, are average barely-out-of-highschool, and that computer litteracy depends a whole lot on which school thay have attented previously - and on home income. Even a 500 Euro laptop an be a big strain on some people's economies.

Several of our nursing students have gone straight from zero to KDE *without* getting lost in the murky wilderness of Windows. No pain. ;-) 

-Alan Ward


-----Missatge original-----
De: beowulf-bounces at beowulf.org en nom de Chris Samuel
Enviat el: dj. 25/01/2007 03:14
Per a: beowulf at beowulf.org
Tema: Re: [Beowulf] An OT patented rgb editorial rant, skip if you like...
 
On Wednesday 24 January 2007 8:17 pm, Vincent Diepeveen wrote:

> As linux is total unusuable by the average grammar- and highschool kid,
> windows is the only alternative to them.

Fortunately someone forgot to tell the ones in Australia about that!

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070125/50b8e0ba/attachment.html>

From ruhollah.mb at gmail.com  Sat Jan 27 03:52:16 2007
From: ruhollah.mb at gmail.com (Ruhollah Moussavi Baygi )
Date: Sat, 27 Jan 2007 15:22:16 +0330
Subject: [Beowulf] Mouse pointer disappear
Message-ID: <1bef2ce30701270352y212bcca2lb3d5ddbc9d361e92@mail.gmail.com>

I have already established a Beowulf cluster with six nodes.

Each node with cpu AMD Athlon 64-bit X2 dual core 4200+,

motherb ASUS M2NPV-MX (chipset NVIDIA GeForce 6150 GPU, & NVIDIA nForce 430
MCP), graphic integrated in the NVIDIA GeForce 6

RAM 2G

HDD W/D STATII 320G

OS= FC5 64-bit

I use one mouse/keyboard/monitor, shared by KVM box


 I prefer to work in graphical mode of Linux rather than text mode. But, my
problem is that I cannot see mouse pointer in after Linux loading.

If anyone knows the solution, please give me help.

With best regards,
Ruhollah Moussavi Baygi
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070127/5920dc55/attachment.html>

From diep at xs4all.nl  Mon Jan 29 01:33:14 2007
From: diep at xs4all.nl (Vincent Diepeveen)
Date: Mon, 29 Jan 2007 10:33:14 +0100
Subject: [Beowulf] An OT patented rgb editorial rant, skip if you like...
References: <OFD98C2949.DEE5EEC5-ON8525726C.006EABCD-8525726C.0073E300@rohmhaas.com><Pine.LNX.4.64.0701231639280.6292@lilith.rgb.private.net><002c01c73f98$79d85cb0$0300a8c0@gourmandises><200701251314.32034.csamuel@vpac.org>
	<FCDCFA11F8533D468EF04D2144C121EB27BC@mail.ua.ad>
Message-ID: <004101c74388$83321820$0300a8c0@gourmandises>

RS: [Beowulf] An OT patented rgb editorial rant, skip if you like...And at home they're using a $0.25 copy of windows to play games?

Vincent
  ----- Original Message ----- 
  From: Alan Ward 
  To: Chris Samuel ; beowulf at beowulf.org 
  Sent: Thursday, January 25, 2007 8:15 AM
  Subject: RS: [Beowulf] An OT patented rgb editorial rant, skip if you like...


  Hi.

  Here at University of Andorra we had some problems getting the authorization to install Linux in just one of the labs. We installed Ubuntu.

  Six months later, user feedback made it possible to convert all but one of them to Ubuntu.

  It should be mentionned that our kids, like yours, are average barely-out-of-highschool, and that computer litteracy depends a whole lot on which school thay have attented previously - and on home income. Even a 500 Euro laptop an be a big strain on some people's economies.

  Several of our nursing students have gone straight from zero to KDE *without* getting lost in the murky wilderness of Windows. No pain. ;-)

  -Alan Ward


  -----Missatge original-----
  De: beowulf-bounces at beowulf.org en nom de Chris Samuel
  Enviat el: dj. 25/01/2007 03:14
  Per a: beowulf at beowulf.org
  Tema: Re: [Beowulf] An OT patented rgb editorial rant, skip if you like...

  On Wednesday 24 January 2007 8:17 pm, Vincent Diepeveen wrote:

  > As linux is total unusuable by the average grammar- and highschool kid,
  > windows is the only alternative to them.

  Fortunately someone forgot to tell the ones in Australia about that!


------------------------------------------------------------------------------


  _______________________________________________
  Beowulf mailing list, Beowulf at beowulf.org
  To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070129/cf6e0947/attachment.html>

From rgb at phy.duke.edu  Mon Jan 29 07:00:44 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Mon, 29 Jan 2007 10:00:44 -0500 (EST)
Subject: [Beowulf] Microsoft Rants, Gorification...
In-Reply-To: <732F361DB3C00E4999AE107665FCEF23048486@kfd-ex56.nnc.co.uk>
References: <732F361DB3C00E4999AE107665FCEF23048486@kfd-ex56.nnc.co.uk>
Message-ID: <Pine.LNX.4.64.0701290833300.16688@lilith.rgb.private.net>

On Wed, 24 Jan 2007, Cannon, Andrew wrote:

> The text of this message should be posted on every forum we can.  It was a
> fascinating read and needs publishing far and wide.  Robert, do we have your
> permission to post this on any forums/bbs/mailing lists that we read
> (subject to proper notification that it belongs to you)?

Sure, feel free.  Mind you, it isn't beyond criticism or reproach --
some very valid counterpoints have been raised both on and off list by
various people.  You might want to splice this reply onto the bottom of
those reposts both to record the blanket permission to republish with
attribution and to serve as an "addendum" to some of the subsequent
discussion.

There are a number of things that I left out of the original post.  For
example, I didn't talk much about MS's current tendency to try to lock
in the market by means of software patents as software copyrights have
proven ineffective in protecting a supermonopoly's interest in remaining
a supermonopoly.  It is just too easy for people to reverse engineer
software, clone software, or write brand new software that goes beyond
software.

Unsurprisingly, as a few links that were posted clearly show, MS's
patent reach has already gone far beyond their legal or ethical grasp,
and they are trying to patent other people's inventions or ideas that
have long been in the public domain, counting on their ability to be
able to spend more money on lawsuits than the original inventors or any
who might challenge their right to own other people's ideas.  Honestly
one can see this tendency repeatedly expressed in their takeover of the
"Turbo" IDE idea, the integrated office suite idea, the internet, java,
and so on, but previously they've exploited the ease of legally cloning
clever software ideas mixed with their ability to manipulate the
development environment to their advantage.  Now they're turning around
and exploiting things the other way -- stealing the ideas and legally
cloning successful products and THEN patenting them as their own so
nobody else can clone or use them, including the original inventors.

Way cool, actually.  Ar, matey.  Take no prisoners.  Into the briney
deep with them.  At least we now know where the distant descendants of
Captain Jack Sparrow ended up...

Also, I probably overemphasized the economic influence of pension funds
on decision makers, as MS has already undergone one major correction
(along with the general dotcom collapse) where it lost half its value.
I personally think it has another half or three quarters to give, and
still think that heavy investments on the part of pension funds etc give
them an unnatural influence in the political and business arena, but
sure, their collapse probably wouldn't trigger an actual depression,
just some heavy relative impoverishment of the mostly very rich.

Some people noted that the IT business is so fast paced and cutthroat
competitive that they expect that a paradigm shift, perhaps to cell
phone based devices or something else entirely, will sooner or later
cause even MS's empire to come tumbling down.  I'm not so optimistic
about that -- if the invention of the web wasn't enough of a paradigm
shift to do it (noting that the web came out of the UNIX world and the
internet) what could possibly be?  Microsoft has just as good a chance
as any to hop on any new bandwagons as they appear in the IT landscape,
and they have the legal clout and unassailable position on the desktop
to co-opt it, patent it, and send the actual inventors down to Davey
Jones' Locker as they have so many times before.

Where is Borland today?  Oh, sure, it's big enough that Phillipe Kahn
probably isn't starving.  But it is surviving on the dregs, literally,
of MS's software development business.  Lotus?  That would be a
subsidiary of IBM.  Corel?  Hey, WordPerfect Office actually still
exists!  I'll bet they sell a bunch of it, too.  Not.  They'd "own" Java
if it weren't for the fact that Sun Microsystems still has a few billion
of its own that they can spend on lawsuits.  Netscape won their
antitrust lawsuit, and lost the war -- I've struggled with installing
Netscape in place of Explorer on XP boxes, and let me assure you, it
just breaks things all over the place, I'm sure by design.  Most people
who try it are forced to reselect Explorer as their default browser and
in some cases just plain uninstall Netscape (something that seems to
work perfectly, even where the install does not).  .NET is clearly more
of the same -- html is too open, php and friends ditto, java belongs to
Sun, so we damn sure maybe want a development environment and integrated
browser stuff that we can patent, copyright, and use to gorify* the
active application aspects of the Internet.

[* Gorify:  verb, meaning to "assert the invention of when one really
didn't, honest", as in one "gorifies" the Internet by asserting that one
invented it, one "gorifies" Global Warming by asserting that one
invented THAT.  Here's a nice example of contemporary usage:

"Steven Ballmer recently gorified XML as the core component of .NET by
asserting that only Microsoft has had the vision of extending XML across
both client and server."

(See e.g.

   http://news.zdnet.com/2100-3513_22-961877.html

-- which has some really juicy quotes "There has yet to be any
innovation, new features or new capabilities out of the Linux platform",
for example -- and

   http://www.itwriting.com/dotnet1.php

for a critique of .NET that is remarkably well done.)

Well, shoot, I've got this little application, y'see, called xmlsysd (a
server) and wulfstat (a client) that was released in early 2002, when I
hadn't every HEARD of .NET.  And the idea of languages that are
converted into executable form at runtime, gee, didn't Borland invent
that on the compiler side?  Isn't that pretty much what the scripting
language of your choice does, or Java, or even some of my programmable
graphics UIs do?  Hey, they're gorifying again, but this time in the
patent office -- be very scared...]

Finally, it was pointed out that they are just one medium sized iceberg
in a sea of giant multinational corporations that exercise soullessly
evil influence on government, business, money, and human lives, with
e.g. oil companies, car companies, banking and holding companies, and
some major manufacturing companies all bigger than and potentially
eviller that MS.  Here I agree that there are plenty of other big evil
supergiant companies (large enough to serve as shadow governments in
their own right) that we as citizens should be concerned about, although
those companies also do much good in the sense that they are the
backbone of the US/World economy and for better or worse provide a
living and many comforts and amenities to people all over the world.

However I disagree as well.  The difference between, say, WalMart and
Microsoft, or Ford and Microsoft, is that WalMart is far from being a
supermonopoly.  It has competition from Roses, from K-Mart, from Target,
from Sears, from many other large stores with similar merchandise and
targeted consumers.  Ford has competition from many other car companies,
and is perfectly capable of losing billions of dollars in relative
market share to them if their management does a poor job.  Consumers
have real, stable, economically viable choices that they can make
according to their whim, their pocketbook, and their political or
environmental preferences.  I know a lot of people who refuse to shop at
WalMart, for example, BECAUSE they are destroying competition and choice
and exploiting labor forces at home and abroad to achieve their
low-price edge.  Those people can do this because they HAVE choices --
local merchants, other chains with prices that are nearly as good or
that carry better quality merchandise with less of an exploitative price
tag, owners that are less butt-headed that Sam Walton's children
apparently are.

Adam Smith's good old invisible hand still works for these companies,
even though their size makes them seem invulnerable.  After all, I
remember days when K-Mart WAS the "WalMart" of today, when Roses was
still a great place to shop instead of surviving at the edge of
extinction.  In a way, WalMart's success (and their current
difficulties) are competition in action.  We "vote" in retail with our
choices.

This is not true for Microsoft.  There are few examples even in history
of market dominance like Microsoft's.  95% of the GLOBAL consumer
desktop market, most of that via locked in hardware agreements that
never even present the ILLUSION of choice to the consumer.  The
remaining market divided up between: a software company that has never
quite realized that this is what they are and that persists on
representing itself as a hardware company, with management that is so
quixotic and ego-tonic that in spite of its occasional brilliance and
appeal to the rebels and artists out there, it could never be viewed as
a serious threat; and Linux, which is if anything even more quixotic and
unpredictable even though it has proven to be a threat to take seriously
because it is so difficult to control and so cost effective in certain
contexts.

I would be very, very worried if 95% of the oil in the world were
controlled by a single, basically unregulated company.  I would be very,
very worried if 95% of the cars being driven were Fords and the
remaining "cars" were either somewhat pricey SUVs made by a single
manufacturer or homemade from a build-your-own-car kit.  I would be
absolutely terrified if WalMart controlled 95% of all consumer retail of
any sort, with what is left of Sears controlling 4% of the remainder and
1% consisting of small family businesses struggling to hold on.

So should we all be concerned about a market that controls the flow of
>>information<< that has a major sector where 95% of all business is
controlled by a single company, a company that also controls the lion's
share (by a healthy, although less overwhelming margin) of the other
major sector?  Damn sure you betcha.  When that company has a clear
history of and several "convictions" on record for anticompetitive
business practices, when that company makes side deals with major
foreign countries that more or less enable "thought control" via topdown
management (something that our own government has on more than one
occasion tried to mandate), when that company has cleverly arranged
things so that it makes MORE money in actual marginal profit than the
companies that made the hardware, than the businesses that sell the
hardware and software alike -- well, forgive me for think of them as an
unwelcome hand reaching into my purse and a potential threat to my
political liberty combined.

    rgb

P.S. -- to the rest of the beowulf list, that's it for this thread, I
quit, I'm done, got work to do gorifying cluster monitoring tools like
xmlsysd and working on my newly gorified dieharder application, not to
mention gorifying Maxwell-- I mean "Brown's Equations" for my physics
class.  I just reinvent the notation a bit, that's all that one really
needs to do, right?  Suppose I use \vec{F} for the (electric )F(ield)
instead of \vec{E}, that ought to do it...hmmmm.

So, anybody can use my immortal prose -- without gorifying it -- and we
can let this thread die die die.  I'm sure some of you already wish I
would die die die as it is...;-)

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From rgb at phy.duke.edu  Mon Jan 29 07:13:04 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Mon, 29 Jan 2007 10:13:04 -0500 (EST)
Subject: [Beowulf] Mouse pointer disappear
In-Reply-To: <1bef2ce30701270352y212bcca2lb3d5ddbc9d361e92@mail.gmail.com>
References: <1bef2ce30701270352y212bcca2lb3d5ddbc9d361e92@mail.gmail.com>
Message-ID: <Pine.LNX.4.64.0701291001580.16688@lilith.rgb.private.net>

On Sat, 27 Jan 2007, Ruhollah Moussavi Baygi  wrote:

> I have already established a Beowulf cluster with six nodes.
>
> Each node with cpu AMD Athlon 64-bit X2 dual core 4200+,
>
> motherb ASUS M2NPV-MX (chipset NVIDIA GeForce 6150 GPU, & NVIDIA nForce 430
> MCP), graphic integrated in the NVIDIA GeForce 6
>
> RAM 2G
>
> HDD W/D STATII 320G
>
> OS= FC5 64-bit
>
> I use one mouse/keyboard/monitor, shared by KVM box
>
>
>
> I prefer to work in graphical mode of Linux rather than text mode. But, my
> problem is that I cannot see mouse pointer in after Linux loading.
>
> If anyone knows the solution, please give me help.

Mice are funny.  Some mice have to be reset when you boot, or when X
starts up, in order to work.  If you have a KVM box it is actually
pretty easy to "freeze" the mouse.

To test whether or not this is the problem, try plugging the mouse
directly into your GUI node and booting.  The mouse should "just work".
If it does, then try putting the mouse back into the KVM chain and
booting just that node without switching the KVM box (so that the mouse
signal is sustained through the boot and startup of X11). Since the KVM
is basically nothing but an extender cable in this case, it should just
work.

Then try switching the mouse around from system to system.  If the mouse
stops working, then you need a better KVM box, one that keeps power on
to the mouse through the switching process -- Belkin makes some
good/cheap ones, so do a few other companies.  Unpowered parallel rotary
switch boxes (which I've tried) are notorious for dropping the mouse,
and often the mouse won't come back until you send it the kind of reset
information that usually only is sent in a boot or X11 startup.

Sometimes you can get away with just adding a better mouse, as well.
Some of the newer e.g. USB mice can tolerate disconnection/reconnection
better than others.

Note that this is a very idiosyncratic problem -- some people will have
it because of their COMBINATION of hardware while others with some
overlap of that hardware will not.  You just need to find a combination
that works, probably beginning with a better KVM.

Oh, I forgot to say that if the mouse doesn't work when you first boot
straight through with no KVM in at all, either your mouse itself is
broken or you haven't correctly installed the mouse support in X.  I
don't know what the approved way of installing/configuring mice from a
tty interface is these days -- they "just work" for me pretty
consistently so I haven't had to do this for three or four revisions now
-- but somebody on list probably does.

    rgb

>
> With best regards,
> Ruhollah Moussavi Baygi
>

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From peter.st.john at gmail.com  Mon Jan 29 07:24:11 2007
From: peter.st.john at gmail.com (Peter St. John)
Date: Mon, 29 Jan 2007 10:24:11 -0500
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <37613.192.168.1.1.1169919762.squirrel@mail.eadline.org>
References: <web-1279309@free.net> <45AE3C29.1090809@scalableinformatics.com>
	<6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
	<37613.192.168.1.1.1169919762.squirrel@mail.eadline.org>
Message-ID: <e4d4fd070701290724g4ba1ac2cvdf845fd65943b916@mail.gmail.com>

I would suggest the BlueJ folks stroll across the campus and introduce
themselves to the Law facutly. If some lawyer wanted to build a cluster,
we'd think him a fool not to ask us about it, wouldn't we? And they have
some serious documentation searching projects :-)
By "us" of course I mean "you". I'm just an algorithmist, lurking on this
list looking for wisdom about building a tiny beowulf. I've already strolled
to my nearest campus to talk to EE guys.
Peter


On 1/27/07, Douglas Eadline <deadline at clustermonkey.net> wrote:
>
> > I know some of you aren't, um, tolerant of Microsoft for various reasons
> > but I thought I'd clear up a couple errors in some of the posts. If you
> > hate Microsoft at least you now have an email address for when you're
> > feeling grumpy.
> >
>
> Like Robert, I don't hate Microsoft. However, the following
> URL is a recent example of why many people like myself
> fear Microsoft (and other big "We are going to patent obvious and prior
> art because you can't afford to defend yourself" technology companies)
>
> http://www.bluej.org/mrt/?p=21
>
> --
> Doug
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070129/812366e5/attachment.html>

From peter.st.john at gmail.com  Mon Jan 29 07:40:50 2007
From: peter.st.john at gmail.com (Peter St. John)
Date: Mon, 29 Jan 2007 10:40:50 -0500
Subject: [Beowulf] middleware for heterogeneous cluster
In-Reply-To: <45B7C6EF.6000906@dcc.ufmg.br>
References: <45B7C6EF.6000906@dcc.ufmg.br>
Message-ID: <e4d4fd070701290740k3d372b68n50c9059ca0724fb2@mail.gmail.com>

Bruno,
Has anybody tried dynamically reconfiguring the nodes to optimize for an
application, maybe trimming the kernels down to bare minima for quick
swichovers? I'm interested in optimization algorithms more than in
application hosting per se, so it would not need to be real fast for
expermienting.
Peter


On 1/24/07, Bruno Rocha Coutinho <coutinho at dcc.ufmg.br> wrote:
>
> > We are planning to integrate some legacy hardware (Pentium IV and Xeon
> > two-processors) with brand new hardware (Itanium II Montecito SMP
> > boards) in order to build an heterogeneous cluster.
> >
> > I'm interested in using a middleware like Warewulf, Perceus, Oscar,
> > Rocks, ... to reduce installation and maintenance overhead,
>
>
> I think this middleware can solve your installation problems:
> http://wiki.systemimager.org/
>
> You can have several images to be replicated and select a image and a
> install script for each machine.
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070129/b6095bad/attachment.html>

From hahn at mcmaster.ca  Mon Jan 29 10:06:55 2007
From: hahn at mcmaster.ca (Mark Hahn)
Date: Mon, 29 Jan 2007 13:06:55 -0500 (EST)
Subject: [Beowulf] GlusterFS 1.2-BENKI (GNU Cluster File System) -
	Announcement
In-Reply-To: <10960.201.234.224.4.1169585317.squirrel@mail.gnu-india.org>
References: <10960.201.234.224.4.1169585317.squirrel@mail.gnu-india.org>
Message-ID: <Pine.LNX.4.64.0701291304590.16089@coffee.psychology.mcmaster.ca>

> http://www.gluster.org/docs/index.php/GlusterFS_Benchmarks for

nice graph.  but how does it look if you compare a single glusterfs 
brick with a single NFS brick?

> The next release (1.3-BENKI) will have further enhancements such as
> "ib-verbs", Infiniband RDMA transport, asynchronous I/O and epoll
> leading to much lower latency and more POSIX correctness.

but the page only mentions bandwidth; in what cases does the latency
matter so much?


From deadline at clustermonkey.net  Mon Jan 29 14:59:50 2007
From: deadline at clustermonkey.net (Douglas Eadline)
Date: Mon, 29 Jan 2007 17:59:50 -0500 (EST)
Subject: [Beowulf] SGI to offer Windows on clusters
In-Reply-To: <37613.192.168.1.1.1169919762.squirrel@mail.eadline.org>
References: <web-1279309@free.net>
	<45AE3C29.1090809@scalableinformatics.com><6.2.3.4.2.20070117101040.0337f2b8@mail.jpl.nasa.gov>
	<45AE6BEE.9050001@scalableinformatics.com>
	<74735BF202608043B11025A9FAA9438904662308@WIN-MSG-21.wingroup.windeploy.ntdev.microsoft.com>
	<37613.192.168.1.1.1169919762.squirrel@mail.eadline.org>
Message-ID: <54456.192.168.1.1.1170111590.squirrel@mail.eadline.org>


And sometimes Microsoft can do the right thing:

http://listserv.acm.org/scripts/wa.exe?A2=ind0701d&L=sigcse-members&F=&S=&P=2285


 --
 Doug


>> I know some of you aren't, um, tolerant of Microsoft for various reasons
>> but I thought I'd clear up a couple errors in some of the posts. If you
>> hate Microsoft at least you now have an email address for when you're
>> feeling grumpy.
>>
>
> Like Robert, I don't hate Microsoft. However, the following
> URL is a recent example of why many people like myself
> fear Microsoft (and other big "We are going to patent obvious and prior
> art because you can't afford to defend yourself" technology companies)
>
> http://www.bluej.org/mrt/?p=21
>
> --
> Doug
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
> !DSPAM:45bb8fbb151061396896698!
>


--
Doug


From 06002352 at brookes.ac.uk  Mon Jan 29 16:17:02 2007
From: 06002352 at brookes.ac.uk (Mitchell Wisidagamage)
Date: Tue, 30 Jan 2007 00:17:02 +0000
Subject: [Beowulf] massive parallel processing application required
Message-ID: <45BE8E7E.4010808@brookes.ac.uk>

Hi all,
  As part of my dissertation, I'm looking for "raw data" which will be 
used for massive parallel processing using Beuwulf cluster (with the use 
of PVM or MPI). I tried looking for e-science raw data (and the 
computations required on it) such as bioinformatics, fluid dynamics, 
etc. but without any luck.

Anyone has any idea of getting some raw data so I can give compute 
intensive "work" to the nodes?

Any pointers/hints/tips would be very much appriciated.

Best wishes,
  Mitchell


From mark.kosmowski at gmail.com  Mon Jan 29 19:11:23 2007
From: mark.kosmowski at gmail.com (Mark Kosmowski)
Date: Mon, 29 Jan 2007 22:11:23 -0500
Subject: [Beowulf] building my first cluster
Message-ID: <c84311bb0701291911i3dd0870fl6530a34db34e8c39@mail.gmail.com>

Dear Beowulf Community:

I'm building my first cluster and wanted to express thankfulness for
the community resources here.  I've been reading about clustering here
and at clustermonkey.net and my hardware is arriving this week.  In
fact, my power supplies and CPUs arrived today.

I'm going to be adding 2 nodes of dual opteron machines to my existing
dual opteron workstation to make a 3 node 6 processor (all single
core) cluster.  This is being done from personal funds and will be
used to support my doctoral research in computational chemistry.
However, as much as I'd like to learn clustering, the research has to
come first and I'm prepared to fall back to a 3 workstation setup if I
fail miserably at cluster implementation.  Each machine will have 4 Gb
RAM, and I may upgrade RAM as budget permits (which means each machine
will have 4 Gb RAM, at least until I've finished my PhD program).  In
order to be able to fall back to the three workstations if needed,
each machine will have a 80 - 120 Gb hard drive.

I'm probably going to use SUSE Professional 9.1 as my distribution (my
PGI 5.2 compilers don't run on OpenSUSE 10.2 - I've tried the
suggestions to fix this from pgroup.com forums but they didn't work -
I'd be happy for further ideas regarding this issue).

Other than just saying hello here, I do have a question.  For this
modest sized cluster, how important is it to have a cluster manager
like OSCAR?  Will it be just as time consuming to just learn a flavor
of MPI as it would be to learn to use OSCAR (for example)?  I will
primarily be using CPMD for my calculations, but may want to try out
abinit and DFT++.

Thanks,

Mark Kosmowski
Syracuse University
Syracuse, NY
US


From coutinho at dcc.ufmg.br  Tue Jan 30 06:01:41 2007
From: coutinho at dcc.ufmg.br (Bruno Rocha Coutinho)
Date: Tue, 30 Jan 2007 12:01:41 -0200
Subject: [Beowulf] middleware for heterogeneous cluster
Message-ID: <45BF4FC5.3040702@dcc.ufmg.br>

I don't heard of anybody doing that, but:

The entire client install process is controlled by the install script, 
so you can customize it to do only what you really need ant can 
configure systemimager to use your trimmed down kernel in the install 
process to have a working client faster.

I think that the speed of the install will depend of your network 
bandwidth. SystemImager has three transports to image a client: rsync, 
multicast and bitturrent. If you use small images and tweak the install 
script, the install can be faster that the reboot itself.


2007/1/29, Peter St. John <peter.st.john at gmail.com>:

    Bruno,
    Has anybody tried dynamically reconfiguring the nodes to optimize
    for an application, maybe trimming the kernels down to bare minima
    for quick swichovers? I'm interested in optimization algorithms more
    than in application hosting per se, so it would not need to be real
    fast for expermienting.
    Peter

     
    On 1/24/07, *Bruno Rocha Coutinho* <coutinho at dcc.ufmg.br
    <mailto:coutinho at dcc.ufmg.br>> wrote:

         > We are planning to integrate some legacy hardware (Pentium IV
        and Xeon
         > two-processors) with brand new hardware (Itanium II Montecito
        SMP
         > boards) in order to build an heterogeneous cluster.
         >
         > I'm interested in using a middleware like Warewulf, Perceus,
        Oscar,
         > Rocks, ... to reduce installation and maintenance overhead,


        I think this middleware can solve your installation problems:
        http://wiki.systemimager.org/

        You can have several images to be replicated and select a image
        and a
        install script for each machine.
        _______________________________________________
        Beowulf mailing list, Beowulf at beowulf.org
        <mailto:Beowulf at beowulf.org>
        To change your subscription (digest mode or unsubscribe) visit
        http://www.beowulf.org/mailman/listinfo/beowulf
        <http://www.beowulf.org/mailman/listinfo/beowulf>


From olivier.crameri at gmail.com  Tue Jan 30 06:04:41 2007
From: olivier.crameri at gmail.com (Olivier Crameri)
Date: Tue, 30 Jan 2007 15:04:41 +0100
Subject: [Beowulf] Survey about software upgrades
Message-ID: <b54af0b00701300604i277fabdu26977237d2f6e977@mail.gmail.com>

(please apologize if you receive multiple copies of this message)

Hi all,

in the scope of our research project, we are currently building a
prototype infrastructure to simplify the software upgrade management
cycle.

In order to progress with our study, we are conducting a survey on the
common problems faced by system administrators regarding software
upgrades.

We would highly appreciate if you could help us in completing the
survey. This should not take you more than 10 to 15 minutes. The
survey can be found at this address:
http://survey.epfl.ch/?form=Soft_upgrade_survey

Note that our research project is a joint effort from two different
laboratories at EPFL (http://labos.epfl.ch and http://nsl.epfl.ch in
Switzerland). The project is not affiliated with or sponsored by any
commercial organization. We will share the survey results with the
practitioner and research communities through scientific papers.

Also, in order to recognize your effort in providing the testimony, we
will hold a lottery to select four winners who will each receive a $50
(50 american dollars) amazon.com gift certificate.

We thank you very much for your help,

With best regards,

Olivier Crameri,
Ph.D. Student
Operating Systems Laboratory (http://labos.epfl.ch)
EPFL, Switzerland


From enverever at hotmail.com  Tue Jan 30 07:28:43 2007
From: enverever at hotmail.com (enver ever)
Date: Tue, 30 Jan 2007 15:28:43 +0000
Subject: [Beowulf] failure rates
Message-ID: <BAY101-F13A3268A2A60F32451BBCBA1A60@phx.gbl>

Hello there

I am a PhD student working on mathematical looking to the availability of  
Beowulf  clusters.

I was looking whether or not it is possible to take exponential failure 
rates fot the nodes.

Thats the case in these publications:

1- "A Realistic Evaluation of Consistency Algorithms for Replicated 
Files"Annual Simulation Symposium  archive Proceedings of the 21st annual 
symposium on Simulation table of contents Tampa, Florida, United States  
Pages: 121 - 130   Year of Publication: 1988  ISBN:0-8186-0845-5

2-"Availability Modeling and Analysis on High Performance ClusterComputing 
Systems"Availability, Reliability and Security, 2006. ARES 2006. The First 
International Conference on Publication Date: 20-22 April 2006

3-"A Failure Predictive and Policy-Based High Availability Strategy for 
Linux High Performance Computing Cluster" Chokchai Leangsuksun1, Tong Liu1, 
Tirumala Rao1, Stephen L. Scott2, and Richard Libby Linux.com | LCI 5th 
International Linux Cluster Conference.

I think it can be taken as exponentially distributed since in many 
multi-server systems this was the approach followed.

I would appreciate if you could add any comments

Many Regards

_________________________________________________________________
MSN Hotmail is evolving ? check out the new Windows Live Mail 
http://ideas.live.com


From deadline at clustermonkey.net  Tue Jan 30 12:31:26 2007
From: deadline at clustermonkey.net (Douglas Eadline)
Date: Tue, 30 Jan 2007 15:31:26 -0500 (EST)
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45BE8E7E.4010808@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
Message-ID: <36397.192.168.1.1.1170189086.squirrel@mail.eadline.org>


You may want to quantify "massively parallel"
and define "raw data" (i.e. how many
processors and data for which specific application?)


  --
  Doug


> Hi all,
>   As part of my dissertation, I'm looking for "raw data" which will be
> used for massive parallel processing using Beuwulf cluster (with the use
> of PVM or MPI). I tried looking for e-science raw data (and the
> computations required on it) such as bioinformatics, fluid dynamics,
> etc. but without any luck.
>
> Anyone has any idea of getting some raw data so I can give compute
> intensive "work" to the nodes?
>
> Any pointers/hints/tips would be very much appriciated.
>
> Best wishes,
>   Mitchell
>
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
> !DSPAM:45bf9f33277683326710967!
>


--
Doug


From deadline at clustermonkey.net  Tue Jan 30 12:31:44 2007
From: deadline at clustermonkey.net (Douglas Eadline)
Date: Tue, 30 Jan 2007 15:31:44 -0500 (EST)
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45BE8E7E.4010808@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
Message-ID: <36398.192.168.1.1.1170189104.squirrel@mail.eadline.org>


You may want to quantify "massively parallel"
and define "raw data" (i.e. how many
processors and data for which specific application?)


  --
  Doug


> Hi all,
>   As part of my dissertation, I'm looking for "raw data" which will be
> used for massive parallel processing using Beuwulf cluster (with the use
> of PVM or MPI). I tried looking for e-science raw data (and the
> computations required on it) such as bioinformatics, fluid dynamics,
> etc. but without any luck.
>
> Anyone has any idea of getting some raw data so I can give compute
> intensive "work" to the nodes?
>
> Any pointers/hints/tips would be very much appriciated.
>
> Best wishes,
>   Mitchell
>
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
> !DSPAM:45bf9f33277683326710967!
>


--
Doug


From hahn at mcmaster.ca  Tue Jan 30 17:18:27 2007
From: hahn at mcmaster.ca (Mark Hahn)
Date: Tue, 30 Jan 2007 20:18:27 -0500 (EST)
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45BE8E7E.4010808@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
Message-ID: <Pine.LNX.4.64.0701302015200.16343@coffee.psychology.mcmaster.ca>

> As part of my dissertation, I'm looking for "raw data" which will be used 
> for massive parallel processing using Beuwulf cluster (with the use of PVM

my "massive", so you mean "embarassingly parallel" (aka loosely coupled)?
if so, I'd probably go with password cracking ;)

> MPI). I tried looking for e-science raw data (and the computations required 
> on it) such as bioinformatics, fluid dynamics, etc. but without any luck.

most EP is distinguished by having very little input; 
sometimes none at all.


From john.hearns at streamline-computing.com  Wed Jan 31 00:14:33 2007
From: john.hearns at streamline-computing.com (John Hearns)
Date: Wed, 31 Jan 2007 08:14:33 +0000
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45BE8E7E.4010808@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
Message-ID: <45C04FE9.5050502@streamline-computing.com>

Mitchell Wisidagamage wrote:
> Hi all,
>  As part of my dissertation, I'm looking for "raw data" which will be 
> used for massive parallel processing using Beuwulf cluster (with the use 
> of PVM or MPI). I tried looking for e-science raw data (and the 
> computations required on it) such as bioinformatics, fluid dynamics, 
> etc. but without any luck.
> 
> Anyone has any idea of getting some raw data so I can give compute 
> intensive "work" to the nodes?

Mitchell,
   how about running the NIST Fire Dynamics simulation?
http://www.fire.nist.gov/fds/
It simulates the spread of smoke and fire in buildings.
There are some sample input models for download.

The Smokeview program visualizes the output, which will be a nice 
demonstration for your tutor.


But why not just go across to the Oxford E-science centre?
I know for sure they have one cluster there for handling large datasets!
Ask them for help in getting a suitable dataset for your project.

Drop me an email if I can give you any advice, you're in my neck of the 
woods.


-- 
      John Hearns
      Senior HPC Engineer
      Streamline Computing,
      The Innovation Centre, Warwick Technology Park,
      Gallows Hill, Warwick CV34 6UW
      Office: 01926 623130 Mobile: 07841 231235


From eugen at leitl.org  Wed Jan 31 02:20:41 2007
From: eugen at leitl.org (Eugen Leitl)
Date: Wed, 31 Jan 2007 11:20:41 +0100
Subject: [eugen@leitl.org: Re: [Beowulf] Re: Selling computation time]
Message-ID: <20070131102041.GA1755@leitl.org>


(resending the message, since it never got posted)

On Fri, Dec 29, 2006 at 09:17:29AM +1100, Chris Samuel wrote:

> > It is a reasonable assumption that Sun did their homework. I wonder who
> > they are targeting.
> 
> http://www.channelregister.co.uk/2005/10/25/sun_grid_slip/
> 
> Sun's grid: lights on, no customers   (October 2005)

Compare this to Amazon (EC2, S3).

----- End forwarded message -----
-- 
Eugen* Leitl <a href="http://leitl.org">leitl</a> http://leitl.org
______________________________________________________________
ICBM: 48.07100, 11.36820            http://www.ativel.com
8B29F6BE: 099D 78BA 2FD3 B014 B08A  7779 75B0 2443 8B29 F6BE


From eugen at leitl.org  Wed Jan 31 02:21:41 2007
From: eugen at leitl.org (Eugen Leitl)
Date: Wed, 31 Jan 2007 11:21:41 +0100
Subject: [Beowulf] [eugen@leitl.org: accelerator cards (German)]
Message-ID: <20070131102141.GB1755@leitl.org>

(this is also a resend)

A nice summary of recent products and price/performance;
unfortunately German-only:

http://www.heise.de/newsticker/meldung/83902

----- End forwarded message -----
-- 
Eugen* Leitl <a href="http://leitl.org">leitl</a> http://leitl.org
______________________________________________________________
ICBM: 48.07100, 11.36820            http://www.ativel.com
8B29F6BE: 099D 78BA 2FD3 B014 B08A  7779 75B0 2443 8B29 F6BE


From gerry.creager at tamu.edu  Wed Jan 31 06:37:02 2007
From: gerry.creager at tamu.edu (Gerry Creager)
Date: Wed, 31 Jan 2007 08:37:02 -0600
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <Pine.LNX.4.64.0701302015200.16343@coffee.psychology.mcmaster.ca>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<Pine.LNX.4.64.0701302015200.16343@coffee.psychology.mcmaster.ca>
Message-ID: <45C0A98E.4000508@tamu.edu>

Numerical weather prediction?  Uses a fair bit of initial boundary 
condition data from other models...

Climate code, especially when coupling between atmosphere and ocean models?

Both tend to be embarassingly parallel and run w/ MPI.
gerry

Mark Hahn wrote:
>> As part of my dissertation, I'm looking for "raw data" which will be 
>> used for massive parallel processing using Beuwulf cluster (with the 
>> use of PVM
> 
> my "massive", so you mean "embarassingly parallel" (aka loosely coupled)?
> if so, I'd probably go with password cracking ;)
> 
>> MPI). I tried looking for e-science raw data (and the computations 
>> required on it) such as bioinformatics, fluid dynamics, etc. but 
>> without any luck.
> 
> most EP is distinguished by having very little input; sometimes none at 
> all.
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf

-- 
Gerry Creager -- gerry.creager at tamu.edu
Texas Mesonet -- AATLT, Texas A&M University	
Cell: 979.229.5301 Office: 979.458.4020 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843


From hahn at mcmaster.ca  Wed Jan 31 06:42:13 2007
From: hahn at mcmaster.ca (Mark Hahn)
Date: Wed, 31 Jan 2007 09:42:13 -0500 (EST)
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C0A98E.4000508@tamu.edu>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<Pine.LNX.4.64.0701302015200.16343@coffee.psychology.mcmaster.ca>
	<45C0A98E.4000508@tamu.edu>
Message-ID: <Pine.LNX.4.64.0701310938150.22216@coffee.psychology.mcmaster.ca>

> Climate code, especially when coupling between atmosphere and ocean models?

seems like it would require some nontrivial physics, not to mention
realistic input data.  don't most climate codes also depend on huge
multi-dimensional FFT's where the transpose is coded as all-to-all?

here's an alternative: nbody physics.  just put a bunch of particles in some
empty space and see what they do as they interact through gravity.  of
course, gravity is all-to-all, but then again in a nontrivial sense, less 
coupled problems are less interesting...


From rgb at phy.duke.edu  Wed Jan 31 07:55:31 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Wed, 31 Jan 2007 10:55:31 -0500 (EST)
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <Pine.LNX.4.64.0701310938150.22216@coffee.psychology.mcmaster.ca>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<Pine.LNX.4.64.0701302015200.16343@coffee.psychology.mcmaster.ca>
	<45C0A98E.4000508@tamu.edu>
	<Pine.LNX.4.64.0701310938150.22216@coffee.psychology.mcmaster.ca>
Message-ID: <Pine.LNX.4.64.0701311051490.16420@lilith.rgb.private.net>

On Wed, 31 Jan 2007, Mark Hahn wrote:

>> Climate code, especially when coupling between atmosphere and ocean models?
>
> seems like it would require some nontrivial physics, not to mention
> realistic input data.  don't most climate codes also depend on huge
> multi-dimensional FFT's where the transpose is coded as all-to-all?
>
> here's an alternative: nbody physics.  just put a bunch of particles in some
> empty space and see what they do as they interact through gravity.  of
> course, gravity is all-to-all, but then again in a nontrivial sense, less 
> coupled problems are less interesting...

Or another simple physics problem -- simulate e.g. the Ising problem, or
any of a number of problems in magnetism.  Nearest neighbor
interactions, "known results".

And if it is just a matter of a nifty demo, don't forget the always
useful parallel mandelbrot set packages and/or rendering packages
(povray).  I'm pretty sure both are still around -- I still use xep
to demo PVM, although I have hacked it a bit because it is now too easy
to get to the "bottom" of floating point resolution even on a single
processor, which actually kind of sucks as the display breaks down just
when you get way down into the set where things get very odd and spiky.
Spikier.  Whatever.

    rgb

> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf
>

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From eugen at leitl.org  Wed Jan 31 08:43:04 2007
From: eugen at leitl.org (Eugen Leitl)
Date: Wed, 31 Jan 2007 17:43:04 +0100
Subject: [Beowulf] clusters in gaming
Message-ID: <20070131164304.GB21677@leitl.org>


I've been looking at Second Life recently, which does most
things server-side (in fact, running a distributed world
with game physics) unlike games like WoW, where the intelligence
(and crunch) is mostly in the client. Linden Labs run a
large cluster to host the game world, which is segmented
by virtual machines. I don't know which network topology
they use, but a contiguous game world maps well to a 2d mesh
or a torus.

What I didn't like is that most of the game is purportedly
based on a byte-compiled language, with some long-term plans
to switch to .Net (Mono, actually), which should result in
much improved performance. Current performance is 
rather ridiculous, even high-priority simulations like
private islands only tolerate few 10 avatars before severe
performance degradation, and even crashes.

While I do see what a usual C/C++ MPI approach wouldn't
be probably enough for a highly dynamic and flexible virtual
environment, the result still strikes me as inelegant,
and killing architectural deficiences by throwing enough
hardware at it (not necessary always wrong, mark, just
not in this case).

Can things be compiled in realtime by passing code snippets
in conventional compiled languages, or is this always limited
to highly dynamic environments like Smalltalk (which OpenCroquet
is based on) or Lisp (with sbcl and cmucl there are now great
compilers for Lisp, though I don't know about MPI support)?

-- 
Eugen* Leitl <a href="http://leitl.org">leitl</a> http://leitl.org
______________________________________________________________
ICBM: 48.07100, 11.36820            http://www.ativel.com
8B29F6BE: 099D 78BA 2FD3 B014 B08A  7779 75B0 2443 8B29 F6BE


From 06002352 at brookes.ac.uk  Tue Jan 30 16:23:27 2007
From: 06002352 at brookes.ac.uk (Mitchell Wisidagamage)
Date: Wed, 31 Jan 2007 00:23:27 +0000
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <36397.192.168.1.1.1170189086.squirrel@mail.eadline.org>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<36397.192.168.1.1.1170189086.squirrel@mail.eadline.org>
Message-ID: <45BFE17F.5080901@brookes.ac.uk>

Thank you for the reply. what I mean by massively parallel processing is 
"compute intensive" problem where I will be able to process on many 
hosts independently in parallel (by parallelizing the algorithms). 
Apologies if I'm repeating what you already know. Currently there are 9 
hosts on Beowulf cluster in Lab running on UltraSPARC 5 (yes, very old 
machines!).

"raw data" I'm looking for are the input files required for the 
application, so I can "process" them. I do not have any specific 
application in mind. I will be developing the different components of 
the application such as job scheduling, fault tolerance, fault recovery, 
etc for my dissertation. I will be useing PVM or MPI for message passing 
and programming in c.

But I do have any specific application in mind and I don't mind any 
application. I can come up with say, "data mining" but it's pointless 
unless I have the "raw data" files to process and know the patterns I'm 
looking for.

Application design and programming model all depends on the type of 
application I get my hands on. I'm quite desperate for a "problem" since 
I have to submit my proposal end of next week. Any help would he greatly 
appreciated. I'm sure everyone here has lot of experience in distributed 
processing and hopefully can be of some help. Sorry for the long post.


Thanks,
  Mitchell


Douglas Eadline wrote:
> You may want to quantify "massively parallel"
> and define "raw data" (i.e. how many
> processors and data for which specific application?)
> 
> 
>   --
>   Doug
> 
> 
>> Hi all,
>>   As part of my dissertation, I'm looking for "raw data" which will be
>> used for massive parallel processing using Beuwulf cluster (with the use
>> of PVM or MPI). I tried looking for e-science raw data (and the
>> computations required on it) such as bioinformatics, fluid dynamics,
>> etc. but without any luck.
>>
>> Anyone has any idea of getting some raw data so I can give compute
>> intensive "work" to the nodes?
>>
>> Any pointers/hints/tips would be very much appriciated.
>>
>> Best wishes,
>>   Mitchell
>>
>>
>>
>> _______________________________________________
>> Beowulf mailing list, Beowulf at beowulf.org
>> To change your subscription (digest mode or unsubscribe) visit
>> http://www.beowulf.org/mailman/listinfo/beowulf
>>
>> !DSPAM:45bf9f33277683326710967!
>>
> 
> 
> --
> Doug


From 06002352 at brookes.ac.uk  Wed Jan 31 03:10:20 2007
From: 06002352 at brookes.ac.uk (Mitchell Wisidagamage)
Date: Wed, 31 Jan 2007 11:10:20 +0000
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <Pine.LNX.4.64.0701302015200.16343@coffee.psychology.mcmaster.ca>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<Pine.LNX.4.64.0701302015200.16343@coffee.psychology.mcmaster.ca>
Message-ID: <45C0791C.5080904@brookes.ac.uk>

Any kind system is fine. It all depends on the type of the "application" 
I get.

I thought of password cracking. But there wouldn't be much substance in 
the project in it.

Any ideas? There's got be lots of transaction processing.

When I was working at an ISP we calculated dial-up internet usage by 
useing the large RADIUS log files. The program took quite took quite a 
while to process. And I remember there were many other applications that 
took a while to process such as the bandwidth manager, anti-spam/anti 
virus gateway, traffic monitoring software, etc. Unfortunately I no I 
know anyone who's still working at the company.

Any other real world applications where I can atleast simulate input data?

thanks,
   Mitchell


Mark Hahn wrote:
>> As part of my dissertation, I'm looking for "raw data" which will be 
>> used for massive parallel processing using Beuwulf cluster (with the 
>> use of PVM
> 
> my "massive", so you mean "embarassingly parallel" (aka loosely coupled)?
> if so, I'd probably go with password cracking ;)
> 
>> MPI). I tried looking for e-science raw data (and the computations 
>> required on it) such as bioinformatics, fluid dynamics, etc. but 
>> without any luck.
> 
> most EP is distinguished by having very little input; sometimes none at 
> all.


From 06002352 at brookes.ac.uk  Wed Jan 31 03:17:56 2007
From: 06002352 at brookes.ac.uk (Mitchell Wisidagamage)
Date: Wed, 31 Jan 2007 11:17:56 +0000
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C04FE9.5050502@streamline-computing.com>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<45C04FE9.5050502@streamline-computing.com>
Message-ID: <45C07AE4.1020508@brookes.ac.uk>

Thank you very much for the fire dynamics idea. I will have a look at it.

I did try to contact many e-science projects including some researchers 
at Oxford. But I got no reply. Then I went to get some contacts from a 
tutor who worked at a e-science project himself. He told me people, 
especially scientists are "very jealous" of their data. And not replying 
is a kind way of saying "no". And there's the problem of "who's this guy 
wanting my data", "what will he do with it?".

I have given up the e-science idea. Now looking for other real world 
applications.

Thanks,
Mitchell


John Hearns wrote:
> Mitchell Wisidagamage wrote:
>> Hi all,
>>  As part of my dissertation, I'm looking for "raw data" which will be 
>> used for massive parallel processing using Beuwulf cluster (with the 
>> use of PVM or MPI). I tried looking for e-science raw data (and the 
>> computations required on it) such as bioinformatics, fluid dynamics, 
>> etc. but without any luck.
>>
>> Anyone has any idea of getting some raw data so I can give compute 
>> intensive "work" to the nodes?
> 
> Mitchell,
>   how about running the NIST Fire Dynamics simulation?
> http://www.fire.nist.gov/fds/
> It simulates the spread of smoke and fire in buildings.
> There are some sample input models for download.
> 
> The Smokeview program visualizes the output, which will be a nice 
> demonstration for your tutor.
> 
> 
> But why not just go across to the Oxford E-science centre?
> I know for sure they have one cluster there for handling large datasets!
> Ask them for help in getting a suitable dataset for your project.
> 
> Drop me an email if I can give you any advice, you're in my neck of the 
> woods.
> 
> 


From 06002352 at brookes.ac.uk  Wed Jan 31 08:29:44 2007
From: 06002352 at brookes.ac.uk (06002352)
Date: Wed, 31 Jan 2007 16:29:44 +0000
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <Pine.LNX.4.64.0701310858500.22216@coffee.psychology.mcmaster.ca>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<Pine.LNX.4.64.0701302015200.16343@coffee.psychology.mcmaster.ca>
	<45C0791C.5080904@brookes.ac.uk>
	<Pine.LNX.4.64.0701310858500.22216@coffee.psychology.mcmaster.ca>
Message-ID: <45C0C3F8.20608@brookes.ac.uk>

> 
> the reason this is such a weird query is that parallel processing is 
> normally
> motivated by already having something that needs it.  I have a hard time
> imagining what work you've already done that has somehow managed to be not
> driven by having a compute-intensive job at hand. 

I have to come up with my own problem. I worked on a Beowulf cluster for
  my assignment and I quite liked it. And I'm interested in HPC.

> it's about like when people build a cluster as an end in itself, then 
>ask for advice on what to use it for.
>

That's true and it's embarassing. I liked working on the Beowulf cluster
and want to do project on "cluster computing" to solve a compute
intensive problem. So I know how I will go on implementing it but don't
know what I will implement. :o)


From joelja at bogus.com  Wed Jan 31 09:17:42 2007
From: joelja at bogus.com (Joel Jaeggli)
Date: Wed, 31 Jan 2007 09:17:42 -0800
Subject: [Beowulf] clusters in gaming
In-Reply-To: <20070131164304.GB21677@leitl.org>
References: <20070131164304.GB21677@leitl.org>
Message-ID: <45C0CF36.6030507@bogus.com>

Eugen Leitl wrote:
> What I didn't like is that most of the game is purportedly
> based on a byte-compiled language, with some long-term plans
> to switch to .Net (Mono, actually), which should result in
> much improved performance. Current performance is 
> rather ridiculous, even high-priority simulations like
> private islands only tolerate few 10 avatars before severe
> performance degradation, and even crashes.

you're talking about an environment where whole regions are routinely
crashed by self replicating objects. The thing is a huge kludge.


From DaveWolfson at mail.maricopa.gov  Wed Jan 31 09:40:02 2007
From: DaveWolfson at mail.maricopa.gov (Dave Wolfson - MCDOTX)
Date: Wed, 31 Jan 2007 10:40:02 -0700
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C0A98E.4000508@tamu.edu>
References: <45BE8E7E.4010808@brookes.ac.uk><Pine.LNX.4.64.0701302015200.16343@coffee.psychology.mcmaster.ca>
	<45C0A98E.4000508@tamu.edu>
Message-ID: <C8E34300C8A8CA4889603F514EAF6F6E07AB21@EVS4.enterprise.maricopa.gov>


There is a highly developed multi-organizational effort under the
umbrella of the Weather Research and Forecasting Model (WRF) group, home
page http://www.wrf-model.org/index.php .  This is the same basic model
that is operational on massively parallel systems at NCAR and NCEP.
Considerable resources and documentation are available from this link.

As with the operational weather models the availability and
incorporation of real-time observational data is essential and highly
refined.  There are versions and subsets of these models that are run on
1..n systems.  As one would expect from such a non-trivial application,
the implementation on one's own system is not trivial; however there are
many individuals and low-budget sites getting usable results from these
models today. 

David Wolfson, Senior Transportation Analyst 
Maricopa County Department of Transportation 
2901 West Durango Street 
Phoenix, Arizona 85009 
ph: 602-506-6950 
fax: 602-506-4882 
email: DaveWolfson at mail.maricopa.gov 
also at: dwolfson at inficad.com 
ph: 602-881-3799 

-----Original Message-----
From: beowulf-bounces at beowulf.org [mailto:beowulf-bounces at beowulf.org]
On Behalf Of Gerry Creager
Sent: Wednesday, January 31, 2007 7:37 AM
To: Mark Hahn
Cc: beowulf at beowulf.org
Subject: [SPAM:] - Re: [Beowulf] massive parallel processing application
required - Email has different SMTP TO: and MIME TO: fields in the email
addresses

Numerical weather prediction?  Uses a fair bit of initial boundary 
condition data from other models...

Climate code, especially when coupling between atmosphere and ocean
models?

Both tend to be embarassingly parallel and run w/ MPI.
gerry

Mark Hahn wrote:
>> As part of my dissertation, I'm looking for "raw data" which will be 
>> used for massive parallel processing using Beuwulf cluster (with the 
>> use of PVM
> 
> my "massive", so you mean "embarassingly parallel" (aka loosely
coupled)?
> if so, I'd probably go with password cracking ;)
> 
>> MPI). I tried looking for e-science raw data (and the computations 
>> required on it) such as bioinformatics, fluid dynamics, etc. but 
>> without any luck.
> 
> most EP is distinguished by having very little input; sometimes none
at 
> all.
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf

-- 
Gerry Creager -- gerry.creager at tamu.edu
Texas Mesonet -- AATLT, Texas A&M University	
Cell: 979.229.5301 Office: 979.458.4020 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843
_______________________________________________
Beowulf mailing list, Beowulf at beowulf.org
To change your subscription (digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf


From deadline at eadline.org  Wed Jan 31 10:28:32 2007
From: deadline at eadline.org (Douglas Eadline)
Date: Wed, 31 Jan 2007 13:28:32 -0500 (EST)
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C0791C.5080904@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<Pine.LNX.4.64.0701302015200.16343@coffee.psychology.mcmaster.ca>
	<45C0791C.5080904@brookes.ac.uk>
Message-ID: <60632.192.168.1.1.1170268112.squirrel@mail.eadline.org>


If you want to do a little development and impress your friends,
try playing with pgapack (Parallel Genetic Algorithm Library)

http://www-fp.mcs.anl.gov/CCST/research/reports_pre1998/comp_bio/stalk/pgapack.html

You can develop a GA on single computer then run it on
a cluster.

 --
 Doug

> Any kind system is fine. It all depends on the type of the "application"
> I get.
>
> I thought of password cracking. But there wouldn't be much substance in
> the project in it.
>
> Any ideas? There's got be lots of transaction processing.
>
> When I was working at an ISP we calculated dial-up internet usage by
> useing the large RADIUS log files. The program took quite took quite a
> while to process. And I remember there were many other applications that
> took a while to process such as the bandwidth manager, anti-spam/anti
> virus gateway, traffic monitoring software, etc. Unfortunately I no I
> know anyone who's still working at the company.
>
> Any other real world applications where I can atleast simulate input data?
>
> thanks,
>    Mitchell
>
>
>
> Mark Hahn wrote:
>>> As part of my dissertation, I'm looking for "raw data" which will be
>>> used for massive parallel processing using Beuwulf cluster (with the
>>> use of PVM
>>
>> my "massive", so you mean "embarassingly parallel" (aka loosely
>> coupled)?
>> if so, I'd probably go with password cracking ;)
>>
>>> MPI). I tried looking for e-science raw data (and the computations
>>> required on it) such as bioinformatics, fluid dynamics, etc. but
>>> without any luck.
>>
>> most EP is distinguished by having very little input; sometimes none at
>> all.
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
> !DSPAM:45c0d96397956865219710!
>


--
Doug


From peter.st.john at gmail.com  Wed Jan 31 10:36:57 2007
From: peter.st.john at gmail.com (Peter St. John)
Date: Wed, 31 Jan 2007 13:36:57 -0500
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C07AE4.1020508@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<45C04FE9.5050502@streamline-computing.com>
	<45C07AE4.1020508@brookes.ac.uk>
Message-ID: <e4d4fd070701311036oe3b6f61se736c5709a5a7780@mail.gmail.com>

Mitchell,
I advocate building your own data, which is trivial in mathematics
applications. One of the first uses for distributing hard computations to
volunteers on the net with idle CPU time was number theory (primality
testing, finding primes, for cryptography, for example).

A site that provides various source code to do some of these things is
ECMNET ("Elliptic Curves Method'),
http://www.loria.fr/~zimmerma/records/ecmnet.html
It isn't necessary to do any number theoretic research yourself, but you'd
want to be comfortable with sentences like "use <this algorithm> to factor a
candidate Fermat Number". In this example, a Fermat Number is a particular
number that is "probably" prime. It's hard to factor, to find out if it is.
If it is, you can use it to generate public keys for cryptography. If it
isn't, somebody will read your mail. So you have to look up the formula for
Fermat numbers and appy the algortithm to try and factor it. You don't need
to do any math yourself besides elementary algebra, and there are software
packages for everything.

An example from the site: "Peter Montgomery found in November 1995 a factor
of 47 digits of 5^256+1". The exponential thing, a huge power of a prime,
plus one, would be the possible prime; it was proven not to be prime, by
finding a 47 digit prime factor. Nowadays 47 digits is chump change; I don't
know the current records. But you can burn up all your nodes by asking each
of them to do a few factorizations of this nature, and you can become famous
by finding a new largest prime.

A somewhat prettier site about Elliptic Curves is
http://www.math.utah.edu/~jfernand/elliptic/

There are probably contributors to the Cunningham Project (mentioned by
ECMnet) that would love to help you implement the app on your cluster, in
exchange for access to your cluster for their apps, which in this case would
amount to the same thing.

Peter


On 1/31/07, Mitchell Wisidagamage <06002352 at brookes.ac.uk> wrote:
>
> Thank you very much for the fire dynamics idea. I will have a look at it.
>
> I did try to contact many e-science projects including some researchers
> at Oxford. But I got no reply. Then I went to get some contacts from a
> tutor who worked at a e-science project himself. He told me people,
> especially scientists are "very jealous" of their data. And not replying
> is a kind way of saying "no". And there's the problem of "who's this guy
> wanting my data", "what will he do with it?".
>
> I have given up the e-science idea. Now looking for other real world
> applications.
>
> Thanks,
> Mitchell
>
>
> John Hearns wrote:
> > Mitchell Wisidagamage wrote:
> >> Hi all,
> >>  As part of my dissertation, I'm looking for "raw data" which will be
> >> used for massive parallel processing using Beuwulf cluster (with the
> >> use of PVM or MPI). I tried looking for e-science raw data (and the
> >> computations required on it) such as bioinformatics, fluid dynamics,
> >> etc. but without any luck.
> >>
> >> Anyone has any idea of getting some raw data so I can give compute
> >> intensive "work" to the nodes?
> >
> > Mitchell,
> >   how about running the NIST Fire Dynamics simulation?
> > http://www.fire.nist.gov/fds/
> > It simulates the spread of smoke and fire in buildings.
> > There are some sample input models for download.
> >
> > The Smokeview program visualizes the output, which will be a nice
> > demonstration for your tutor.
> >
> >
> > But why not just go across to the Oxford E-science centre?
> > I know for sure they have one cluster there for handling large datasets!
> > Ask them for help in getting a suitable dataset for your project.
> >
> > Drop me an email if I can give you any advice, you're in my neck of the
> > woods.
> >
> >
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070131/0ea1dc31/attachment.html>

From rgb at phy.duke.edu  Wed Jan 31 14:03:46 2007
From: rgb at phy.duke.edu (Robert G. Brown)
Date: Wed, 31 Jan 2007 17:03:46 -0500 (EST)
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C07AE4.1020508@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<45C04FE9.5050502@streamline-computing.com>
	<45C07AE4.1020508@brookes.ac.uk>
Message-ID: <Pine.LNX.4.64.0701311519060.16420@lilith.rgb.private.net>

On Wed, 31 Jan 2007, Mitchell Wisidagamage wrote:

> Thank you very much for the fire dynamics idea. I will have a look at it.
>
> I did try to contact many e-science projects including some researchers at 
> Oxford. But I got no reply. Then I went to get some contacts from a tutor who 
> worked at a e-science project himself. He told me people, especially 
> scientists are "very jealous" of their data. And not replying is a kind way 
> of saying "no". And there's the problem of "who's this guy wanting my data", 
> "what will he do with it?".
>
> I have given up the e-science idea. Now looking for other real world 
> applications.

Remember, NASA puts all (or at least a lot) of its e.g. weather data
online.  And there are many things one can do with it.  Look for the
NOAA sites.  You can get sunspot data, proxy temperature data, and much
more, and build your very own climate model.  If you do, don't be
surprised if it fails to agree with the current one (due to be
re-released today, IIRC, from the IPCC).

    rgb

>
> Thanks,
> Mitchell
>
>
> John Hearns wrote:
>> Mitchell Wisidagamage wrote:
>>> Hi all,
>>>  As part of my dissertation, I'm looking for "raw data" which will be used 
>>> for massive parallel processing using Beuwulf cluster (with the use of PVM 
>>> or MPI). I tried looking for e-science raw data (and the computations 
>>> required on it) such as bioinformatics, fluid dynamics, etc. but without 
>>> any luck.
>>> 
>>> Anyone has any idea of getting some raw data so I can give compute 
>>> intensive "work" to the nodes?
>> 
>> Mitchell,
>>   how about running the NIST Fire Dynamics simulation?
>> http://www.fire.nist.gov/fds/
>> It simulates the spread of smoke and fire in buildings.
>> There are some sample input models for download.
>> 
>> The Smokeview program visualizes the output, which will be a nice 
>> demonstration for your tutor.
>> 
>> 
>> But why not just go across to the Oxford E-science centre?
>> I know for sure they have one cluster there for handling large datasets!
>> Ask them for help in getting a suitable dataset for your project.
>> 
>> Drop me an email if I can give you any advice, you're in my neck of the 
>> woods.
>> 
>> 
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf
>

-- 
Robert G. Brown	                       http://www.phy.duke.edu/~rgb/
Duke University Dept. of Physics, Box 90305
Durham, N.C. 27708-0305
Phone: 1-919-660-2567  Fax: 919-660-2525     email:rgb at phy.duke.edu


From hahn at mcmaster.ca  Wed Jan 31 14:37:41 2007
From: hahn at mcmaster.ca (Mark Hahn)
Date: Wed, 31 Jan 2007 17:37:41 -0500 (EST)
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C07AE4.1020508@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<45C04FE9.5050502@streamline-computing.com>
	<45C07AE4.1020508@brookes.ac.uk>
Message-ID: <Pine.LNX.4.64.0701311714380.6132@coffee.psychology.mcmaster.ca>

> worked at a e-science project himself. He told me people, especially

is this "e-science" term more popular where you are?  I don't really
hear it here (Canada, probably NA in general.)  many (most) branches of 
science are so dependent on computers that it seems redundant and archaic.

> scientists are "very jealous" of their data. And not replying is a kind way 
> of saying "no". And there's the problem of "who's this guy wanting my data", 
> "what will he do with it?".

sure - academia is all about publishing, so you don't want someone else 
to scoop you.  but often concerns are much more mundane, like:

 	- how long will it transfer this 50GB blob of data?

 	- if I provide neurophysiology data from my lab, can I
 	be sued if the subjects' privacy is violated?

 	- do I have time to spend explaining how the data is encoded?

regards, mark hahn.


From 06002352 at brookes.ac.uk  Wed Jan 31 17:33:12 2007
From: 06002352 at brookes.ac.uk (Mitchell Wisidagamage)
Date: Thu, 01 Feb 2007 01:33:12 +0000
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <Pine.LNX.4.64.0701311714380.6132@coffee.psychology.mcmaster.ca>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<45C04FE9.5050502@streamline-computing.com>
	<45C07AE4.1020508@brookes.ac.uk>
	<Pine.LNX.4.64.0701311714380.6132@coffee.psychology.mcmaster.ca>
Message-ID: <45C14358.8030800@brookes.ac.uk>


> is this "e-science" term more popular where you are?  I don't really
> hear it here (Canada, probably NA in general.)  many (most) branches of 
> science are so dependent on computers that it seems redundant and archaic.
> 

"e-science" is huge here, in universities and scientific research . It's
associated with grid computing in the science area.


>     - how long will it transfer this 50GB blob of data?
> 
>     - if I provide neurophysiology data from my lab, can I
>     be sued if the subjects' privacy is violated?
> 
>     - do I have time to spend explaining how the data is encoded?
> 

and
  - hassle of going up the ladder to get permission from bosses
  - explain the formulas and how to process the data.


From 06002352 at brookes.ac.uk  Wed Jan 31 17:51:20 2007
From: 06002352 at brookes.ac.uk (Mitchell Wisidagamage)
Date: Thu, 01 Feb 2007 01:51:20 +0000
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C0E921.7090600@tempemusic.com>
References: <45BE8E7E.4010808@brookes.ac.uk>	<36397.192.168.1.1.1170189086.squirrel@mail.eadline.org>
	<45BFE17F.5080901@brookes.ac.uk> <45C0E921.7090600@tempemusic.com>
Message-ID: <45C14798.9040304@brookes.ac.uk>

lenoxx at tempemusic.com wrote:
> You can try:
> CMAQ
> www.cmascenter.org
> MM5
> http://www.mmm.ucar.edu/mm5/
> WRF
> http://www.wrf-model.org/index.php
> 
> 
> Any of the above could be used and input data can be gathered from many 
> sources. If you want some WRF inputs, let me know. I don't currently 
> have any CMAQ or MM5 inputs though.
> 
> 
> 
Thank you for the links. Wonder how I didn't find them before. I spend 
lots of time searching for sites like that.

Now I'm not sure what to do with these data sets. I should program my 
own application. But how should I be processing them?...without the 
algorithms for processing I'm lost.  :o)


From gerry.creager at tamu.edu  Wed Jan 31 18:44:58 2007
From: gerry.creager at tamu.edu (Gerry Creager)
Date: Wed, 31 Jan 2007 20:44:58 -0600
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C14358.8030800@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>	<45C04FE9.5050502@streamline-computing.com>	<45C07AE4.1020508@brookes.ac.uk>	<Pine.LNX.4.64.0701311714380.6132@coffee.psychology.mcmaster.ca>
	<45C14358.8030800@brookes.ac.uk>
Message-ID: <45C1542A.4030701@tamu.edu>

No... "e-Science" has been co-opted by "grid" as their own. 
Collaborative computational science has been around for quite some time. 
  Determining the appropriate degree of distributed computational 
interprocess communication requires more than trivial examination of the 
codes in use.

Please don't fall into the trap of thinking "e-Science" requires a tie 
to the Globus Toolkit to be valid.

gerry

Mitchell Wisidagamage wrote:
> 
>> is this "e-science" term more popular where you are?  I don't really
>> hear it here (Canada, probably NA in general.)  many (most) branches 
>> of science are so dependent on computers that it seems redundant and 
>> archaic.
>>
> 
> "e-science" is huge here, in universities and scientific research . It's
> associated with grid computing in the science area.
> 
> 
>>     - how long will it transfer this 50GB blob of data?
>>
>>     - if I provide neurophysiology data from my lab, can I
>>     be sued if the subjects' privacy is violated?
>>
>>     - do I have time to spend explaining how the data is encoded?
>>
> 
> and
>  - hassle of going up the ladder to get permission from bosses
>  - explain the formulas and how to process the data.
> 
> 
> 
> 
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf

-- 
Gerry Creager -- gerry.creager at tamu.edu
Texas Mesonet -- AATLT, Texas A&M University	
Cell: 979.229.5301 Office: 979.458.4020 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843


From gerry.creager at tamu.edu  Wed Jan 31 18:49:50 2007
From: gerry.creager at tamu.edu (Gerry Creager)
Date: Wed, 31 Jan 2007 20:49:50 -0600
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C14798.9040304@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>	<36397.192.168.1.1.1170189086.squirrel@mail.eadline.org>	<45BFE17F.5080901@brookes.ac.uk>
	<45C0E921.7090600@tempemusic.com> <45C14798.9040304@brookes.ac.uk>
Message-ID: <45C1554E.9000400@tamu.edu>

Don't forget the Weather Research and  Forecasting model at 
http://wrf-model.org/users/users.php

Mitchell Wisidagamage wrote:
> lenoxx at tempemusic.com wrote:
>> You can try:
>> CMAQ
>> www.cmascenter.org
>> MM5
>> http://www.mmm.ucar.edu/mm5/
>> WRF
>> http://www.wrf-model.org/index.php
>>
>>
>> Any of the above could be used and input data can be gathered from 
>> many sources. If you want some WRF inputs, let me know. I don't 
>> currently have any CMAQ or MM5 inputs though.
>>
>>
>>
> Thank you for the links. Wonder how I didn't find them before. I spend 
> lots of time searching for sites like that.
> 
> Now I'm not sure what to do with these data sets. I should program my 
> own application. But how should I be processing them?...without the 
> algorithms for processing I'm lost.  :o)
> 
> 
> 
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf

-- 
Gerry Creager -- gerry.creager at tamu.edu
Texas Mesonet -- AATLT, Texas A&M University	
Cell: 979.229.5301 Office: 979.458.4020 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843


From James.P.Lux at jpl.nasa.gov  Wed Jan 31 21:18:36 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Wed, 31 Jan 2007 21:18:36 -0800
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C07AE4.1020508@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<45C04FE9.5050502@streamline-computing.com>
	<45C07AE4.1020508@brookes.ac.uk>
Message-ID: <6.2.3.4.2.20070131211133.03400140@mail.jpl.nasa.gov>

At 03:17 AM 1/31/2007, Mitchell Wisidagamage wrote:
>Thank you very much for the fire dynamics idea. I will have a look at it.
>
>I did try to contact many e-science projects including some 
>researchers at Oxford. But I got no reply. Then I went to get some 
>contacts from a tutor who worked at a e-science project himself. He 
>told me people, especially scientists are "very jealous" of their 
>data. And not replying is a kind way of saying "no". And there's the 
>problem of "who's this guy wanting my data", "what will he do with it?".
>
>I have given up the e-science idea. Now looking for other real world 
>applications.


Optimum path routing of ships and/or airplanes, taking into account 
the winds, currents, sea state, temperatures, etc.

Large realtime and climatological databases are available.
The path optimization algorithms are simple and fairly well known (A 
and A-star are two to start with).  The challenge is in suitable 
heuristics to prune the search space.

You can optimize for minimum time in transit, or minimum fuel cost, 
or minimum probability of delay, etc.

You can burn a lot of compute cycles even doing a fairly simple route 
(say, Los Angeles to Yokohama by ship or New York to Los Angeles by 
air), because the search space is quite dense (probably don't want to 
change course too often, but that's still hundreds of waypoints).. 
and then, after you've found the route, you should (either by looking 
at what you calculated during route finding, or as a post process 
step) do a sensitivity analysis to see how critical the routing is 
(if small variations in climate/weather cause huge changes in time, 
that's a bad thing)


>Thanks,
>Mitchell

James Lux, P.E.
Spacecraft Radio Frequency Subsystems Group
Flight Communications Systems Section
Jet Propulsion Laboratory, Mail Stop 161-213
4800 Oak Grove Drive
Pasadena CA 91109
tel: (818)354-2075
fax: (818)393-6875 


From James.P.Lux at jpl.nasa.gov  Wed Jan 31 21:29:46 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Wed, 31 Jan 2007 21:29:46 -0800
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <Pine.LNX.4.64.0701311519060.16420@lilith.rgb.private.net>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<45C04FE9.5050502@streamline-computing.com>
	<45C07AE4.1020508@brookes.ac.uk>
	<Pine.LNX.4.64.0701311519060.16420@lilith.rgb.private.net>
Message-ID: <6.2.3.4.2.20070131212014.0304b408@mail.jpl.nasa.gov>

At 02:03 PM 1/31/2007, Robert G. Brown wrote:
>On Wed, 31 Jan 2007, Mitchell Wisidagamage wrote:
>
>>Thank you very much for the fire dynamics idea. I will have a look at it.
>>
>>I did try to contact many e-science projects including some 
>>researchers at Oxford. But I got no reply. Then I went to get some 
>>contacts from a tutor who worked at a e-science project himself. He 
>>told me people, especially scientists are "very jealous" of their 
>>data. And not replying is a kind way of saying "no". And there's 
>>the problem of "who's this guy wanting my data", "what will he do with it?".
>>
>>I have given up the e-science idea. Now looking for other real 
>>world applications.
>
>Remember, NASA puts all (or at least a lot) of its e.g. weather data
>online.

Well.. not exactly NASA.. operational "weather" data is the province 
of NOAA.  NASA does research, not operational, data, so there's 
typically a time lag, especially for processed and calibrated data.

By and large, most environmental data collected by NASA winds up in 
DAACs (Distributed Active Archiving Centers). Physical Oceanography 
data, for instance, winds up at PO-DAAC... 
http://www-podaac.jpl.nasa.gov/ which has data for sea surface 
temperature, sea surface topography, and ocean vector winds acquired 
by NASA instruments.  This whole process is very well documented, and 
the data moves through the various levels of processing and into the 
archives in a regular and stately fashion.

But, for instance, the live data from a single instrument (e.g. 
QuikSCAT for ocean winds, on which I worked) also gets fed to a 
realtime process at NOAA within about an hour after it's received on 
the ground every 100 minutes, and thence to folks like NCAR who run 
numerical models, which then winds up at the NWS and makes the 
weather predictions more accurate on the evening news.  This is a bit 
harder to find in a reliable online source, especially if you want 
things gridded into standard geographic grids, etc.   It's all out 
there, but since the funding stream for distribution is more tenuous 
(NOAA doesn't have as much money as NASA for this sort of thing, but 
they do have "real time" requirements), the data tends to be a bit 
more "raw" or idiosyncratic, and not necessarily in HDF files, 
etc.  It tends to be in whatever format is convenient for them, which 
may or may not be convenient for you.


>  And there are many things one can do with it.  Look for the
>NOAA sites.  You can get sunspot data, proxy temperature data, and much
>more, and build your very own climate model.  If you do, don't be
>surprised if it fails to agree with the current one (due to be
>re-released today, IIRC, from the IPCC).

James Lux, P.E.
Spacecraft Radio Frequency Subsystems Group
Flight Communications Systems Section
Jet Propulsion Laboratory, Mail Stop 161-213
4800 Oak Grove Drive
Pasadena CA 91109
tel: (818)354-2075
fax: (818)393-6875 


From James.P.Lux at jpl.nasa.gov  Wed Jan 31 21:37:23 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Wed, 31 Jan 2007 21:37:23 -0800
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <Pine.LNX.4.64.0701311714380.6132@coffee.psychology.mcmaste r.ca>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<45C04FE9.5050502@streamline-computing.com>
	<45C07AE4.1020508@brookes.ac.uk>
	<Pine.LNX.4.64.0701311714380.6132@coffee.psychology.mcmaster.ca>
Message-ID: <6.2.3.4.2.20070131213007.0307aa18@mail.jpl.nasa.gov>

At 02:37 PM 1/31/2007, Mark Hahn wrote:
>>worked at a e-science project himself. He told me people, especially
>
>is this "e-science" term more popular where you are?  I don't really
>hear it here (Canada, probably NA in general.)  many (most) branches 
>of science are so dependent on computers that it seems redundant and archaic.
>
>>scientists are "very jealous" of their data. And not replying is a 
>>kind way of saying "no". And there's the problem of "who's this guy 
>>wanting my data", "what will he do with it?".
>
>sure - academia is all about publishing, so you don't want someone 
>else to scoop you.


This is an interesting aspect.  All the latest Announcements of 
Opportunity for space research (think cameras taking pictures of 
Mars, or analyzing rocks etc.) have fairly fast time lines(weeks, not 
months or years) for relase of data to general public, and you have 
to put your plans and budgets for public distribution in the 
proposal.  No more holding onto the data, "recalibrating and 
reprocessing",  until everyone gets their dissertations done.

The Dead Sea Scrolls are probably a pretty notorious example of 
"we're still working on it.. when we're done, we'll release 
it"...Amazing what a computer and a concordance can do.

>   but often concerns are much more mundane, like:
>
>         - how long will it transfer this 50GB blob of data?

Think terabytes, for many data sets.  QuikSCAT gives you wind vectors 
for 90% of the world's ice free ocean twice a day on a 25 km 
grid.  Raw radar data coming down from the spacecraft before 
calibration is basically 200 odd measurements/second (each 
measurement consisting of 10 numbers) continuously 24/7.  And, that's 
a low volume sensor.  Something like a SAR doing radar imaging is 
orders of magnitude greater. Sipping from a firehose indeed.


>         - if I provide neurophysiology data from my lab, can I
>         be sued if the subjects' privacy is violated?
>
>         - do I have time to spend explaining how the data is encoded?

That's a biggie... Budgets are always limited, so they tend to do 
just what's needed for the immediate purpose.


>regards, mark hahn.
>_______________________________________________
>Beowulf mailing list, Beowulf at beowulf.org
>To change your subscription (digest mode or unsubscribe) visit 
>http://www.beowulf.org/mailman/listinfo/beowulf

James Lux, P.E.
Spacecraft Radio Frequency Subsystems Group
Flight Communications Systems Section
Jet Propulsion Laboratory, Mail Stop 161-213
4800 Oak Grove Drive
Pasadena CA 91109
tel: (818)354-2075
fax: (818)393-6875 


From James.P.Lux at jpl.nasa.gov  Wed Jan 31 21:40:40 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Wed, 31 Jan 2007 21:40:40 -0800
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45C14798.9040304@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
	<36397.192.168.1.1.1170189086.squirrel@mail.eadline.org>
	<45BFE17F.5080901@brookes.ac.uk> <45C0E921.7090600@tempemusic.com>
	<45C14798.9040304@brookes.ac.uk>
Message-ID: <6.2.3.4.2.20070131213744.03074e00@mail.jpl.nasa.gov>

At 05:51 PM 1/31/2007, Mitchell Wisidagamage wrote:
>lenoxx at tempemusic.com wrote:
>>You can try:
>>CMAQ
>>www.cmascenter.org
>>MM5
>>http://www.mmm.ucar.edu/mm5/
>>WRF
>>http://www.wrf-model.org/index.php
>>
>>Any of the above could be used and input data can be gathered from 
>>many sources. If you want some WRF inputs, let me know. I don't 
>>currently have any CMAQ or MM5 inputs though.
>>
>Thank you for the links. Wonder how I didn't find them before. I 
>spend lots of time searching for sites like that.
>
>Now I'm not sure what to do with these data sets. I should program 
>my own application. But how should I be processing them?...without 
>the algorithms for processing I'm lost.  :o)


http://www.ocean-systems.com/VOSS.htm
www.weather.navy.mil/paoweb/starsams.ppt
http://realdistance.com/


>_______________________________________________
>Beowulf mailing list, Beowulf at beowulf.org
>To change your subscription (digest mode or unsubscribe) visit 
>http://www.beowulf.org/mailman/listinfo/beowulf

James Lux, P.E.
Spacecraft Radio Frequency Subsystems Group
Flight Communications Systems Section
Jet Propulsion Laboratory, Mail Stop 161-213
4800 Oak Grove Drive
Pasadena CA 91109
tel: (818)354-2075
fax: (818)393-6875 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20070131/ab973b63/attachment.html>

From James.P.Lux at jpl.nasa.gov  Wed Jan 31 21:45:09 2007
From: James.P.Lux at jpl.nasa.gov (Jim Lux)
Date: Wed, 31 Jan 2007 21:45:09 -0800
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45BE8E7E.4010808@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>
Message-ID: <6.2.3.4.2.20070131214141.0342e818@mail.jpl.nasa.gov>

At 04:17 PM 1/29/2007, Mitchell Wisidagamage wrote:
>Hi all,
>  As part of my dissertation, I'm looking for "raw data" which will 
> be used for massive parallel processing using Beuwulf cluster (with 
> the use of PVM or MPI). I tried looking for e-science raw data (and 
> the computations required on it) such as bioinformatics, fluid 
> dynamics, etc. but without any luck.

An interesting computationally intensive task would be constructing a 
3-D model of a pot from multiple 2-d images of pot shards, especially 
if there are missing or extra pieces.

This would be quite a boon for archaeologists, who currently do it by 
hand, as a 3D jigsaw puzzle.

And, of course, it has almost no commercial value.. there are 
commercial archaeologists, but they don't get paid to reassemble broken pots.

A similar application might be to take a 3D tomographic image of a 
block of tar from the La Brea tarpits and extract all the bone 
images, and attempt to match them up.


James Lux, P.E.
Spacecraft Radio Frequency Subsystems Group
Flight Communications Systems Section
Jet Propulsion Laboratory, Mail Stop 161-213
4800 Oak Grove Drive
Pasadena CA 91109
tel: (818)354-2075
fax: (818)393-6875 


From steve_heaton at iinet.net.au  Wed Jan 31 22:29:33 2007
From: steve_heaton at iinet.net.au (Steve Heaton)
Date: Thu, 01 Feb 2007 17:29:33 +1100
Subject: [Beowulf] massive parallel processing application,	required
In-Reply-To: <200702010539.l115cTsQ000851@bluewest.scyld.com>
References: <200702010539.l115cTsQ000851@bluewest.scyld.com>
Message-ID: <45C188CD.4050400@iinet.net.au>

G'day Jim and all

Two interesting space examples would be the original Viking lander pics. 
The story goes that they were in such a rush to get the images to the 
press conference that they 'made the sky blue and the ground red'.

Later, more careful analysis showed the sky a more dusty pink/brown and 
the surface a truer brown (stronger purple component).

At the other end of the scale, the general public 'scooped' the Huygen's 
team by processing the raw landing approach images using the likes of 
Photoshop and had them on the Web in advance of the official release.

I'm sure, Jim, you likely have more inside details than I ;)

On the topic of met models, if anyone's looking for some good 'toy' code 
to start from, have a look at PUMA:
http://www.mi.uni-hamburg.de/PUMA.215.0.html

I recommend GrADS for the viz: http://www.iges.org/grads/

The PSU/NCAR MM5 code scares the willys out of me! ;)

(There *is* a train of thought here... I've used PUMA many moons ago to 
build my own Martian and Titan models. A certain perverse pleasure in 
watching your model shake itself to pieces!)

Cheers
Stevo

=====

Date: Wed, 31 Jan 2007 21:37:23 -0800
From: Jim Lux <James.P.Lux at jpl.nasa.gov>
Subject: Re: [Beowulf] massive parallel processing application
	required
To: Mark Hahn <hahn at mcmaster.ca>,	Mitchell Wisidagamage
	<06002352 at brookes.ac.uk>
Cc: beowulf at beowulf.org
Message-ID: <6.2.3.4.2.20070131213007.0307aa18 at mail.jpl.nasa.gov>
Content-Type: text/plain; charset="us-ascii"; format=flowed

At 02:37 PM 1/31/2007, Mark Hahn wrote:

 > >sure - academia is all about publishing, so you don't want someone
 > >else to scoop you.


This is an interesting aspect.  All the latest Announcements of
Opportunity for space research (think cameras taking pictures of
Mars, or analyzing rocks etc.) have fairly fast time lines(weeks, not
months or years) for relase of data to general public, and you have
to put your plans and budgets for public distribution in the
proposal.  No more holding onto the data, "recalibrating and
reprocessing",  until everyone gets their dissertations done.


From lenoxx at tempemusic.com  Wed Jan 31 11:08:17 2007
From: lenoxx at tempemusic.com (lenoxx at tempemusic.com)
Date: Wed, 31 Jan 2007 12:08:17 -0700
Subject: [Beowulf] massive parallel processing application required
In-Reply-To: <45BFE17F.5080901@brookes.ac.uk>
References: <45BE8E7E.4010808@brookes.ac.uk>	<36397.192.168.1.1.1170189086.squirrel@mail.eadline.org>
	<45BFE17F.5080901@brookes.ac.uk>
Message-ID: <45C0E921.7090600@tempemusic.com>

You can try:
CMAQ
www.cmascenter.org
MM5
http://www.mmm.ucar.edu/mm5/
WRF
http://www.wrf-model.org/index.php


Any of the above could be used and input data can be gathered from many 
sources. If you want some WRF inputs, let me know. I don't currently 
have any CMAQ or MM5 inputs though.


Mitchell Wisidagamage wrote:

> Thank you for the reply. what I mean by massively parallel processing 
> is "compute intensive" problem where I will be able to process on many 
> hosts independently in parallel (by parallelizing the algorithms). 
> Apologies if I'm repeating what you already know. Currently there are 
> 9 hosts on Beowulf cluster in Lab running on UltraSPARC 5 (yes, very 
> old machines!).
>
> "raw data" I'm looking for are the input files required for the 
> application, so I can "process" them. I do not have any specific 
> application in mind. I will be developing the different components of 
> the application such as job scheduling, fault tolerance, fault 
> recovery, etc for my dissertation. I will be useing PVM or MPI for 
> message passing and programming in c.
>
> But I do have any specific application in mind and I don't mind any 
> application. I can come up with say, "data mining" but it's pointless 
> unless I have the "raw data" files to process and know the patterns 
> I'm looking for.
>
> Application design and programming model all depends on the type of 
> application I get my hands on. I'm quite desperate for a "problem" 
> since I have to submit my proposal end of next week. Any help would he 
> greatly appreciated. I'm sure everyone here has lot of experience in 
> distributed processing and hopefully can be of some help. Sorry for 
> the long post.
>
>
> Thanks,
>  Mitchell
>
>
> Douglas Eadline wrote:
>
>> You may want to quantify "massively parallel"
>> and define "raw data" (i.e. how many
>> processors and data for which specific application?)
>>
>>
>>   --
>>   Doug
>>
>>
>>> Hi all,
>>>   As part of my dissertation, I'm looking for "raw data" which will be
>>> used for massive parallel processing using Beuwulf cluster (with the 
>>> use
>>> of PVM or MPI). I tried looking for e-science raw data (and the
>>> computations required on it) such as bioinformatics, fluid dynamics,
>>> etc. but without any luck.
>>>
>>> Anyone has any idea of getting some raw data so I can give compute
>>> intensive "work" to the nodes?
>>>
>>> Any pointers/hints/tips would be very much appriciated.
>>>
>>> Best wishes,
>>>   Mitchell
>>>
>>>
>>>
>>> _______________________________________________
>>> Beowulf mailing list, Beowulf at beowulf.org
>>> To change your subscription (digest mode or unsubscribe) visit
>>> http://www.beowulf.org/mailman/listinfo/beowulf
>>>
>>> !DSPAM:45bf9f33277683326710967!
>>>
>>
>>
>> -- 
>> Doug
>
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf
>


From angelv at iac.es  Wed Jan 31 11:29:48 2007
From: angelv at iac.es (Angel de Vicente)
Date: 31 Jan 2007 19:29:48 +0000
Subject: [Beowulf] building my first cluster
In-Reply-To: <c84311bb0701291911i3dd0870fl6530a34db34e8c39@mail.gmail.com>
References: <c84311bb0701291911i3dd0870fl6530a34db34e8c39@mail.gmail.com>
Message-ID: <82ejpbc7v7.fsf@kohji.angelv.es>

Hi Mark,

> Other than just saying hello here, I do have a question.  For this
> modest sized cluster, how important is it to have a cluster manager
> like OSCAR?  Will it be just as time consuming to just learn a flavor
> of MPI as it would be to learn to use OSCAR (for example)?  I will
> primarily be using CPMD for my calculations, but may want to try out
> abinit and DFT++.

If this is a personal cluster, and you are going to be the only user,
installing cluster management software would probably be not
worthwhile. If you make sure you install the same software in all the
nodes, connect them with a switch, perhaps install NFS for your home
directory and install a version of MPICH, you probably have all you
need for many happy hours of parallel computation.

Cheers,
Angel de Vicente