Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] Should I go for diskless or not?

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Douglas Eadline deadline at eadline.org
Fri May 15 05:26:36 PDT 2009


You will note that I used sufficient wiggle words "usually" and
"generally" because in my experience it always depends.
And of course my comments are from my personal
experience. I have found that diskless allows for
the entire cluster to be "re provisioned" without
have into re-image disks. Reboots are quicker
(for the hardware I use) and since I use ram
disk approach (Warewulf/Perceus) I find that things
are a bit faster, also the diskless image has
minimal services running vs a disk-full distribution image.
(I fully understand the good admin can trim a
disk-full distribution)

There are plenty of arguments either way.  Back in 2006
I did a mini-poll on node disk space usage:

http://www.clustermonkey.net//component/option,com_poll/task,results/id,18/

So "in general, it varies". YMMV

--
Doug



>
> Doug:  Diskless provisioning is usually easier to manage.
>
> Hmm, not sure I buy that one. Pretty much any decent cluster distribution
> should:
> * allow you to add a compute node without much more than plugging it in
>   and telling it to PXE boot (diskless or diskfull)
> * Allow you to push a configuration cluster wide
> * allow you to reinstall/reboot all nodes.
>
> Sure installing 100 diskfull nodes takes more network bandwidth than
> booting
> 100 diskless nodes.  The flip side is booting 100 diskless nodes takes
> more
> bandwidth then 100 disk nodes.  For practical uses of clusters either way
> booting/installation is approximately 0% of the annual network bandwidth.
> Certainly installing 1000 nodes from a single fileserver would take quite
> awhile, but various technologies (bit-torrent and broadcast) remove that
> bottleneck.  I'm kinda curious since diskless nodes don't really install
> how
> do they handle heterogeneous hardware.  Say a dead motherboard comes back
> with
> a new pci-id or 3?
>
> Doug:  In general diskless is faster.
>
> At what?  Diskless booting is faster than diskfull booting or diskfull
> installation.  Application performance should be pretty much the same.
> The
> worst case scenarios like installing 1000 compute nodes from the head node
> are
> usually dealt with by using broadcasts or bit-torrent.
>
> So it depends, my current thinking is that it's not worth the man hours to
> do
> it yourself unless you have a larger cluster.  If it's supported by your
> cluster distribution it could easily be worth it.  In the whole scheme of
> things I'd worry about diskless last.  Decide on your cluster distribution
> based on your application and user needs, systems administrator
> experience,
> and budget.
>
>
> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf
>
> --
> This message has been scanned for viruses and
> dangerous content by MailScanner, and is
> believed to be clean.
>


--
Doug

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.




More information about the Beowulf mailing list