Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] RAID for home beowulf

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Mark Hahn hahn at mcmaster.ca
Sat Oct 3 14:19:30 PDT 2009


> It depends on your workload. RAID5 is good for large sequential writes,
> but sucks at small sequential writes because for every write it has to
> do a read to compare parity.

well, it's bad at small random writes.  small _sequential_ writes would 
be able to avoid reads for all but the first transaction.

IMO, raid5 is often unappealing because raid10 avoids the write penalty,
and raid6 is a lot more survivable.  ultimately it depends on your taste
in trading off performance, space efficiency, risk.

>> 2) I want to put the /home at the beginning of the disks go get faster
>> write/seek speeds, if the partitions are the same, software RAID doesn't
>> care where they are?
>
> I don't think this will buy you much performance. There probably is a
> measurable difference, but I don't think it's enough to worry about.

inner tracks are normally about 60% of the speed of outer tracks - 
that's for a normal density-optimized disk, not a latency-optimized 
(and therefore inherently small) "enterprise" disk.

>> 3) I'll leave the /boot partition on one of the 3 disks and it will NOT
>> be included in the RAID array, is this ok?
>
> Sure, but /boot is actually trivial to mirror. Just make sure your boot
> loader is on each disk in the mirror and that each disk is partitioned
> identically, and all you have to do if a drive dies is change the device
> you boot off of if a drive dies.

or better yet, don't bother booting of the local disk.  simply make your 
head/admin/master server reliable and net-boot.  it's likley that nodes 
won't be functional without the master server anyway, and net-booting 
doesn't mean you can't use the local disk for swap/scratch/...

>> 4) I've read about setting up parallel swaping via priority given to
>> swap partitions in fstab, but also how it would be ok to create RAID 1
>> array of swap partitions for the HA of the cluster. What should I choose?
>
> Any swapping at all will kill performance. I would get enough RAM to
> make sure you don't swap.

well, using swap space is harmless as long as you're not actually swapping
_in_ any nontrivial amount.

unless you have some very extreme parameters (uncheckpointable long jobs,
flakey hardware or power, banking-level reliability expectations),
I wouldn't bother raiding swap.



More information about the Beowulf mailing list