Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] RAID for home beowulf

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Tomislav Maric tomislav.maric at gmx.com
Sat Oct 3 15:02:27 PDT 2009


Mark Hahn wrote:
>> It depends on your workload. RAID5 is good for large sequential writes,
>> but sucks at small sequential writes because for every write it has to
>> do a read to compare parity.
> 
> well, it's bad at small random writes.  small _sequential_ writes would 
> be able to avoid reads for all but the first transaction.
> 
> IMO, raid5 is often unappealing because raid10 avoids the write penalty,
> and raid6 is a lot more survivable.  ultimately it depends on your taste
> in trading off performance, space efficiency, risk.

I can't wrap my mind around the RAID config, because I'm using software
RAID: it supports linear mode, and RAID level 0,1,4 or 5. Since it acts
as if a partition is a device, this gives me way too much freedom (more
to think about). :))

So, maybe the bold question to ask would be: what would be the best RAID
config for 3 HDDS and a max 6 node HPC cluster? Should I just use RAID 1
for the system partitions on one disk,  and RAID 0 for the simulation
data placed on the same partitions on other two disks: after
post-processing, the data is gone anyway... and with a good backup
strategy, I don't have to worry about RAID0 not recovering from a disk
fail...

> 
>>> 2) I want to put the /home at the beginning of the disks go get faster
>>> write/seek speeds, if the partitions are the same, software RAID doesn't
>>> care where they are?
>> I don't think this will buy you much performance. There probably is a
>> measurable difference, but I don't think it's enough to worry about.
> 
> inner tracks are normally about 60% of the speed of outer tracks - 
> that's for a normal density-optimized disk, not a latency-optimized 
> (and therefore inherently small) "enterprise" disk.
> 
>>> 3) I'll leave the /boot partition on one of the 3 disks and it will NOT
>>> be included in the RAID array, is this ok?
>> Sure, but /boot is actually trivial to mirror. Just make sure your boot
>> loader is on each disk in the mirror and that each disk is partitioned
>> identically, and all you have to do if a drive dies is change the device
>> you boot off of if a drive dies.
> 
> or better yet, don't bother booting of the local disk.  simply make your 
> head/admin/master server reliable and net-boot.  it's likley that nodes 
> won't be functional without the master server anyway, and net-booting 
> doesn't mean you can't use the local disk for swap/scratch/...
> 

Well, I want to configure the net boot for all diskless nodes and use
the master node and it's RAID for a performance gains with writing CFD
simulation data against network communication and to be able to scale
more easily.

>>> 4) I've read about setting up parallel swaping via priority given to
>>> swap partitions in fstab, but also how it would be ok to create RAID 1
>>> array of swap partitions for the HA of the cluster. What should I choose?
>> Any swapping at all will kill performance. I would get enough RAM to
>> make sure you don't swap.
> 
> well, using swap space is harmless as long as you're not actually swapping
> _in_ any nontrivial amount.
> 
> unless you have some very extreme parameters (uncheckpointable long jobs,
> flakey hardware or power, banking-level reliability expectations),
> I wouldn't bother raiding swap.

Excellent, thank you very much!

Best regards,
Tomislav

> _______________________________________________
> Beowulf mailing list, Beowulf at beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
> 




More information about the Beowulf mailing list