[Beowulf] RAID for home beowulf

Tony Travis a.travis at abdn.ac.uk
Sat Oct 3 15:30:39 PDT 2009


Tomislav Maric wrote:
> [...]
> I've seen Centos mentioned a lot in connection to HPC, am I making a
> mistake with Ubuntu??

Hello, Tomislav.

[Just let me put my flame-proof trousers on...]

I know a lot of HPC people on this list use RH-based distros, but I use 
Ubuntu for HPC and I think it's very good. In fact I started a thread on 
the Ubuntu forums about EasyUbuntuClustering:

   http://ubuntuforums.org/showthread.php?t=1030849

I used RH6-9, and Fedora core2, but I switched to Debian and now Ubuntu.

>> You also need to be aware that RAID5 is not so good when writing to the
>> disk, because parity has to be calculated and written to the disk. In
>> fact this performance penalty has lead to a campaign against RAID5:
>>
>>    http://www.baarf.com/
> 
> Okaay. :) There's war going on against it.

This campaign really made me think twice about what I was doing using 
RAID5. I lied to you (a bit) because I've bought more 3ware 8006-2's to 
put /home on RAID10 for our Beowulf servers. I must admit that hot-swap 
is one of the main reasons, but BAARF did come into it as well.

>[...]
> Yeah, but isn't RAID1 used for disk mirroring? How then would I get any
> speedup? From what I've read so far, data stripping is where I get the
> performance boost when using RAID: there's no real parallel
> writing/seeking applied to single data stream in RAID1...

You don't get a speedup when writing, but you avoid the performance 
penalty of writing to RAID5. Writing to a RAID1 is essentially the same 
speed as writing to a single disk. However, you do get a performance 
benefit when reading from RAID1, and you decouple disk access between 
the 'system' disk and /home on the RAID1 if you follow my suggestion.

On COTS motherboards the main bottleneck is the PCI bus anyway, not the 
SATA disks. Have you benchmarked the disk i/o performance that your 
hardware is capable of?

> [...]
> Thanks, my only problem is that I've reached my financial limits for my
> home project so I have to work with what I have. :) I'll definitely save
> this e-mail in my "importants" folder.

I set out with similar ideas to yours, but in the end you get what you 
pay for. My four-disk software RAID systems work fine and they survive 
single disk failures without crashing or losing any data. However, we've 
had a couple of near double disk failures so I decided to put the system 
and /backups on hardware RAID1 instead. I'm still using software RAID5 
for /home, and I think this is a reasonable compromise between cost, HA 
and performance.

Good luck!

   Tony.
-- 
Dr. A.J.Travis, University of Aberdeen, Rowett Institute of Nutrition
and Health, Greenburn Road, Bucksburn, Aberdeen AB21 9SB, Scotland, UK
tel +44(0)1224 712751, fax +44(0)1224 716687, http://www.rowett.ac.uk
mailto:a.travis at abdn.ac.uk, http://bioinformatics.rri.sari.ac.uk/~ajt



More information about the Beowulf mailing list