hard disk reliability

Joel Jaeggli joelja@darkwing.uoregon.edu
Thu, 3 Jun 1999 11:45:33 -0400


On a 13 node cluster 2 bad power supplies one dead disk(western digital
2gb ide) in a year and a half.

On Thu, 3 Jun 1999, Christoph Wasshuber wrote:

> Some days ago someone mentioned that one of
> the big benefits of running a diskless cluster
> is the increased reliability. Hard disks are
> the most unreliable part in PCs. Does anybody
> have manufacturer numbers like MTBF (mean time
> between failure)?
> 
> I would also be interested in comments from
> people running beowulfs with 100 or more
> nodes, where every node has a hard disk. Do
> you guys exchange a hard disk every month?
> Or even every week?
> 
> How serious is the hard disk reliability issue
> in reality?

Probably not a serious as the the cheapo chinese power supply reliability
issue. 


> Chris....
> 

-------------------------------------------------------------------------- 
Joel Jaeggli				       joelja@darkwing.uoregon.edu    
Academic User Services			     consult@gladstone.uoregon.edu
     PGP Key Fingerprint: 1DE9 8FCA 51FB 4195 B42A 9C32 A30D 121E
--------------------------------------------------------------------------
It is clear that the arm of criticism cannot replace the criticism of
arms.  Karl Marx -- Introduction to the critique of Hegel's Philosophy of
the right, 1843.