lindahl at pbm.com
Fri Apr 3 13:48:12 PDT 2009
On Thu, Apr 02, 2009 at 09:05:26PM -0700, Ellis Wilson wrote:
> Though entertainingly put, it would be an error to say "ECC is a
> requirement" for everyone in a "cluster group". I can think of more
> than just a few purposes for clusters that truly do not require the
> accuracy guaranteed by ECC RAM.
The only big cluster I can think of built without ECC was built by a
guy whose research area was making computations reliable by doing
additional inexpensive computations to check the answer. While that
was really clever, the cluster was intended to be a general purpose
machine, and this answer-checking thing can only be efficiently done
for a subset of algorithms. Oddly enough, the cluster was subsequently
upgraded to ECC.
I have never run into a situation where a cluster would be improved by
leaving ECC off. I buy ECC for desktops, too, it's a small price to
pay to avoid engineer downtime.
More information about the Beowulf