D-Link switch and ecc-memory.
glindahl at hpti.com
Tue Jan 16 09:57:23 PST 2001
> My best estimate is that our system corrects one single bit error (SBE)
> per week in 37.5 GB of ECC memory. This translates into SBE event
> intervals of about 9 months per GB of RAM. Your mileage may vary...
Josip neglected to mention that he is at sea level. If you are at a higher
altitude, you will see more errors.
CPlant's 2000 cpus have a total of something like 500 gigabytes of RAM. I
haven't computed the errors/GB/month (although we do monitor them, because
it detects bad motherboards), but with Josip's number, that would be an
interrupt every 12 hours.
More information about the Beowulf