[Beowulf] ECC support on motherboards?

Kimitoshi Takahashi ktaka at clustcom.com
Tue Jun 17 02:16:58 PDT 2008


Hi Perry,

> So another question is, how can you reliably test any of this stuff?
> It isn't like you can reliably induce single bit errors and see if the
> hardware catches them. (A special memory module that let you test
> would be a wonderful thing, but I've never even heard of such a thing.)
> 

We scrathed off one of the plates to reliably induce single bit errors, so that we could submit EDAC patch for i3000.

http://www.clustcom.com/content/view/89/32/
It's not in English, but I hope you'll know what you can do.

Using the same way, my collegue is writing edac code for i3200. 

Of course, there is non destructive way:
http://bluesmoke.sourceforge.net/testing.html

But we wanted to be sure if it's really causing bit errors. If both edac and   memtest86 didn't support the tested chipset, we wouldn't be able to tell if we really masked right pins and hence reliably causing bit errors.
That was the case for i3000 and i3200.

Please note that, pin arrangement is different for different type of memories. And we are not sure if this technique applies to different types of memory.

-- 
Kimitoshi Takahashi





More information about the Beowulf mailing list