[Beowulf] ECC support on motherboards?
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
Kimitoshi Takahashi ktaka at clustcom.comTue Jun 17 02:16:58 PDT 2008
- Previous message: [Beowulf] June New York/Jersey HPC users meeting
- Next message: [Beowulf] Nvidia, cuda, tesla and... where's my double floating point?
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Hi Perry, > So another question is, how can you reliably test any of this stuff? > It isn't like you can reliably induce single bit errors and see if the > hardware catches them. (A special memory module that let you test > would be a wonderful thing, but I've never even heard of such a thing.) > We scrathed off one of the plates to reliably induce single bit errors, so that we could submit EDAC patch for i3000. http://www.clustcom.com/content/view/89/32/ It's not in English, but I hope you'll know what you can do. Using the same way, my collegue is writing edac code for i3200. Of course, there is non destructive way: http://bluesmoke.sourceforge.net/testing.html But we wanted to be sure if it's really causing bit errors. If both edac and memtest86 didn't support the tested chipset, we wouldn't be able to tell if we really masked right pins and hence reliably causing bit errors. That was the case for i3000 and i3200. Please note that, pin arrangement is different for different type of memories. And we are not sure if this technique applies to different types of memory. -- Kimitoshi Takahashi
- Previous message: [Beowulf] June New York/Jersey HPC users meeting
- Next message: [Beowulf] Nvidia, cuda, tesla and... where's my double floating point?
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
