[Beowulf] PowerEdge SC 1435: Unexplained Crashes.
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
Nifty niftyompi Mitch niftyompi at niftyegg.comFri Oct 17 18:20:53 PDT 2008
- Previous message: [Beowulf] PowerEdge SC 1435: Unexplained Crashes.
- Next message: [Beowulf] kdump / kexec to optain crash dumps from randomly crashing nodes.
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
On Fri, Oct 17, 2008 at 10:37:17AM -0500, Rahul Nabar wrote: ....... > > Stability has already changed. After I swapped motherboard+cpu. No > more dead nodes in over 2 weeks now (yay!) But I just want to make > sure this won't be a recurring problem with these SC1435's before we > go in for our next expansion. If hardware updates help then it is most likely HW.... Keep a good log... dmidecode will often give you hooks to track hardware with scripts. -- T o m M i t c h e l l Found me a new hat, now what?
- Previous message: [Beowulf] PowerEdge SC 1435: Unexplained Crashes.
- Next message: [Beowulf] kdump / kexec to optain crash dumps from randomly crashing nodes.
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
