Mysterious kernel hangs

Pfenniger Daniel daniel.pfenniger at obs.unige.ch
Thu Mar 15 06:18:34 PST 2001


Felix Rauch wrote:
> 
> We recently bought a new 16 node cluster with dual 1 GHz PentiumIII
> nodes, but machines mysteriously freeze :-(
...
If this doesn't depend on kernel nor on communications, I would suggest
a relation with temperature.  Typically on compute intensive task 
temperature can raise by a few degrees and components such as memory 
or processor can stop to work, without any error message!

A further action might be to increase the node cooling. 

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Dr Daniel Pfenniger 			  | Daniel.Pfenniger at obs.unige.ch
 Geneva Observatory, University of Geneva | tel: +41 (22) 755 2611 
 CH-1290 Sauverny, Switzerland		  | fax: +41 (22) 755 3983
__________________________________________________________________________





More information about the Beowulf mailing list