[Beowulf] Re: RRDtools graphs of temp from IPMI
csamuel at vpac.org
Tue Nov 11 12:08:21 PST 2008
----- "Dave Love" <d.love at liverpool.ac.uk> wrote:
> Chris Samuel <csamuel at vpac.org> writes:
> > The reason it worries about high load is that we
> > used to see processes hang trying to read from the
> > IPMI device, but haven't seen that with more recent
> > kernels..
> How recent? We've seen similar trouble on Supermicros with a SuSE
> 10.3 (220.127.116.11) kernel, hence doing it out-of-band, as I just posted.
I think they seemed to go away somewhere around 2.6.27 I believe.
> (Sorry I basically duplicated the in-band one of yours.) It involves
> the kipmi0 kernel thread going CPU-bound and sometimes getting a huge
> load average from failed ipmitool instances hanging around.
Sounds very much like what we were seeing on ours!
Christopher Samuel - (03) 9925 4751 - Systems Manager
The Victorian Partnership for Advanced Computing
P.O. Box 201, Carlton South, VIC 3053, Australia
VPAC is a not-for-profit Registered Research Agency
More information about the Beowulf