[Beowulf] Odd AMD quad core SuperMicro power off issues

Chris Samuel csamuel at vpac.org
Mon Jul 6 22:20:36 PDT 2009


----- "Jason Clinton" <jclinton at advancedclustering.com> wrote:

Hi Jason,

> We saw a similar power-off issue on a customer of ours who upgraded
> from 2220's to Barcelona's on a similar board; it was reproducible at
> the same failure rate on approximately 160 nodes. After trying just
> about everything under the sun, we wholesale replaced all the memory
> in the entire cluster. The power-offs ceased immediately thereafter
> and have not returned.

We saw that with Barcelona's, but instead going to the
2.3GHz (75W) Shanghai's solved the issue for us - we were
rather surprised to see it reappear with the 2.4GHz (55W)
Shanghai. :-(

cheers,
Chris
-- 
Christopher Samuel - (03) 9925 4751 - Systems Manager
 The Victorian Partnership for Advanced Computing
 P.O. Box 201, Carlton South, VIC 3053, Australia
VPAC is a not-for-profit Registered Research Agency



More information about the Beowulf mailing list