HPL residual check failure

연규정 kjyoun at netstech.com
Mon Sep 3 05:22:41 PDT 2001


Hi
When I was doing HPL benchmark test using big matrix(bigger than 20,000 ) with many linux server(more than 20), sometimes I got residual check error as attached. 
When I got residual check error, I turned off my linux servers for several hours and then tried again. And usually it worked - I don't know the reason.
Heat is suspicious. But, is it really heat problem?
Is there anybody who have experienced similar problem or know the reason?
please help me.

Thanks in advance! 

Keaton


HPL result files------------------------------------------------------------

============================================================================
T/V                N    NB     P     Q               Time             Gflops
----------------------------------------------------------------------------
W11R2C4        21000   200     6     6             702.80          8.786e+00
----------------------------------------------------------------------------
||Ax-b||_oo / ( eps * ||A||_1  * N        ) =        0.0272768 ...... PASSED
||Ax-b||_oo / ( eps * ||A||_1  * ||x||_1  ) =        0.0140749 ...... PASSED
||Ax-b||_oo / ( eps * ||A||_oo * ||x||_oo ) =        0.0026585 ...... PASSED
============================================================================
T/V                N    NB     P     Q               Time             Gflops
----------------------------------------------------------------------------
W11R2C4        23000   200     6     6             866.35          9.364e+00
----------------------------------------------------------------------------
||Ax-b||_oo / ( eps * ||A||_1  * N        ) =     3255.3898794 ...... FAILED
||Ax-b||_oo / ( eps * ||A||_1  * ||x||_1  ) =     7833.1904572 ...... FAILED
||Ax-b||_oo / ( eps * ||A||_oo * ||x||_oo ) =     1364.3123654 ...... FAILED
||Ax-b||_oo  . . . . . . . . . . . . . . . . . =           0.000049
||A||_oo . . . . . . . . . . . . . . . . . . . =        5827.145943
||A||_1  . . . . . . . . . . . . . . . . . . . =        5836.795619
||x||_oo . . . . . . . . . . . . . . . . . . . =           2.390054




More information about the Beowulf mailing list