mpptest report errors on linux

bal bal at morelinux.com
Wed Mar 28 21:12:05 PST 2001


hi everybody

I am trying to build a four node linux cluster. All machines are PIII
500 Mhz 
with 10/100 rtl 8139 ethrnet card connected using a 100Mbps switch.
mpich-1.2.1 is installed from source. mpptest reports following problem

./runmpptest -long -blocking -bisect  -fname long-blocking-bisect 
-gnuplot -np
Bisection tests-blocking
Exceeded 900.000000 seconds, aborting
[0] MPI Abort by user Aborting program !
[0] Aborting program!
p0_1458:  p4_error: : 1
bm_list_1459:  p4_error: interrupt SIGINT: 2
rm_l_2_21356:  p4_error: interrupt SIGINT: 2
rm_l_3_12637:  p4_error: interrupt SIGINT: 2
rm_l_1_14220:  p4_error: interrupt SIGINT: 2
p3_12636:  p4_error: interrupt SIGINT: 2
p2_21355:  p4_error: interrupt SIGINT: 2
p1_14219:  p4_error: interrupt SIGINT: 2
/usr/local/beowulf/mpich-1.2.1/bin/mpirun: line 1:  1458 Broken pipe

I have tested it on kernel 2.2.18 with mosix and without mosix, also on
kernel 2.2.17
Do I have to apply some patches to kernel?

It seems the problem is releted with increased message length.
Problem even appears when using option short in place of long for some
tests.

Thanks
bal at morelinux.com





More information about the Beowulf mailing list