broken pipe at MPI startup

Daniel Ridge newt at scyld.com
Thu Jan 11 08:23:59 PST 2001


What version of MPI are you running and on what platform?

On Wed, 10 Jan 2001, Qian Peng wrote:

> I had a small cluster of 6 duals and recently doubled the size of it.  A
> program once ran fine on the 12 processors.  Now when I run it on all 24
> processors, I will occasionally get "Command terminated on signal 13" error
> at the mpirun level.  The broken pipe is when mpirun is trying to start the
> executables.  If I only use 12 processors, whether from 6 nodes or use one
> processor each from all 12 nodes, I cannot make this error happen.  I'm
> using mpirun with ssh.  It seems to be random when and on which node this
> error occurs.  Any insights on what may the possible causes be?  Thanks,

Regards,
	Dan Ridge
	Scyld Computing Corporation





More information about the Beowulf mailing list