Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] Beowulf cluster mpi problem

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

chandru chandu_dreams2001 at yahoo.co.in
Fri Jan 6 04:04:54 PST 2006


Hi,
I have built a Linux cluster with a master and a
slave. The communication medium is ssh. ssh works
without any passwords. this has been acheived using
the public keys. I have installed mpich 1.2.7 p1. I am
using Fedora core 2.

when i run a mpi program on the master..

If i run mpirun nwith 1 processor everything works
fine.

[root at master basic]# mpirun -np 1 cpi
Process 0 of 1 on master.mydomain.com
pi is approximately 3.1415926544231341, Error is
0.0000000008333410
wall clock time = 0.000634

But if i run it with 2 procesors it gives me this
error message

[root at master basic]# mpirun -np 2 cpi
rm_3771:  p4_error: rm_start: net_conn_to_listener
failed: 32902
p0_4293:  p4_error: Child process exited while making
connection to remote process on client1.mydomain.com:
0
p0_4293: (10.253768) net_send: could not write to
fd=4, errno = 32
[root at master basic]#

I do not know where the problem is. I have updated the
machines.LINUX file. it has the client's host name
"client1.mydomain.com". SSH connection exists between
master and client. 

Please help 

Thanks,
Chandu



		
__________________________________________ 
Yahoo! DSL – Something to write home about. 
Just $16.99/mo. or less. 
dsl.yahoo.com 




More information about the Beowulf mailing list