Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] LAM_MPI problem on PBS

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Onur Destanoğlu odestanoglu at gmail.com
Tue Aug 23 01:30:24 PDT 2005


Hi,

this is my PBS script;
#PBS -N firstscp
#PBS -l nodes=1:ppn=2
#PBS -l mem=4mb
#PBS -l walltime=1:00:00
#PBS -V
#PBS -m bea
PATH=/usr/kerberos/sbin:/usr/kerberos/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/X11R6/bin:/root/bin
export PATH
lamboot -v
mpirun -v C first
lamhalt -v

my systems /home directory is nfs shared between all nodes, so there
is onl one hosts file in user niyazi's home directory, this is the
hosts file;

node00
node01
node02
node03
node04
node05

node00 is not my execution node it only runs pbs_server and pbs_sched.

when i run the script i encounter some problems like these;

one error file;

n-1<2289> ssi:boot:base:linear: booting n0 (localhost)
n-1<2289> ssi:boot:base:linear: finished

one output file:

LAM 7.1.1/MPI 2 C++/ROMIO - Indiana University

2294 first running on n0 (o)
Hello, I am 0 of the nodes : 1 

LAM 7.1.1/MPI 2 C++/ROMIO - Indiana University

Shutting down LAM
hreq: received HALT_ACK from n0 (bee01.bee-hive)
LAM halted

so what's is going wrong?




More information about the Beowulf mailing list