mafunk at nmsu.edu
Mon Apr 9 10:30:32 PDT 2007
i hope this is the right mailing list to post to...
Anyway, i was wondering if i could get some advice/direction on how to debug
my mpich program. I am running on a scyld configuration. What i am trying
right now is the following:
mpirun -dbg=gdb -nolocal -np 32 exec
which starts the debugger in which i go
which then start the program. However, it doesn't get very far until it just
sits there. When i ps all the processes are defunced.
When i do the same thing except mpirun -dbg=gdb -nolocal -np 1 exec
and run it in the debugger, the program starts running well.
The reason i want to run on 32 processor though, is that it takes (on 32
procs) several hours till my program crashes. Also, i would like to be able
to keep the conditions under which it crashes intact as much as possible
(i.e. run on 32 procs rather than 1).
Does anyone have any advice? I am open to try out other things as well if
possible. I am just starting to learn debugger techniques for a parallel
More information about the Beowulf