[Beowulf] Kill zombies after a parallel run
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
mg mg.mailing-list at laposte.netTue May 2 00:49:18 PDT 2006
- Previous message: [Beowulf] 512 nodes Myrinet cluster Challanges
- Next message: [Beowulf] Kill zombies after a parallel run
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Hi all, I use MPICH-1.2.5.2 to generate and run an FEM parallel application. During a parallel run, one process can crash, leaving the other processes run and OS commands have to be used for kill these zombies. So, does someone have a solution to avoid zombies after a failed parallel run: can the crashed process kill the other processes? Thanks, Mathieu
- Previous message: [Beowulf] 512 nodes Myrinet cluster Challanges
- Next message: [Beowulf] Kill zombies after a parallel run
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
