[Beowulf] Problems with a JS21 - Ah, the networking...
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
Bruce Allen ballen at gravity.phys.uwm.eduFri Sep 28 22:02:10 PDT 2007
- Previous message: [Beowulf] Problems with a JS21 - Ah, the networking...
- Next message: [Beowulf] Problems with a JS21 - Ah, the networking...
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
> The myrinet connection was working right, but sometimes a user program > just got stuck - one of the processes was sleeping, and all others were > running. Then, the program hangs. > > Any suggestions? I can provide any log necessary. Ivan, you probably already know this, but if not it can be very useful. If your cluster is Linux based, then you can often use the 'strace' utility on the stuck user program to understand why it is sleeping, for example what message is it waiting for that is not arriving. This might help you in diagnosing the problem. Cheers, Bruce
- Previous message: [Beowulf] Problems with a JS21 - Ah, the networking...
- Next message: [Beowulf] Problems with a JS21 - Ah, the networking...
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
