[Beowulf] HPCC "intel_mpi" error

gossips J polk678 at gmail.com
Mon Mar 9 02:08:25 PDT 2009


Hi,

We are using ICR validation.

We are facing following problem while running below command:

cluster-check --debug --include_only intel_mpi /root/sample.xml


Problem is:

Output of cluster checker shows us that "intel_mpi" FAILED, where as by
looking into debug.out file it is seen that "Hello World" is returned from
all nodes.


I have 16 nodes configuration and we are running 8 proc/node.

Above behavior is observed with even 1 proc/node, 2 proc/node, 4 proc/node
as well. I also tried "rdma" and "rdssm" as a DEVICE in XML file but no luck.

If anyone can shed some light on this issue, it would be great help.

Another thing I would like to know is:

Is there a way to specify "-env RDMA_TRANSLATION_CACHE" option with
Intel Cluster Checker?

Awaiting for kind response,


Thanks in advance,

Polk.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.beowulf.org/pipermail/beowulf/attachments/20090309/44deb969/attachment.html>


More information about the Beowulf mailing list