[Beowulf] HPL Runtime error
Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.
Langford, Lester Lester.Langford at ssc.nasa.govThu Apr 20 08:11:44 PDT 2006
- Previous message: [Beowulf] Determining NFS usage by user on a cluster
- Next message: [Beowulf] HPL Runtime error
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Hello all, I am kind of new to cluster operation, but have built 4 small sized (16 48 nodes) for other researchers. Now, I¹m trying to run HPL on our 48-node dual Opteron 246 cluster. Got it to build the xhpl file, but when I try a test run I get the following error: a48:/local-io/linux_bench/hpl/bin/Linux_Op_246 # mpirun -np 4 xhpl /local-io/linux_bench/hpl/bin/Linux_Op_246/xhpl: Command not found. p0_4305: p4_error: Child process exited while making connection to remote process on ath64: 0 p0_4305: (46.070312) net_send: could not write to fd=4, errno = 32 ath64 is a workstation I am not using for this task. Any and all help on this matter would be greatly appreciated. Thanks, Les Langford Lester Langford Technology Development & Transfer NASA Test Operations Group Jacobs Sverdrup/ERC Bldg 8306 Stennis Space Center, MS 39529 - Lester.Langford at ssc.nasa.gov ( (228) 688-7221 Fax (228) 688-1106 -------------- next part -------------- An HTML attachment was scrubbed... URL: http://www.scyld.com/pipermail/beowulf/attachments/20060420/ec89c6eb/attachment.html
- Previous message: [Beowulf] Determining NFS usage by user on a cluster
- Next message: [Beowulf] HPL Runtime error
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Beowulf mailing list
