Archives


- Beowulf
- Beowulf Announce
- Scyld-users
- Beowulf on Debian

[Beowulf] Need some advise: Sun storage' management server hangs repeatedly

Many of your questions may have already been answered in earlier discussions or in the FAQ. The search results page will indicate current discussions as well as past list serves, articles, and papers.

Search

Sangamesh B forum.san at gmail.com
Mon Jan 18 01:13:05 PST 2010


Hello all,

     Thanks for your suggestions.
     But we lost the access to the cluster because of the delay.

    But I got useful information to debug next time.

Thanks,
Sangamesh
On Thu, Jan 14, 2010 at 10:38 AM, Skylar Thompson <skylar at cs.earlham.edu>wrote:

> Sangamesh B wrote:
> > Hi HPC experts,
> >
> >      I seek your advise/suggestion to resolve a storage(NAS) server'
> > repeated hanging problem.
> >
> >      We've a 23 nodes Rocks-5.1 HPC cluster. The Sun storage of
> > capacity 12 TB is connected to a management server Sun Fire X4150
> > installed with RHEL 5.3 and this server is connected to a Gigabit
> > switch which provides cluster private network. The home directories on
> > the cluster are NFS mounted from storage partitions across all nodes
> > including the master.
> >
> >    This server gets hanged repeatedly. As an initial troubleshooting
> > we installed Ganglia, to check network utilization. But its normal.
> > We're not getting how to troubleshoot it and resolve the problem. Can
> > anybode help us resolve this issue?
> Is there anything amiss according to the service processor?
>
> --
> -- Skylar Thompson (skylar at cs.earlham.edu)
> -- http://www.cs.earlham.edu/~skylar/<http://www.cs.earlham.edu/%7Eskylar/>
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.scyld.com/pipermail/beowulf/attachments/20100118/144631da/attachment.html


More information about the Beowulf mailing list