[Beowulf] anyone using SALT on your clusters?

Joe Landman landman at scalableinformatics.com
Tue Jul 2 06:08:20 PDT 2013


On 07/02/2013 04:32 AM, Hearns, John wrote:
>
>
>
>
> -----Original Message-----
>
>  > Our NoSQL database uses pub-sub for cluster membership, and we found
>  > that the hosts with screwed up RAID controllers could easily stay in
>  > the cluster even if they were really screwed up. We had to add some extra
>  > watchdogs and tests that the system disk is working.
>
> I've seen that before with a RAID controller when one disk fails.
> System pings, OS and TCP-IP stack are up, but the system disk has been
> marked write-only.
> I'm still pretty amazed that the Linux OS soldiers on in this state.
> One to watch out for!

I have a feeling its going to be big some day ...

This is one of the reasons why we like diskless/stateless boot.

-- 
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics, Inc.
email: landman at scalableinformatics.com
web  : http://scalableinformatics.com
        http://scalableinformatics.com/siflash
phone: +1 734 786 8423 x121
fax  : +1 866 888 3112
cell : +1 734 612 4615


More information about the Beowulf mailing list