[Beowulf] Lustre failover
bernd.schubert at fastmail.fm
Wed Sep 10 05:37:12 PDT 2008
On Wednesday 10 September 2008 13:41:18 andrew holway wrote:
> >From the Lustre manual:-
> With OST servers it is possible to have a load-balanced active/active
> Each node is the primary node for a group of OSTs, and the failover
> node for other
> groups. To expand the simple two-node example, we add ost2 which is primary
> on nodeB, and is on the LUNs nodeB:/dev/sdc1 and nodeA:/dev/sdd1. This
> demonstrates that the /dev/ identity can differ between nodes, but both
> devices must map to the same physical LUN. In this type of failover
> configuration, you can
> mount two OSTs on two different nodes, and format them from either node.
> With failover, two OSSs provide the same service to the Lustre network in
> parallel. In case
> of disaster or a failure in one of the nodes, the other OSS can
> provide uninterrupted
> filesystem services.
> For an active/active configuration, mount one OST on one node and another
> OST on the other node. You can format them from either node.
> Anyone done this on a production system?
Yes, sure we do this all the time.
> Experiances? Comments?
You should use either careful manual failover or heartbeat + stonith to
prevent accidental double mounts. Usually we have a setup like this:
Q-Leap Networks GmbH
More information about the Beowulf