Jerome Etienne wrote:
>
> lets see if i get it this time :)
>
> when i want to modify a shared resource, i check i have the lock for it.
> if not, i stop the whole cluster via a fencing mecanism and i execute the
> quorum algorithm. if i have the quorum, i lock the resource and start
> modifying it. correct ?
>
> note that a write lock can be owned by only one computer and
> that this kind of lock requires to stop the whole cluster to be
> modified so it is slow and not scalable.
> it seems a good idea to have kind of hierachical lock/quorum to avoid
> to stop the whole cluster each time a lock is modified.
Getting closer:
step 1: if you have the lock, you are OK.
step 2: If you don't have the lock, try to get it from
whoever has the lock. If you can, you are OK.
step 3: If you can't get the lock, because you can't talk to
the node that has it, you may be in a partition.
Check quorum and do the right thing.
step 4: If you can't get the lock, but there is no partition or
quorum problem, and the node that has the lock ought to
be giving it to you, file a bug against the locking system,
or dig into the code and figure it out.
Your locking system may hide the partition and reconfig from
you completely, by blocking you until you can proceed.
It might also signal a reconfig event, indicating you
should retry. It might also never return, and be the
place where your node is taken down because of a reconfig
or a fence.
Right now is a good time to be playing with locking systems.
The GFS people have one model, and Peter Braam was looking at
reimplementing the VAX/VMS model in a linux context. I haven't
heard that Peter is making any progress, and the GFS dlock daemon
still isn't quite what you really want.
-dB
should retry your lock acquisition.
>
> _______________________________________________________
> Linux-HA-Dev: Linux-HA-Dev@lists.tummy.com
> http://lists.tummy.com/mailman/listinfo/linux-ha-dev
> Home Page: http://linux-ha.org/