[Pkg-ganeti-devel] [ganeti] 99/165: Add design document about N+1 redundancy for shared storage

Tue Aug 11 13:53:17 UTC 2015

This is an automated email from the git hooks/post-receive script.

apoikos pushed a commit to branch master
in repository ganeti.

commit a8491e64ed08ca80d3d99643f637535cecef9f9d
Author: Klaus Aehlig <aehlig at google.com>
Date:   Fri Mar 27 10:00:00 2015 +0100

    Add design document about N+1 redundancy for shared storage
    
    Add a design document describing how Ganeti will take care of N+1
    redundancy in the presence of shared storage. As opposed to DRBD,
    instances using shared storage can be started on any node. Therefore,
    at no particular node we have to reserve memory---however, in total
    enough memory for any node to fail has to be reserved nevertheless.
    
    Signed-off-by: Klaus Aehlig <aehlig at google.com>
    Reviewed-by: Petr Pudlak <pudlak at google.com>
---
 Makefile.am                              |  1 +
 doc/design-draft.rst                     |  1 +
 doc/design-shared-storage-redundancy.rst | 73 ++++++++++++++++++++++++++++++++
 3 files changed, 75 insertions(+)

diff --git a/Makefile.am b/Makefile.am
index 17620bc..68107f5 100644
--- a/Makefile.am
+++ b/Makefile.am
@@ -709,6 +709,7 @@ docinput = \
 	doc/design-resource-model.rst \
 	doc/design-restricted-commands.rst \
 	doc/design-shared-storage.rst \
+	doc/design-shared-storage-redundancy.rst \
 	doc/design-ssh-ports.rst \
 	doc/design-storagetypes.rst \
 	doc/design-sync-rate-throttling.rst \
diff --git a/doc/design-draft.rst b/doc/design-draft.rst
index 88f16ec..c589b56 100644
--- a/doc/design-draft.rst
+++ b/doc/design-draft.rst
@@ -27,6 +27,7 @@ Design document drafts
    design-multi-storage-htools.rst
    design-dedicated-allocation.rst
    design-allocation-efficiency.rst
+   design-shared-storage-redundancy.rst
 
 .. vim: set textwidth=72 :
 .. Local Variables:
diff --git a/doc/design-shared-storage-redundancy.rst b/doc/design-shared-storage-redundancy.rst
new file mode 100644
index 0000000..14e8bc1
--- /dev/null
+++ b/doc/design-shared-storage-redundancy.rst
@@ -0,0 +1,73 @@
+=================================
+N+1 redundancy for shared storage
+=================================
+
+.. contents:: :depth: 4
+
+This document describes how N+1 redundancy is achieved
+for instanes using shared storage.
+
+
+Current state and shortcomings
+==============================
+
+For instances with DRBD as disk template, in case of failures
+of their primary node, there is only one node where the instance
+can be restarted immediately. Therefore, ``htools`` reserve enough
+memory on that node to cope with failure of a single node.
+For instances using shared storage, however, they can be restarted
+on any node---implying that on no particular node memory has to
+be reserved. This, however, motivated the current state where no
+memory is reserved at all. And even a large cluster can run out
+of capacity.
+
+Proposed changes
+================
+
+Definition on N+1 redundancy in the presence of shared storage
+--------------------------------------------------------------
+
+A cluster is considered N+1 redundant, if, for every node, all
+DRBD instances can be migrated out and then all shared-storage
+instances can be relocated to a different node without moving
+instances on other nodes. This is precisely the operation done
+after a node breaking. Obviously, simulating failure and evacuation
+for every single node is an expensive operation.
+
+Basic Considerations
+--------------------
+
+For DRBD, keeping N+1 redundancy is affected by moving instances and
+balancing the cluster. Moreover, taking is into account for balancing
+can help :doc:`design-allocation-efficiency`. Hence, N+1 redundancy
+for DRBD is to be taken into account for all choices affecting instance
+location, including instance allocation and balancing.
+
+For shared-storage instances, they can move everywhere within the
+node group. So, in practise, this is mainly a question of capacity
+planing, especially is most instances have the same size. Nevertheless,
+offcuts if instances don't fill a node entirely may not be ignored.
+
+
+Modifications to existing tools
+-------------------------------
+
+- ``hail`` will compute and rank possible allocations as usual. However,
+  before returing a choice it will filter out allocations that are
+  not N+1 redundant.
+
+- Normal ``gnt-cluster verify`` will not be changed; in particular,
+  it will still check for DRBD N+1 redundancy, but not for shared
+  storage N+1 redundancy. However, ``hcheck`` will verify shared storage
+  N+1 redundancy and report it that fails.
+
+- ``hbal`` will consider and rank moves as usual. However, before deciding
+  on the next move, it will filter out those moves that lead from a
+  shared storage N+1 redundant configuration into one that isn't.
+
+- ``hspace`` computing the capacity for DRBD instances will be unchanged.
+  For shared storage instances, however, it will first evacuate one node
+  and then compute capacity as normal pretending that node was offline.
+  While this technically deviates from interatively doing what hail does,
+  it should still give a reasonable estimate of the cluster capacity without
+  significantly increasing the algorithmic complexity.

-- 
Alioth's /usr/local/bin/git-commit-notice on /srv/git.debian.org/git/pkg-ganeti/ganeti.git