Write-affinity aware object deletion
When deleting objects in multi-region swift delpoyment with write affinity configured, users always get 404 when deleting object before it's replcated to approriate nodes. This patch adds a config item 'write_affinity_handoff_delete_count' so that operator could define how many local handoff nodes should swift send request to get more candidates for the final response, or by default just leave it to swift to calculate the appropriate number. Change-Id: Ic4ef82e4fc1a91c85bdbc6bf41705a76f16d1341 Closes-Bug: #1503161
This commit is contained in:
parent
2d18ecdf4b
commit
831eb6e3ce
@ -1663,9 +1663,9 @@ ionice_priority None I/O scheduling p
|
|||||||
|
|
||||||
[proxy-server]
|
[proxy-server]
|
||||||
|
|
||||||
============================ =============== =====================================
|
====================================== =============== =====================================
|
||||||
Option Default Description
|
Option Default Description
|
||||||
---------------------------- --------------- -------------------------------------
|
-------------------------------------- --------------- -------------------------------------
|
||||||
use Entry point for paste.deploy for
|
use Entry point for paste.deploy for
|
||||||
the proxy server. For most
|
the proxy server. For most
|
||||||
cases, this should be
|
cases, this should be
|
||||||
@ -1843,7 +1843,27 @@ write_affinity_node_count 2 * replicas The number of local (as governed
|
|||||||
for the request. This option may be
|
for the request. This option may be
|
||||||
overridden in a per-policy
|
overridden in a per-policy
|
||||||
configuration section.
|
configuration section.
|
||||||
============================ =============== =====================================
|
write_affinity_handoff_delete_count auto The number of local (as governed by
|
||||||
|
the write_affinity setting) handoff
|
||||||
|
nodes to attempt to contact on
|
||||||
|
deletion, in addition to primary
|
||||||
|
nodes. Example: in geographically
|
||||||
|
distributed deployment, If replicas=3,
|
||||||
|
sometimes there may be 1 primary node
|
||||||
|
and 2 local handoff nodes in one region
|
||||||
|
holding the object after uploading but
|
||||||
|
before object replicated to the
|
||||||
|
appropriate locations in other regions.
|
||||||
|
In this case, include these handoff
|
||||||
|
nodes to send request when deleting
|
||||||
|
object could help make correct decision
|
||||||
|
for the response. The default value 'auto'
|
||||||
|
means Swift will calculate the number
|
||||||
|
automatically, the default value is
|
||||||
|
(replicas - len(local_primary_nodes)).
|
||||||
|
This option may be overridden in a
|
||||||
|
per-policy configuration section.
|
||||||
|
====================================== =============== =====================================
|
||||||
|
|
||||||
.. _proxy_server_per_policy_config:
|
.. _proxy_server_per_policy_config:
|
||||||
|
|
||||||
@ -1858,6 +1878,7 @@ options are:
|
|||||||
- ``read_affinity``
|
- ``read_affinity``
|
||||||
- ``write_affinity``
|
- ``write_affinity``
|
||||||
- ``write_affinity_node_count``
|
- ``write_affinity_node_count``
|
||||||
|
- ``write_affinity_handoff_delete_count``
|
||||||
|
|
||||||
The per-policy config section name must be of the form::
|
The per-policy config section name must be of the form::
|
||||||
|
|
||||||
@ -1887,6 +1908,7 @@ policy with index ``3``::
|
|||||||
read_affinity = r2=1
|
read_affinity = r2=1
|
||||||
write_affinity = r2
|
write_affinity = r2
|
||||||
write_affinity_node_count = 1 * replicas
|
write_affinity_node_count = 1 * replicas
|
||||||
|
write_affinity_handoff_delete_count = 2
|
||||||
|
|
||||||
.. note::
|
.. note::
|
||||||
|
|
||||||
|
@ -82,9 +82,9 @@ Note that read_affinity only affects the ordering of primary nodes
|
|||||||
(see ring docs for definition of primary node), not the ordering of
|
(see ring docs for definition of primary node), not the ordering of
|
||||||
handoff nodes.
|
handoff nodes.
|
||||||
|
|
||||||
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
|
~~~~~~~~~~~~~~
|
||||||
write_affinity and write_affinity_node_count
|
write_affinity
|
||||||
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
|
~~~~~~~~~~~~~~
|
||||||
|
|
||||||
This setting makes the proxy server prefer local backend servers for
|
This setting makes the proxy server prefer local backend servers for
|
||||||
object PUT requests over non-local ones. For example, it may be
|
object PUT requests over non-local ones. For example, it may be
|
||||||
@ -97,9 +97,15 @@ the object won't immediately have any replicas in NY. However,
|
|||||||
replication will move the object's replicas to their proper homes in
|
replication will move the object's replicas to their proper homes in
|
||||||
both SF and NY.
|
both SF and NY.
|
||||||
|
|
||||||
Note that only object PUT requests are affected by the write_affinity
|
One potential issue with write_affinity is, end user may get 404 error when
|
||||||
setting; POST, GET, HEAD, DELETE, OPTIONS, and account/container PUT
|
deleting objects before replication. The write_affinity_handoff_delete_count
|
||||||
requests are not affected.
|
setting is used together with write_affinity in order to solve that issue.
|
||||||
|
With its default configuration, Swift will calculate the proper number of
|
||||||
|
handoff nodes to send requests to.
|
||||||
|
|
||||||
|
Note that only object PUT/DELETE requests are affected by the write_affinity
|
||||||
|
setting; POST, GET, HEAD, OPTIONS, and account/container PUT requests are
|
||||||
|
not affected.
|
||||||
|
|
||||||
This setting lets you trade data distribution for throughput. If
|
This setting lets you trade data distribution for throughput. If
|
||||||
write_affinity is enabled, then object replicas will initially be
|
write_affinity is enabled, then object replicas will initially be
|
||||||
|
@ -236,6 +236,20 @@ use = egg:swift#proxy
|
|||||||
# This option may be overridden in a per-policy configuration section.
|
# This option may be overridden in a per-policy configuration section.
|
||||||
# write_affinity_node_count = 2 * replicas
|
# write_affinity_node_count = 2 * replicas
|
||||||
#
|
#
|
||||||
|
# The number of local (as governed by the write_affinity setting) handoff nodes
|
||||||
|
# to attempt to contact on deletion, in addition to primary nodes.
|
||||||
|
#
|
||||||
|
# Example: in geographically distributed deployment of 2 regions, If
|
||||||
|
# replicas=3, sometimes there may be 1 primary node and 2 local handoff nodes
|
||||||
|
# in one region holding the object after uploading but before object replicated
|
||||||
|
# to the appropriate locations in other regions. In this case, include these
|
||||||
|
# handoff nodes to send request when deleting object could help make correct
|
||||||
|
# decision for the response. The default value 'auto' means Swift will
|
||||||
|
# calculate the number automatically, the default value is
|
||||||
|
# (replicas - len(local_primary_nodes)). This option may be overridden in a
|
||||||
|
# per-policy configuration section.
|
||||||
|
# write_affinity_handoff_delete_count = auto
|
||||||
|
#
|
||||||
# These are the headers whose values will only be shown to swift_owners. The
|
# These are the headers whose values will only be shown to swift_owners. The
|
||||||
# exact definition of a swift_owner is up to the auth system in use, but
|
# exact definition of a swift_owner is up to the auth system in use, but
|
||||||
# usually indicates administrative responsibilities.
|
# usually indicates administrative responsibilities.
|
||||||
@ -264,6 +278,7 @@ use = egg:swift#proxy
|
|||||||
# read_affinity =
|
# read_affinity =
|
||||||
# write_affinity =
|
# write_affinity =
|
||||||
# write_affinity_node_count =
|
# write_affinity_node_count =
|
||||||
|
# write_affinity_handoff_delete_count =
|
||||||
|
|
||||||
[filter:tempauth]
|
[filter:tempauth]
|
||||||
use = egg:swift#tempauth
|
use = egg:swift#tempauth
|
||||||
|
@ -1596,7 +1596,8 @@ class Controller(object):
|
|||||||
{'method': method, 'path': path})
|
{'method': method, 'path': path})
|
||||||
|
|
||||||
def make_requests(self, req, ring, part, method, path, headers,
|
def make_requests(self, req, ring, part, method, path, headers,
|
||||||
query_string='', overrides=None):
|
query_string='', overrides=None, node_count=None,
|
||||||
|
node_iterator=None):
|
||||||
"""
|
"""
|
||||||
Sends an HTTP request to multiple nodes and aggregates the results.
|
Sends an HTTP request to multiple nodes and aggregates the results.
|
||||||
It attempts the primary nodes concurrently, then iterates over the
|
It attempts the primary nodes concurrently, then iterates over the
|
||||||
@ -1613,11 +1614,16 @@ class Controller(object):
|
|||||||
:param query_string: optional query string to send to the backend
|
:param query_string: optional query string to send to the backend
|
||||||
:param overrides: optional return status override map used to override
|
:param overrides: optional return status override map used to override
|
||||||
the returned status of a request.
|
the returned status of a request.
|
||||||
|
:param node_count: optional number of nodes to send request to.
|
||||||
|
:param node_iterator: optional node iterator.
|
||||||
:returns: a swob.Response object
|
:returns: a swob.Response object
|
||||||
"""
|
"""
|
||||||
start_nodes = ring.get_part_nodes(part)
|
nodes = GreenthreadSafeIterator(
|
||||||
nodes = GreenthreadSafeIterator(self.app.iter_nodes(ring, part))
|
node_iterator or self.app.iter_nodes(ring, part)
|
||||||
pile = GreenAsyncPile(len(start_nodes))
|
)
|
||||||
|
node_number = node_count or len(ring.get_part_nodes(part))
|
||||||
|
pile = GreenAsyncPile(node_number)
|
||||||
|
|
||||||
for head in headers:
|
for head in headers:
|
||||||
pile.spawn(self._make_request, nodes, part, method, path,
|
pile.spawn(self._make_request, nodes, part, method, path,
|
||||||
head, query_string, self.app.logger.thread_locals)
|
head, query_string, self.app.logger.thread_locals)
|
||||||
@ -1628,7 +1634,7 @@ class Controller(object):
|
|||||||
continue
|
continue
|
||||||
response.append(resp)
|
response.append(resp)
|
||||||
statuses.append(resp[0])
|
statuses.append(resp[0])
|
||||||
if self.have_quorum(statuses, len(start_nodes)):
|
if self.have_quorum(statuses, node_number):
|
||||||
break
|
break
|
||||||
# give any pending requests *some* chance to finish
|
# give any pending requests *some* chance to finish
|
||||||
finished_quickly = pile.waitall(self.app.post_quorum_timeout)
|
finished_quickly = pile.waitall(self.app.post_quorum_timeout)
|
||||||
@ -1637,7 +1643,7 @@ class Controller(object):
|
|||||||
continue
|
continue
|
||||||
response.append(resp)
|
response.append(resp)
|
||||||
statuses.append(resp[0])
|
statuses.append(resp[0])
|
||||||
while len(response) < len(start_nodes):
|
while len(response) < node_number:
|
||||||
response.append((HTTP_SERVICE_UNAVAILABLE, '', '', ''))
|
response.append((HTTP_SERVICE_UNAVAILABLE, '', '', ''))
|
||||||
statuses, reasons, resp_headers, bodies = zip(*response)
|
statuses, reasons, resp_headers, bodies = zip(*response)
|
||||||
return self.best_response(req, statuses, reasons, bodies,
|
return self.best_response(req, statuses, reasons, bodies,
|
||||||
|
@ -128,7 +128,8 @@ class BaseObjectController(Controller):
|
|||||||
self.container_name = unquote(container_name)
|
self.container_name = unquote(container_name)
|
||||||
self.object_name = unquote(object_name)
|
self.object_name = unquote(object_name)
|
||||||
|
|
||||||
def iter_nodes_local_first(self, ring, partition, policy=None):
|
def iter_nodes_local_first(self, ring, partition, policy=None,
|
||||||
|
local_handoffs_first=False):
|
||||||
"""
|
"""
|
||||||
Yields nodes for a ring partition.
|
Yields nodes for a ring partition.
|
||||||
|
|
||||||
@ -141,6 +142,9 @@ class BaseObjectController(Controller):
|
|||||||
|
|
||||||
:param ring: ring to get nodes from
|
:param ring: ring to get nodes from
|
||||||
:param partition: ring partition to yield nodes for
|
:param partition: ring partition to yield nodes for
|
||||||
|
:param policy: optional, an instance of :class:`BaseStoragePolicy
|
||||||
|
:param local_handoffs_first: optional, if True prefer primaries and
|
||||||
|
local handoff nodes first before looking elsewhere.
|
||||||
"""
|
"""
|
||||||
policy_options = self.app.get_policy_options(policy)
|
policy_options = self.app.get_policy_options(policy)
|
||||||
is_local = policy_options.write_affinity_is_local_fn
|
is_local = policy_options.write_affinity_is_local_fn
|
||||||
@ -148,23 +152,38 @@ class BaseObjectController(Controller):
|
|||||||
return self.app.iter_nodes(ring, partition, policy=policy)
|
return self.app.iter_nodes(ring, partition, policy=policy)
|
||||||
|
|
||||||
primary_nodes = ring.get_part_nodes(partition)
|
primary_nodes = ring.get_part_nodes(partition)
|
||||||
|
handoff_nodes = ring.get_more_nodes(partition)
|
||||||
|
all_nodes = itertools.chain(primary_nodes, handoff_nodes)
|
||||||
|
|
||||||
|
if local_handoffs_first:
|
||||||
|
num_locals = policy_options.write_affinity_handoff_delete_count
|
||||||
|
if num_locals is None:
|
||||||
|
local_primaries = [node for node in primary_nodes
|
||||||
|
if is_local(node)]
|
||||||
|
num_locals = len(primary_nodes) - len(local_primaries)
|
||||||
|
|
||||||
|
first_local_handoffs = list(itertools.islice(
|
||||||
|
(node for node in handoff_nodes if is_local(node)), num_locals)
|
||||||
|
)
|
||||||
|
preferred_nodes = primary_nodes + first_local_handoffs
|
||||||
|
else:
|
||||||
num_locals = policy_options.write_affinity_node_count_fn(
|
num_locals = policy_options.write_affinity_node_count_fn(
|
||||||
len(primary_nodes))
|
len(primary_nodes)
|
||||||
|
)
|
||||||
|
preferred_nodes = list(itertools.islice(
|
||||||
|
(node for node in all_nodes if is_local(node)), num_locals)
|
||||||
|
)
|
||||||
|
# refresh it; it moved when we computed preferred_nodes
|
||||||
|
handoff_nodes = ring.get_more_nodes(partition)
|
||||||
|
all_nodes = itertools.chain(primary_nodes, handoff_nodes)
|
||||||
|
|
||||||
all_nodes = itertools.chain(primary_nodes,
|
node_iter = itertools.chain(
|
||||||
ring.get_more_nodes(partition))
|
preferred_nodes,
|
||||||
first_n_local_nodes = list(itertools.islice(
|
(node for node in all_nodes if node not in preferred_nodes)
|
||||||
(node for node in all_nodes if is_local(node)), num_locals))
|
)
|
||||||
|
|
||||||
# refresh it; it moved when we computed first_n_local_nodes
|
return self.app.iter_nodes(ring, partition, node_iter=node_iter,
|
||||||
all_nodes = itertools.chain(primary_nodes,
|
policy=policy)
|
||||||
ring.get_more_nodes(partition))
|
|
||||||
local_first_node_iter = itertools.chain(
|
|
||||||
first_n_local_nodes,
|
|
||||||
(node for node in all_nodes if node not in first_n_local_nodes))
|
|
||||||
|
|
||||||
return self.app.iter_nodes(
|
|
||||||
ring, partition, node_iter=local_first_node_iter, policy=policy)
|
|
||||||
|
|
||||||
def GETorHEAD(self, req):
|
def GETorHEAD(self, req):
|
||||||
"""Handle HTTP GET or HEAD requests."""
|
"""Handle HTTP GET or HEAD requests."""
|
||||||
@ -589,10 +608,12 @@ class BaseObjectController(Controller):
|
|||||||
raise NotImplementedError()
|
raise NotImplementedError()
|
||||||
|
|
||||||
def _delete_object(self, req, obj_ring, partition, headers):
|
def _delete_object(self, req, obj_ring, partition, headers):
|
||||||
"""
|
"""Delete object considering write-affinity.
|
||||||
send object DELETE request to storage nodes. Subclasses of
|
|
||||||
the BaseObjectController can provide their own implementation
|
When deleting object in write affinity deployment, also take configured
|
||||||
of this method.
|
handoff nodes number into consideration, instead of just sending
|
||||||
|
requests to primary nodes. Otherwise (write-affinity is disabled),
|
||||||
|
go with the same way as before.
|
||||||
|
|
||||||
:param req: the DELETE Request
|
:param req: the DELETE Request
|
||||||
:param obj_ring: the object ring
|
:param obj_ring: the object ring
|
||||||
@ -600,11 +621,37 @@ class BaseObjectController(Controller):
|
|||||||
:param headers: system headers to storage nodes
|
:param headers: system headers to storage nodes
|
||||||
:return: Response object
|
:return: Response object
|
||||||
"""
|
"""
|
||||||
# When deleting objects treat a 404 status as 204.
|
policy_index = req.headers.get('X-Backend-Storage-Policy-Index')
|
||||||
|
policy = POLICIES.get_by_index(policy_index)
|
||||||
|
|
||||||
|
node_count = None
|
||||||
|
node_iterator = None
|
||||||
|
|
||||||
|
policy_options = self.app.get_policy_options(policy)
|
||||||
|
is_local = policy_options.write_affinity_is_local_fn
|
||||||
|
if is_local is not None:
|
||||||
|
primaries = obj_ring.get_part_nodes(partition)
|
||||||
|
node_count = len(primaries)
|
||||||
|
|
||||||
|
local_handoffs = policy_options.write_affinity_handoff_delete_count
|
||||||
|
if local_handoffs is None:
|
||||||
|
local_primaries = [node for node in primaries
|
||||||
|
if is_local(node)]
|
||||||
|
local_handoffs = len(primaries) - len(local_primaries)
|
||||||
|
|
||||||
|
node_count += local_handoffs
|
||||||
|
|
||||||
|
node_iterator = self.iter_nodes_local_first(
|
||||||
|
obj_ring, partition, policy=policy, local_handoffs_first=True
|
||||||
|
)
|
||||||
|
|
||||||
status_overrides = {404: 204}
|
status_overrides = {404: 204}
|
||||||
resp = self.make_requests(req, obj_ring,
|
resp = self.make_requests(req, obj_ring,
|
||||||
partition, 'DELETE', req.swift_entity_path,
|
partition, 'DELETE', req.swift_entity_path,
|
||||||
headers, overrides=status_overrides)
|
headers, overrides=status_overrides,
|
||||||
|
node_count=node_count,
|
||||||
|
node_iterator=node_iterator)
|
||||||
|
|
||||||
return resp
|
return resp
|
||||||
|
|
||||||
def _post_object(self, req, obj_ring, partition, headers):
|
def _post_object(self, req, obj_ring, partition, headers):
|
||||||
@ -725,8 +772,20 @@ class BaseObjectController(Controller):
|
|||||||
else:
|
else:
|
||||||
req.headers['X-Timestamp'] = Timestamp(time.time()).internal
|
req.headers['X-Timestamp'] = Timestamp(time.time()).internal
|
||||||
|
|
||||||
|
# Include local handoff nodes if write-affinity is enabled.
|
||||||
|
node_count = len(nodes)
|
||||||
|
policy = POLICIES.get_by_index(policy_index)
|
||||||
|
policy_options = self.app.get_policy_options(policy)
|
||||||
|
is_local = policy_options.write_affinity_is_local_fn
|
||||||
|
if is_local is not None:
|
||||||
|
local_handoffs = policy_options.write_affinity_handoff_delete_count
|
||||||
|
if local_handoffs is None:
|
||||||
|
local_primaries = [node for node in nodes if is_local(node)]
|
||||||
|
local_handoffs = len(nodes) - len(local_primaries)
|
||||||
|
node_count += local_handoffs
|
||||||
|
|
||||||
headers = self._backend_requests(
|
headers = self._backend_requests(
|
||||||
req, len(nodes), container_partition, container_nodes)
|
req, node_count, container_partition, container_nodes)
|
||||||
return self._delete_object(req, obj_ring, partition, headers)
|
return self._delete_object(req, obj_ring, partition, headers)
|
||||||
|
|
||||||
|
|
||||||
|
@ -35,7 +35,7 @@ from swift.common.ring import Ring
|
|||||||
from swift.common.utils import cache_from_env, get_logger, \
|
from swift.common.utils import cache_from_env, get_logger, \
|
||||||
get_remote_client, split_path, config_true_value, generate_trans_id, \
|
get_remote_client, split_path, config_true_value, generate_trans_id, \
|
||||||
affinity_key_function, affinity_locality_predicate, list_from_csv, \
|
affinity_key_function, affinity_locality_predicate, list_from_csv, \
|
||||||
register_swift_info, readconf
|
register_swift_info, readconf, config_auto_int_value
|
||||||
from swift.common.constraints import check_utf8, valid_api_version
|
from swift.common.constraints import check_utf8, valid_api_version
|
||||||
from swift.proxy.controllers import AccountController, ContainerController, \
|
from swift.proxy.controllers import AccountController, ContainerController, \
|
||||||
ObjectControllerRouter, InfoController
|
ObjectControllerRouter, InfoController
|
||||||
@ -130,13 +130,18 @@ class ProxyOverrideOptions(object):
|
|||||||
'Invalid write_affinity_node_count value: %r' %
|
'Invalid write_affinity_node_count value: %r' %
|
||||||
(' '.join(value)))
|
(' '.join(value)))
|
||||||
|
|
||||||
|
self.write_affinity_handoff_delete_count = config_auto_int_value(
|
||||||
|
get('write_affinity_handoff_delete_count', 'auto'), None
|
||||||
|
)
|
||||||
|
|
||||||
def __repr__(self):
|
def __repr__(self):
|
||||||
return '%s({}, {%s})' % (self.__class__.__name__, ', '.join(
|
return '%s({}, {%s})' % (self.__class__.__name__, ', '.join(
|
||||||
'%r: %r' % (k, getattr(self, k)) for k in (
|
'%r: %r' % (k, getattr(self, k)) for k in (
|
||||||
'sorting_method',
|
'sorting_method',
|
||||||
'read_affinity',
|
'read_affinity',
|
||||||
'write_affinity',
|
'write_affinity',
|
||||||
'write_affinity_node_count')))
|
'write_affinity_node_count',
|
||||||
|
'write_affinity_handoff_delete_count')))
|
||||||
|
|
||||||
def __eq__(self, other):
|
def __eq__(self, other):
|
||||||
if not isinstance(other, ProxyOverrideOptions):
|
if not isinstance(other, ProxyOverrideOptions):
|
||||||
@ -145,7 +150,8 @@ class ProxyOverrideOptions(object):
|
|||||||
'sorting_method',
|
'sorting_method',
|
||||||
'read_affinity',
|
'read_affinity',
|
||||||
'write_affinity',
|
'write_affinity',
|
||||||
'write_affinity_node_count'))
|
'write_affinity_node_count',
|
||||||
|
'write_affinity_handoff_delete_count'))
|
||||||
|
|
||||||
|
|
||||||
class Application(object):
|
class Application(object):
|
||||||
|
@ -279,6 +279,86 @@ class BaseObjectControllerMixin(object):
|
|||||||
self.assertEqual(len(all_nodes), len(local_first_nodes))
|
self.assertEqual(len(all_nodes), len(local_first_nodes))
|
||||||
self.assertEqual(sorted(all_nodes), sorted(local_first_nodes))
|
self.assertEqual(sorted(all_nodes), sorted(local_first_nodes))
|
||||||
|
|
||||||
|
def test_iter_nodes_local_handoff_first_noops_when_no_affinity(self):
|
||||||
|
# this test needs a stable node order - most don't
|
||||||
|
self.app.sort_nodes = lambda l, *args, **kwargs: l
|
||||||
|
controller = self.controller_cls(
|
||||||
|
self.app, 'a', 'c', 'o')
|
||||||
|
policy = self.policy
|
||||||
|
self.app.get_policy_options(policy).write_affinity_is_local_fn = None
|
||||||
|
object_ring = policy.object_ring
|
||||||
|
all_nodes = object_ring.get_part_nodes(1)
|
||||||
|
all_nodes.extend(object_ring.get_more_nodes(1))
|
||||||
|
|
||||||
|
local_first_nodes = list(controller.iter_nodes_local_first(
|
||||||
|
object_ring, 1, local_handoffs_first=True))
|
||||||
|
|
||||||
|
self.maxDiff = None
|
||||||
|
|
||||||
|
self.assertEqual(all_nodes, local_first_nodes)
|
||||||
|
|
||||||
|
def test_iter_nodes_handoff_local_first_default(self):
|
||||||
|
controller = self.controller_cls(
|
||||||
|
self.app, 'a', 'c', 'o')
|
||||||
|
policy_conf = self.app.get_policy_options(self.policy)
|
||||||
|
policy_conf.write_affinity_is_local_fn = (
|
||||||
|
lambda node: node['region'] == 1)
|
||||||
|
|
||||||
|
object_ring = self.policy.object_ring
|
||||||
|
primary_nodes = object_ring.get_part_nodes(1)
|
||||||
|
handoff_nodes_iter = object_ring.get_more_nodes(1)
|
||||||
|
all_nodes = primary_nodes + list(handoff_nodes_iter)
|
||||||
|
handoff_nodes_iter = object_ring.get_more_nodes(1)
|
||||||
|
local_handoffs = [n for n in handoff_nodes_iter if
|
||||||
|
policy_conf.write_affinity_is_local_fn(n)]
|
||||||
|
|
||||||
|
prefered_nodes = list(controller.iter_nodes_local_first(
|
||||||
|
object_ring, 1, local_handoffs_first=True))
|
||||||
|
|
||||||
|
self.assertEqual(len(all_nodes), self.replicas() +
|
||||||
|
POLICIES.default.object_ring.max_more_nodes)
|
||||||
|
|
||||||
|
first_primary_nodes = prefered_nodes[:len(primary_nodes)]
|
||||||
|
self.assertEqual(sorted(primary_nodes), sorted(first_primary_nodes))
|
||||||
|
|
||||||
|
handoff_count = self.replicas() - len(primary_nodes)
|
||||||
|
first_handoffs = prefered_nodes[len(primary_nodes):][:handoff_count]
|
||||||
|
self.assertEqual(first_handoffs, local_handoffs[:handoff_count])
|
||||||
|
|
||||||
|
def test_iter_nodes_handoff_local_first_non_default(self):
|
||||||
|
# Obviously this test doesn't work if we're testing 1 replica.
|
||||||
|
# In that case, we don't have any failovers to check.
|
||||||
|
if self.replicas() == 1:
|
||||||
|
return
|
||||||
|
|
||||||
|
controller = self.controller_cls(
|
||||||
|
self.app, 'a', 'c', 'o')
|
||||||
|
policy_conf = self.app.get_policy_options(self.policy)
|
||||||
|
policy_conf.write_affinity_is_local_fn = (
|
||||||
|
lambda node: node['region'] == 1)
|
||||||
|
policy_conf.write_affinity_handoff_delete_count = 1
|
||||||
|
|
||||||
|
object_ring = self.policy.object_ring
|
||||||
|
primary_nodes = object_ring.get_part_nodes(1)
|
||||||
|
handoff_nodes_iter = object_ring.get_more_nodes(1)
|
||||||
|
all_nodes = primary_nodes + list(handoff_nodes_iter)
|
||||||
|
handoff_nodes_iter = object_ring.get_more_nodes(1)
|
||||||
|
local_handoffs = [n for n in handoff_nodes_iter if
|
||||||
|
policy_conf.write_affinity_is_local_fn(n)]
|
||||||
|
|
||||||
|
prefered_nodes = list(controller.iter_nodes_local_first(
|
||||||
|
object_ring, 1, local_handoffs_first=True))
|
||||||
|
|
||||||
|
self.assertEqual(len(all_nodes), self.replicas() +
|
||||||
|
POLICIES.default.object_ring.max_more_nodes)
|
||||||
|
|
||||||
|
first_primary_nodes = prefered_nodes[:len(primary_nodes)]
|
||||||
|
self.assertEqual(sorted(primary_nodes), sorted(first_primary_nodes))
|
||||||
|
|
||||||
|
handoff_count = policy_conf.write_affinity_handoff_delete_count
|
||||||
|
first_handoffs = prefered_nodes[len(primary_nodes):][:handoff_count]
|
||||||
|
self.assertEqual(first_handoffs, local_handoffs[:handoff_count])
|
||||||
|
|
||||||
def test_connect_put_node_timeout(self):
|
def test_connect_put_node_timeout(self):
|
||||||
controller = self.controller_cls(
|
controller = self.controller_cls(
|
||||||
self.app, 'a', 'c', 'o')
|
self.app, 'a', 'c', 'o')
|
||||||
@ -369,6 +449,36 @@ class BaseObjectControllerMixin(object):
|
|||||||
resp = req.get_response(self.app)
|
resp = req.get_response(self.app)
|
||||||
self.assertEqual(resp.status_int, 204)
|
self.assertEqual(resp.status_int, 204)
|
||||||
|
|
||||||
|
def test_DELETE_write_affinity_before_replication(self):
|
||||||
|
policy_conf = self.app.get_policy_options(self.policy)
|
||||||
|
policy_conf.write_affinity_handoff_delete_count = self.replicas() / 2
|
||||||
|
policy_conf.write_affinity_is_local_fn = (
|
||||||
|
lambda node: node['region'] == 1)
|
||||||
|
handoff_count = policy_conf.write_affinity_handoff_delete_count
|
||||||
|
|
||||||
|
req = swift.common.swob.Request.blank('/v1/a/c/o', method='DELETE')
|
||||||
|
codes = [204] * self.replicas() + [404] * handoff_count
|
||||||
|
with set_http_connect(*codes):
|
||||||
|
resp = req.get_response(self.app)
|
||||||
|
|
||||||
|
self.assertEqual(resp.status_int, 204)
|
||||||
|
|
||||||
|
def test_DELETE_write_affinity_after_replication(self):
|
||||||
|
policy_conf = self.app.get_policy_options(self.policy)
|
||||||
|
policy_conf.write_affinity_handoff_delete_count = self.replicas() / 2
|
||||||
|
policy_conf.write_affinity_is_local_fn = (
|
||||||
|
lambda node: node['region'] == 1)
|
||||||
|
handoff_count = policy_conf.write_affinity_handoff_delete_count
|
||||||
|
|
||||||
|
req = swift.common.swob.Request.blank('/v1/a/c/o', method='DELETE')
|
||||||
|
codes = ([204] * (self.replicas() - handoff_count) +
|
||||||
|
[404] * handoff_count +
|
||||||
|
[204] * handoff_count)
|
||||||
|
with set_http_connect(*codes):
|
||||||
|
resp = req.get_response(self.app)
|
||||||
|
|
||||||
|
self.assertEqual(resp.status_int, 204)
|
||||||
|
|
||||||
def test_POST_non_int_delete_after(self):
|
def test_POST_non_int_delete_after(self):
|
||||||
t = str(int(time.time() + 100)) + '.1'
|
t = str(int(time.time() + 100)) + '.1'
|
||||||
req = swob.Request.blank('/v1/a/c/o', method='POST',
|
req = swob.Request.blank('/v1/a/c/o', method='POST',
|
||||||
|
@ -1366,16 +1366,19 @@ class TestProxyServerConfigLoading(unittest.TestCase):
|
|||||||
read_affinity = r1=100
|
read_affinity = r1=100
|
||||||
write_affinity = r1
|
write_affinity = r1
|
||||||
write_affinity_node_count = 1 * replicas
|
write_affinity_node_count = 1 * replicas
|
||||||
|
write_affinity_handoff_delete_count = 4
|
||||||
"""
|
"""
|
||||||
expected_default = {"read_affinity": "",
|
expected_default = {"read_affinity": "",
|
||||||
"sorting_method": "shuffle",
|
"sorting_method": "shuffle",
|
||||||
"write_affinity": "",
|
"write_affinity": "",
|
||||||
"write_affinity_node_count_fn": 6}
|
"write_affinity_node_count_fn": 6,
|
||||||
|
"write_affinity_handoff_delete_count": None}
|
||||||
exp_options = {None: expected_default,
|
exp_options = {None: expected_default,
|
||||||
POLICIES[0]: {"read_affinity": "r1=100",
|
POLICIES[0]: {"read_affinity": "r1=100",
|
||||||
"sorting_method": "affinity",
|
"sorting_method": "affinity",
|
||||||
"write_affinity": "r1",
|
"write_affinity": "r1",
|
||||||
"write_affinity_node_count_fn": 3},
|
"write_affinity_node_count_fn": 3,
|
||||||
|
"write_affinity_handoff_delete_count": 4},
|
||||||
POLICIES[1]: expected_default}
|
POLICIES[1]: expected_default}
|
||||||
exp_is_local = {POLICIES[0]: [({'region': 1, 'zone': 2}, True),
|
exp_is_local = {POLICIES[0]: [({'region': 1, 'zone': 2}, True),
|
||||||
({'region': 2, 'zone': 1}, False)],
|
({'region': 2, 'zone': 1}, False)],
|
||||||
@ -1387,7 +1390,8 @@ class TestProxyServerConfigLoading(unittest.TestCase):
|
|||||||
self.assertEqual(
|
self.assertEqual(
|
||||||
"ProxyOverrideOptions({}, {'sorting_method': 'shuffle', "
|
"ProxyOverrideOptions({}, {'sorting_method': 'shuffle', "
|
||||||
"'read_affinity': '', 'write_affinity': '', "
|
"'read_affinity': '', 'write_affinity': '', "
|
||||||
"'write_affinity_node_count': '2 * replicas'})",
|
"'write_affinity_node_count': '2 * replicas', "
|
||||||
|
"'write_affinity_handoff_delete_count': None})",
|
||||||
repr(default_options))
|
repr(default_options))
|
||||||
self.assertEqual(default_options, eval(repr(default_options), {
|
self.assertEqual(default_options, eval(repr(default_options), {
|
||||||
'ProxyOverrideOptions': default_options.__class__}))
|
'ProxyOverrideOptions': default_options.__class__}))
|
||||||
@ -1396,7 +1400,8 @@ class TestProxyServerConfigLoading(unittest.TestCase):
|
|||||||
self.assertEqual(
|
self.assertEqual(
|
||||||
"ProxyOverrideOptions({}, {'sorting_method': 'affinity', "
|
"ProxyOverrideOptions({}, {'sorting_method': 'affinity', "
|
||||||
"'read_affinity': 'r1=100', 'write_affinity': 'r1', "
|
"'read_affinity': 'r1=100', 'write_affinity': 'r1', "
|
||||||
"'write_affinity_node_count': '1 * replicas'})",
|
"'write_affinity_node_count': '1 * replicas', "
|
||||||
|
"'write_affinity_handoff_delete_count': 4})",
|
||||||
repr(policy_0_options))
|
repr(policy_0_options))
|
||||||
self.assertEqual(policy_0_options, eval(repr(policy_0_options), {
|
self.assertEqual(policy_0_options, eval(repr(policy_0_options), {
|
||||||
'ProxyOverrideOptions': policy_0_options.__class__}))
|
'ProxyOverrideOptions': policy_0_options.__class__}))
|
||||||
@ -1411,6 +1416,7 @@ class TestProxyServerConfigLoading(unittest.TestCase):
|
|||||||
use = egg:swift#proxy
|
use = egg:swift#proxy
|
||||||
sorting_method = affinity
|
sorting_method = affinity
|
||||||
write_affinity_node_count = 1 * replicas
|
write_affinity_node_count = 1 * replicas
|
||||||
|
write_affinity_handoff_delete_count = 3
|
||||||
|
|
||||||
[proxy-server:policy:0]
|
[proxy-server:policy:0]
|
||||||
read_affinity = r1=100
|
read_affinity = r1=100
|
||||||
@ -1419,12 +1425,14 @@ class TestProxyServerConfigLoading(unittest.TestCase):
|
|||||||
expected_default = {"read_affinity": "",
|
expected_default = {"read_affinity": "",
|
||||||
"sorting_method": "affinity",
|
"sorting_method": "affinity",
|
||||||
"write_affinity": "",
|
"write_affinity": "",
|
||||||
"write_affinity_node_count_fn": 3}
|
"write_affinity_node_count_fn": 3,
|
||||||
|
"write_affinity_handoff_delete_count": 3}
|
||||||
exp_options = {None: expected_default,
|
exp_options = {None: expected_default,
|
||||||
POLICIES[0]: {"read_affinity": "r1=100",
|
POLICIES[0]: {"read_affinity": "r1=100",
|
||||||
"sorting_method": "affinity",
|
"sorting_method": "affinity",
|
||||||
"write_affinity": "r1",
|
"write_affinity": "r1",
|
||||||
"write_affinity_node_count_fn": 3},
|
"write_affinity_node_count_fn": 3,
|
||||||
|
"write_affinity_handoff_delete_count": 3},
|
||||||
POLICIES[1]: expected_default}
|
POLICIES[1]: expected_default}
|
||||||
exp_is_local = {POLICIES[0]: [({'region': 1, 'zone': 2}, True),
|
exp_is_local = {POLICIES[0]: [({'region': 1, 'zone': 2}, True),
|
||||||
({'region': 2, 'zone': 1}, False)],
|
({'region': 2, 'zone': 1}, False)],
|
||||||
@ -1440,29 +1448,35 @@ class TestProxyServerConfigLoading(unittest.TestCase):
|
|||||||
read_affinity = r2=10
|
read_affinity = r2=10
|
||||||
write_affinity_node_count = 1 * replicas
|
write_affinity_node_count = 1 * replicas
|
||||||
write_affinity = r2
|
write_affinity = r2
|
||||||
|
write_affinity_handoff_delete_count = 2
|
||||||
|
|
||||||
[proxy-server:policy:0]
|
[proxy-server:policy:0]
|
||||||
read_affinity = r1=100
|
read_affinity = r1=100
|
||||||
write_affinity = r1
|
write_affinity = r1
|
||||||
write_affinity_node_count = 5
|
write_affinity_node_count = 5
|
||||||
|
write_affinity_handoff_delete_count = 3
|
||||||
|
|
||||||
[proxy-server:policy:1]
|
[proxy-server:policy:1]
|
||||||
read_affinity = r1=1
|
read_affinity = r1=1
|
||||||
write_affinity = r3
|
write_affinity = r3
|
||||||
write_affinity_node_count = 4
|
write_affinity_node_count = 4
|
||||||
|
write_affinity_handoff_delete_count = 4
|
||||||
"""
|
"""
|
||||||
exp_options = {None: {"read_affinity": "r2=10",
|
exp_options = {None: {"read_affinity": "r2=10",
|
||||||
"sorting_method": "affinity",
|
"sorting_method": "affinity",
|
||||||
"write_affinity": "r2",
|
"write_affinity": "r2",
|
||||||
"write_affinity_node_count_fn": 3},
|
"write_affinity_node_count_fn": 3,
|
||||||
|
"write_affinity_handoff_delete_count": 2},
|
||||||
POLICIES[0]: {"read_affinity": "r1=100",
|
POLICIES[0]: {"read_affinity": "r1=100",
|
||||||
"sorting_method": "affinity",
|
"sorting_method": "affinity",
|
||||||
"write_affinity": "r1",
|
"write_affinity": "r1",
|
||||||
"write_affinity_node_count_fn": 5},
|
"write_affinity_node_count_fn": 5,
|
||||||
|
"write_affinity_handoff_delete_count": 3},
|
||||||
POLICIES[1]: {"read_affinity": "r1=1",
|
POLICIES[1]: {"read_affinity": "r1=1",
|
||||||
"sorting_method": "affinity",
|
"sorting_method": "affinity",
|
||||||
"write_affinity": "r3",
|
"write_affinity": "r3",
|
||||||
"write_affinity_node_count_fn": 4}}
|
"write_affinity_node_count_fn": 4,
|
||||||
|
"write_affinity_handoff_delete_count": 4}}
|
||||||
exp_is_local = {POLICIES[0]: [({'region': 1, 'zone': 2}, True),
|
exp_is_local = {POLICIES[0]: [({'region': 1, 'zone': 2}, True),
|
||||||
({'region': 2, 'zone': 1}, False)],
|
({'region': 2, 'zone': 1}, False)],
|
||||||
POLICIES[1]: [({'region': 3, 'zone': 2}, True),
|
POLICIES[1]: [({'region': 3, 'zone': 2}, True),
|
||||||
@ -1533,18 +1547,21 @@ class TestProxyServerConfigLoading(unittest.TestCase):
|
|||||||
None: {"read_affinity": "r1=100",
|
None: {"read_affinity": "r1=100",
|
||||||
"sorting_method": "shuffle",
|
"sorting_method": "shuffle",
|
||||||
"write_affinity": "r0",
|
"write_affinity": "r0",
|
||||||
"write_affinity_node_count_fn": 6},
|
"write_affinity_node_count_fn": 6,
|
||||||
|
"write_affinity_handoff_delete_count": None},
|
||||||
# policy 0 read affinity is r2, dictated by policy 0 section
|
# policy 0 read affinity is r2, dictated by policy 0 section
|
||||||
POLICIES[0]: {"read_affinity": "r2=100",
|
POLICIES[0]: {"read_affinity": "r2=100",
|
||||||
"sorting_method": "affinity",
|
"sorting_method": "affinity",
|
||||||
"write_affinity": "r2",
|
"write_affinity": "r2",
|
||||||
"write_affinity_node_count_fn": 6},
|
"write_affinity_node_count_fn": 6,
|
||||||
|
"write_affinity_handoff_delete_count": None},
|
||||||
# policy 1 read_affinity is r0, dictated by DEFAULT section,
|
# policy 1 read_affinity is r0, dictated by DEFAULT section,
|
||||||
# overrides proxy server section
|
# overrides proxy server section
|
||||||
POLICIES[1]: {"read_affinity": "r0=100",
|
POLICIES[1]: {"read_affinity": "r0=100",
|
||||||
"sorting_method": "affinity",
|
"sorting_method": "affinity",
|
||||||
"write_affinity": "r0",
|
"write_affinity": "r0",
|
||||||
"write_affinity_node_count_fn": 6}}
|
"write_affinity_node_count_fn": 6,
|
||||||
|
"write_affinity_handoff_delete_count": None}}
|
||||||
exp_is_local = {
|
exp_is_local = {
|
||||||
# default write_affinity is r0, dictated by DEFAULT section
|
# default write_affinity is r0, dictated by DEFAULT section
|
||||||
None: [({'region': 0, 'zone': 2}, True),
|
None: [({'region': 0, 'zone': 2}, True),
|
||||||
|
Loading…
Reference in New Issue
Block a user