Storage architecture ==================== OpenStack has multiple storage realms to consider: * Block Storage (cinder) * Object Storage (swift) * Image storage (glance) * Ephemeral storage (nova) Block Storage (cinder) ~~~~~~~~~~~~~~~~~~~~~~ The Block Storage (cinder) service manages volumes on storage devices in an environment. In a production environment, the device presents storage via a storage protocol (for example, NFS, iSCSI, or Ceph RBD) to a storage network (``br-storage``) and a storage management API to the management network (``br-mgmt``). Instances are connected to the volumes via the storage network by the hypervisor on the Compute host. The following diagram illustrates how Block Storage is connected to instances. .. figure:: ../figures/production-storage-cinder.png :width: 600px The diagram shows the following steps. +----+---------------------------------------------------------------------+ | 1. | A volume is created by the assigned ``cinder-volume`` service | | | using the appropriate `cinder driver`_. The volume is created by | | | using an API that is presented to the management network. | +----+---------------------------------------------------------------------+ | 2. | After the volume is created, the ``nova-compute`` service connects | | | the Compute host hypervisor to the volume via the storage network. | +----+---------------------------------------------------------------------+ | 3. | After the hypervisor is connected to the volume, it presents the | | | volume as a local hardware device to the instance. | +----+---------------------------------------------------------------------+ .. important:: The `LVMVolumeDriver`_ is designed as a reference driver implementation, which we do not recommend for production usage. The LVM storage back-end is a single-server solution that provides no high-availability options. If the server becomes unavailable, then all volumes managed by the ``cinder-volume`` service running on that server become unavailable. Upgrading the operating system packages (for example, kernel or iSCSI) on the server causes storage connectivity outages because the iSCSI service (or the host) restarts. Because of a `limitation with container iSCSI connectivity`_, you must deploy the ``cinder-volume`` service directly on a physical host (not into a container) when using storage back ends that connect via iSCSI. This includes the `LVMVolumeDriver`_ and many of the drivers for commercial storage devices. .. note:: The ``cinder-volume`` service does not run in a highly available configuration. When the ``cinder-volume`` service is configured to manage volumes on the same back end from multiple hosts or containers, one service is scheduled to manage the life cycle of the volume until an alternative service is assigned to do so. This assignment can be made through the `cinder-manage CLI tool`_. This configuration might change if `cinder volume active-active support spec`_ is implemented. .. _cinder driver: http://docs.openstack.org/developer/cinder/drivers.html .. _LVMVolumeDriver: http://docs.openstack.org/developer/cinder/drivers.html#lvmvolumedriver .. _limitation with container iSCSI connectivity: https://bugs.launchpad.net/ubuntu/+source/lxc/+bug/1226855 .. _cinder-manage CLI tool: http://docs.openstack.org/developer/cinder/man/cinder-manage.html#cinder-volume .. _cinder volume active-active support spec: https://specs.openstack.org/openstack/cinder-specs/specs/mitaka/cinder-volume-active-active-support.html Object Storage (swift) ~~~~~~~~~~~~~~~~~~~~~~ The Object Storage (swift) service implements a highly available, distributed, eventually consistent object/blob store that is accessible via HTTP/HTTPS. The following diagram illustrates how data is accessed and replicated. .. figure:: ../figures/production-storage-swift.png :width: 600px The ``swift-proxy`` service is accessed by clients via the load balancer on the management network (``br-mgmt``). The ``swift-proxy`` service communicates with the Account, Container, and Object services on the Object Storage hosts via the storage network(``br-storage``). Replication between the Object Storage hosts is done via the replication network (``br-repl``). Image storage (glance) ~~~~~~~~~~~~~~~~~~~~~~ The Image service (glance) can be configured to store images on a variety of storage back ends supported by the `glance_store drivers`_. .. important:: When the File System store is used, the Image service has no mechanism of its own to replicate the image between Image service hosts. We recommend using a shared storage back end (via a file system mount) to ensure that allĀ ``glance-api`` services have access to all images. Doing so prevents losing access to images when an infrastructure (control plane) host is lost. The following diagram illustrates the interactions between the Image service, the storage device, and the ``nova-compute`` service when an instance is created. .. figure:: ../figures/production-storage-glance.png :width: 600px The diagram shows the following steps. +----+---------------------------------------------------------------------+ | 1 | When a client requests an image, the ``glance-api`` service | | | accesses the appropriate store on the storage device over the | | | storage network (``br-storage``) and pulls it into its cache. When | | | the same image is requested again, it is given to the client | | | directly from the cache. | +----+---------------------------------------------------------------------+ | 2 | When an instance is scheduled for creation on a Compute host, the | | | ``nova-compute`` service requests the image from the ``glance-api`` | | | service over the management network (``br-mgmt``). | +----+---------------------------------------------------------------------+ | 3 | After the image is retrieved, the ``nova-compute`` service stores | | | the image in its own image cache. When another instance is created | | | with the same image, the image is retrieved from the local base | | | image cache. | +----+---------------------------------------------------------------------+ .. _glance_store drivers: http://docs.openstack.org/developer/glance_store/drivers/ Ephemeral storage (nova) ~~~~~~~~~~~~~~~~~~~~~~~~ When the flavors in the Compute service are configured to provide instances with root or ephemeral disks, the ``nova-compute`` service manages these allocations using its ephemeral disk storage location. In many environments, the ephemeral disks are stored on the Compute host's local disks, but for production environments we recommend that the Compute hosts be configured to use a shared storage subsystem instead. A shared storage subsystem allows quick, live instance migration between Compute hosts, which is useful when the administrator needs to perform maintenance on the Compute host and wants to evacuate it. Using a shared storage subsystem also allows the recovery of instances when a Compute host goes offline. The administrator is able to evacuate the instance to another Compute host and boot it up again. The following diagram illustrates the interactions between the storage device, the Compute host, the hypervisor, and the instance. .. figure:: ../figures/production-storage-nova.png :width: 600px The diagram shows the following steps. +----+---------------------------------------------------------------------+ | 1 | The Compute host is configured with access to the storage device. | | | The Compute host accesses the storage space via the storage network | | | (``br-storage``) by using a storage protocol (for example, NFS, | | | iSCSI, or Ceph RBD). | +----+---------------------------------------------------------------------+ | 2 | The ``nova-compute`` service configures the hypervisor to present | | | the allocated instance disk as a device to the instance. | +----+---------------------------------------------------------------------+ | 3 | The hypervisor presents the disk as a device to the instance. | +----+---------------------------------------------------------------------+