iconimage - Fotolia

Feature

Spread of Kubernetes spurs backup and disaster recovery products

Growth of containerised applications brings with it the need to back up and provide disaster recovery. But what do you need to protect, and who provides the products?

Antony Adshead, Storage Editor

Published: 22 Jan 2020

With the rise in use of containers – and their orchestration via Kubernetes (also known as K8s or Kube) – the application landscape is changing rapidly.

“Traditional” applications, monolithic even, are obviously present in huge numbers, but for web-based use cases, particularly where scaling up and down rapidly is an issue, containers come into their own.

But what about backup for containers and disaster recovery?

After all, your container infrastructure and, quite possibly, the data it creates needs to be protected. So what are the key options for container backup and disaster recovery?

Here we look at what needs protecting, and how to do it, with a short survey of the key suppliers that offer container disaster recovery.

Container architecture: New challenges

The “traditional” application architecture is well known. Applications run in the datacentre on their server with their operating system (OS), while data resides on the same hardware or in shared storage.

All that – even if, or especially if, virtual servers are used – makes it pretty simple to spin up a fresh deployment in case of a disaster that takes down your primary infrastructure.

At its simplest, you can take backups of virtual machine images and associated data and recover from those, to a known state.

Containers pose new challenges. Although virtual servers are relatively easy to create, use and destroy, the lifespan of containerised applications is mayfly-like in comparison, with most – about 85%, according to a 2019 survey by Sysdig – lasting less than a day.

This reflects the massive advantage of containers. They are easily built components of an application architecture that can be spun up, scaled up and decommissioned easily and quickly – in the cloud as well as on-premise – in most cases today using the Kubernetes orchestration environment.

Why is this good for application development and deployment? In theory, apps can be built and tested on a laptop and reproduced rapidly in the production environment after that.

Core Kubernetes backup and recovery

If you are a small-scale operation, you could, in theory, rebuild your Kubernetes deployments from their git repos. It is totally within the DevOps ethos to rebuild from code.

That, of course, does not take care of any container configuration information held in etcd or any data created by your containers that might need to be recovered.

You could, in theory, back up ectd from within Kubernetes by running and scheduling CronJobs and run restores on those snapshots.

But none of that will be terribly practical in a small and medium-sized enterprise (SME) or enterprise environment, so a number of specialist startups have emerged that supply Kubernetes management with backup and disaster recovery, based on application programming interface (API) connections to the container orchestration platform.

Here is a look at some of those that have emerged.

Specialist Kubernetes management products

Kasten’s K10 data management platform is claimed to be purpose-built for Kubernetes. When deployed, it scans for Kubernetes components that need protection and provides customer-defined, policy-based data protection, mobility, backup and restore, and disaster recovery. It runs in its own namespace in your Kubernetes cluster and its reach extends to state related to persistent storage volumes and databases. Granular restore is possible to point-in-time and application subset. It supports AWS, Azure, GCP and IBM clouds and works on-premise.

Portworx was a pioneer of persistent storage for containers. Late last year, it added its PX-Backup software to the Portworx Enterprise storage management platform. The new functionality protects individual containers, groups of containers or a full Kubernetes namespace with a single command to any S3-compatible object storage. PX-Backup does not require the primary Portworx Enterprise storage management software platform.

Rancher provides a container management environment that includes access control and allows DevOps engineers to manage application workloads without in-depth knowledge of Kubernetes concepts. The Rancher API server is built on top of the Kubernetes API server and etcd database. In the Rancher UI, etcd backup and recovery for Rancher Kubernetes clusters can be performed. Snapshots of the etcd database are taken and saved locally onto etcd nodes or to a S3 compatible target, which can be remote and used to restore the cluster.

Velero is an open source tool that can back up, recover and migrate Kubernetes clusters and associated persistent storage on-premise and in a public cloud. Velero consists of a server process that runs in the Kubernetes cluster and a command-line interface (CLI) that allows administrators to configure scheduled backups, trigger ad-hoc backups, perform restores, and so on. It uses the Kubernetes API to capture the state of cluster resources down to granular levels.

Next Steps

Veeam acquisition of Kasten boosts Kubernetes backup

Kubernetes backup demands new IT practices and tools

Trilio and Portworx upgrade Kubernetes backup at KubeCon

Vendors push Kubernetes backup as a must for production

Spread of Kubernetes spurs backup and disaster recovery products

Growth of containerised applications brings with it the need to back up and provide disaster recovery. But what do you need to protect, and who provides the products?

Container architecture: New challenges

Read more on container data protection

Core Kubernetes backup and recovery

Specialist Kubernetes management products

Next Steps

Read more on Storage management and strategy

Backup technology explained: The fundamentals of enterprise backup

Top 10 Kubernetes and storage stories of 2024

Kubernetes disaster recovery: Five key questions

Storage technology explained: Kubernetes, containers and persistent storage