Kubernetes Can Make Data Protection Complicated

Image credit: iStockphoto/francescoch

While containers have been around since the 1970s, their popularity exploded with Docker in 2013.

Now, container-orchestration tools like Kubernetes are rapidly transforming how applications are developed and deployed. They’re doing it by enabling developers to focus exclusively on building software and delivering value.

It is not an overstatement to say that containers are sparking a revolution in software development. But if you’re going to embrace containers, you must also get your arms around the issue of data protection.

A modern concept

A container is an application that bundles all its dependencies, libraries, and configuration files into a single package. This bundling makes it easier to spin up new container instances and seamlessly move containers from one computing environment to another. 

That is a significant advantage in a lot of ways. For instance, developers typically use containers to move an application from a testing environment, such as their laptops, to a live production environment. The use of containers is also common when migrating from a physical machine to a cloud-based virtual machine.

Containers are highly beneficial in many different scenarios because differences in operating systems, software versions, etc., do not slow them down. Indeed, containers are extremely flexible and portable, making them a natural fit for many cloud applications. As computing and storage rapidly move to the cloud, containers will likely become a vital technology for every modern organization.

However, while container orchestration tools like Kubernetes are convenient due to their scalability and portability, they fall short in data protection.

The data protection dilemma

Well, for starters, a Kubernetes architecture is exceptionally fluid and dynamic. Containers are rapidly spun up and just as quickly torn down, depending on the developers’ goals and specifications. That means containers are essentially temporary and have a relatively short lifespan. 

What does this mean for data protection? It means that as more enterprises adopt containers, data protection will become an increasingly important issue. The growing number of organizations now using containers in their testing environment before deploying new applications are discovering that unexpected things can happen to the data during that migration and deployment.

The takeaway: properly backing up your data is particularly important — and will only become more critical in the months and years ahead.

Data protection also needs to go beyond the container lifespan.

As organizations use more containers, they will create more data that will need to be backed up and stored. Since testing and development are the primary uses of containers, the containers’ lifespan is usually shorter than the data they create. For compliance and other reasons, this data needs to be stored and protected long after a particular container is decommissioned or destroyed.

It is also important to note that backing up container data is not a time-based process, with data backed up every few minutes or hours. With containers, backup is more event-driven. For example, if you create a new container and you do not get the result you are hoping for, you want the ability to return to the previous state quickly. So, you need a backup at that point.

For these reasons, container backup is increasingly a front-burner issue. Even though containers, by their nature, are designed for existing only when needed, organizations are realizing that container data needs to last longer and must ultimately be protected.

Rethinking data protection

There are several steps companies can take to ensure that their container data is stored successfully.

For starters, it is vital to assess the data requirements for each containerized application. Companies should also ensure that there are protocols in place to stop container data from being needlessly overwritten. Further, companies must be aware of every container’s security and access requirements in their environment.

Containerization has provided a significant boost to application development. But organizations need to give serious consideration to storing, backing up, and protecting their containerized data. By tackling the data-protection issue head-on, organizations can truly reap the many benefits of container-based development and move confidently into the future.

Florian Malecki, the international product marketing director at StorageCraft, wrote this article.

The views and opinions expressed in this article are those of the author and do not necessarily reflect those of CDOTrends. Image credit: iStockphoto/francescoch