Deploy != Release (Part 1)

The difference between deploy and release and why it matters.

Art Gillespie

Published in

Turbine Labs

5 min readMay 24, 2017

Q: â€œIs the latest version deployed?â€

A: â€œI deployed animated gif support to production.â€

Q: â€œSo animated gif support is released?â€

A: â€œThe animated gif release is deployed.â€

Q: â€œâ€¦â€

Iâ€™ve worked at many companies where â€œdeployâ€, â€œdeploymentâ€, â€œshipâ€, and â€œreleaseâ€ are used loosely, even interchangeably. As an industry, we havenâ€™t done a great job of standardizing our use of these terms, even though weâ€™ve radically improved operations practices and tooling over the past decade. At Turbine Labs, we use precise definitions of â€œshipâ€, â€œdeployâ€, â€œreleaseâ€, and â€œrollbackâ€, and spend a lot of our time thinking about what the world looks like when you think about â€œreleaseâ€ as an independent phase of your shipping process. In part one of this post, Iâ€™ll share these definitions, describe some common â€œdeploy == releaseâ€ practices, and explain why they have a poor risk profile. In part two, Iâ€™ll describe some of the extremely powerful risk mitigation techniques made possible when â€œdeployâ€ and â€œreleaseâ€ are treated as distinct phases of your software shipping cycle.

Ship

Shipping is your teamâ€™s process of getting a snapshot of your serviceâ€™s code â€” a version â€” from your teamâ€™s source control repository all the way to handling production traffic. I think of the overall shipping process as four distinct groups of smaller, specialized processes: Build, test, deploy, and release. Thanks to technology advances in cloud infrastructure, containers, orchestration frameworks, as well as process advances like twelve-factor, continuous integration, and continuous delivery, itâ€™s never been easier to execute the first three processes â€” build, test, and deploy.

Deploy

Deployment is your teamâ€™s process for installing the new version of your serviceâ€™s code on production infrastructure. When we say a new version of software is deployed, we mean it is running somewhere in your production infrastructure. That could be a newly spun-up EC2 instance on AWS, or a Docker container running in a pod in your data centerâ€™s Kubernetes cluster. Your software has started successfully, passed health checks, and is ready (you hope!) to handle production traffic, but may not actually be receiving any. This is an important point, so Iâ€™ll repeat it using Mediumâ€™s awesome large pull quote format:

Deployment need not expose customers to a new version of your service.

Given this definition, deployment can be an almost zero-risk activity. Sure, a lot can go wrong during deployment, but if a container backs off a crash loop in the woods and no customer gets a 500 status response, did it really happen?

Release

When we say a version of a service is released, we mean that it is responsible for serving production traffic. In verb form, releasing is the process of moving production traffic to the new version. Given this definition, all the risks we associate with shipping a new binary â€” outages, angry customers, snarky write-ups in The Register â€” are related to the release, not deployment, of new software. (At some companies Iâ€™ve heard this phase of shipping referred as rollout. Weâ€™ll stick to release for this post.)

Rollback

Sooner or later, but probably sooner and later, your team is going to ship something broken. Rollback (and its dangerous, unpredictable, stressed-out cousin, roll-forward) is the process of getting production back to a known state, typically by re-releasing the most recent version. Itâ€™s useful to think of rollback as just another deploy and release, the only differences being:

You are shipping a version whose characteristics are known in production,
You are executing your deploy and release process under time pressure, and
You are potentially releasing into a different environment â€” things may have changed during (or been changed by) the failed release.

An example of rollback after a bad release.

Now that weâ€™ve agreed on these definitions for shipping, deployment, release, and rollback, letâ€™s examine some common deploy and release practices.

Release in Place (Or deploy == release)

When your teamâ€™s shipping process involves pushing a new version of your software onto a server running the old version and re-starting the process, youâ€™re releasing in place. Using our definition above, deployment and release occur simultaneously: as soon as the new software is running (deployed), itâ€™s taking all the production traffic the old version was taking a split-second ago (released). In this world, a successful deploy is a successful release, and a bad deploy gets you a partial or complete outage, a bunch of mad users, andâ€”possiblyâ€”a hyperventilating manager.

Release-in-place has the distinction of being the only deploy/release process weâ€™ll discuss here that directly exposes deploy risk to customers. If the new version youâ€™ve just deployed canâ€™t launch â€” maybe it throws an error when it doesnâ€™t find a newly-required secret in an environment variable, maybe thereâ€™s an unmet library dependency, or maybe itâ€™s just not your day â€” there is no old version to take that instanceâ€™s customer traffic. Your service isâ€”at least partiallyâ€”down.

Moreover, if thereâ€™s a user-facing issue or more subtle operational issue with the new version â€” I call this release risk â€” release-in-place will expose every production request to it for each instance that youâ€™ve released to.

In a clustered environment, you might first release-in-place to just one of your instances. This practice, most commonly referred to as canary, can mitigate some risk â€” the percentage of your traffic exposed to deploy and release risk is equal to the number of instances with the new version of your service divided by the total number of instances in your serviceâ€™s cluster.

A canary release: One host in the cluster is running the new version.

Finally, rolling back a broken release-in-place deploy can be problematic. Thereâ€™s no guarantee that you can get back to the previous system state even if you rollback (re-release) the old version. Your rollback deploy is just as likely to fail at startup as your currently broken deploy.

Despite its relatively poor risk management characteristicsâ€”even with canaries, youâ€™re directly exposing some percentage of customersâ€™ requests to deploy riskâ€”release-in-place is still a common way to do business. I think itâ€™s experience with these kinds of systems that leads to the unfortunate use of the terms â€œdeployâ€ and â€œreleaseâ€ interchangeably.

Despair Not

We can do much, much better! In the second part of this post, Iâ€™ll talk about strategies for decoupling deploy from release and some of the powerful workflows you can build on top of a sophisticated release system.

Iâ€™m an engineer at Turbine Labs where weâ€™re building Houston, a service that makes building and monitoring sophisticated, realtime release workflows easy. If youâ€™d like to ship more and worry less, you should definitely get in touch! Weâ€™d love to talk with you.

Thanks to Glen Sanford, Mark McBride, Emily Pinkerton, Brook Shelley, Sara, and Jenn Gillespie for reading drafts of this post.