From Prototype to Production: Deploying in the Cloud

Welcome to Cloud Engineering

Details

From working prototypes to production infrastructure

You built something that works. Now comes the hard part: making it work reliably, at scale
This course takes you
- from developer to cloud engineer
- from local services to orchestrated microservices
- from Docker to Kubernetes

You have built the app

Now you need to deploy it

Problem: Prototype != Production

Docker Compose

Great for dev/test
But no self-healing, no scaling (single node scaling only), no declarative control

Docker Swarm

Kubernetes

Details

If Docker Compose is your local test bed, Kubernetes is your data center.

Open-source container orchestration platform
Maintains desired state using declarative configuration (YAML)
Core resources:
- Pods: smallest deployable unit
- Services: expose pods
- Deployments: manage pod replicas

What does orchestrate mean?

Dictionary definition: to arrange or combine so as to achieve a desired or maximum effect
- Kubernetes documentation: We tell Kubernetes what the desired state of our system is like, and Kubernetes will work to maintain that
Before containerization/virtualization, we have cluster of computers running jobs.
- Jobs = applications running on single or multiple computing nodes
- Applications’ dependencies are tied in to the supporting operating system on these nodes.
- Cluster management system only need to manage applications.
Container is more than an application.
- A lightweight virtualization of an operating system and its components that help an application to run, including external libraries.
- A running container does not depending on a host computer’s libraries.
- Is the management process the same as a cluster management system?

Before Kubernetes: Borg, a cluster management system

Google’s Cluster Management System
- First developed in 2003.
- Abhishek Verma, Luis Pedrosa, Madhukar Korupolu, David Oppenheimer, Eric Tune, and John Wilkes. Large-scale cluster management at Google with Borg. In Proceedings of the Tenth European Conference on Computer Systems, p. 18. ACM, 2015.
- Manages hundreds of thousands of jobs, from many thousands of different applications, across clusters up to tens of thousands machines.
Borg is the predecessor of Kubernetes. Understand Borg helps understand the design decision in creating Kubernetes.
- Kubernetes is perhaps the most popular open-source container orchestration system today, for both academic and industry.
- Other container orchestration systems are either
  - Deprecating (Docker Swarm)
  - Integrates container management as part of the existing framework rather than developing a new management system (UC Berkeley’s Mesos and Twitter’s Aurora)

Details

Kubernetes is Borg’s spiritual child but open, modular, extensible

Borg’s concepts elaborated

Work is submitted to Borg as jobs, which can have one or more tasks (binary).
Each job runs in one Borg cell, consisting of multiple machines that are managed as a single unit.
A Borg’s allocation defines a reserved set of resources on a machine in which one or more tasks can be run.
Job types:
- Long running services that should never goes down and have short-lived latency-sensitive requests: Gmail, Google Docs, Web Search …
- Batch jobs that take a few seconds to a few days to complete.
Borg cells allow for not just applications, but applications frameworks
- One master job and one or more worker jobs.
- The framework can execute parallel applications itself.
- Examples of frameworks running on top of Borg:
  - MapReduce
  - FlumeJava: Data-Parallel Pipelines
  - Millwheel: Fault-tolerant Stream Processing at Internet Scale
  - Pregel: Large-scale graph processing
Machines in cells belong to a single cluster, defined by the high-performance datacenter-scale network fabric connecting them.
- How is this different that the traditional cluster model?

Borg’s architecture

Scalability of Borg Master
- Reported in the 2015 paper: Unsure of the ultimate scalability limit (flex anyone?)
- A single master can
  - manage many thousands machines in a cell
  - several cells have arrival rates of more than 10,000 tasks per minute.
- 2020 Borg analysis report:
  - (Muhamad Tirmazi, Adam Barker, Nan Deng, Md E. Haque, Zhijing Gene Qin, Steven Hand, Mor Harchol-Balter, and John Wilkes. “Borg: the next generation.” In Proceedings of the fifteenth European conference on computer systems)[https://dl.acm.org/doi/pdf/10.1145/3342195.3387517]
- 2011 log data: 1 cell, 12000 machines (40 GB compressed)
- 2020 log data: 8 cells, 96000 machines (350 GB compressed)
The below graph show fraction of CPU and memory allocation of each category of priority queue **relative to cell’s capacity”.
What is special about this?

Benefits of Borg

Hides the details of resource management and failure handling so its users can focus on application development.
Operates with very high reliability and availability, and supports applications that have similar requirements.
Runs workloads across tens of thousands of machines efficiently.
Is not the first system that can do these, but is one of the very few that can do it at such scale.

Details

We will cover these concepts through the course!

What Kubernetes learned from Borg

Rejection of the job concept and organize around the concept of pods.
- labels are used to described the objects (jobs, services, …) and their desired states.
- Kubernetes schedules and orchestrates things at the pod level, not at the container level.
- Containers running in the same pod have to be managed together (shared fate).
- Management transparency: You don’t have to micromanage processes within a pod.
IP addresses are mapped to pods and services and not physical computers.
Optimizations for high-demand jobs.
The perception of Kubernetes’ kernel as an operation system kernel for a distributed system.

Project-based cloud engineering
- Will depend on projects that you (or members of your group) worked on in CSC 468, Introduction to Cloud Computing
Local testbed: Moving from Docker Desktop to Rancher Desktop
Cloud testbed: CloudLab
Rancher Desktop, minikube, or kind (local clusters)