Cluster

What a cluster is, how it works, and why it forms the foundation for Kubernetes.

Last updated: March 16, 2026

What is a Cluster?

A cluster is a group of computers that work together as a unified system. In the context of Kubernetes, a cluster consists of a control plane that manages the overall state and worker nodes that run the actual application workloads. From the outside, the cluster appears as a single computing environment, even though it spans multiple machines.

What problem does it solve?

A single server has limited CPU, memory, and storage capacity. If that server fails, everything running on it goes down with it. Scaling vertically by adding more resources to one machine eventually hits a ceiling, and there is no redundancy to protect against hardware failures or maintenance downtime.

How does it help?

Clusters distribute workloads across multiple nodes, so the failure of one machine does not take down the entire system. New nodes can be added to increase capacity horizontally without replacing existing hardware. Kubernetes leverages this foundation to schedule containers intelligently, balance traffic across healthy nodes, and automatically reschedule workloads when a node becomes unavailable. This makes clusters the backbone of resilient, scalable infrastructure.