Kubernetes: Understanding Its Core Architecture

Sarthak Varshney
5h
224
0
3

Article

Introduction

I still remember the first time I tried to deploy a side‑project on Kubernetes. I had two dusty laptops on my desk—one hosting the control plane, the other pretending to be a worker. The cluster booted, the fan noise rose to jet‑engine levels, and I stared at a blinking cursor, wondering why my pods weren’t starting. Ten minutes of log‑hunting later, I discovered I’d misspelled a label. That little mishap taught me two things: (1) Kubernetes will always do exactly what you ask, not what you meant, and (2) understanding its architecture is the only antidote to frustration.

This article is the explanation I wish I’d had that day—a conversational walk-through Kubernetes architecture that swaps acronyms for plain English, sprinkles in real‑world metaphors, and keeps beginners front and center. By the end, you’ll know which part of Kubernetes is the “brain,” which is the “muscle,” and how they work together to run your containers like a well‑rehearsed orchestra. Let’s dive in.

1. The Big Picture: Clusters, Control Plane, and Data Plane

Think of a Kubernetes cluster as a miniature city. Every city has a city hall (planning, zoning, permits) and a set of neighborhoods where people actually live and work. In Kubernetes‑land:

Control Plane (Master Node) = City Hall
Data Plane (Worker Nodes) = Neighborhoods

Keeping these roles distinct allows Kubernetes to scale gracefully and to survive failures: if a house in one neighborhood burns down, city hall still stands and can coordinate rebuilds elsewhere.

Clusters, Control Plane, and Data Plane

2. Master Node vs. Worker Node: Who Does What?

Master Node—The Brain and City Planner

The Master Node hosts the components that decide what should happen:

Accepts your requests (“I need three replicas of my web service.”)
Plans where those replicas should live
Watches the city map for anything out of place
Updates records so every other official stays informed

Worker Nodes—The Hands and Feet

Worker Nodes are ordinary machines (physical or virtual) where your application containers actually run:

They receive marching orders from the Master Node
They spin up, monitor, and tear down containers
They keep your application traffic flowing

A cluster can run with a single Master and a single Worker on a laptop, but production setups usually run several of each for high availability.

3. An Inside Look at the Master Node

Kubernetes fans often say, “The control plane is just five pieces.” That’s like saying a smartphone is just a screen, battery, and CPU—it’s true, but the details matter. Let’s meet each piece.

3.1. API Server: The Friendly Receptionist

Imagine walking into city hall. At the front desk sits the API Server—the single door for every citizen, council member, or inspector. Whether you use kubectl, a CI/CD pipeline, or another internal component, you must file your request through this receptionist.

API Server

What it does

Validates and sanitizes every request
Stores the desired state in a central database (etcd)
Serves fresh data to anyone who asks (“Which pods are healthy?”)

Anecdote

On my first real‑world cluster, the API Server logs were my compass. Whenever a deployment stalled, I tailed those logs and—nine times out of ten—found the typo or quota error right there.

3.2. Scheduler: The Matchmaker

Once a new pod shows up at reception, the Scheduler swipes through the cluster’s dating profile, looking for the perfect node match.

Scheduler

How it chooses

CPU, memory, and GPU requirements
Node labels (“Only run on SSD nodes”)
Affinity/anti‑affinity—pods that should or shouldn’t sit together
Taints & tolerations—explicit acceptance or rejection criteria

Real‑world example

Suppose you have three worker nodes:

A rugged, GPU‑heavy node
A memory‑rich node
A general‑purpose node

Create a pod that needs a GPU, and the Scheduler will almost certainly pick node #1. Change your YAML to demand 16 GiB of RAM, and node #2 becomes the star attraction.

3.3. Controller Manager: The Tireless Watchdog

If the Scheduler is the matchmaker, the Controller Manager is the city inspector walking neighborhoods with a clipboard.

Controller Manager

Key controllers

DeploymentController: ensures deployments have the right replica count
ReplicaSetController: tracks low‑level replica sets
NodeController: marks nodes unhealthy if they stop reporting
Job & CronJob Controllers: handle one‑off or scheduled tasks

Every controller runs a tight loop: observe → compare → act. When a deployment says “keep three replicas,” and a node dies, the controller immediately spins up a replacement elsewhere—no human intervention required.

3.4. etcd: The Single Source of Truth

Under the floorboards of city hall sits etcd, a lightning‑fast, distributed key‑value store. Every record—pod names, secrets, ConfigMaps, even the leadership election for other components—lives here.

etcd

Why should you care?

Lose etcd, and the control plane forgets everything.
Back up etcd regularly; it’s surprisingly small (megabytes, not gigabytes) but mission‑critical.
Production clusters often run an external etcd cluster for extra durability.

I once watched a teammate accidentally point two clusters at the same etcd endpoint. Within seconds, resources from cluster A started showing up in cluster B. Lesson learned: treat etcd endpoints like root passwords.

3.5. Cloud Controller Manager (Optional): The Polyglot Diplomat

Kubernetes prides itself on running “anywhere,” but public clouds wrap nodes in proprietary plumbing—ELBs on AWS, Routes on GCP, Load Balancers on Azure.

Cloud Controller Manager

The Cloud Controller Manager (CCM) hides these quirks:

Node Lifecycle: creates/terminates cloud VMs as you scale
Load Balancer: allocates native cloud LBs for Service objects
Volume Provisioning: provisions EBS, PD, or Disk volumes on demand
Routes & Networking: programs subnet routes so pods can talk cluster‑wide

If you run Kubernetes on bare‑metal or Raspberry Pis, you might disable the CCM entirely or replace it with a bare‑metal provider like MetalLB.

4. Components of a Worker Node: What Really Keeps Things Running

So far, we’ve talked a lot about the Master Node—the brains of the operation—but what about the brawn? The Worker Nodes are where your actual applications live and breathe. They're not just silent machines running containers; they’re more like mini-operating centers doing a lot under the hood to make sure everything works properly.

Here’s a breakdown of what powers each worker node and what makes it tick.

4.1. Kubelet — The Pod’s Bodyguard

Every node in your cluster runs something called a kubelet. If I had to describe it simply, I’d say it’s like a very strict bodyguard who has a checklist of what needs to be running on your node, and won't rest until every item is ticked off.

Kubelet

Kubelet gets instructions from the control plane (via the API Server), and then takes care of the pods it has been assigned. But it doesn’t just launch them and walk away—it constantly checks in to see if they’re alive, healthy, and running as expected.

Here’s what kubelet handles behind the scenes:

Starting and stopping containers according to the specifications in your pod YAML.
Monitoring the node’s health and telling the Master Node if something goes wrong.
Enforce resource limits like CPU and memory (as defined in your pod specs).
Setting up networking so that your pods can talk to each other and the outside world.
Managing volumes—so your pods can access storage as needed.

Personal experience

I once messed up a pod’s resource limits—gave it too little memory—and it kept getting OOMKilled. The kubelet tried to restart it again and again, just like a dedicated teammate doing their job. That little incident taught me the importance of resource requests and limits, and also earned kubelet my respect.

4.2. Kube‑Proxy — The Network Switchboard Operator

Next on the list is kube-proxy. It’s one of those components that you won’t even notice… until networking breaks.

Kube‑Proxy

To simplify: kube-proxy is like a smart switchboard operator. Its job is to make sure traffic inside your cluster goes to the right pod at the right time. It sets up routing rules so that when a request comes into a node—say, someone hitting your web app—it gets sent to one of your running pods, automatically and efficiently.

There are a few ways kube-proxy can handle this routing:

User space mode: It routes traffic by acting like a middleman. It’s slow and rarely used now.
iptables mode: This is the default for most Kubernetes setups. It’s faster and uses the Linux kernel’s iptables to direct traffic.
IPVS mode: Think of this as the upgraded version. It uses the Linux IP Virtual Server for even more efficient load balancing, especially useful when you’re running at scale.

If your services are acting weird or traffic is being dropped, kube-proxy is the first thing you should check.

4.3. Container Runtime — The Real Container Engine

Here’s the thing: Kubernetes doesn’t run containers directly. That’s where the container runtime comes in. You can think of it as the actual engine behind your pods. Kubernetes tells the container runtime what to do—and the runtime does the heavy lifting.

So what exactly does it handle?

Pulling container images from registries like Docker Hub or your private repository.
Starting and stopping containers, managing their lifecycle.
Networking and volume mounting—hooking containers into the right networks and file systems.
Keeping containers isolated and ensuring they follow security and resource boundaries.

There are several runtimes out there:

Docker: It used to be the default, but it’s no longer recommended since Kubernetes dropped official support for it in v1.24.
contained: This is now the go-to choice for most clusters. Lightweight, reliable, and fast.
CRI-O: Designed specifically for Kubernetes. It’s minimalist and built for performance.
rkt (rocket): Was once promising, but is no longer actively maintained. Still good to know for legacy cases.

Quick tip from experience

If you're ever wondering why your pods are stuck in “ContainerCreating,” it’s often something going wrong at the container runtime level—maybe the image can’t be pulled, or the runtime crashed. Logs from the runtime (like containerd) will give you more clues than Kubernetes logs sometimes.

In a Nutshell

Each Worker Node isn’t just “a machine that runs containers.” It’s got several key players: kubelet to manage and report, kube-proxy to route traffic, and the container runtime to actually run the containers. All of these work in harmony to keep your applications healthy, reachable, and performant.

Once you understand how these components play together, debugging becomes easier, scaling becomes logical, and your trust in Kubernetes goes way up.

Picture a restaurant kitchen: the kubelet is the head chef taking orders, the container runtime is the stove, and kube‑proxy is the waiter routing dishes to tables. Together, they keep the dining room—your application—humming along.

5. A Day in the Life of a Pod (Practical Walk‑Through)

Let’s ground all this theory with an example. Say you run:

kubectl apply -f web-deployment.yaml

API Server validates the YAML, stores the desired state (3 replicas) in etcd.
Scheduler sees new unscheduled pods and binds each to the most suitable node.
kubelet (Worker) on each selected node pulls the container image, starts the pod, and reports status back.
Controller Manager watches until all three pods are “Running ✓ 1/1 Ready.”
kube‑proxy updates routing so that incoming traffic to the web-service ClusterIP spreads evenly across the three pods.
If a node crashes, the NodeController flags it NotReady, the ReplicaSetController notices only two replicas remain, and the Scheduler places a fresh pod on a healthy node—often before end‑users ever notice.

All that orchestration took maybe a few hundred milliseconds, yet no human clicked a thing. That’s Kubernetes architecture at work.

6. Why This Separation Matters (Scale, Resilience, Velocity)

Scale: You can add dozens of worker nodes without touching the control plane—handy during Black Friday traffic surges.
Resilience: The control plane can run replicated across three Masters. Lose one, and Raft consensus inside etcd keeps the cluster safe.
Velocity: Developers push declarative YAML; Kubernetes closes the gap between desired and actual state automatically.

I’ve witnessed a team adopt Kubernetes and move from weekly releases to multiple safe deploys per day—solely because they trusted the platform’s self‑healing design.

7. Friendly Tips for Beginners

Start Small: Spin up a local cluster with kind or minikube. Break it on purpose, then fix it; the muscle memory is priceless.
Watch the Logs: kubectl logs, kubectl describe, and the API Server logs are your best teachers.
Backup etcd: Even for dev clusters. A failed laptop taught me that the hard way.
Label Nodes: Give nodes meaningful labels (gpu=true, zone=us‑east‑1a) so the Scheduler can make smarter choices.
Respect Resource Requests/Limits: Omit them, and Kubernetes picks defaults that may bite you under load.

Conclusion: Architecture Is a Story, Not a Diagram

Kubernetes architecture can look intimidating—dozens of arrows, boxes, and acronyms. But under the hood, it’s a simple story: a thoughtful brain (Master Node) watches a bustling city (Worker Nodes) and constantly nudges it toward the state you asked for. Once you grasp the roles of the API Server, Scheduler, Controller Manager, etcd, and their on‑the‑ground partners (kubelet, container runtime, and kube‑proxy), the magic starts to feel logical—even predictable.

So grab two machines (real or virtual), create a tiny cluster, and repeat my opening experiment—but hopefully without the log‑induced panic. When your pods spin up on cue and traffic flows smoothly, you’ll feel the same “aha!” moment that hooked me years ago. And you’ll know not just how Kubernetes works, but why its architecture matters.