Chapter Five

Scheduling and Scaling

How Kubernetes decides where Pods run and how many — requests and limits, the scheduler's filters, the autoscalers, health probes, and the budgets that protect availability.

6 topics

Two questions decide cluster behavior under load: where does each Pod run, and how many copies exist. Get the inputs wrong and you get evictions, throttling, and pages at 3 a.m.

This chapter covers the levers: requests and limits that drive both scheduling and eviction, the scheduler's filter-and-score logic with affinity and taints, the three autoscalers, the probes that gate traffic and restarts, and the budgets and QoS classes that decide who survives pressure.

Topics in This Chapter