--- title: "The auto-scaling part: VPA, HPQ, KEDA, Nodes, How do they dance" weight: 10 tags: - rejekts --- {{% button href="https://www.youtube.com/watch?v=1US_-3udMDo" style="warning" icon="video" %}}Watch talk on YouTube{{% /button %}} ## Hypothesis - In 2024 27% of cloud spent was wasted - 100ms delay => decrease in sales ## Pod resources - Requests: Informs scheduler's decision - Too low: Schedule on strained nodes - Too high: Wasted resources - Limits: Throttels (CPU) or Kills (Memory) if reached - QoS: sort the eviction priority during ressource pressure - Quranteed (request=limits) - Burstable (Limits>Requests) - Best effort (Nothing defined) - Gotcha: CPU throtteling can happen before tirggers happen if requests and limits are very close TODO: Steal table from Slides Requests | 100m, 256Mi | 100m, 256Mi Limits |100m, 256Mi | None or