docs: new talk

2025-03-30 18:16:00 +02:00
parent cb8d7f9d48
commit 17b4407fea
1 changed files with 72 additions and 0 deletions
--- a/content/day-2/10_auto-scale.md
+++ b/content/day-2/10_auto-scale.md
@@ -0,0 +1,72 @@
+---
+title: "The auto-scaling part: VPA, HPQ, KEDA, Nodes, How do they dance"
+weight: 10
+tags:
+ - rejekts
+---
+
+<!-- {{% button href="https://youtu.be/rkteV6Mzjfs" style="warning" icon="video" %}}Watch talk on YouTube{{% /button %}} -->
+
+## Hypothesis
+
+- In 2024 27% of cloud spent was wasted
+- 100ms delay => decrease in sales
+
+## Pod resources
+
+- Requests: Informs scheduler's decision
+    - Too low: Schedule on strained nodes
+    - Too high: Wasted resources
+- Limits: Throttels (CPU) or Kills (Memory) if reached
+- QoS: sort the eviction priority during ressource pressure
+    - Quranteed (request=limits)
+    - Burstable (Limits>Requests)
+    - Best effort (Nothing defined) 
+- Gotcha: CPU throtteling can happen before tirggers happen if requests and limits are very close
+
+TODO: Steal table from Slides
+
+Requests | 100m, 256Mi | 100m, 256Mi
+Limits |100m, 256Mi | None or <limits
+QoS | Gurantee | Burstable | Best effort
+
+## Scalers
+
+- VPA: Moar power aka reccomend requests
+- HPA: Moar moar aka more replicas
+- KEDA: Proxy over HPA
+
+### VPA
+
+Modes:
+- Off: Dry-Run
+- Initial: Applies Reccomendations to new Pods (can be used for finding out)
+- Auto/Recreate: Evicts and restarts pods to update resources
+
+Trigger: Usually Memory
+Tip: `maxAllowed` in order to not exhaust stuff
+
+
+### HPA
+
+- Trigger: Usually cpu (percent of requests)
+- Formula: $1+\frac{usage}{target}$
+- Fun fact: Can not scale to 0
+
+### KeDA
+
+- Basicly automates HPA with flexible metrics (from different soruces)
+- Can scale Jobs
+- Can Scale to 0
+
+## Anti patterns
+
+TODO: Steal from slides
+
+| Pattern | Bad | Better
+| CPI limit = Requests | Throtteling before scale | Set requests only |
+
+
+## Demo
+
+Auto scaling meme generator (see slides/video)