GCP-2：谷歌云GKE的Autoscaling操作

本文链接：https://blog.csdn.net/rookie_orrico/article/details/136706167

本文详细介绍了谷歌云GKE的四种主要自动扩展机制，包括ClusterAutoscaling(CA)、NodeAuto-provisioning(NAP)、HorizontalPodAutoscaling(HPA)和VerticalPodAutoscaling(VPA)，以及它们在创建集群、部署工作负载时的应用和配合策略。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

谷歌云GKE Autoscaling大致分4种：infrastructure有Cluster Autoscaling (CA)和Node Auto-provisioning (NAP)，workload有Horizontal Pod Autoscaling (HPA)和Vertical Pod Autoscaling (VPA)

-谷歌云GKE Cluster的创建有2中：autopilot（只需要考虑HPA，其他autoscaling为default）和standard（需要考虑所有autoscaling）

Infrastructure Autoscaling：

1. Cluster Autoscaling (CA)：可以自动创建相同规格的node（基本上都会用）

#: gcloud container clusters update <cluster> --enable-autoscaling --min-nodes=1 --max-nodes=3

2. Node Auto-provisioning (NAP)：可以自动创建不同规格的node（新的machine type等，即创建新的node pool，适用于价格优化的batch job）

#: gcloud container clusters update <cluster> --enable-autoprovisioning --min-cpu=0.5 --max-cpu=2

-场景：需要deploy的job有resource request和toleration，NAP会自动创建新的node pool和新的machine type来配对job的resource request和node taint来配对job的toleration；当job完成后，NAP会自动关闭新创建的node pool

Workload Autoscaling：

3. Horizontal Pod Autoscaling (HPA)：可以自动创建相同规格的Pod（基本上都会用）

-必须在deployment中设置CPU的resource request用做HPA autoscaling的基准

#: kubectl autoscale deploy nginx --max=3 --cpu-percent=70

#: kubectl get hpa nginx -o yaml（可以查看hpa的spec，也可以进行修改）