integ/kubernetes
Andre Kantek 1c628f50a6 Test kube-api server availability for longer time
It was observed that during dead office recovery (when all nodes
restart simultaneously) that the availability of kube-api-server can
be larger than the current wait loops used to check pod stabilization.
This can lead to a failed execution of important pod operations to
recover them to normal operation.

This change adds an initial specific loop to wait for the API server
availability, in case of failure it exits the script and now the
service is marked as failed which will trigger new attempts to execute
in case of failure (with a 10s interval between service executions).

A 5 minutes time was chosen to have a better chance of finishing the
script without needing to have systemd to restart the service.

The monitoring is done using the instructions in:
https://kubernetes.io/docs/reference/using-api/health-checks/

Test Plan:
install AIO-DX and:
[PASS] execute nodes lock/unlock and observe the service correct
       execution, kube-api is immediately available.
[PASS] execute simultaneous node reboots and observe that the service
       remains waiting for kube-api server for some time (in vbox
       it was 2.5min) and after that it executes the script

Closes-Bug: 2089864

Change-Id: I0d77da1735ecb829ab1da013fe93431688e4cb97
Signed-off-by: Andre Kantek <andrefernandozanella.kantek@windriver.com>
2024-11-29 15:04:33 -03:00
..
chartmuseum/debian Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
cni debian-pkg: Uprev cni plugins 2024-05-23 07:19:47 -04:00
containerd/debian Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
crictl/debian Fix lint errors identified by Zuul pylint job 2023-03-15 12:07:17 +00:00
docker-distribution/debian Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
etcd/debian Merge "Remove CentOS/OpenSUSE build support" 2024-05-22 15:14:42 +00:00
helm Remove MAKEKUBEAPI package 2024-06-03 10:14:14 -03:00
k8s-cni-cache-cleanup Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
k8s-pod-recovery Test kube-api server availability for longer time 2024-11-29 15:04:33 -03:00
kubernetes-1.24.4/debian Modify kube-apiserver cpu resource limits and readiness probes 2024-11-08 16:05:32 -05:00
kubernetes-1.25.3/debian Modify kube-apiserver cpu resource limits and readiness probes 2024-11-08 16:05:32 -05:00
kubernetes-1.26.1/debian Modify kube-apiserver cpu resource limits and readiness probes 2024-11-08 16:05:32 -05:00
kubernetes-1.27.5/debian Modify kube-apiserver cpu resource limits and readiness probes 2024-11-08 16:05:32 -05:00
kubernetes-1.28.4/debian Modify kube-apiserver cpu resource limits and readiness probes 2024-11-08 16:05:32 -05:00
kubernetes-1.29.2/debian Modify kube-apiserver cpu resource limits and readiness probes 2024-11-08 16:05:32 -05:00
kubernetes-unversioned/debian Merge "Remove CentOS/OpenSUSE build support" 2024-05-22 15:14:42 +00:00
n3000/debian Merge "Remove CentOS/OpenSUSE build support" 2024-05-22 15:14:42 +00:00
plugins Merge "Remove CentOS/OpenSUSE build support" 2024-05-22 15:14:42 +00:00
runc/debian Up-rev runc package to 1.1.12 2024-03-20 09:43:36 -04:00