You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When checking for pod status in case of Prometheus deployment the wait_for_resource func currently checks if pods are in Ready state or not and based on current state waits till max retries are exhausted. In certain situations where some Pods don't come up immediately in a given namespace or are in init state and one pod is already available this check skips validating the remaining pod status.
One of the sample run-log from kepler-operator CI:
2024-03-28T10:34:11.7436108Z
2024-03-28T10:34:11.7437530Z �[1m🔆🔆🔆 Waiting for pods in monitoring to be ready 🔆🔆🔆 �[0m
2024-03-28T10:34:11.7451616Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
2024-03-28T10:34:11.7467141Z 10:34:11 🔔 INFO : Waiting for pods to be in Ready state
2024-03-28T10:34:11.7467532Z
2024-03-28T10:34:13.7202430Z pod/prometheus-operator-86c875f999-zgpkm condition met
2024-03-28T10:34:13.7770338Z pod/prometheus-operator-86c875f999-zgpkm condition met
2024-03-28T10:34:13.7771429Z error: condition not met for pods/prometheus-k8s-0
2024-03-28T10:34:13.7790130Z ❌ pods --all -n monitoring failed to be in Ready state
2024-03-28T10:34:13.7790728Z
2024-03-28T10:34:13.7791175Z ❌ Pods below failed to run
2024-03-28T10:34:13.7791565Z
2024-03-28T10:34:13.8257085Z NAME READY STATUS RESTARTS AGE
2024-03-28T10:34:13.8258340Z prometheus-k8s-0 0/2 Init:0/1 0 1s
2024-03-28T10:34:13.8344266Z fail to setup local-dev-cluster
The text was updated successfully, but these errors were encountered:
When checking for pod status in case of Prometheus deployment the wait_for_resource func currently checks if pods are in Ready state or not and based on current state waits till max retries are exhausted. In certain situations where some Pods don't come up immediately in a given namespace or are in init state and one pod is already available this check skips validating the remaining pod status.
One of the sample run-log from kepler-operator CI:
The text was updated successfully, but these errors were encountered: