-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OpenCost metrics interfere with OpenShift's "degraded control plane" detection? #249
Comments
Hmmm, this seems to completely break any cost calculation in OpenCost. After setting this, there are no more metrics visible. I enabled the |
Even after re-enabling |
As just stated in #252 I am not sure if this issue should rather go to the opencost repository, as it seems (to me, with the knowledge I have today...) like not just a problem of disabling some things on OpenShift, but a general problem of OpenCost not working on OpenShift without interfering with OpenShift itself? |
Hi @kastl-ars |
Thank you! We are using the latest chart version 1.43.1. |
The more pressing issue would be #252 as a wrong CPU count sounds more problematic. But my guess is they are related... |
Thank you, |
Dear OpenCost maintainers,
since last week we noticed that our OpenShift cluster show a degradation warning, as only 50% of the apiservers are responding.
Turns out this seems to be related to metrics exposed by OpenCost, scraped by Prometheus and then returned by the query used for this degradation detection.
We have explictly disabled the emission of pod annotations, namespace annotations and ksm V1 metrics and the error vanished.
The following lines appeared in the deployment:
I would like to see this added to the documentation that @mittal-ishaan was working on IIRC.
The query that went wrong was this:
Before we introduced the workaround described above, this returned 6 pods, while only three were really running. Hence the degradation warning as only 50% were working...
Kind Regards,
Johannes
The text was updated successfully, but these errors were encountered: