Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Kubernetes Integration] Implement Rate Limit mechanism for Kubernetes API calls #37922

Open
Tracked by #3801
gizas opened this issue Feb 8, 2024 · 1 comment
Open
Tracked by #3801
Labels
enhancement Team:Cloudnative-Monitoring Label for the Cloud Native Monitoring team

Comments

@gizas
Copy link
Contributor

gizas commented Feb 8, 2024

Describe the enhancement:

Kubernetes Integration performs API calls to the Kubernetes API in order to retrieve information for the resources it monitors .It uses the provided information to enrich the metadata of the documents that ingests. The implementation is based on client-go library and this is an example of a watcher that will start monitor namespace events and performs API calls in the background. The mechanism of API calls should have a way to limit the number of calls towards KubeAPi to prevent the exhaustion of the server.

Cases of 429 Too many requets from APi server, should be handled gardually from the code, intially to limit/stop the requests when error occurs (or if possible if in big delays of responses) and perform a retry on a given time period.

Describe a specific use case for the enhancement or feature:

We have seen issues (3991, 4325) that customers complain for the large number of API calls beats or elastic agent performs to the KubeAPI, especially in big scale clusters.

Related Links

@botelastic botelastic bot added the needs_team Indicates that the issue/PR needs a Team:* label label Feb 8, 2024
@gizas gizas added the Team:Cloudnative-Monitoring Label for the Cloud Native Monitoring team label Feb 8, 2024
@botelastic botelastic bot removed the needs_team Indicates that the issue/PR needs a Team:* label label Feb 8, 2024
@anson627
Copy link

we found API server OOM killed where beats list the resources nodes/namespaces/replicasets from the daemonset pods from all nodes, is there any plan to move forward with this issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Team:Cloudnative-Monitoring Label for the Cloud Native Monitoring team
Projects
None yet
Development

No branches or pull requests

2 participants