[Proposal] Efficient and Greener way to use k8s cluster for benchmarking tasks #67

dipankardas011 · 2024-02-22T03:07:18Z

Proposal

Context: Use of autoscaler to scale the cluster up / down

Why: Assuming that benchmarking or other tasks related to specific projects run only for finite intervals, also that the the event of doing this is MP is a release event for all supported projects

Expected outcome: when we have to run specific project benchmark tasks we can use the OpenTOFU to add a node and we can attach node labels, etc. and then we can schedule our workload to it. once done with all the processing of the tasks we can store the results in Grafana or something and then de-provision the node we allocated to free up the node we provisioned before

Achievement: reduced costs, also demonstrates how can we optimize the Tests on each project

Challenges:

@AntonioDiTuri 📔 About adding a node on demand it would be nice for the next release, I guess a tradeoff would be startup time, a new node might take a while to set up.)

rossf7 · 2024-04-16T17:41:25Z

@dipankardas011 Thanks again for creating this proposal. Reducing the footprint of the cluster is important to us as a WG and will be important as we onboard more projects.

We deferred this from the Q2 pipeline automation work we are starting. This is because the scope of that work is already large. However IMO the automation we will be developing in #84 would also help with this in future.

There are some challenges here and I expect it will take around 10 mins to join nodes to the cluster. However none of the metrics we capture are time sensitive and the benchmarking also takes time so I don't see that as a blocker.

dipankardas011 · 2025-02-26T16:47:43Z

Updates

we are planning for subset of kubernetes cluster to be scalable. Worker node where the benchmarking job will work are going to be made scalable to zero

later we can also think of batching benchmarking jobs where we can have N: no of benchmarking jobs we spin up and add the node to the cluster perform the benchmarking of all those N jobs and once its done we can then free up the baremetal (suggestion by @leonardpahlke)

nikimanoledaki added this to TAG-Environmental-Sustainability Feb 28, 2024

dipankardas011 mentioned this issue Apr 16, 2024

[ACTION] Proposal 1: Trigger and Deploy #84

Closed

5 tasks

rossf7 added kind/feature New feature or request board/wg-green-reviews priority/important-longterm area/cluster labels Apr 16, 2024

rossf7 mentioned this issue Aug 28, 2024

update to the README: description, images and roadmap #120

Merged

nikimanoledaki added this to Green Reviews Feb 12, 2025

nikimanoledaki moved this to Backlog in Green Reviews Feb 12, 2025

nikimanoledaki added the help wanted Extra attention is needed label Feb 12, 2025

nikimanoledaki moved this from Backlog to Ready in Green Reviews Feb 13, 2025

rossf7 mentioned this issue Feb 26, 2025

Investigate using Oracle Cloud bare metal for green reviews cluster #166

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Proposal] Efficient and Greener way to use k8s cluster for benchmarking tasks #67

[Proposal] Efficient and Greener way to use k8s cluster for benchmarking tasks #67

dipankardas011 commented Feb 22, 2024

rossf7 commented Apr 16, 2024

dipankardas011 commented Feb 26, 2025

[Proposal] Efficient and Greener way to use k8s cluster for benchmarking tasks #67

[Proposal] Efficient and Greener way to use k8s cluster for benchmarking tasks #67

Comments

dipankardas011 commented Feb 22, 2024

Proposal

rossf7 commented Apr 16, 2024

dipankardas011 commented Feb 26, 2025

Updates