Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add GPU right sizing for GPU slices created by instaslice and apply the recommendations #1449

Open
5 tasks
bharathappali opened this issue Jan 8, 2025 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@bharathappali
Copy link
Member

bharathappali commented Jan 8, 2025

Describe the feature

Kruize currently works along side with VPA to apply the recommendations, but VPA doesn't apply the GPU partitioning changes to apply the accelerator recommendations generated by kruize. Instaslice Operator on Openshift actually partitions the GPU on the fly when ever the pod resources for GPU are changed. So If kruize can track the MIG partition usage and come up with MIG right sizing recommendations, kruize can apply them via instaslice (As on when kruize changes the pod requests and limits wrt to GPU instaslice can create the slice and assign to the pod). This way the accelerator recommendations can also be applied via kruize.

Suggest a solution

Kruize should be able to track the MIG partition (which are created by instaslice) usage and come up with accelerator recommendations and apply them by updating the kubernetes deployment / statefulset / job object with latest requests and limits and let instaslice create the GPU slices on the fly.

Checklist of changes to be done

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant