Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I have found that the agent component achieves resource oversubscription and mixed online/offline deployment based on CPU and memory resources. Is it possible to achieve these based on GPU resources as well? #3945

Open
sakuradiss opened this issue Dec 31, 2024 · 5 comments
Labels
area/agent Issues or PRs related to volcano agent area/colocation ssues or PRs related to colocation features kind/question Categorizes issue related to a new question

Comments

@sakuradiss
Copy link

Please describe your problem in detail

I have found that the agent component achieves resource oversubscription and mixed online/offline deployment based on CPU and memory resources. Is it possible to achieve these based on GPU resources as well?

Any other relevant information

No response

@sakuradiss sakuradiss added the kind/question Categorizes issue related to a new question label Dec 31, 2024
@Monokaix
Copy link
Member

Can you describe your user case more detail?Currently volcano also support GPU virtualization, you can see https://github.com/volcano-sh/volcano/blob/master/docs/user-guide/how_to_use_gpu_number.md for more detail: )

@Monokaix
Copy link
Member

/area agent

@volcano-sh-bot volcano-sh-bot added the area/agent Issues or PRs related to volcano agent label Dec 31, 2024
@Monokaix
Copy link
Member

/area colocation

@volcano-sh-bot volcano-sh-bot added the area/colocation ssues or PRs related to colocation features label Dec 31, 2024
@sakuradiss
Copy link
Author

Can you describe your user case more detail?Currently volcano also support GPU virtualization, you can see https://github.com/volcano-sh/volcano/blob/master/docs/user-guide/how_to_use_gpu_number.md for more detail: )你能更详细地描述一下你的用户案例吗?目前volcano也支持GPU虚拟化,你可以查看https://github.com/volcano-sh/volcano/blob/master/docs/user-guide/how_to_use_gpu_number.md了解更多细节:)

When I was reviewing the link https://github.com/volcano-sh/volcano/blob/master/docs/design/colocation/Overview.md, I noticed that their resource management and limitations are focused on CPU and memory. However, I would like to use GPU resources and memory for building a complex model training and inference system that supports both online and offline mixed deployment. Is this approach feasible? Thank you for your answer.

@Monokaix
Copy link
Member

Can you describe your user case more detail?Currently volcano also support GPU virtualization, you can see https://github.com/volcano-sh/volcano/blob/master/docs/user-guide/how_to_use_gpu_number.md for more detail: )你能更详细地描述一下你的用户案例吗?目前volcano也支持GPU虚拟化,你可以查看https://github.com/volcano-sh/volcano/blob/master/docs/user-guide/how_to_use_gpu_number.md了解更多细节:)

When I was reviewing the link https://github.com/volcano-sh/volcano/blob/master/docs/design/colocation/Overview.md, I noticed that their resource management and limitations are focused on CPU and memory. However, I would like to use GPU resources and memory for building a complex model training and inference system that supports both online and offline mixed deployment. Is this approach feasible? Thank you for your answer.

Can GPU virtualization meet your need?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/agent Issues or PRs related to volcano agent area/colocation ssues or PRs related to colocation features kind/question Categorizes issue related to a new question
Projects
None yet
Development

No branches or pull requests

3 participants