-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor code in preparation for SGLang PR #5
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Moving some code to hip-attention is a brilliant idea 👍
This should make us to contribute SGLang easier.
So DeepAuto-AI/hip-attention#25 should be merged as well?
I think I need a review from Heejun before deciding to merge. Some changes might not be 100% correct. |
Sure thing! Every members needs to acknowledge code changes in |
It looks very GOOD! However, I have a suggestion:
What do you guys think about it? @daniel-geon-park @kbumsik |
If we pass a non-stable If that happens, the So, I don't think it is wise to make a change where our Instead, we could future-proof it by explicitly passing more values that we might need in the future to |
I am with @daniel-geon-park . We need to avoid a cyclic dependency. |
Thanks for the comment! I now understand it. Agree with @daniel-geon-park |
Thank you! |
Motivation
We need to make our SGLang dev branch fit for making a pull request to the upstream SGLang repo.
Modifications
For maximum flexibility and extensibility once our code is merged to mainstream, I moved most of the HiP Attention-internal logic from this repo into DeepAuto-AI/hip-attention:
Also, I remove the following files, and instead integrate their functionality into the existing SGLang classes:
Checklist