-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrating GPU based Vector Search using cuVS #14131
base: main
Are you sure you want to change the base?
Conversation
@@ -22,6 +22,7 @@ plugins { | |||
} | |||
|
|||
repositories { | |||
mavenLocal() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove mavenLocal before merging, if it happens. There will be issues with it - some are very cryptic and hard to diagnose (like different artifact hashes). It's going to be a major headache if it's left in.
@chatman thanks for creating the PR. This looks very interesting. is the idea here is the Lucene library will on a GPU machine and running the CUVS. |
.withNumWriterThreads(cuvsWriterThreads) | ||
.withIntermediateGraphDegree(intGraphDegree) | ||
.withGraphDegree(graphDegree) | ||
.withCagraGraphBuildAlgo(CagraGraphBuildAlgo.NN_DESCENT) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I have some experience with building the Cagra index, and I think NN_DESCENT is faster in cagra index creation but it has a high GPU memory footprint. Should we use IVF_PQ here? Or can we have a hybrid approach where if doc count is < a specific number then we use NN_DESCENT else IVF_PQ.
Description
This is an in-progress PR at the moment. Here is a way to test it out:
TODO:
This work is mainly done by @narangvivek10, @punAhuja and me, along with help from @cjnolet.