-
Notifications
You must be signed in to change notification settings - Fork 476
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
In what scenarios should QuantManagedCollisionEmbeddingCollection
be used?
#2724
Comments
hi, @iamzainhuda @kausv @dstaay-fb @henrylhtsang @PaulZhang12 can you see this problem? |
@tiankongdeguiji QuantManagedEmbeddingCollection is a stronger indication that the underlying EC is quantized. It is also to be used for inference as it implicitly calls reset_inference_mode(). |
thx! I believe that during the inference phase, the MC module will not do profile. Could you please explain the rationale behind setting |
I think reset_inference_mode() should also set self.training to False which we have missed. |
thx! Could you see this problem #2669 ? |
I believe that when we need to use a quantized
ManagedCollisionEmbeddingCollection
, we can configure the mapping fromEmbeddingCollection
toQuantEmbeddingCollection
withinquantize_dynamic
. This makes the existence of a QuantManagedCollisionEmbeddingCollection class unnecessary. In what scenarios shouldQuantManagedCollisionEmbeddingCollection
be used?The text was updated successfully, but these errors were encountered: