-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support different embedding types of model response #1007
Support different embedding types of model response #1007
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #1007 +/- ##
============================================
- Coverage 81.74% 81.71% -0.03%
+ Complexity 2509 1255 -1254
============================================
Files 190 95 -95
Lines 8560 4282 -4278
Branches 1436 718 -718
============================================
- Hits 6997 3499 -3498
+ Misses 1006 506 -500
+ Partials 557 277 -280 ☔ View full report in Codecov by Sentry. |
Can you provide sample request/response for each of new embedding type? Is the type configurable on the client side (neural search in this case), or model is smart enough to switch between types? Any changes required from user to use new type(s), for instance do they need to recreated model connector and index mapping? |
Sample of different embedding types:
The type is configurable in model side, in the connector specifically. When user wants to switch to a different embedding type, user needs to recreate a new model with different embedding type in the connector, also user should do a reindex to ensure the embeddings are using the new embedding type result, in this case, a new index should created and after the reindex user can specify alias on original index to new one. We don't consider modifying index mapping as we can only add new field in existing index mapping, not able to modify an existing field. |
@zane-neo Plz resolve the conflicting files. The code looks good to me. |
24cdeeb
to
f021c01
Compare
@zane-neo is this PR ready for review or you expect more changes? |
Signed-off-by: zane-neo <[email protected]>
Signed-off-by: zane-neo <[email protected]>
f021c01
to
264abc4
Compare
Signed-off-by: zane-neo <[email protected]>
Signed-off-by: zane-neo <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The code looks good to me. Nice PR!
The failure BWC tests are flaky, @martin-gaievski @heemin32 This PR is ready for review, thanks. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall the PR looks good to me. The bwc issue should be fixed after we 1. bump to 3.0.0-alpha1 and 2. consider loaded
and deployed
as valid model states.
I saw both this PR and #1141 are fixing the model state issue. We should confirm we're not making duplicate fixes. cc @martin-gaievski
@martin-gaievski Do you have a plan when to merge #1141? If it's going to have more changes, maybe we can merge this PR first? |
#1141 is merged now. We can merge the PR after rebase main |
@zane-neo Conflict files need to be resolved |
Signed-off-by: Martin Gaievski <[email protected]>
resolved conflict in CHANGELOG, @zane-neo in case more changes will be required please make sure to pull latest version and rebase on main |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good to me, thank you
Description
Support different embedding types of model response
Related Issues
#1006
Check List
--signoff
.By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.