[FEATURE] Hybrid Search should provide scores of sub queries for understanding/debugging the results. #658

vamshin · 2024-03-31T09:26:24Z

Is your feature request related to a problem?

Hybrid search doesn't return the scores of each individual query, making it difficult to debug why fragments were included/excluded

What solution would you like?

As part of _explain API, we should provide scores of sub queries for understanding/debugging the results.

The text was updated successfully, but these errors were encountered:

smacrakis · 2024-08-09T20:51:47Z

Yes, customers would like to see both scores from hybrid search, both for debugging and for training LTR models.

smacrakis · 2024-08-09T20:53:08Z

We were also hoping for this feature in 2.16 for our own work (with OSC) in tuning hybrid search using LTR.

yuye-aws · 2024-08-12T00:53:22Z

Are we also going to support explain API for KNN queries like: opensearch-project/k-NN#875?

smacrakis · 2024-08-28T19:53:43Z

The documentation on _explain says "The explain API is an expensive operation in terms of both resources and time. On production clusters, we recommend using it sparingly for the purpose of troubleshooting."
If this is true, then returning the subquery scores via _explain is not going to be viable for LTR in production if the subquery scores are being used as features. Do we have a path to returning the scores more efficiently?

zhichao-aws · 2024-08-29T05:45:00Z

The documentation on _explain says "The explain API is an expensive operation in terms of both resources and time. On production clusters, we recommend using it sparingly for the purpose of troubleshooting." If this is true, then returning the subquery scores via _explain is not going to be viable for LTR in production if the subquery scores are being used as features. Do we have a path to returning the scores more efficiently?

I have the same question.
Customers may need the absolute scores as input features for downstream systems. While current hybrid query just normalize the scores and we lose that information.

vamshin added untriaged enhancement and removed untriaged labels Mar 31, 2024

github-actions bot added the untriaged label Mar 31, 2024

vamshin added v2.15.0 v2.16.0 and removed untriaged labels Mar 31, 2024

vamshin added this to Vector Search RoadMap Mar 31, 2024

github-project-automation bot moved this to Backlog in Vector Search RoadMap Mar 31, 2024

vamshin moved this from Backlog to Backlog (Hot) in Vector Search RoadMap Mar 31, 2024

vamshin moved this from Backlog (Hot) to 2.15.0 in Vector Search RoadMap Apr 1, 2024

bbarani added this to Test roadmap format Apr 9, 2024

github-project-automation bot moved this to Planned work items in Test roadmap format Apr 9, 2024

vamshin removed the v2.15.0 label May 31, 2024

vamshin moved this from 2.15.0 to 2.16.0 in Vector Search RoadMap May 31, 2024

martin-gaievski added v2.17.0 and removed v2.16.0 labels Aug 8, 2024

martin-gaievski removed this from Test roadmap format Aug 15, 2024

naveentatikonda added v2.18.0 and removed v2.17.0 labels Aug 29, 2024

naveentatikonda moved this from 2.17.0 to Now(This Quarter) in Vector Search RoadMap Aug 29, 2024

vamshin assigned martin-gaievski Sep 11, 2024

This was referenced Sep 12, 2024

[FEATURE] Provide way of defining configuration for the pipeline #904

Open

[RFC] Explainability for Hybrid query #905

Open

martin-gaievski mentioned this issue Oct 31, 2024

Explainability in hybrid query #970

Merged

5 tasks

martin-gaievski mentioned this issue Nov 1, 2024

[DOC] Enabling explainability for hybrid query opensearch-project/documentation-website#8645

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Hybrid Search should provide scores of sub queries for understanding/debugging the results. #658

[FEATURE] Hybrid Search should provide scores of sub queries for understanding/debugging the results. #658

vamshin commented Mar 31, 2024

smacrakis commented Aug 9, 2024

smacrakis commented Aug 9, 2024

yuye-aws commented Aug 12, 2024

smacrakis commented Aug 28, 2024

zhichao-aws commented Aug 29, 2024

[FEATURE] Hybrid Search should provide scores of sub queries for understanding/debugging the results. #658

[FEATURE] Hybrid Search should provide scores of sub queries for understanding/debugging the results. #658

Comments

vamshin commented Mar 31, 2024

Is your feature request related to a problem?

What solution would you like?

smacrakis commented Aug 9, 2024

smacrakis commented Aug 9, 2024

yuye-aws commented Aug 12, 2024

smacrakis commented Aug 28, 2024

zhichao-aws commented Aug 29, 2024