`Query::term_weights` is not assigned #501

elshize · 2022-12-24T15:48:30Z

Query::term_weights seems to be never assigned. The --weighted flag in queries and evaluate_queries programs ignore those, and instead resolve weights when creating cursors.

The text was updated successfully, but these errors were encountered:

seanmacavaney · 2022-12-31T19:07:01Z

I noticed this recently when working on the Python integration. Fixing this would simplify the integration since we wouldn't need to resort to term repetition to apply the weighting.

elshize · 2022-12-31T19:31:23Z

@seanmacavaney thanks for letting us know. Just to be clear, by fixing you mean that term_weights should be used and should affect the query processing, right?

I have some other stuff open now that I want to merge first, but I can look closer into this after that.

seanmacavaney · 2022-12-31T19:36:18Z

Yes. I expected that term_weights would be honoured during query processing, but they appear to only be used in intersection.

The issue isn't urgent though. Query term repetition is good enough for the time being.

elshize · 2022-12-31T20:29:03Z

I expected that term_weights would be honoured during query processing

Yeah, I was also surprised :)

JMMackenzie · 2023-01-26T23:23:33Z

I always figured we had term_weights for future proofing in case we wanted to support query inputs like: hello:20 world:10 or something like.

Would it be more or less painful to have a vector of pairs/structs? It seems a bit tedious/error prone to have separate vectors accessed by index, at least in my opinion.

Thinking something like:

struct query_term {
  uint32_t term_id;
  double weight;
   ...
};

elshize · 2023-01-26T23:59:17Z

Would it be more or less painful to have a vector of pairs/structs?

Possibly. This sounds like a good idea, but would have to look at the code again to see if there's anything preventing (or discouraging) that.

seanmacavaney · 2024-01-16T09:04:38Z

Amazing -- thanks for the fix @elshize!

elshize · 2024-01-16T23:03:25Z

@seanmacavaney please note that this is quite a rewrite around query parsing/handling. Not sure how much that would affect your Python binding once you upgrade.

We would love to provide some better stability in our APIs, but I'm currently actively trying to improve multiple parts of the library, so it will get worse before it gets better unfortunately.

If at any point you have any questions or issues, I'd be more than happy to help with any future upgrades.

seanmacavaney · 2024-01-17T10:13:40Z

Thanks for letting me know. So it's best to hold off on any changes to the Python integration until the API stabilises a bit.

Are there some specific gh issues that you recommend I subscribe to to help keep an eye on this progress?

elshize · 2024-01-18T19:30:12Z

Are there some specific gh issues that you recommend I subscribe to to help keep an eye on this progress?

Not really, but it may be a good idea to open a tracking issue. Let me think briefly on how to best organize it, and I'll let you know.

elshize · 2024-01-18T19:47:59Z

@seanmacavaney I created an issue, not much there now, but you can subscribe to get updates: #569

elshize added bug Something isn't working priority:high needs investigation labels Oct 5, 2023

elshize self-assigned this Dec 22, 2023

elshize mentioned this issue Dec 30, 2023

Query refactoring #561

Merged

elshize closed this as completed in #561 Jan 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`Query::term_weights` is not assigned #501

`Query::term_weights` is not assigned #501

elshize commented Dec 24, 2022

seanmacavaney commented Dec 31, 2022

elshize commented Dec 31, 2022

seanmacavaney commented Dec 31, 2022

elshize commented Dec 31, 2022

JMMackenzie commented Jan 26, 2023

elshize commented Jan 26, 2023 •

edited

Loading

seanmacavaney commented Jan 16, 2024

elshize commented Jan 16, 2024

seanmacavaney commented Jan 17, 2024

elshize commented Jan 18, 2024

elshize commented Jan 18, 2024

Query::term_weights is not assigned #501

Query::term_weights is not assigned #501

Comments

elshize commented Dec 24, 2022

seanmacavaney commented Dec 31, 2022

elshize commented Dec 31, 2022

seanmacavaney commented Dec 31, 2022

elshize commented Dec 31, 2022

JMMackenzie commented Jan 26, 2023

elshize commented Jan 26, 2023 • edited Loading

seanmacavaney commented Jan 16, 2024

elshize commented Jan 16, 2024

seanmacavaney commented Jan 17, 2024

elshize commented Jan 18, 2024

elshize commented Jan 18, 2024

`Query::term_weights` is not assigned #501

`Query::term_weights` is not assigned #501

elshize commented Jan 26, 2023 •

edited

Loading