Query-time boosting affects model execution but not feature value logging #368

tmanabe · 2021-05-10T08:11:39Z

Hi. I would like to ask a question about this plugin and query-time boosting.
I am afraid that query-time boosting affects only execution of models but does not affect feature value logging.
In my opinion, this behavior may lead to inaccurate modeling.
Is this an expected behavior?

A test case to reproduce the behavior:
main...tmanabe:different-feature-values

Thanks!

The text was updated successfully, but these errors were encountered:

nomoa · 2021-05-10T12:57:46Z

This is true, boosting only affects the output score of the model. This is I think what is expected when you boost a query: the query score is multiplied by the boost value.

If the query boost was to affect the feature values then I'm not sure how this would work:

depending on the model the output will no longer be what most users will expect output_score != model_score * boost
the boost becomes something inherent to the model training, running the model with a different boost than the one used to extract feature values might lead to weird behaviors esp. for decision trees.

I think it's less error prone to have features as independent as possible from their context (parent query boost here) to reduce possible discrepancies between training and runtime.

tmanabe · 2021-05-12T09:32:07Z

Thanks for your quick reply!

With a debugger, I compared feature values for model execution with ones for feature value logging.
As a result, my understanding is that query-time boosting affects feature values themselves (not only model output) and
affects at model execution time only.

So these two are also my concern:

Depending on the model, output_score != model_score * boost
Models can be trained with original feature values then executed with boosted feature values

nomoa · 2021-05-14T06:59:22Z

@tmanabe thanks for digging into this!
You're absolutely correct, I wrongfully assumed that the RankerQuery did not propagate the boost but it does as shown in your debugging session.
Culprit seems RankerQuery#createWeight at
https://github.com/o19s/elasticsearch-learning-to-rank/blob/main/src/main/java/com/o19s/es/ltr/query/RankerQuery.java#L201 where top-level boost is passed to feature queries.

I'd be in favor of forcing the boost to 1 for feature queries and apply the boost a posteriori from the scorer (https://github.com/o19s/elasticsearch-learning-to-rank/blob/main/src/main/java/com/o19s/es/ltr/query/RankerQuery.java#L312).

@worleydl do you have any thoughts on this?

nomoa · 2021-05-14T11:59:16Z

For context I think the behavior changed with b907213#diff-07788001c91b0b5c03be973de2a368900204bab6c6fc6d3255ec34bcf6184c09L239 where we normalized explicitly with a boost set to 1.0F (elastic 6.1.0 upgrade).

worleydl · 2021-05-14T13:11:42Z

Thanks for the additional info David. Definitely seems like a regression maybe we can add some additional test cases around it and get the boost setup as you describe?

nathancday · 2021-09-07T19:19:15Z

@worleydl coming back to this issue, should we lay out the unit test requirements and tag this as help wanted (or to be developed)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query-time boosting affects model execution but not feature value logging #368

Query-time boosting affects model execution but not feature value logging #368

tmanabe commented May 10, 2021

nomoa commented May 10, 2021

tmanabe commented May 12, 2021

nomoa commented May 14, 2021

nomoa commented May 14, 2021 •

edited

Loading

worleydl commented May 14, 2021

nathancday commented Sep 7, 2021

Query-time boosting affects model execution but not feature value logging #368

Query-time boosting affects model execution but not feature value logging #368

Comments

tmanabe commented May 10, 2021

nomoa commented May 10, 2021

tmanabe commented May 12, 2021

nomoa commented May 14, 2021

nomoa commented May 14, 2021 • edited Loading

worleydl commented May 14, 2021

nathancday commented Sep 7, 2021

nomoa commented May 14, 2021 •

edited

Loading