[spark] Make spark model have the same UID with its estimator #9022

WeichenXu123 · 2023-04-11T13:30:36Z

Make spark model have the same UID with its estimator.

Because all other pyspark models obey the rule that uses the same UID with estimator, and the UID is used in param copying (e.g. CrossValidator copies param list to estimator, and UID is used for param owner validation), so we need to make the UID correct.

Signed-off-by: Weichen Xu <[email protected]>

WeichenXu123 · 2023-04-11T13:31:02Z

CC @wbo4958 @trivialfis Would you mind taking a look ? Thank you!

WeichenXu123 · 2023-04-11T13:32:08Z

Can we backport this fix to xgboost 1.7.6 version ?

trivialfis · 2023-04-11T16:30:42Z

I don't have any plan for 1.7.6 at the moment, is this fix significant? In what scenario this can cause issues?

WeichenXu123 · 2023-04-12T00:11:20Z

I don't have any plan for 1.7.6 at the moment, is this fix significant? In what scenario this can cause issues?

Only some very edge case (a pyspark pipeline contains xgboost estimator, and crossvalidator tunes the pipeline, and we tunes some params related to xgboost model prediction).

It is not an critical bug so 1.7.6 release is not urgent for . :)

wbo4958

LGTM

WeichenXu123 · 2023-04-13T00:08:49Z

@trivialfis Could you trigger all CI runs and then merge the PR once CI passed? Thankyou!

WeichenXu123 · 2023-04-13T07:55:10Z

Reminder: Backport this commit to 1.7 branch :)

…tor (dmlc#9022) Signed-off-by: Weichen Xu <[email protected]>

…tor (#9022) (#9285) Signed-off-by: Weichen Xu <[email protected]> Co-authored-by: WeichenXu <[email protected]>

init

5c532ba

Signed-off-by: Weichen Xu <[email protected]>

wbo4958 approved these changes Apr 12, 2023

View reviewed changes

trivialfis approved these changes Apr 13, 2023

View reviewed changes

trivialfis merged commit 191d0aa into dmlc:master Apr 13, 2023

trivialfis mentioned this pull request Jun 8, 2023

1.7.6 Patch Release #9275

Closed

7 tasks

trivialfis pushed a commit to trivialfis/xgboost that referenced this pull request Jun 9, 2023

[backport] [spark] Make spark model have the same UID with its estima…

6903e8d

…tor (dmlc#9022) Signed-off-by: Weichen Xu <[email protected]>

trivialfis pushed a commit to trivialfis/xgboost that referenced this pull request Jun 10, 2023

[backport] [spark] Make spark model have the same UID with its estima…

892d36b

…tor (dmlc#9022) Signed-off-by: Weichen Xu <[email protected]>

trivialfis added a commit that referenced this pull request Jun 11, 2023

[backport] [spark] Make spark model have the same UID with its estima…

e882fb3

…tor (#9022) (#9285) Signed-off-by: Weichen Xu <[email protected]> Co-authored-by: WeichenXu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spark] Make spark model have the same UID with its estimator #9022

[spark] Make spark model have the same UID with its estimator #9022

WeichenXu123 commented Apr 11, 2023

WeichenXu123 commented Apr 11, 2023

WeichenXu123 commented Apr 11, 2023

trivialfis commented Apr 11, 2023 •

edited

Loading

WeichenXu123 commented Apr 12, 2023 •

edited

Loading

wbo4958 left a comment

WeichenXu123 commented Apr 13, 2023

WeichenXu123 commented Apr 13, 2023

[spark] Make spark model have the same UID with its estimator #9022

[spark] Make spark model have the same UID with its estimator #9022

Conversation

WeichenXu123 commented Apr 11, 2023

WeichenXu123 commented Apr 11, 2023

WeichenXu123 commented Apr 11, 2023

trivialfis commented Apr 11, 2023 • edited Loading

WeichenXu123 commented Apr 12, 2023 • edited Loading

wbo4958 left a comment

Choose a reason for hiding this comment

WeichenXu123 commented Apr 13, 2023

WeichenXu123 commented Apr 13, 2023

trivialfis commented Apr 11, 2023 •

edited

Loading

WeichenXu123 commented Apr 12, 2023 •

edited

Loading