[python-package] No warning if single alias of num_boost_round is passed #6548

vnherdeiro · 2024-07-16T20:35:34Z

I see a misleading warning log line when I pass a 'num_iterations' argument my training params. There should be no alias warning in that case, only when more than one alias for the number of boosted trees is passed.

By picking the last of the alias in the possibles (available in params), the "tie breaking behaviour" should be conserved.

jameslamb

Thanks for your interest in lightgbm.

Can you please share a minimal, reproducible example that demonstrates the behavior you're claiming this PR fixes? You can use this as a reference: #6014 (comment)

Please also see the significant prior discussion in #6324

vnherdeiro · 2024-07-18T22:04:17Z

Thanks for the welcome, feedback and advice @jameslamb

First, I just pushed a possible fix for the failing tests 🤞

About the aim of the MR, at my work I use LightGBM extensively in some AWS Sagemaker instances (thanks for contributing to this awesome package!). When I set the parameter n_estimators my screen is populated with the UserWarning line (because of repeated training instances). There may some non-repeat warning config missing on that cloud instance (because locally I cannot reproduce the message to show more than one time), but I also find that the warning message is not helpful. I had to go through the docs and code to be sure my parametrization was working as intended (why warn if I am using a single alias?).

Here are the before/after screenshots:

I read the discussion in #6324 with interest. I would defend that a warning should be raised even if the value of two+ aliases is the same. The reason being, on such instance, the user is most likely mis-using the parametrization and should be made aware of the redundant input information.

jmoralez · 2024-07-18T22:14:04Z

In your case the warning is showing up because the train function has an argument num_boost_round, which defaults to 100, so LightGBM is seeing n_estimators=1, num_boost_round=100 and warning you that it's choosing n_estimators. I believe in this case the warning is correct.

vnherdeiro · 2024-07-18T22:17:07Z

@jmoralez I will check it tomorrow =) Will let you know!

jameslamb · 2024-07-19T02:38:11Z

@vnherdeiro there is still quite a bit to do to implement the plan agreed to in #6458. For example, cv() needs to be updated with similar behavior as train(), and tests covering those 2 functions and the scikit-learn interface need to be added.

I would really like to fix this as part of the next release (thank you for reminding us!), and we are under some time pressure to get that release out in the next 2 weeks (see #6522).

In this interest of time, would it be ok with you if I close this and put up a separate pull request addressing this? We would really love to have you contribute to LightGBM, but I think this particular change might just be a difficult first contribution.

vnherdeiro · 2024-07-19T11:28:23Z

@jmoralez Have tested your point, indeed I had tried using different aliases, even 'num_boost_round' but always inside params, hence I couldn't get rid of the warning. Thanks for the help!

@jameslamb I am not familiar with the API and never used cv. I believe you this change needs a bigger scope. Agreed to close this PR. I do want to contribute to this project. Do you have any lower hanging fruit to suggest?

jameslamb · 2024-07-30T05:07:31Z

I believe you this change needs a bigger scope. Agreed to close this PR.

Thanks very much for understanding!

I'm working on this in #6579, so I'll close this PR.

I do want to contribute to this project. Do you have any lower hanging fruit to suggest?

Thanks very much, we appreciate it! #6361 or #6498 would be good places to start. You can @-mention me on those (or on a draft pull request you open) if you want any help.

When you do that, please use a branch other than master on your fork. I think you'll find that easier to work with, especially if you have multiple pull requests open at the same time.

jameslamb · 2024-08-02T18:07:35Z

@vnherdeiro if you're interested in seeing how I implemented + tested this, please see #6579.

Thanks again for your work here and for bringing this issue back to our attention. It's long overdue for a fix.

vnherdeiro · 2024-08-03T07:39:32Z

@jameslamb Thanks for suggesting the changes above as possibilities of contributing. I checked your MR for this issue btw, the new behaviour looks good to me!

No warning if single alias of num_boost_round is passed

59f0051

vnherdeiro requested review from guolinke, jameslamb, shiyu1994, jmoralez and borchero as code owners July 16, 2024 20:35

jameslamb requested changes Jul 16, 2024

View reviewed changes

jameslamb added the in progress label Jul 16, 2024

jameslamb changed the title ~~No warning if single alias of num_boost_round is passed~~ [python-package] No warning if single alias of num_boost_round is passed Jul 16, 2024

jameslamb requested a review from StrikerRUS July 16, 2024 22:10

Fixing IndexError

4e2cf27

Merge branch 'microsoft:master' into master

3463063

jameslamb mentioned this pull request Jul 30, 2024

[python-package] limit when num_boost_round warnings are emitted (fixes #6324) #6579

Merged

jameslamb closed this Jul 30, 2024

vnherdeiro mentioned this pull request Aug 24, 2024

fix some shellcheck warnings #6621

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python-package] No warning if single alias of num_boost_round is passed #6548

[python-package] No warning if single alias of num_boost_round is passed #6548

vnherdeiro commented Jul 16, 2024 •

edited

Loading

jameslamb left a comment

vnherdeiro commented Jul 18, 2024

jmoralez commented Jul 18, 2024

vnherdeiro commented Jul 18, 2024

jameslamb commented Jul 19, 2024

vnherdeiro commented Jul 19, 2024

jameslamb commented Jul 30, 2024

jameslamb commented Aug 2, 2024

vnherdeiro commented Aug 3, 2024

[python-package] No warning if single alias of num_boost_round is passed #6548

[python-package] No warning if single alias of num_boost_round is passed #6548

Conversation

vnherdeiro commented Jul 16, 2024 • edited Loading

jameslamb left a comment

Choose a reason for hiding this comment

vnherdeiro commented Jul 18, 2024

jmoralez commented Jul 18, 2024

vnherdeiro commented Jul 18, 2024

jameslamb commented Jul 19, 2024

vnherdeiro commented Jul 19, 2024

jameslamb commented Jul 30, 2024

jameslamb commented Aug 2, 2024

vnherdeiro commented Aug 3, 2024

vnherdeiro commented Jul 16, 2024 •

edited

Loading