-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature idea - provide custom validation sets for early stopping #48
Comments
Thanks for the issue! I'm on board. :) Related to tidymodels/parsnip#760, and tidymodels/parsnip#765. My response for the analogous parsnip issues reflects where my thinking is at with this in bonsai as well.
|
Great! Thanks for the quick response. Looks like there's already a PR in |
Whenever you or others here pick this up @simonpcouch , There is a LightGBM-y way to create validation sets that is slightly different from "just subset rows". See https://lightgbm.readthedocs.io/en/latest/R/reference/lgb.Dataset.create.valid.html. |
Hi, Example: validation = 0.3 # default random sample (current solution) validation = c(0.7, 0.9) # alternative solution to select a continuous subset starting from 70% and ending at 90% of the training set. Here the code to replace the internal function after bonsai library 0.3.0 has been loaded. Hope it is useful Regards
|
Thanks for creating this excellent package. I created a similar fork of treesnip but am planning to replace it with
{bonsai}
in all our production models.One feature that I think would be incredibly useful in
{bonsai}
is the ability to provide custom validation sets during early stopping (instead of using a random split of the training data). This would have a few potential benefits:train
,validate
,test
split. Currently,{bonsai}
will further split thetrain
data intotrain subset
andvalidation specifically for early stopping
sets. Instead, it would be ideal to be able to pass thevalidate
set directly. This would mean all oftrain
would be used for training.Let me know if this is out-of-scope for this project. If not, I'm happy to contribute if needed.
The text was updated successfully, but these errors were encountered: