Skip to content

Releases: tidymodels/tune

tune 1.2.1

24 May 17:09
Compare
Choose a tag to compare
  • Addressed issue in int_pctl() where the function would error when parallelized using makePSOCKcluster() (#885).

  • Addressed issue where tuning functions would raise the error object 'iteration' not found with plan(multisession) and the control option parallel_over = "everything" (#888).

Note that this GitHub release is a re-release with the tag v1.2.1, corresponding to the package version that arrived on CRAN on 2024-04-18. The previous GitHub release with this tag did not match the version released to CRAN. Closes #897.

tune 1.2.0

20 Mar 20:06
981a6dd
Compare
Choose a tag to compare

New Features

  • tune now fully supports models in the "censored regression" mode. These models can be fit, tuned, and evaluated like the regression and classification modes. tidymodels.org has more information and tutorials on how to work with survival analysis models.

  • Introduced support for parallel processing using the future framework. The tune package previously supported parallelism with foreach, and users can use either framework for now. In a future release, tune will begin the deprecation cycle for parallelism with foreach, so we encourage users to begin migrating their code now. See the Parallel Processing section in the "Optimizations" article to learn more (#866).

  • Added a type argument to collect_metrics() to indicate the desired output format. The default, type = "long", returns output as before, while type = "wide" pivots the output such that each metric has its own column (#839).

  • Added a new function, compute_metrics(), that allows for computing new metrics after evaluating against resamples. The arguments and output formats are closely related to those from collect_metrics(), but this function requires that the input be generated with the control option save_pred = TRUE and additionally takes a metrics argument with a metric set for new metrics to compute. This allows for computing new performance metrics without requiring users to re-fit and re-predict from each model (#663).

  • A method for rsample's int_pctl() function that will compute percentile confidence intervals on performance metrics for objects produced by fit_resamples(), tune_*(), and last_fit().

  • The Brier score is now part of the default metric set for classification models.

Bug Fixes

  • last_fit() will now error when supplied a fitted workflow (#678).

  • Fixes bug where .notes entries were sorted in the wrong order in tuning results for resampling schemes with IDs that aren't already in alphabetical order (#728).

  • Fixes bug where .config entries in the .extracts column in tune_bayes() output didn't align with the entries they ought to in the .metrics and .predictions columns (#715).

  • Metrics from apparent resamples are no longer included when estimating performance with estimate_tune_results() (and thus with collect_metrics(..., summarize = TRUE) and compute_metrics(..., summarize = TRUE), #714).

  • Handles edge cases for tune_bayes()' iter argument more soundly. For iter = 0, the output of tune_bayes() should match tune_grid(), and tune_bayes() will now error when iter < 0. tune_bayes() will now alter the state of RNG slightly differently, resulting in changed Bayesian optimization search output (#720).

  • augment() methods to tune_results, resample_results, and last_fit objects now always return tibbles (#759).

Other Changes

  • Improved error message when needed packages aren't installed (#727).

  • augment() methods to tune_results, resample_results, and last_fit objects now always returns tibbles (#759).

  • Improves documentation related to the hyperparameters associated with extracted objects that are generated from submodels. See the "Extracting with submodels" section of ?collect_extracts to learn more.

  • eval_time and eval_time_target attribute was added to tune objects. There are also .get_tune_eval_times() and .get_tune_eval_time_target() functions.

  • collect_predictions() now reorders the columns so that all prediction columns come first (#798).

  • augment() methods to tune_results, resample_results, and last_fit objects now return prediction results in the first columns (#761).

  • autoplot() will now meaningfully error if only 1 grid point is present, rather than producing a plot (#775).

  • Added notes on case weight usage to several functions (#805).

  • For iterative optimization routines, autoplot() will use integer breaks when type = "performance" or type = "parameters".

Breaking Changes

  • Several functions gained an eval_time argument for the evaluation time of dynamic metrics for censored regression. The placement of the argument breaks passing-by-position for one or more other arguments to autoplot.tune_results() and the developer-focused check_initial() (#857).

  • Ellipses (...) are now used consistently in the package to require optional arguments to be named. For functions that previously had ellipses at the end of the function signature, they have been moved to follow the last argument without a default value: this applies to augment.tune_results(), collect_predictions.tune_results(), collect_metrics.tune_results(), select_best.tune_results(), show_best.tune_results(), and the developer-focused estimate_tune_results(), load_pkgs(), and encode_set(). Several other functions that previously did not have ellipses in their signatures gained them: this applies to conf_mat_resampled() and the developer-focused check_workflow(). Optional arguments previously passed by position will now error informatively prompting them to be named. These changes don't apply in cases when the ellipses are currently in use to forward arguments to other functions (#863).

tune 1.1.2

24 Aug 14:48
Compare
Choose a tag to compare
  • last_fit() now works with the 3-way validation split objects from rsample::initial_validation_split(). last_fit() and fit_best() now have a new argument add_validation_set to include or exclude the validation set in the dataset used to fit the model (#701).

  • Disambiguates the verbose and verbose_iter control options to better align with documented functionality. The former controls logging for general progress updates, while the latter only does so for the Bayesian search process. (#682)

tune 1.1.1

11 Apr 16:41
fab790e
Compare
Choose a tag to compare
  • Fixed a bug introduced in tune 1.1.0 in collect_() functions where the
    .iter column was dropped.

tune 1.1.0

05 Apr 15:09
110d0b7
Compare
Choose a tag to compare

tune 1.1.0 introduces a number of new features and bug fixes, accompanied by various optimizations that substantially decrease the total evaluation time to tune hyperparameters in the tidymodels.

New features

  • Introduced a new function fit_best() that provides a shorthand interface to fit a final model after parameter tuning. (#586)

  • Refined machinery for logging issues during tuning. Rather than printing out warnings and errors as they appear, the package will now only print unique tuning issues, updating a dynamic summary message that maintains counts of each unique issue. This feature is only enabled for tuning sequentially and can be manually toggled with the verbose option. (#588)

  • Introduced collect_extracts(), a function for collecting extracted objects from tuning results. The format of results closely mirrors collect_notes(), where the extracted objects are contained in a list-column alongside the resample ID and workflow .config. (#579)

Bug fixes

  • Fixed bug in select_by_pct_loss() where the model with the greatest loss within the limit was returned rather than the most simple model whose loss was within the limit. (#543)

  • Fixed bug in tune_bayes() where .Last.tune.result would return intermediate tuning results. (#613)

  • Extended show_best(), select_best(), select_by_one_std_error(), select_by_pct_loss() to accommodate metrics with a target value of zero (notably, yardstick::mpe() and yardstick::msd()). (#243)

Other changes

  • Implemented various optimizations in tune's backend that substantially decrease the total evaluation time to tune hyperparameters with the tidymodels. (#634, #635, #636, #637, #640, #641, #642, #648, #649, #653, #656, #657)

  • Allowed users to supply list-columns in grid arguments. This change allows for manually specifying grid values that must be contained in list-columns, like functions or lists. (#625)

  • Clarified error messages in select_by_* functions. Error messages now only note entries in ... that are likely candidates for failure to arrange(), and those error messages are no longer duplicated for each entry in ....

  • Improved condition handling for errors that occur during extraction from workflows. While messages and warnings were appropriately handled, errors occurring due to misspecified extract() functions being supplied to control_*() functions were silently caught. As with warnings, errors are now surfaced both during execution and at print() (#575).

  • Moved forward with the deprecation of parameters() methods for workflows, model_specs, and recipes. Each of these methods will now warn on every usage and will be defunct in a later release of the package. (#650)

  • Various bug fixes and improvements to documentation.

tune 1.0.1

10 Oct 20:20
c358a47
Compare
Choose a tag to compare
  • last_fit(), fit_resamples(), tune_grid(), and tune_bayes() do not automatically error if the wrong type of control object is passed. If the passed control object is not a superset of the one that is needed, the function will still error. As an example, passing control_grid() to tune_bayes() will fail but passing control_bayes() to tune_grid() will not. (#449)

  • The collect_metrics() method for racing objects was removed (and is now in the finetune package).

  • Improved prompts related to parameter tuning. When tuning parameters are supplied that are not compatible with the given engine, tune_*() functions will now error. (#549)

  • The control_bayes() got a new argument verbose_iter that is used to control the verbosity of the Bayesian calculations. This change means that the verbose argument is being passed to tune_grid() to control its verbosity.

  • The control_last_fit() function gained an argument allow_par that defaults to FALSE. This change addresses failures after last_fit() using modeling engines that require native serialization, and we anticipate little to no increase in time-to-fit resulting from this change. (#539, tidymodels/bonsai#52)

  • show_notes() does a better jobs of... showing notes. (#558)

tune 1.0.0

07 Jul 20:51
b07ebde
Compare
Choose a tag to compare
  • show_notes() is a new function that can better help understand warnings and errors.

  • Logging that occurs using the tuning and resampling functions now show multi-line error messages and warnings in multiple lines.

  • When fit_resamples(), last_fit(), tune_grid(), or tune_bayes() complete without error (even if models fail), the results are also available via .Last.tune.result.

  • last_fit() now accepts a control argument to allow users to control aspects of the last fitting process via control_last_fit() (#399).

  • Case weights are enabled for models that can use them.

  • Some internal functions were exported for use by other packages.

  • A check was added to fit_resamples() and last_fit() to give a more informative error message when a preprocessor or model have parameters marked for tuning.

  • outcome_names() works correctly when recipe has NA roles. (#518)

tune 0.2.0

19 Mar 16:38
139360d
Compare
Choose a tag to compare
  • The .notes column now contains information on the type of note (error or warning), the location where it occurred, and the note. Printing a tune result has different output describing the notes.

  • collect_notes() can be used to gather any notes to a tibble. (#363)

  • Parallel processing with PSOCK clusters is now more efficient, due to carefully avoiding sending extraneous information to each worker (#384, #396).

  • The engine arguments for xgboost alpha, lambda, and scale_pos_weight are now tunable.

  • When the Bayesian optimization data contain missing values, these are removed before fitting the GP model. If all metrics are missing, no GP is fit and the current results are returned. (#432)

  • Moved tune() from tune to hardhat (#442).

  • The parameters() methods for recipe, model_spec, and workflow objects have been soft-deprecated in favor of extract_parameter_set_dials() methods (#428).

tune 0.1.6

21 Jul 17:19
b4ae617
Compare
Choose a tag to compare
  • When using load_pkgs(), packages that use random numbers on start-up do not affect the state of the RNG. We also added more control of the RNGkind to make it consistent with the user's previous value (#389).

  • New extract_*() functions have been added that supersede many of the the existing pull_*() functions. This is part of a larger move across the tidymodels packages towards a family of generic extract_*() functions. Many pull_*() functions have been soft-deprecated, and will eventually be removed. (#378)

tune 0.1.5

23 Apr 15:49
8fe768d
Compare
Choose a tag to compare
  • Fixed a bug where the resampled confusion matrix is transposed when conf_mat_resamped(tidy = FALSE) (#372)