XGBoost + Dask + HPO with Optuna

This repository contains a sequence of notebooks that progressively train a large model on tabular data using ...

XGBoost for gradient boosted trees
Dask for parallel computing
Optuna for hyper-parameter-optimization

We find that this combination is pragmatic in large scale machine learning problems.

Notebooks

The notebooks in this repository are progressively more sophisticated. We start by looking at the data, and a simple training with XGBoost and Dask. Then we show how to do Hyper Parameter Optimization with XGBoost and Dask and Optuna, and finally, we train many models in parallel with XGBoost and Dask and Optuna. This progression can help to make it clear what each tool does, and how best to combine them.

We hope that these notebooks serve as a prototype for others to adapt to their needs.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.gitignore		.gitignore
Exploratory Analysis.ipynb		Exploratory Analysis.ipynb
Feature Engineering.ipynb		Feature Engineering.ipynb
Modeling 1 - Train an XGBoost Model with Dask.ipynb		Modeling 1 - Train an XGBoost Model with Dask.ipynb
Modeling 2 - HPO with Optuna & XGBoost.ipynb		Modeling 2 - HPO with Optuna & XGBoost.ipynb
Modeling 3 - Parallel HPO of XGBoost with Optuna and Dask (multi cluster).ipynb		Modeling 3 - Parallel HPO of XGBoost with Optuna and Dask (multi cluster).ipynb
Modeling 4 - Parallel HPO of XGBoost with Optuna and Dask (single cluster).ipynb		Modeling 4 - Parallel HPO of XGBoost with Optuna and Dask (single cluster).ipynb
Modeling_3.svg		Modeling_3.svg
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XGBoost + Dask + HPO with Optuna

Notebooks

About

Releases

Packages

Contributors 4

Languages

coiled/dask-xgboost-nyctaxi

Folders and files

Latest commit

History

Repository files navigation

XGBoost + Dask + HPO with Optuna

Notebooks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages