This repo includes all of the IPython notebooks that I will go through in the tutorial. It is targeted to people who have intermediate knowledge in machine learning and wants to learn more advanced features of the Scikit-learn.
It tries to cover the following concepts in Scikit-Learn:
- Pipeline
- Cross-Validation
- Grid-Search
- Randomized Grid Search
- Distance and Scoring Functions
- Feature Unions and Engineering
- Out-of-core Learning (
partial_fit
)
The dependencies are given in the 0th notebook, to reproduce it, make sure you have at least those versions in that notebook. Otherwise, please feel free to open an issue in this repository.
You could browse the IPython notebooks in nbviewer