Python framework to speed up the ML process starting from dataset collection and preprocessing to training and prediction
git clone https://gitlab.g42.ae/uplatform/experimentation-ml-pipeline.git pip install -r requirements.txt
All configs are intended to be tweaked for appropriate model/dataset
If you already have a torch.Dataset skip preprocessing
PREPROCESSING
# collects and transforms the data
python run_preprocessing.py --config-yml=configs/preprocessing.yml
# trains one or multimple models
python run_train.py --config-yml=configs/dense_net.yml
# makes predictions with several models
python run_prediction.py --config-yml=configs/prediction.yml
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.
Please make sure to update tests as appropriate.