Project information

This project intends to find a solution to the ongoing (at the time of writing January 2024) kaggle competition SenNet + HOA - Hacking the Human Vasculature in 3D

The project runs are logged to Weights&Biases:
https://wandb.ai/aaalex-lit/blood_vessel_segmentation

Project description

Here's a short summary of the competition description:

The goal of the competition is to segment blood vessels by creating a model trained on 3D Hierarchical Phase-Contrast Tomography (HiP-CT) data from human kidneys to help complete a picture of vasculature throughout a body.

This work will better researchers' understanding of the size, shape, branching angles, and patterning of blood vessels in human tissue.

Please see the detailed description on the competitions' page

In this particular project we will set a bit more humble goal. We will only use 2D data to train a segmentation model and we will train it only on a part of the whole data. To solve this problem we will apply YOLOv8 segmentation model from Ultralytics training it on the selected subset of the whole competition data.

YOLOv8 is the latest version of YOLO by Ultralytics. As a cutting-edge, state-of-the-art (SOTA) model, YOLOv8 builds on the success of previous versions, introducing new features and improvements for enhanced performance, flexibility, and efficiency. YOLOv8 supports a full range of vision AI tasks, including detection, segmentation, pose estimation, tracking, and classification.

The dataset

The complete dataset is available on Kaggle This repo contains a small part of the full dataset images along with the masks that were modified compared to the original dataset by converting the original binary masks to YOLOv8-accepted text format.

Yashvardhan Jain, Katy Borner, Claire Walsh, Nancy Ruschman, Peter D. Lee, Griffin M. Weber, Ryan Holbrook, Addison Howard. (2023). SenNet + HOA - Hacking the Human Vasculature in 3D. Kaggle. https://kaggle.com/competitions/blood-vessel-segmentation

Reproduce the work

Create environment

Disclaimer:
I prefer to use conda because it comes with a Python interpreter of the specified version whereas with the other options like pipenv, poetry etc you need a base interpreter of a required version. If you don't want to use conda, you can as well skip the conda environment setup and use the provided Pipfile.* to reproduce the environment or create a virtual environment of your choice (eg python's built-in venv), and install the dependencies using the provided requirements.txt. In the latter case you need to remember that the base interpreter's python version must be 3.10 and that 100% reproducibility is likely to be achieved but is not guaranteed.

Below are instructions for conda:

Clone this repo

Create a clean Python 3.10 based environment and activate it

conda create -n blood-vessel-segmentation python=3.10
conda activate blood-vessel-segmentation

Install requirements
```
pip install -r requirements.txt 
```

Run the notebooks

Spin up a jupyter server and use your browser to open a notebook from the notebooks folder by executing:

Note on Windows OS:
There is some bash script used in the notebooks. I'm not 100% sure it will execute properly on Windows within the notebook. If it doesn't just follow the existing notebooks output.

jupyter notebook

Notebooks description:

EDA notebook

Full dataset EDA notebook.

Note: Can be run locally (proper execution on Windows in not guaranteed)

The purpose is to explore the whole dataset and the task, and decide how to reduce it for the project

Create smaller dataset notebook

Note: Can be run locally (proper execution on Windows in not guaranteed)

Create smaller dataset from the full one.
The dataset obtained as a result of the notebook execution is already present in the notebook so the code is provided for reference, it doesn't need to be executed unless you want to change the dataset.

Baseline Training notebook

Note: The notebook is meant to be executed on Google Colab

Initial Experiments with different images sizes and batch sizes
Logged to WandB: https://wandb.ai/aaalex-lit/blood_vessel_segmentation

Hyperparameter tuning notebook

Note: The notebook is meant to be executed on Google Colab

The results of the experimetnts can be fount in the hyperparameter_tuning_results folder

Funny enough the best hyperparameters were achieved on the first iteration:

The best hyperparameters can be found in the best_hyperparameters.yaml

They're used in the following notebook to train the final model.

Final model training notebook

In the end it appeared that in fact it was a mistake to train image size 1024 with batch 4 for longer even with the best hyperparametes. It's clear from WandB graphs that image size 1600 with batch size 9 shows much better performance with shorter training time. So the final model is trained with these parameters. The training of the final model is exported to a separate script.

Kaggle notebooks

As the name suggests, the notebooks in the kaggle_notebooks folder can be found and run on Kaggle. They are used in the competition, but none of them are directly utilized in this project.

Model training script

train.py

Please note that the script is designed to be trained on GPU with at least 15G of GPU RAM (I ran the training on the Google Colab T4).
If you try to run it on your computer and you're not on Mac and with no GPU it will most likely fail with OOM.

I tried to run it on my Mac M2 and it worked but still estimated to run 4 minutes per epoch that is way too much compared to ~25 seconds per epoch on Google Colab with T4

To run the training execute (make sure to have blood-vessel-segmentation environment is activated)

python train.py

Model deployment

Model is deployed with FastAPI + Uvicorn

Running locally

To run the service locally execute

python predict.py

That will spin up uvicorn server on port 8000 (make sure it's not occupied)

The API can be tested directly in the browser using the built-in Swagger UI, accessible at http://127.0.0.1:8000/docs.

There are 2 endpoints in the service

/predict_rle_mask:
returns the Run-Length Encoded predicted mask.
This format is required for competition submission.
It's a lossless format and can be easily decoded back to a matrix on the other end.

Alternatively it can be tested from the command line using curl

curl -X 'POST' \
'http://localhost:8000/predict_rle_mask' \
-H 'accept: application/json' \
-H 'Content-Type: application/json' \
-d '{
"url": "https://github.com/aaalexlit/hacking-human-vasculature/raw/main/dataset/test/images/1505.tif"
}'

/predict_img:
returns a JPG image with the predicted masks and boxes.
Added for the sake of quick assessment

Alternatively it can be tested from the command line using curl

curl -X 'POST' \
'http://localhost:8000/predict_img' \
-H 'accept: application/json' \
-H 'Content-Type: application/json' \
-d '{
"url": "https://github.com/aaalexlit/hacking-human-vasculature/raw/main/dataset/test/images/1505.tif"
}' --output result.jpg

Running this command will download the prediction as result.jpg file.

Containerization

To build and spin the service docker container up run.
(Before doing that make sure port 80 is not occupied):

docker compose up --build

Then in this instance to test the service you can do it through the UI available on

!!!Note the port difference compared to the local version!!!

http://localhost/docs

or by running the following curl (for the RLE endpoind):

curl -X 'POST' \
'http://localhost/predict_rle_mask' \
-H 'accept: application/json' \
-H 'Content-Type: application/json' \
-d '{
"url": "https://github.com/aaalexlit/hacking-human-vasculature/raw/main/dataset/test/images/1505.tif"
}'

To clean up after stopping the container run

docker compose down

Deploy on the local K8s cluster

Install kind and kubectl following the instructions

Create K8s cluster with kind and check that it's running

kind create cluster
kubectl cluster-info --context kind-kind

After running the service with docker-compose above you should have the image hacking_human_vasculature-predict:latest. We need to give it a tag so that it works properly with kind. Then we need to load the image into kind
```
docker tag hacking_human_vasculature-predict:latest blood-vessel-seg:v1
kind load docker-image blood-vessel-seg:v1
```
Apply the deployment.yaml
```
kubectl apply -f k8s/deployment.yaml 
```

Perform port-forwarding to test the deployment

# get the pod name
kubectl get po
# use the pod copied from the output of the previous line
kubectl port-forward segment-8587787685-9xgdc 8080:80
# Curl from the other terminal
curl -X 'POST' \
    'http://localhost:8080/predict_rle_mask' \
    -H 'accept: application/json' \
    -H 'Content-Type: application/json' \
    -d '{
    "url": "https://github.com/aaalexlit/hacking-human-vasculature/raw/main/dataset/test/images/1505.tif"
    }'
# terminate port forwarding

Create K8s service for load balancing applying service.yaml
```
kubectl apply -f k8s/service.yaml
```

And test it using the port forwarding again

kubectl port-forward svc/segment 8080:80  
# Curl from the other terminal
curl -X 'POST' \
    'http://localhost:8080/predict_rle_mask' \
    -H 'accept: application/json' \
    -H 'Content-Type: application/json' \
    -d '{
    "url": "https://github.com/aaalexlit/hacking-human-vasculature/raw/main/dataset/test/images/1505.tif"
    }'
# terminate port forwarding

Deploy to AWS EKS

Install eksctl following the instructions
Create EKS cluster by applying eks-config.yaml
```
eksctl create cluster -f k8s/eks-config.yaml
```
Create ECR if not exists, tag and push the image blood-vessel-seg:v1 created on the previous step:
```
./k8s/push-to-ecr.sh blood-vessel-seg:v1 
```
Check the newly created EKS cluster nodes
```
kubectl get nodes
```

Apply K8s deployment and service

kubectl apply -f k8s/deployment-eks.yaml
kubectl apply -f k8s/service.yaml

Test getting the predictions through the load balancer on AWS

curl -X 'POST' \
'http://a897730848cb34e6984ed9b1879dc310-720995883.us-west-2.elb.amazonaws.com/predict_rle_mask' \
-H 'accept: application/json' \
-H 'Content-Type: application/json' \
-d '{
"url": "https://github.com/aaalexlit/hacking-human-vasculature/raw/main/dataset/test/images/1505.tif"
}'

Visualize the pods on the AWS Console (Optional)

To be able to see the cluster on the AWS Console aws-auth configMaps needs to be edited:

kubectl edit configmap aws-auth -n kube-system

And add the following:

mapUsers: "- groups: \n  - system:masters\n  userarn: arn:aws:iam::<aws-account-id>:root\n"

Delete the cluster and clean-up

Delete cluster

kubectl delete pdb coredns -n kube-system
eksctl delete cluster --name mlzoomcamp-cluster

delete ECR repo
```
./k8s/destroy-infra.sh
```

delete CloudFormation stack created by eksctl

aws cloudformation delete-stack --stack-name eksctl-mlzoomcamp-cluster-nodegroup-ng-m5-large

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
dataset		dataset
hyperparameter_tuning_results		hyperparameter_tuning_results
images		images
k8s		k8s
kaggel_notebooks		kaggel_notebooks
kaggle_notebooks		kaggle_notebooks
notebooks		notebooks
small_train		small_train
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
best_model.pt		best_model.pt
docker-compose.yaml		docker-compose.yaml
docker-requirements.txt		docker-requirements.txt
predict.py		predict.py
requirements.txt		requirements.txt
settings.yaml		settings.yaml
train.py		train.py
yolo_dataset.yaml		yolo_dataset.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project information

Project description

The dataset

Reproduce the work

Create environment

Run the notebooks

EDA notebook

Create smaller dataset notebook

Baseline Training notebook

Hyperparameter tuning notebook

Final model training notebook

Kaggle notebooks

Model training script

Model deployment

Running locally

Containerization

Deploy on the local K8s cluster

Deploy to AWS EKS

Visualize the pods on the AWS Console (Optional)

Delete the cluster and clean-up

About

Releases

Packages

Languages

aaalexlit/hacking-human-vasculature

Folders and files

Latest commit

History

Repository files navigation

Project information

Project description

The dataset

Reproduce the work

Create environment

Run the notebooks

Model training script

Model deployment

Running locally

Containerization

Deploy on the local K8s cluster

Deploy to AWS EKS

Visualize the pods on the AWS Console (Optional)

Delete the cluster and clean-up

About

Resources

Stars

Watchers

Forks

Languages