Skip to content

Master's of Data Science final course capstone, creation of a cervical cancer risk detection model, find-a-provider tool, and streamlit web app.

Notifications You must be signed in to change notification settings

mxa256/Cervical_Cancer_Risk_Assessment

 
 

Repository files navigation

Cervical_Cancer_Risk_Assessment

Cervical Cancer Image Model

  • CervicalCancerImages_EDA.ipynb:

Cervical Cancer Risk Factors Model

Find a Provider Near You

  • npi_filtering.ipynb: Initial exploration and filtering of raw NPI data. Data is filtered from 7.6 million entries down to ~400k and from 330 columns down to 32. Resulting data includes only physicians actively practicing in the US who could be reasonably expected to perform pap smears (i.e. OB/GYNs, Family, and Internal Medicine). Note: Raw data for the NPI Registry can be found here: https://download.cms.gov/nppes/NPI_Files.html

  • prepare_database.ipynb: This notebook sets up batch geocoding scripts, merging the resulting data, and column manipulations for formatting, aggregations, and more descriptive content. At the end, the prepared data is pushed into a PostgreSQL database table.

  • geospatial_queries.ipynb: Testing geospatial queries and features for the streamlit app.

  • streamlit_test.py: Minimal streamlit app containing all functionality for the find a provider feature. Includes functionality to search by address or search by current location, and outputs an interactive map and table.

About

Master's of Data Science final course capstone, creation of a cervical cancer risk detection model, find-a-provider tool, and streamlit web app.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 52.9%
  • HTML 47.1%