This GitHub repository is intended for exploration in my capstone research. I apologize for the messy code. The actual paper can be found at the root of the director as Capstone_Project_1__Housing_Price_Predictions_in_San_Francisco.pdf
. Please read this if you are interested in my actual project.
- Data Cleaning
- Contains jupyter notebooks of iterations of data cleaning i performed.
- ENG_DATA
- Contains the data sets after I performed operations on them.
- RAW_DATA
- Contains the data sets i began with
- RESULTS
- Contains some documentation of the results. These were mostly intended for the paper submission.
- Models
- These contain notebooks I ran the models in
- Feature_Visualization_and_Selection
- These contain notebooks where I performed visualizations, selections, and other techniques on the data.
- Data_Cleaner
- This is a python library I intend to build to automatically clean the data. It is heavily still under development and was not used for the paper submission.