This project is essentially a webscrape of an MBA website where a former admissions officer gives people the odds that they'll get into certain programs across the country.
There is an api deployed at https://jcp.pythonanywhere.com/api/v1
and it processes POST requests with the following format.
The following JSON payload would be sent to the api to receive the chances of admission to University of Chicago Booth MBA for an underrepresented minority male with a 3.1
gpa, a 650
gmat, a STEM major, and a degree from a well recognized school (whatever that means).
{"gpa":"3.1",
"gmat":"650",
"major":"1",
"race":"1",
"gender":"0",
"school":"1",
"university":"booth"}
gpa
: your gpa on a 4 point scale
major
: 1 for STEM, 0 for non stem
urm
: 1 for underrepresented minority, 0 for not
gender
: 1 for female, else 0
university
: 1 if you went to a school everyone knows, 0 if its less known
school
: the school you want to predict your MBA admission chances for. choices are ["stanford","harvard","wharton","booth","columbia","sloan","kellogg"]
noting that these features are not high enough resolution to truly represent an applicant, but read the blog post for more on that
-
The
page_parser
is used to build a dataset of applicant profiles from Poets and Quants -
Applicant information will be the features that go into a ML model. probably just linear regression. Not sure yet. Progress on those results can be found here
-
Uses the basic features from the webscrape:
- GMAT or GMAT equivalent (using this conversion tool)
- GPA
- school
- major
- gender
- race
predict someones1 chances of getting into different2 business schools based on their profile3.
1 someone willing to provide the features listed above
2 so poets & quants or their readers really only seem to be interested in the same 6 or 7 schooles. harvard, stanford, yale, ross, kellogg, booth, anderson, and occassionally a few others. so its not any school.
3Yes, a lot more goes into an admission decision, but this is still fun to do. Since I'm assuming people don't want to read all 250+ profiles looking for one with a similar background/stats to them. I think there's real demand for this because the comments in those articles are overflowing with people posting their profiles and waiting for the website to do a feature on them.
** Peep the latest data scrape progress from a more updated version **
old visuals of the webscraper doin its thingy thing.
Its still pretty early but if you have suggestions, thoughts, feedback, criticism, etc feel free to open a PR or submit an Issue.
Thanks in advance 😊
If ya feeling generous, hollr @ the kid ❤️
https://www.paypal.me/hijodelsol
BTC: 3EbMygEoo8gqgPHxmqa631ZVSwgWaoCj3m
ETH: 0x2F2604AA943dB4E7257636793F38dD3B1808A9e7
LTC: MQVgzNDgw43YzyUg3XmH3jQ7L8ndVswmN3