In the Notebook below, I use supervised machine learning algorithms to help a non-profit predict whether an individual makes more or less than $50,000. This information is very important to them because it will inform their fundraising efforts.
I explain the approach to the problem, how to go about pre-processing the data, how to create a pipeline to examine three supervised machine learning models, how to select one and how to analyze its most important features.
This project was an incredible learning experience, and I hope to provide a similar service to a non-profit in the future.
Here is a link to the code in full page mode:
Interesting choice to split the above and below 50K into a binary.